doris

Author	SHA1	Message	Date
Yongqiang YANG	3d6a13605d	[improvement](stacktrace) do not captute stack trace for txn error codes (#19817 )	2023-05-18 23:58:56 +08:00
ZhangYu0123	07bbf741fb	[enhence](memory) gc inverted index cache when there is not enough memory (#19622 ) Support to gc inverted index cache when there is not enough memory. previous problem： The inverted index cache (InvertedIndexSearcherCache and InvertedIndexQueryCache) may use 20% memory which can't be released.	2023-05-18 16:41:51 +08:00
Xinyi Zou	068a32bc49	[Improvement](memory) faststring use Allocator #19762 After the outer catch exception, faststring resize reserve build may throw a memory alloc failure exception from the Allocator. Currently page body compress will catch memory alloc failure exception	2023-05-18 15:00:49 +08:00
Xinyi Zou	7c8b7878cd	[fix](memory) Print all query/load memory before memory GC when `memory_debug=true` (#19720 )	2023-05-18 14:55:47 +08:00
Gabriel	851886cc18	[minor](datev2) remove datev2 because datev2 is used by default (#19777 )	2023-05-18 13:36:11 +08:00
yixiutt	943e5fb7e5	[improvement](MOW) use seperated cache for mow pk cache (#19686 ) In mow, primary key cache have a big impact on load performance, so we add a new cache type to seperate it from page cache to make it more flexible in some cases	2023-05-18 13:27:09 +08:00
chenlinzhong	f412aec187	[improvement](load)disable shrink memory by default (#19714 ) disable shrink memory by default, it becomes very slow when importing large amounts of data you can turn on If you think it's necessary	2023-05-18 11:25:39 +08:00
Xinyi Zou	d5d47703fe	[fix](memory) remove auto option in memory config and optimize memtracker logs #19706 fix mem_limit default value memory_gc_sleep_time_s to memory_gc_sleep_time_ms LoadChannelMgr::_handle_mem_exceed_limit process_mem_limit to process soft mem limit fix query mem tracker print	2023-05-18 08:54:03 +08:00
zhengyu	4566281cc3	[fix](sink) disable lazy-open partition by default (#19769 ) Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-05-18 07:28:04 +08:00
HappenLee	dc18da2ce4	[Log](expr) add DCHECK info for expr close DCHECK (#19683 )	2023-05-17 21:37:38 +08:00
luozenglin	272a7565b8	[improvement](tracing) Remove useless span levels from be side tracing (#19665 ) 1. Remove an exec node method corresponding to a span and replace it with an exec node corresponding to a span; 2. Fix some problems with tracing in pipeline.	2023-05-17 19:04:52 +08:00
Yongqiang YANG	d76e2e2254	[chore](config) ignore_eovercrowded to be true by default (#19282 )	2023-05-17 16:21:32 +08:00
zxealous	2d9cc8fe8f	[improvement](file cache)Support set min file segment size while use block file cache (#19536 )	2023-05-17 10:23:33 +08:00
Gabriel	8fd1eb0d1e	[minor](hash table) parameterize hash table (#19653 )	2023-05-17 09:58:26 +08:00
Yongqiang YANG	610f1c8ef5	[improvement](load) skip compression when memtable is small (#19300 ) * [improvement](load) skip compression when memtable is small * format	2023-05-16 12:08:41 +08:00
luozenglin	6c9c9e9765	[feature-wip](resource-group) Supports memory hard isolation of resource group (#19526 )	2023-05-15 22:45:46 +08:00
AlexYue	0617c7e56b	[enhance](Cold&Heat separation) use file block cache for cold heat separation rowset (#19410 ) For performance issue, we would specify rowset included by cold heat separation table to use file block cache no matter what config user has set. I've tested the config using cold_heat_seperation_case_p2 and it works well.	2023-05-14 22:06:26 +08:00
HHoflittlefish777	f8ef25bb10	[enhancement](load) lazy-open necessary partitions when load (#18874 )	2023-05-14 16:09:55 +08:00
yiguolei	1d421a26d9	[bugfix](memory) merge block may allocate failed (#19507 )	2023-05-11 10:42:47 +08:00
DeadlineFen	a05dbd3f81	[chore](compile) Improves PCH cache hit ratio (#19469 ) Supplement the documentation of be-clion-dev, avoid the problem of undefined DORIS_JAVA_HOME and inability to find jni.h when using clion development without directly compiling through build.sh Complete the classification of header files in pch.h and introduce some header files that are not frequently modified in doris. Separate the declaration and definition in common/config.h. If you need to modify the default configuration now, please modify it in common/config.cpp. gen_cpp/version.h is regenerated every time it is recompiled, which may cause PCH to fail, so now you need to get the version information indirectly rather than directly.	2023-05-10 12:49:01 +08:00
lihangyu	b07053f47d	[chore](simdjson reader) default enable simdjson for json reader (#19375 )	2023-05-09 16:53:21 +08:00
DeadlineFen	e08de52ee7	[chore](compile) using PCH for compilation acceleration under clang (#19303 )	2023-05-08 19:51:06 +08:00
airborne12	f199860dea	[Improvement](inverted index) Enhance compaction performance through direct inverted index merging (#19207 )	2023-05-08 14:07:32 +08:00
zclllyybb	bb462202dc	[Exec] log the fuzzy config of be (#19349 )	2023-05-08 11:01:54 +08:00
奕冷	5bf1396efe	[enhancement](load) merge single-replica related services as non-standalone (#18421 )	2023-05-06 22:54:56 +08:00
Yongqiang YANG	3ddedb676c	[fix](status) do not capture stacktrace for META_KEY_NOT_FOUND (#19308 ) * [fix](status) do not capture stacktrace for META_KEY_NOT_FOUND * handle PUSH_VERSION_ALREADY_EXIST	2023-05-06 10:04:28 +08:00
Xinyi Zou	58cb404661	[fix](memory) Allocator throws Exception instead of std::bad_alloc (#19285 ) W0505 01:31:25.840227 1727715 scanner_scheduler.cpp:340] Scan thread read VScanner failed: [MEM_LIMIT_EXCEEDED]PreCatch error code:11, [E11] Allocator sys memory check failed: Cannot alloc:16384, consuming tracker:<Orphan>, exec node:<>, process memory used 5.87 GB exceed limit 5.64 GB or sys mem available 252.17 GB less than low water mark 1.60 GB, failed alloc size 16.00 KB. @ 0x555c19e0cca8 doris::Exception::Exception() @ 0x555c1c3e0c3f Allocator<>::sys_memory_check() @ 0x555c1c3e1052 Allocator<>::memory_check() @ 0x555c19e0a645 Allocator<>::alloc() @ 0x555c1c34508b COWHelper<>::create<>() @ 0x555c1e23f574 doris::vectorized::ConvertThroughParsing<>::execute<>() @ 0x555c1e23f209 doris::vectorized::FunctionConvertFromString<>::execute_impl() @ 0x555c1e23f4aa doris::vectorized::FunctionConvertFromString<>::execute_impl() @ 0x555c1e15ac29 doris::vectorized::PreparedFunctionImpl::execute_without_low_cardinality_columns() @ 0x555c1e15ac56 doris::vectorized::PreparedFunctionImpl::execute() @ 0x555c1e245276 _ZNSt17_Function_handlerIFN5doris6StatusEPNS0_15FunctionContextERNS0_10vectorized5BlockERKSt6vectorImSaImEEmmEZNKS4_12FunctionCast14create_wrapperINS4_14DataTypeNumberIiEEEESt8functionISC_ERKSt10shared_ptrIKNS4_9IDataTypeEEPKT_bEUlS3_S6_SB_mmE_E9_M_invokeERKSt9_Any_dataOS3_S6_SB_OmSY_ @ 0x555c1e2a9341 _ZZNK5doris10vectorized12FunctionCast23prepare_remove_nullableEPNS_15FunctionContextERKSt10shared_ptrIKNS0_9IDataTypeEES9_bENKUlS3_RNS0_5BlockERKSt6vectorImSaImEEmmE_clES3_SB_SG_mm @ 0x555c1e2a8d42 _ZNSt17_Function_handlerIFN5doris6StatusEPNS0_15FunctionContextERNS0_10vectorized5BlockERKSt6vectorImSaImEEmmEZNKS4_12FunctionCast23prepare_remove_nullableES3_RKSt10shared_ptrIKNS4_9IDataTypeEESJ_bEUlS3_S6_SB_mmE_E9_M_invokeERKSt9_Any_dataOS3_S6_SB_OmSQ_ @ 0x555c1e20e42b doris::vectorized::PreparedFunctionCast::execute_impl() @ 0x555c1e15ac29 doris::vectorized::PreparedFunctionImpl::execute_without_low_cardinality_columns() @ 0x555c1e15ac56 doris::vectorized::PreparedFunctionImpl::execute() @ 0x555c1d63e960 doris::vectorized::IFunctionBase::execute() @ 0x555c1d628700 doris::vectorized::VCastExpr::execute() @ 0x555c1d6163e5 doris::vectorized::VExprContext::execute() @ 0x555c20a83fe1 doris::vectorized::VFileScanner::_convert_to_output_block() @ 0x555c20a809af doris::vectorized::VFileScanner::_get_block_impl() @ 0x555c209b9bc4 doris::vectorized::VScanner::get_block() @ 0x555c209b1a50 doris::vectorized::ScannerScheduler::_scanner_scan() @ 0x555c209b2ac1 _ZNSt17_Function_handlerIFvvEZZN5doris10vectorized16ScannerScheduler18_schedule_scannersEPNS2_14ScannerContextEENK3$_0clEvEUlvE1_E9_M_invokeERKSt9_Any_data @ 0x555c1a8378cf doris::ThreadPool::dispatch_thread() @ 0x555c1a830fac doris::Thread::supervise_thread() @ 0x7f461faa117a start_thread @ 0x7f462033bdf3 __GI___clone @ (nil) (unknown)	2023-05-05 18:01:48 +08:00
xiaojunjie	9813406757	[Enhancement](HttpServer) Add http interface authentication for BE (#17753 )	2023-05-04 23:46:49 +08:00
yiguolei	8eab20d3df	[bugfix](low cardinality) cached code is wrong will result wrong query result when many null pages (#19221 ) Sometimes the dict is not initialized when run comparison predicate here, for example, the full page is null, then the reader will skip read, so that the dictionary is not inited. The cached code is wrong during this case, because the following page maybe not null, and the dict should have items in the future. This will result the dict string column query return wrong result, if there are many null values in the column. I also add some regression test for dict column's equal query, larger than query, less than query. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-04-29 21:28:41 +08:00
Xinyi Zou	1379d7f3e0	[fix](memory) mmap threshold can be modified in conf, Increase to 128M	2023-04-28 18:17:22 +08:00
Pxl	ec517a53a8	[Chore](build) upgrade clang-format version to 16 && move thrift to fe-common (#19155 ) upgrade clang-format version to 16 move thrift to fe-common fix core dump on pipeline engine when operator canceled and not prepared	2023-04-28 14:14:51 +08:00
Zhengguo Yang	52b1bd2c81	[clone](download) fix be clone action download tablet content length overflow (#18851 )	2023-04-28 11:35:17 +08:00
Lightman	1be5dac036	[improve] Refactor file cache and Improve the file cache strategy (#18652 ) 1. Refactor file cache. Before refactor, the file cache config format is "[{"path":"/path/to/file_cache","normal":21474836480,"persistent":10737418240,"query_limit":10737418240}]" and now change to "[{"path":"/mnt/disk3/selectdb_cloud/file_cache","total_size":21474836480,"query_limit":10737418240}]". It will be simpler than before. 2. Support more strategy. Support file cache priority. The file cache will have three queue, name as 'index'/'normal'/'disposable'. We can avoid that the higher priority data is eliminate by the lower priority data.	2023-04-25 23:14:28 +08:00
yiguolei	4e9b32d622	[bugfix](exception) remove fmt code to test if there still exist core (#19009 )	2023-04-25 07:24:14 +08:00
yiguolei	8d7a9fd21b	[refactor](exceptionsafe) add factory creator to some class (#18978 ) make vexprecontext,vexpr,function,query context,runtimestate thread safe. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-04-24 10:32:11 +08:00
AlexYue	c3baa65de3	[feature](io) enable s3 file writer with multi part uploading concurrently (#17585 ) Formerly S3FileWriter has to write each buffer with 5MB or more then upload one part, after all these works are done it could then process the incoming data, it's blocking and inefficient. This pr brings one bufferpool where the data could write into memory buffer immediately if has free buffer and then it would be uploaded into the S3. This pr doesn't provide the ability to elegantly support cases where there is no free buffer, i'll leave it as one future work.	2023-04-23 23:19:44 +08:00
yiguolei	61b44108e2	[bugfix](asan) fix possible asan check bug in exception to string (#18936 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-04-23 12:26:36 +08:00
Jack Drogon	5706bef2b3	[feature](common) Add unexpected/result support (#18312 ) * Add unexpected/result support * Rename result.hpp -> result.h && Add NOLINT in expected.hpp * Add NOLINT in result.h to avoid clang-tidy checker * Rename result.h to expected.h * Add Apache License for be/src/util/expected.hpp * Disable clang-format in be util/expected.hpp	2023-04-21 17:07:20 +08:00
yiguolei	63a76ed115	[refactor](exceptionsafe) disallow call new method explicitly (#18830 ) disallow call new method explicitly force to use create_shared or create_unique to use shared ptr placement new is allowed reference https://abseil.io/tips/42 to add factory method to all class. I think we should follow this guide because if throw exception in new method, the program will terminate. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-04-21 09:13:24 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
Xinyi Zou	79c446c89f	[enhancement](exception) Column filter/replicate supports exception safety (#18503 )	2023-04-18 19:23:09 +08:00
HappenLee	b68857902e	[Compile](BE) Fix compile failed with tcmalloc (#18748 )	2023-04-18 09:26:45 +08:00
zhengyu	16cdd9e95a	[feature](segcompaction) enable segcompaction by default (#18722 ) Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-04-17 18:00:49 +08:00
Adonis Ling	9e960f4c4f	[chore](build) Use include-what-you-use to optimize includes (#18681 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-17 11:44:58 +08:00
xiaojunjie	cbe2e138c3	[Enhancement](HttpServer) Support https in be (#17034 ) * [Enhancement](HttpServer) Support https in be	2023-04-12 10:27:07 +08:00
Mingyu Chen	e848e456be	[config] modify tablet_shard to 4 and add some log (#18416 ) modify the default value of BE config tablet_map_shard_size to 4. To reduce lock contention. Add log when failed writing disk test file, for debug	2023-04-06 17:18:16 +08:00
gitccl	7f8d92656e	[fix](streamload) fix stream load failed when enable profile (#18364 ) #18015 enables stream load profile log, however be will encounter rpc fail when loading tpch data(see #18291). This is because when `is_report_success` is true, be will reportExecStatus to fe, but fe cannot find QueryInfo in `coordinatorMap`, thus it will return error to be.	2023-04-05 01:01:46 +08:00
Xinyi Zou	dd78001cc1	[fix](memory) Fix memtable flush mem tracker #18330	2023-04-03 20:37:14 +08:00
Mingyu Chen	7e61a85331	[refactor](libhdfs) introduce hadoop libhdfs (#18204 ) 1. Introduce hadoop libhdfs 2. For Linux-X86 platform, use the hadoop libhdfs 3. For other platform, use libhdfs3, because currently we don't have hadoop libhdfs binary for other platform Co-authored-by: adonis0147 <adonis0147@gmail.com>	2023-03-31 18:41:39 +08:00
lihangyu	35bae25568	[Improve](row store) add more profile info in log for point query and make row column page size more configurable (#18181 ) save about 20% FE cpu cost for point query with prepared statement which table contains 100 columns	2023-03-31 10:58:59 +08:00

1 2 3 4 5 ...

544 Commits