doris

Author	SHA1	Message	Date
Xinyi Zou	2ed6a00fd1	[opt](memory) Add GlobalMemoryArbitrator and support ReserveMemory (#34985 ) (#35070 )	2024-05-22 09:53:45 +08:00
Xinyi Zou	11ca738261	[fix](memory) Fix thread context init in MacOS and not use memory tracker (#34125 )	2024-05-06 20:11:20 +08:00
Xinyi Zou	f6ec64c6ad	[fix](exception) Fix Block noexcept method not throw exception (#34002 )	2024-04-24 17:13:50 +08:00
Xinyi Zou	c747714c18	[fix](memory) Fix ExecEnv destroy memory tracking (#33781 ) disable memory tracking when ExecEnv destroy. fix memory tracker label convert to query id	2024-04-19 15:03:10 +08:00
Xinyi Zou	cf7595d423	[opt](memory) Optimize mem tracker accuracy (#32039 ) (#33140 )	2024-04-10 11:42:19 +08:00
yiguolei	005f7af21f	[bugfix](deadlock) should not use query cancelled in fragment mgr	2024-04-09 16:09:01 +08:00
Xinyi Zou	d60d804d9c	[fix](memory) Fix task repeat attach task DCHECK failed #32784 (#33343 ) [branch-2.1](memory) Fix CCR task repeat attach task DCHECK failed3 #33366	2024-04-08 16:15:04 +08:00
Xinyi Zou	4268634115	[fix](memory) Fix Allocator cancel pipelinex query #32048	2024-03-12 14:20:18 +08:00
kindred77	3b093cabd1	Fix building issue in be on ubuntu with test enabled. (#31407 ) Co-authored-by: tangye <tangye@bestpay.com.cn>	2024-02-27 10:12:44 +08:00
yiguolei	2573150f6d	[refactor](runtime filter) do not wait runtime filter rpc finished when hash node or pipeline finished (#30970 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-02-16 10:16:40 +08:00
ShowCode	f565f60bc3	[refactor](standard)BE:Initialize pointer variables in the class to nullptr by default (#27587 )	2023-11-28 13:02:30 +08:00
Xinyi Zou	de6ecd2035	[fix](tls) Manually track memory in Allocator instead of mem hook and ThreadContext life cycle to manual control (#26904 ) Manually track query/load/compaction/etc. memory in Allocator instead of mem hook. Can still use Mem Hook when cannot manually track memory code segments and find memory locations during debugging. This will cause memory tracking loss for Query, loss less than 10% compared to the past, but this is expected to be more controllable. Similarly, Mem Hook will no longer track unowned memory to the orphan mem tracker by default, so the total memory of all MemTrackers will be less than before. Not need to get memory size from jemalloc in Mem Hook each memory alloc and free, which would lose performance in the past. Not require caching bthread local in pthread local for memory hook, in the past this has caused core dumps inside bthread, seems to be a bug in bthread. ThreadContext life cycle to manual control In the past, ThreadContext was automatically created when it was used for the first time (this was usually in the Jemalloc Hook when the first malloc memory), and was automatically destroyed when the thread exited. Now instead of manually controlling the create and destroy of ThreadContext, it is mainly created manually when the task thread start and destroyed before the task thread end. Run 43 clickbench query tests. Use MemHook in the past:	2023-11-14 10:30:42 +08:00
plat1ko	25b6e4deb2	[fix](daemon) Fix incorrect initialization order of daemon services (#23578 ) Current initialization dependency: Daemon ───┬──► StorageEngine ──► ExecEnv ──► Disk/Mem/CpuInfo │ │ BackendService ─┘ However, original code incorrectly initialize Daemon before StorageEngine. This PR also stop and join threads of daemon services in their dtor, to ensure Daemon services release resources in reverse order of initialization via RAII.	2023-08-31 19:46:38 +08:00
Xinyi Zou	ba351af452	[enhancement](thirdparty) upgrade thirdparty libs - again (#23414 ) submit again #23290 (not upgrade brpc, because bthread local has error) protobuf 3.15.0 -> 21.11 glog 0.4.0 -> 0.6.0 lz4 1.9.3 -> 1.9.4 curl 7.79.0 -> 8.2.1 zstd 1.5.2 -> 1.5.5 arrow 7.0.0 -> 13.0.0 abseil 20220623.1 -> 20230125.3 orc 1.7.2 -> 1.9.0 jemalloc for arrow 5.2.1 -> 5.3.0 xsimd 7.0.0 -> 13.0.0 opentelemetry-proto 0.19.0 -> 1.0.0 opentelemetry 1.8.3 -> 1.10.0 new: c-ares -> 1.19.1 grpc -> 1.54.3	2023-08-26 22:59:10 +08:00
Xinyi Zou	1f3de0eae3	[fix](memory) fix invalid large memory check && fix memory info thread safety (#22027 ) fix invalid large memory check fix memory info thread safety	2023-07-26 12:18:31 +08:00
Xinyi Zou	38c8657e5e	[improve](memory) more grace logging for memory exceed limit (#21311 ) more grace logging for Allocator and MemTracker when memory exceed limit fix bthread grace exit.	2023-07-05 14:59:06 +08:00
Xinyi Zou	661e1ae7c5	[fix](memory) no switch bthread context in UBSAN compile (#21064 ) When UBSAN is compiled, all memory will be tracked to the orphan (unknown) mem tracker, and the bthread context and mem tracker will no longer be switched. The supplementary fixes are as follows: #20999	2023-06-21 21:14:07 +08:00
Xinyi Zou	622ef63c69	[fix](memory) fix `bthread_setspecific` error in rpc done.run() (#20999 )	2023-06-20 21:00:45 +08:00
Xinyi Zou	e801e3b737	[fix](memory) Fix crash at `bthread_setspecific` in `brpc::Socket::CheckHealth()` (#20450 ) Only switch to bthread local when modifying the mem tracker in the thread context. No longer switches to bthread local by default when bthread starts mem tracker increases brpc IOBufBlockMemory memory remove thread mem tracker metrics	2023-06-08 19:48:19 +08:00
Pxl	7dc7ed97eb	[Chore](build) remove some unused code and remove some wno (#20326 ) remove some unused code about spinlock remove some wno and fix warning remove varadic macro usage	2023-06-05 10:48:07 +08:00
Xinyi Zou	cf7a74f6ec	[fix](memory) query check cancel while waiting for memory in Allocator, and optimize log (#19967 ) After the query check process memory exceed limit in Allocator, it will wait up to 5s. Before, Allocator will not check whether the query is canceled while waiting for memory, this causes the query to not end quickly.	2023-05-24 11:08:48 +08:00
Xinyi Zou	512806f902	[fix](ubsan) UBSAN avoid thread local switch	2023-05-20 07:14:43 +08:00
yiguolei	1d421a26d9	[bugfix](memory) merge block may allocate failed (#19507 )	2023-05-11 10:42:47 +08:00
Xinyi Zou	cf8ceb8586	[fix](scan) fix scanner mem tracker (#19354 )	2023-05-10 09:56:41 +08:00
Xinyi Zou	58cb404661	[fix](memory) Allocator throws Exception instead of std::bad_alloc (#19285 ) W0505 01:31:25.840227 1727715 scanner_scheduler.cpp:340] Scan thread read VScanner failed: [MEM_LIMIT_EXCEEDED]PreCatch error code:11, [E11] Allocator sys memory check failed: Cannot alloc:16384, consuming tracker:<Orphan>, exec node:<>, process memory used 5.87 GB exceed limit 5.64 GB or sys mem available 252.17 GB less than low water mark 1.60 GB, failed alloc size 16.00 KB. @ 0x555c19e0cca8 doris::Exception::Exception() @ 0x555c1c3e0c3f Allocator<>::sys_memory_check() @ 0x555c1c3e1052 Allocator<>::memory_check() @ 0x555c19e0a645 Allocator<>::alloc() @ 0x555c1c34508b COWHelper<>::create<>() @ 0x555c1e23f574 doris::vectorized::ConvertThroughParsing<>::execute<>() @ 0x555c1e23f209 doris::vectorized::FunctionConvertFromString<>::execute_impl() @ 0x555c1e23f4aa doris::vectorized::FunctionConvertFromString<>::execute_impl() @ 0x555c1e15ac29 doris::vectorized::PreparedFunctionImpl::execute_without_low_cardinality_columns() @ 0x555c1e15ac56 doris::vectorized::PreparedFunctionImpl::execute() @ 0x555c1e245276 _ZNSt17_Function_handlerIFN5doris6StatusEPNS0_15FunctionContextERNS0_10vectorized5BlockERKSt6vectorImSaImEEmmEZNKS4_12FunctionCast14create_wrapperINS4_14DataTypeNumberIiEEEESt8functionISC_ERKSt10shared_ptrIKNS4_9IDataTypeEEPKT_bEUlS3_S6_SB_mmE_E9_M_invokeERKSt9_Any_dataOS3_S6_SB_OmSY_ @ 0x555c1e2a9341 _ZZNK5doris10vectorized12FunctionCast23prepare_remove_nullableEPNS_15FunctionContextERKSt10shared_ptrIKNS0_9IDataTypeEES9_bENKUlS3_RNS0_5BlockERKSt6vectorImSaImEEmmE_clES3_SB_SG_mm @ 0x555c1e2a8d42 _ZNSt17_Function_handlerIFN5doris6StatusEPNS0_15FunctionContextERNS0_10vectorized5BlockERKSt6vectorImSaImEEmmEZNKS4_12FunctionCast23prepare_remove_nullableES3_RKSt10shared_ptrIKNS4_9IDataTypeEESJ_bEUlS3_S6_SB_mmE_E9_M_invokeERKSt9_Any_dataOS3_S6_SB_OmSQ_ @ 0x555c1e20e42b doris::vectorized::PreparedFunctionCast::execute_impl() @ 0x555c1e15ac29 doris::vectorized::PreparedFunctionImpl::execute_without_low_cardinality_columns() @ 0x555c1e15ac56 doris::vectorized::PreparedFunctionImpl::execute() @ 0x555c1d63e960 doris::vectorized::IFunctionBase::execute() @ 0x555c1d628700 doris::vectorized::VCastExpr::execute() @ 0x555c1d6163e5 doris::vectorized::VExprContext::execute() @ 0x555c20a83fe1 doris::vectorized::VFileScanner::_convert_to_output_block() @ 0x555c20a809af doris::vectorized::VFileScanner::_get_block_impl() @ 0x555c209b9bc4 doris::vectorized::VScanner::get_block() @ 0x555c209b1a50 doris::vectorized::ScannerScheduler::_scanner_scan() @ 0x555c209b2ac1 _ZNSt17_Function_handlerIFvvEZZN5doris10vectorized16ScannerScheduler18_schedule_scannersEPNS2_14ScannerContextEENK3$_0clEvEUlvE1_E9_M_invokeERKSt9_Any_data @ 0x555c1a8378cf doris::ThreadPool::dispatch_thread() @ 0x555c1a830fac doris::Thread::supervise_thread() @ 0x7f461faa117a start_thread @ 0x7f462033bdf3 __GI___clone @ (nil) (unknown)	2023-05-05 18:01:48 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
Xinyi Zou	79c446c89f	[enhancement](exception) Column filter/replicate supports exception safety (#18503 )	2023-04-18 19:23:09 +08:00
Xinyi Zou	c704351273	[enhancement](memory) Refactor memory limit exceeded behavior (#18590 ) No check mem tracker limit and no cancel task in mem hook, only in Allocator. This helps in clearer analysis of memory issues and reduces performance loss. PODArray/hash table/arena memory allocation will use Allocator. Optimize mem limit exceeded log printing Optimize compilation time	2023-04-14 10:42:35 +08:00
Xinyi Zou	5e7ea5e305	[fix](memory) Fix `bthread_setspecific` log fatal on UBSAN build (#18274 )	2023-03-31 19:46:53 +08:00
Xinyi Zou	b7677beab7	[enhancement](memtracker) Add special counter for memtracker and fix thread create and destroy track #17301 Add a special counter for memtracker, faster, but relaxed ordering and not accurate in real time Track thread create and destroy memory, which was previously removed due to performance loss and added back	2023-03-02 08:55:00 +08:00
Xinyi Zou	63042a38bd	[fix](memtracker) Fix high frequency load slow lock in memtracker (#16244 ) Global lock stuck in memtracker when bthread is frequently created	2023-02-02 09:53:44 +08:00
Xinyi Zou	cdbbf1e4ee	[enhancement](memory) Add Memory GC when the available memory of the BE process is lacking (#14712 ) When the system MemAvailable is less than the warning water mark, or the memory used by the BE process exceeds the mem soft limit, run minor gc and try to release cache. When the MemAvailable of the system is less than the low water mark, or the memory used by the BE process exceeds the mem limit, run fucc gc, try to release the cache, and start canceling from the query with the largest memory usage until the memory of mem_limit * 20% is released.	2022-12-07 15:28:52 +08:00
Xinyi Zou	176f519fa1	[enhancement](memtracker) Optimize exec node memory tracking (#14711 )	2022-12-01 14:52:21 +08:00
Xinyi Zou	e1f0fa069c	[enhancement](memory) Refactored process memory statistics periodically refresh, and fix catch bad_alloc (#14580 )	2022-11-29 10:15:25 +08:00
Adonis Ling	249b688663	[chore](github) Add a workflow to check BE UT on macOS (#14506 )	2022-11-23 08:38:28 +08:00
Adonis Ling	333c6390ee	[fix](be-ut) AddressSanitizer detects container-overflow issues (#14255 ) * [chore] Fix the container-overflow errors detected by address sanitizer * Fix compilation errors	2022-11-15 15:49:55 +08:00
Xinyi Zou	cffdeff4ec	[fix](memory) Fix memory leak by calling boost::stacktrace (#14269 ) boost::stacktrace::stacktrace() has memory leak, so use glog internal func to print stacktrace. The reason for the memory leak of boost::stacktrace is that a state is saved in the thread local of each thread but not actively released. The test found that each thread leaked about 100M after calling boost::stacktrace. refer to: boostorg/stacktrace#118 boostorg/stacktrace#111	2022-11-15 08:58:57 +08:00
Xinyi Zou	dd11d5c0a5	[enhancement](memory) Support try catch bad alloc (#14135 )	2022-11-13 11:22:56 +08:00
Xinyi Zou	0b945fe361	[enhancement](memtracker) Refactor mem tracker hierarchy (#13585 ) mem tracker can be logically divided into 4 layers: 1)process 2)type 3)query/load/compation task etc. 4)exec node etc. type includes enum Type { GLOBAL = 0, // Life cycle is the same as the process, e.g. Cache and default Orphan QUERY = 1, // Count the memory consumption of all Query tasks. LOAD = 2, // Count the memory consumption of all Load tasks. COMPACTION = 3, // Count the memory consumption of all Base and Cumulative tasks. SCHEMA_CHANGE = 4, // Count the memory consumption of all SchemaChange tasks. CLONE = 5, // Count the memory consumption of all EngineCloneTask. Note: Memory that does not contain make/release snapshots. BATCHLOAD = 6, // Count the memory consumption of all EngineBatchLoadTask. CONSISTENCY = 7 // Count the memory consumption of all EngineChecksumTask. } Object pointers are no longer saved between each layer, and the values of process and each type are periodically aggregated. other fix: In [fix](memtracker) Fix transmit_tracker null pointer because phamp is not thread safe #13528, I tried to separate the memory that was manually abandoned in the query from the orphan mem tracker. But in the actual test, the accuracy of this part of the memory cannot be guaranteed, so put it back to the orphan mem tracker again.	2022-11-08 09:52:33 +08:00
yiguolei	32fea672b0	[chore](gutil) remove some gutil macros and solve some macro conflict with brpc (#13954 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-11-07 13:39:52 +08:00
Xinyi Zou	c7b2b90504	[fix](memtracker) Fix DCHECK !std::count(_consumer_tracker_stack.begin(), _consumer_tracker_stack.end(), tracker)	2022-11-06 16:41:03 +08:00
Xinyi Zou	32a029d9dc	[enhancement](memtracker) Refactor load channel + memtable mem tracker (#13795 )	2022-11-03 09:47:12 +08:00
Xinyi Zou	9f7c76a0d6	[fix](memtracker) Fix the usage of bthread mem tracker (#13708 ) bthead context init has performance loss, temporarily delete it first, it will be completely refactored in #13585.	2022-10-30 19:51:00 +08:00
Xinyi Zou	9dc5dd382a	[enhancement](memtracker) Fix Brpc mem count and refactored thread context macro (#13469 )	2022-10-21 12:01:38 +08:00
Xinyi Zou	87a6b1a13b	[enhancement](memtracker) Fix bthread local consume mem tracker (#13368 ) Previously, bthread_getspecific was called every time bthread local was used. In the test at #10823, it was found that frequent calls to bthread_getspecific had performance problems. So a cache is implemented on pthread local based on the btls key, but the btls key cannot correctly sense bthread switching. So, based on bthread_self to get the bthread id to implement the cache.	2022-10-17 18:31:07 +08:00
Xinyi Zou	c494ca0ed4	[enhancement](memtracker) Print query memory usage log every second when `memory_verbose_track` is enabled (#13302 )	2022-10-13 09:11:23 +08:00
Xinyi Zou	df54c6b63a	[enhancement](memtracker) Add independent and unique scanner mem tracker for each query (#13262 )	2022-10-11 19:47:12 +08:00
Xinyi Zou	80e1f401f0	[enhancement](memory) Fix `USE_MEM_TRACKER=OFF` compile (#13085 )	2022-10-05 12:10:49 +08:00
Xinyi Zou	16bb5cb430	[enhancement](memory) Jemalloc performance optimization and compatibility with MemTracker #12496	2022-09-28 12:04:29 +08:00
Yongqiang YANG	a7d42b5d81	[fix](streamload&sink) release and allocate memory in the same tracker (#12820 ) 1. HttpServer threads allocate bytebuffer and put them into streamload pipe, but scanner thread release them with query tracker. 2. We can assume brpc allocate memory in doris thread. Above problems leads to wrong result of memtracker.	2022-09-23 17:51:44 +08:00

1 2

75 Commits