doris

Author	SHA1	Message	Date
github-actions[bot]	c4bd0e8fa6	branch-2.1: [fix](memory) Fix compatibility with CgroupV2 #44579 (#44934 ) Cherry-picked from #44579 Co-authored-by: Xinyi Zou <zouxinyi@selectdb.com>	2024-12-04 22:09:16 +08:00
Xinyi Zou	6714936f8b	[pick](branch-2.1) pick #39962 #40304 (#44931 )	2024-12-04 17:56:58 +08:00
Xinyi Zou	d319dafb5c	[pick](branch-2.1) pick #41123 (#42541 ) cgroup memory usage should be refreshed frequently.	2024-10-28 19:21:19 +08:00
Xinyi Zou	9d5468d198	[branch-2.1](memory) BE memory info compatible with CgroupV2 (#39799 ) pick #39256	2024-08-23 02:03:00 +08:00
Xinyi Zou	13b882a4cc	[branch-2.1](memory) Add memory metrics to bvar (#39763 ) pick #38391	2024-08-22 17:34:30 +08:00
Xinyi Zou	9861f81630	[branch-2.1](memory) Fix Jemalloc Cache Memory Tracker (#37905 ) pick #37464	2024-07-16 19:01:31 +08:00
Xinyi Zou	747172237a	[branch-2.1](memory) Pick some memory GC patch (#37725 ) pick #36768 #37164 #37174 #37525	2024-07-14 15:19:40 +08:00
Xinyi Zou	ef031c5fb2	[branch-2.1](memory) Fix reserve memory compatible with memory GC and logging (#37682 ) pick #36307 #36412	2024-07-12 11:43:26 +08:00
Xinyi Zou	f7f0c20f00	[branch-2.1](cgroup memory) Correct cgroup mem info cache (#37440 ) pick #36966 Co-authored-by: Hongkun Xu <xuhongkun666@163.com>	2024-07-09 16:19:37 +08:00
yiguolei	4294b7360e	Revert "Revert "[fix](memory) Fix nested scoped tracker and nested reserve memory (#35257 )"" This reverts commit 95393b531d340a865bfd2711ea77d39a04e61993.	2024-05-29 20:16:16 +08:00
yiguolei	95393b531d	Revert "[fix](memory) Fix nested scoped tracker and nested reserve memory (#35257 )" This reverts commit f8fcd17f33deab0605c9378850a21714293ef1b5.	2024-05-28 23:14:19 +08:00
Xinyi Zou	f8fcd17f33	[fix](memory) Fix nested scoped tracker and nested reserve memory (#35257 ) SCOPED_ATTACH_TASK cannot be nested, but SCOPED_SWITCH_THREAD_MEM_TRACKER_LIMITER can continue to be called, so attach_limiter_tracker may be nested.	2024-05-28 13:12:03 +08:00
Xinyi Zou	b6eaf95720	[fix](memory) Fix BE memory info compatible with Cgroup (#35412 ) (#35425 ) 1. `memory.usage_in_bytes ~= free.used + free.(buff/cache) - (buff)`, free cache can be reused, so, modify cgroup_memory_usage = memory.usage_in_bytes - memory.meminfo["Cached"]. 2. If system not configured with cgroup, find cgroup file path will failed, refactor refresh cgroup memory info, compatible with find failed.	2024-05-27 12:31:44 +08:00
Xinyi Zou	9d7c65b4d8	[fix](memory) Avoid frequently refresh cgroup memory info (#35083 ) (#35182 ) pick #35083	2024-05-22 11:42:08 +08:00
Xinyi Zou	2ed6a00fd1	[opt](memory) Add GlobalMemoryArbitrator and support ReserveMemory (#34985 ) (#35070 )	2024-05-22 09:53:45 +08:00
Xinyi Zou	aa156f0781	[opt](memory) BE memory info compatible with Cgroup (#34262 )	2024-05-06 20:11:20 +08:00
wangbo	8abd136ba2	[Improvement](executor)Refactor Workload group memory GC (#33797 ) * just gc group's overcommit query when minor gc * add process usage	2024-04-30 19:34:31 +08:00
Xinyi Zou	2b1ab89b5b	[fix](memory) Fix memory log compile by ASAN (#33162 ) ASAN compiles BE, add markers in memory logs	2024-04-10 15:26:09 +08:00
Pxl	8fd6d4c41b	[Chore](build) add -Wconversion and remove some unused code (#33127 ) add -Wconversion and remove some unused code	2024-04-10 15:26:08 +08:00
zy-kkk	c318c48a38	[fix](compile) fix implicit float-to-int conversion in mem_info calculation (#33311 )	2024-04-08 07:34:22 +08:00
yiguolei	62023d705d	[refactor](rename) rename task group to workload group in be (#32204 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-03-15 18:04:02 +08:00
yiguolei	70304bffd2	[refactor](wg) move memory gc logic to workload group (#31334 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-02-23 23:12:09 +08:00
yiguolei	2c99c53812	[refactor](taskqueue) remove old task scheduler based wg (#30832 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-02-05 22:00:27 +08:00
Xinyi Zou	83e7235bab	[fix](memory) Add thread asynchronous purge jemalloc dirty pages (#28655 ) jemallctl purge all arena dirty pages may take several seconds, which will block memory GC and cause OOM. So purge asynchronously in a thread.	2023-12-22 12:05:20 +08:00
Xinyi Zou	226a0c3b1d	[chore](memory) Warning in log when turning on THP (#28122 )	2023-12-08 17:38:38 +08:00
Kaijie Chen	84a651d976	[improve](load) rewrite memtable memory limiter rules (#27759 )	2023-12-07 17:26:26 +08:00
Xinyi Zou	2548e27c97	[fix](memory) Fix work load group meaningless GC #27307	2023-11-21 09:59:21 +08:00
wangbo	70e070182f	[feature](executor)Make workload group property not required (#27229 ) * Make workload group property not required * remove useless UT	2023-11-19 17:01:51 +08:00
Xinyi Zou	022762d5f0	[fix](memory) Fix work load group GC and add logs to locate slow GC #24975 Fix work load group GC, add cancel load and add logs. Unify the format and change all to lowercase of GC logs, avoid unnecessary trouble when grep or less Add logs to help locate the cause of slow GC.	2023-10-12 10:33:56 +08:00
Xinyi Zou	21beebde7d	[fix](taskgroup) Fix task group overcommit memory GC profile (#22764 )	2023-08-09 18:29:46 +08:00
wangbo	66e540bebe	[Fix](executor)Fix incorrect mem_limit return value type (#22415 )	2023-07-31 22:28:41 +08:00
Xinyi Zou	1f3de0eae3	[fix](memory) fix invalid large memory check && fix memory info thread safety (#22027 ) fix invalid large memory check fix memory info thread safety	2023-07-26 12:18:31 +08:00
Xinyi Zou	4b30485d62	[improvement](memory) Refactor doris cache GC (#21522 ) Abstract CachePolicy, which controls the gc of all caches. Add stale sweep to all lru caches, including page caches, etc. I0710 18:32:35.729460 2945318 mem_info.cpp:172] End Full GC Free, Memory 3866389992 Bytes. cost(us): 112165339, details: FullGC: FreeTopMemoryQuery: - CancelCostTime: 1m51s - CancelTasksNum: 1 - FindCostTime: 0.000ns - FreedMemory: 2.93 GB WorkloadGroup: Cache name=DataPageCache: - CostTime: 15.283ms - FreedEntrys: 9.56K - FreedMemory: 691.97 MB - PruneAllNumber: 1 - PruneStaleNumber: 1	2023-07-11 20:21:31 +08:00
Xinyi Zou	38c8657e5e	[improve](memory) more grace logging for memory exceed limit (#21311 ) more grace logging for Allocator and MemTracker when memory exceed limit fix bthread grace exit.	2023-07-05 14:59:06 +08:00
Xinyi Zou	d2c42ec638	[fix](memory) Purge Jemalloc arena dirty pages when memory insufficient (#21237 ) Jemalloc dirty page only use madvise MADV_FREE, memory is not release back to system, RSS won't reduce in time, So when the process memory exceed limit or system available memory is insufficient, manually transfer dirty page to the muzzy page, which will call MADV_DONTNEED to release the physical memory back to the system. https://jemalloc.net/jemalloc.3.html#opt.dirty_decay_ms	2023-06-28 16:49:45 +08:00
Xinyi Zou	0396f78590	[fix](memory) Remove ChunkAllocator & fix Allocator no use mmap (#21259 )	2023-06-28 16:10:24 +08:00
Lijia Liu	76bdcf1d26	[improvement](pipeline) task group scan entity (#19924 )	2023-06-25 14:43:35 +08:00
Xinyi Zou	e801e3b737	[fix](memory) Fix crash at `bthread_setspecific` in `brpc::Socket::CheckHealth()` (#20450 ) Only switch to bthread local when modifying the mem tracker in the thread context. No longer switches to bthread local by default when bthread starts mem tracker increases brpc IOBufBlockMemory memory remove thread mem tracker metrics	2023-06-08 19:48:19 +08:00
ZhangYu0123	1c950d6930	[fix](config) fix memory config enable_query_memroy_overcommit spell problem #19898	2023-05-22 00:32:20 +08:00
luozenglin	33fd965b5c	[feature-wip](resouce-group) Supports memory soft isolation of resource group (#19802 ) create resource groups name properties( 'enable_memory_overcommit' = 'true' // whether to enable memory soft isolation )	2023-05-21 19:33:57 +08:00
Xinyi Zou	65807f888b	[fix](memory) Remind log if `vm/overcommit_memory`=2 when be start (#19795 ) Expect vm overcommit memory value to be 1, system will no longer throw bad_alloc, memory alloc are always accepted, memory limit check is handed over to Doris Allocator, make sure throw exception position is controllable, otherwise bad_alloc can be thrown anywhere and it will be difficult to achieve exception safety.	2023-05-19 15:01:08 +08:00
ZhangYu0123	07bbf741fb	[enhence](memory) gc inverted index cache when there is not enough memory (#19622 ) Support to gc inverted index cache when there is not enough memory. previous problem： The inverted index cache (InvertedIndexSearcherCache and InvertedIndexQueryCache) may use 20% memory which can't be released.	2023-05-18 16:41:51 +08:00
Xinyi Zou	7c8b7878cd	[fix](memory) Print all query/load memory before memory GC when `memory_debug=true` (#19720 )	2023-05-18 14:55:47 +08:00
yixiutt	943e5fb7e5	[improvement](MOW) use seperated cache for mow pk cache (#19686 ) In mow, primary key cache have a big impact on load performance, so we add a new cache type to seperate it from page cache to make it more flexible in some cases	2023-05-18 13:27:09 +08:00
Xinyi Zou	d5d47703fe	[fix](memory) remove auto option in memory config and optimize memtracker logs #19706 fix mem_limit default value memory_gc_sleep_time_s to memory_gc_sleep_time_ms LoadChannelMgr::_handle_mem_exceed_limit process_mem_limit to process soft mem limit fix query mem tracker print	2023-05-18 08:54:03 +08:00
Dongyang Li	0a28959675	[config](mem) change default mem_limit from 90% to 80% (#19602 ) With the default config of 90%, be may meet OOM when the load pressure is big. when set to 80%, be works well with the same load pressure in my cluster.	2023-05-15 17:48:43 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
Xinyi Zou	335c1e5953	[fix](memory) Fix MacOS mem_limit parse error and GC after env Init #17528 Fix MacOS mem_limit parse result is 0. Fix GC after env Init, otherwise, when the memory is insufficient, BE will start failure. * Query id: 0-0 * * Aborted at 1677833773 (unix time) try "date -d @1677833773" if you are using GNU date * * Current BE git commitID: 8ee5f45 * * SIGSEGV address not mapped to object (@0x70) received by PID 24145 (TID 0x7fa53c9fd700) from PID 112; stack trace: * 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t, void) at be/src/common/signal_handler.h:420 1# os::Linux::chained_handler(int, siginfo, void) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 2# JVM_handle_linux_signal in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 3# signalHandler(int, siginfo, void) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 4# 0x00007FA56295A400 in /lib64/libc.so.6 5# doris::MemTrackerLimiter::log_process_usage_str(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) at be/src/runtime/memory/mem_tracker_limiter.cpp:208 6# doris::MemTrackerLimiter::print_log_process_usage(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) at be/src/runtime/memory/mem_tracker_limiter.cpp:226 7# doris::Daemon::memory_maintenance_thread() at be/src/common/daemon.cpp:245 8# doris::Thread::supervise_thread(void*) at be/src/util/thread.cpp:455 9# start_thread in /lib64/libpthread.so.0 10# clone in /lib64/libc.so.6	2023-03-08 14:00:57 +08:00
Xinyi Zou	9617f46fa5	[improvement](memory) Modify `mem_limit` default value (#17322 ) Modify the default value of mem_limit to auto. auto means process mem limit is equal to max(physical mem * 0.9, 6.4G). 6.4G is the maximum memory reserved for the system.	2023-03-06 10:53:27 +08:00
Xinyi Zou	3871e989ac	[fix](memory) Avoid repeating meaningless memory gc #17258	2023-03-01 19:23:33 +08:00

1 2

81 Commits