doris

Author	SHA1	Message	Date
yujun	0934fbee7e	[improvement](query) prefer to chose tablet on alive disk #39467 (#39654 ) cherry pick from #39467	2024-08-23 12:23:12 +08:00
yiguolei	5271042a7d	[bugfix](gccompile) fix gcc compile error (#34546 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-05-10 22:06:00 +08:00
yiguolei	32cbd4a583	[chore](status) unify error code between thrift,pb, status.h (#34397 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-05-10 14:41:01 +08:00
zxealous	9aa08d8deb	[improve](disk) Not add disk path to broken list if check status is not IO_ERROR (#34111 )	2024-04-26 07:44:12 +08:00
StarryVerse	4f6b9db7a7	Update doris_main.cpp (#34128 ) * Update doris_main.cpp Log(FATAL) introduces a core dump, which is confusing for users. We should print error msg and exit without a core dump. * Update doris_main.cpp	2024-04-26 07:43:40 +08:00
TengJianPing	615765c1c0	[improvement](spill) improve spill directory and fix bugs (#33900 ) * [improvement](spill) improve spill directory and fix bugs * fix	2024-04-22 11:28:22 +08:00
Xinyi Zou	cf7595d423	[opt](memory) Optimize mem tracker accuracy (#32039 ) (#33140 )	2024-04-10 11:42:19 +08:00
Xinyi Zou	06e5c6c966	[fix](grace-exit) Stop incorrectly of reportwork cause heap use after free #32929	2024-04-10 11:34:28 +08:00
TengJianPing	3358f76a7f	[feature](spill) Implement spill to disk for hash join, aggregation and sort for pipelineX (#31910 ) Co-authored-by: Jerry Hu <mrhhsg@gmail.com>	2024-03-12 14:12:09 +08:00
zhangstar333	c40c16b8b3	[improve](conf)refactor fuzzy mode in BE (#31412 ) refactor the code of fuzzy in BE, and will be add more variables in it, then could test case at different mode.	2024-02-29 19:51:07 +08:00
HappenLee	52c45e38af	[Refactor](RF) refactor the profile of rf and pipeline-x support local ignore (#31287 ) * [Refactor](RF) refactor the profile of rf and pipeline-x support local ignore * fix local merge filter	2024-02-23 19:05:06 +08:00
feifeifeimoon	d17ac99abe	[feature](coverage): refresh the coverage file before exiting the program (#28354 )	2023-12-19 10:54:57 +08:00
Yongqiang YANG	ffa4ea66d5	[enhancement](main) donot coredump when be can not start (#27928 )	2023-12-05 20:11:24 +08:00
Xinyi Zou	de6ecd2035	[fix](tls) Manually track memory in Allocator instead of mem hook and ThreadContext life cycle to manual control (#26904 ) Manually track query/load/compaction/etc. memory in Allocator instead of mem hook. Can still use Mem Hook when cannot manually track memory code segments and find memory locations during debugging. This will cause memory tracking loss for Query, loss less than 10% compared to the past, but this is expected to be more controllable. Similarly, Mem Hook will no longer track unowned memory to the orphan mem tracker by default, so the total memory of all MemTrackers will be less than before. Not need to get memory size from jemalloc in Mem Hook each memory alloc and free, which would lose performance in the past. Not require caching bthread local in pthread local for memory hook, in the past this has caused core dumps inside bthread, seems to be a bug in bthread. ThreadContext life cycle to manual control In the past, ThreadContext was automatically created when it was used for the first time (this was usually in the Jemalloc Hook when the first malloc memory), and was automatically destroyed when the thread exited. Now instead of manually controlling the create and destroy of ThreadContext, it is mainly created manually when the task thread start and destroyed before the task thread end. Run 43 clickbench query tests. Use MemHook in the past:	2023-11-14 10:30:42 +08:00
zhiqiang	a5565f68b2	[Refactor](opentelemetry) Remove opentelemetry (#26605 )	2023-11-09 18:05:34 +08:00
AlexYue	59ebbb351e	[feature](merge-cloud) Enable write into cache when uploading file to s3 using s3 file writer (#24364 )	2023-10-16 21:31:02 +08:00
Yongqiang YANG	8eb14eec7c	[enhancement](baddisk) record bad disk in be_custom.conf to handle (#24639 )	2023-09-21 18:31:58 +08:00
Xinyi Zou	fc12362a6d	[feature-wip](arrow-flight)(step2) FE support Arrow Flight server (#24314 ) This is a POC, the design documentation will be updated soon	2023-09-20 14:42:54 +08:00
zhiqqqq	c7ae2a7d22	[Refactor & Bugfix](static variables) move some static vairables to exec_env (#24029 )	2023-09-13 09:27:03 +08:00
zhiqqqq	8f7e7a7b31	[Fix](signal) fix signal handler (#24144 )	2023-09-11 13:18:49 +08:00
神技圈子	0143ae8266	[fix]Add logging before _builtin_unreachable() (#24101 ) Co-authored-by: 宋光璠 <songguangfan@sf.com>	2023-09-09 00:30:11 +08:00
yiguolei	1d1a9e2bfc	[improvement](graceful shutdown) waiting for all query finished when graceful shutdown (#23865 ) In some cloud native deployment scenario, BE(especially the Compute Node BE) will be add to cluster and remove from cluster very frequently. User's query will fail if there is a fragment is running on the shutting down BE. Users could use stop_be.sh --grace, then BE will wait all running queries to stop to avoiding running query failure, but if the waiting time exceed the limit, then be will exit directly. During this period, FE will not send any queries to BE and waiting for all running queries to stop	2023-09-05 09:52:28 +08:00
Xinyi Zou	babd0430c7	[fix](stacktrace) Fix StackTraceCache initialized before ExecEnv (#23828 )	2023-09-05 09:06:16 +08:00
Xinyi Zou	039c76cbc0	[feature-wip] (arrow-flight) (step1) BE support Arrow Flight server, read data only (#23765 )	2023-09-04 19:19:55 +08:00
Ashin Gau	eaf2a6a80e	[fix](date) return right date value even if out of the range of date dictionary(#23664 ) PR(https://github.com/apache/doris/pull/22360) and PR(https://github.com/apache/doris/pull/22384) optimized the performance of date type. However hive supports date out of 1970~2038, leading wrong date value in tpcds benchmark. How to fix: 1. Increase dictionary range: 1900 ~ 2038 2. The date out of 1900 ~ 2038 is regenerated.	2023-09-01 14:40:20 +08:00
plat1ko	25b6e4deb2	[fix](daemon) Fix incorrect initialization order of daemon services (#23578 ) Current initialization dependency: Daemon ───┬──► StorageEngine ──► ExecEnv ──► Disk/Mem/CpuInfo │ │ BackendService ─┘ However, original code incorrectly initialize Daemon before StorageEngine. This PR also stop and join threads of daemon services in their dtor, to ensure Daemon services release resources in reverse order of initialization via RAII.	2023-08-31 19:46:38 +08:00
Xinyi Zou	ba351af452	[enhancement](thirdparty) upgrade thirdparty libs - again (#23414 ) submit again #23290 (not upgrade brpc, because bthread local has error) protobuf 3.15.0 -> 21.11 glog 0.4.0 -> 0.6.0 lz4 1.9.3 -> 1.9.4 curl 7.79.0 -> 8.2.1 zstd 1.5.2 -> 1.5.5 arrow 7.0.0 -> 13.0.0 abseil 20220623.1 -> 20230125.3 orc 1.7.2 -> 1.9.0 jemalloc for arrow 5.2.1 -> 5.3.0 xsimd 7.0.0 -> 13.0.0 opentelemetry-proto 0.19.0 -> 1.0.0 opentelemetry 1.8.3 -> 1.10.0 new: c-ares -> 1.19.1 grpc -> 1.54.3	2023-08-26 22:59:10 +08:00
Lightman	30e3c5bbe6	[bugfix](file cache) Fix the init file cache coredump (#23464 ) * [bugfix](file cache) Fix the init file cache coredump * fix compile	2023-08-26 16:50:50 +08:00
Mingyu Chen	330f369764	[enhancement](file-cache) limit the file cache handle num and init the file cache concurrently (#22919 ) 1. the real value of BE config `file_cache_max_file_reader_cache_size` will be the 1/3 of process's max open file number. 2. use thread pool to create or init the file cache concurrently. To solve the issue that when there are lots of files in file cache dir, the starting time of BE will be very slow because it will traverse all file cache dirs sequentially.	2023-08-17 16:52:08 +08:00
Xinyi Zou	96a46302e8	[fix](stacktrace) Fix Jemalloc enable profile fail to run BE after rewrites dl_iterate_phdr (#22549 ) Jemalloc heap profile follows libgcc's way of backtracing by default. rewrites dl_iterate_phdr will cause Jemalloc to fail to run after enable profile. TODO, two solutions: - Jemalloc specifies GNU libunwind as the prof backtracing way, but my test failed, --enable-prof-libunwind not work: --enable-prof-libunwind not work jemalloc/jemalloc#2504 - ClickHouse/libunwind solves Jemalloc profile backtracing, but the branch of ClickHouse/libunwind has been out of touch with GNU libunwind and LLVM libunwind, which will leave the fate to others.	2023-08-03 19:32:36 +08:00
HappenLee	3a11de889f	[Opt](exec) opt the performance of date parquet convert by date dict (#22384 ) before： mysql> select count(l_commitdate) from lineitem; +---------------------+ \| count(l_commitdate) \| +---------------------+ \| 600037902 \| +---------------------+ 1 row in set (0.86 sec) after: mysql> select count(l_commitdate) from lineitem; +---------------------+ \| count(l_commitdate) \| +---------------------+ \| 600037902 \| +---------------------+ 1 row in set (0.36 sec)	2023-08-01 12:24:00 +08:00
Xinyi Zou	d180ed418d	[fix](stacktrace) Speed up stack trace (#21755 ) Introduce libunwind get stack trace, cost is negligible and has line numbers. use StackTraceCache, PHDRCache speed up, is customizable and has some optimizations. Other stack trace tools remain: glog, boost, glibc, in case for need. TODO: currently support linux __x86_64__, __arm__, __powerpc__, not supported __FreeBSD__, APPLE Note: __arm__, __powerpc__ not been verified Support signal handle libunwid support unw_backtrace for jemalloc Use of undefined compile option USE_MUSL for later	2023-07-19 15:43:14 +08:00
Xinyi Zou	38c8657e5e	[improve](memory) more grace logging for memory exceed limit (#21311 ) more grace logging for Allocator and MemTracker when memory exceed limit fix bthread grace exit.	2023-07-05 14:59:06 +08:00
zhangdong	41ccf77c7d	[feature][fix](fs)(s3)add fs_s3 benchmark tool and fix s3 file writer bug (#20926 ) 1. Fix bug that the field of s3_file_write_bufferpool is not initialized, causing undefined behavior. 2. add fs_s3 benchmark tool，Reference to the usage of tools https://github.com/apache/doris/pull/20770 And opt the output: `sh bin/run-fs-benchmark.sh --conf=conf/s3.conf --fs_type=s3 --operation=single_read --threads=1 --iterations=1` ``` ------------------------------------------------------------------------------------------------------------------------------ Benchmark Time CPU Iterations UserCounters... ------------------------------------------------------------------------------------------------------------------------------ S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1 7366 ms 123 ms 1 ReadRate(B/S)=12.1823M/s ReadTime(S)=7.36572 ReadTotal(B)=89.7314M S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1 6163 ms 116 ms 1 ReadRate(B/S)=14.5597M/s ReadTime(S)=6.16299 ReadTotal(B)=89.7314M S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1 6048 ms 110 ms 1 ReadRate(B/S)=14.8366M/s ReadTime(S)=6.04796 ReadTotal(B)=89.7314M S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1_mean 6526 ms 116 ms 3 ReadRate(B/S)=13.8596M/s ReadTime(S)=6.52556 ReadTotal(B)=89.7314M S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1_median 6163 ms 116 ms 3 ReadRate(B/S)=14.5597M/s ReadTime(S)=6.16299 ReadTotal(B)=89.7314M S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1_stddev 730 ms 6.68 ms 3 ReadRate(B/S)=1.45914M/s ReadTime(S)=0.729876 ReadTotal(B)=0 S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1_cv 11.18 % 5.75 % 3 ReadRate(B/S)=10.53% ReadTime(S)=11.18% ReadTotal(B)=0.00% S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1_max 7366 ms 123 ms 3 ReadRate(B/S)=14.8366M/s ReadTime(S)=7.36572 ReadTotal(B)=89.7314M S3ReadBenchmark/iterations:1/repeats:3/manual_time/threads:1_min 6048 ms 110 ms 3 ReadRate(B/S)=12.1823M/s ReadTime(S)=6.04796 ReadTotal(B)=89.7314M ```	2023-06-29 19:03:49 +08:00
Mingyu Chen	643db55a78	[improvement](thread) stop threads when BE exit gracefully (#19506 )	2023-05-15 21:54:21 +08:00
奕冷	5bf1396efe	[enhancement](load) merge single-replica related services as non-standalone (#18421 )	2023-05-06 22:54:56 +08:00
Lightman	1be5dac036	[improve] Refactor file cache and Improve the file cache strategy (#18652 ) 1. Refactor file cache. Before refactor, the file cache config format is "[{"path":"/path/to/file_cache","normal":21474836480,"persistent":10737418240,"query_limit":10737418240}]" and now change to "[{"path":"/mnt/disk3/selectdb_cloud/file_cache","total_size":21474836480,"query_limit":10737418240}]". It will be simpler than before. 2. Support more strategy. Support file cache priority. The file cache will have three queue, name as 'index'/'normal'/'disposable'. We can avoid that the higher priority data is eliminate by the lower priority data.	2023-04-25 23:14:28 +08:00
hulk	7c099c5747	[bugfix](be) Fix segment fault if the PID_DIR wasn't set (#18789 ) Doris BE would crash if the PID_DIR wasn't set	2023-04-20 10:39:54 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
HappenLee	b68857902e	[Compile](BE) Fix compile failed with tcmalloc (#18748 )	2023-04-18 09:26:45 +08:00
Adonis Ling	9e960f4c4f	[chore](build) Use include-what-you-use to optimize includes (#18681 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-17 11:44:58 +08:00
Mingyu Chen	05db6e9b55	[refactor](file-system)(step-2) remove env, file_utils and filesystem_utils (#18009 ) Follow #17586. This PR mainly changes: Remove env/ Remove FileUtils/FilesystemUtils Some methods are moved to LocalFileSystem Remove olap/file_cache Add s3 client cache for s3 file system In my test, the time of open s3 file can be reduced significantly Fix cold/hot separation bug for s3 fs. This is the last PR of #17764. After this, all IO operation should be in io/fs. Except for tests in #17586, I also tested some case related to fs io: clone concurrency query on local/s3/hdfs load error log create and clean disk metrics	2023-03-29 09:00:52 +08:00
Ashin Gau	2f471de675	[fix](FileCache) load file cache before start up daemon threads (#17199 ) Daemon threads in doris_main.cpp will upload tablet metrics periodically, which will use StorageEngine::instance(). However loading file cache is a process in main thread, when it takes a lot of time to load file cache, StorageEngine::instance() will be a null pointer in daemon threads.	2023-03-01 08:35:57 +08:00
Tiewei Fang	f17d69e450	[feature](file cache)Import `file cache` for remote file reader (#15622 ) The main purpose of this pr is to import `fileCache` for lakehouse reading remote files. Use the local disk as the cache for reading remote file, so the next time this file is read, the data can be obtained directly from the local disk. In addition, this pr includes a few other minor changes Import File Cache: 1. The imported `fileCache` is called `block_file_cache`, which uses lru replacement policy. 2. Implement a new FileRereader `CachedRemoteFilereader`, so that the logic of `file cache` is hidden under `CachedRemoteFilereader`. Other changes: 1. Add a new interface `fs()` for `FileReader`. 2. `IOContext` adds some statistical information to count the situation of `FileCache` Co-authored-by: Lightman <31928846+Lchangliang@users.noreply.github.com>	2023-01-10 12:23:56 +08:00
plat1ko	f3aea7f0f0	[Enhancement](status) Unify error code and enable customed err msg for BE internal errors (#14744 )	2022-12-11 23:33:18 +08:00
Dongyang Li	22883e7e08	[fuzzy](test) be fuzzy conf (#14654 )	2022-11-29 19:38:40 +08:00
Xinyi Zou	e1f0fa069c	[enhancement](memory) Refactored process memory statistics periodically refresh, and fix catch bad_alloc (#14580 )	2022-11-29 10:15:25 +08:00
Yongqiang YANG	0702277196	[improvement](tcmalloc) add moderate mode and avoid oom with a lot of cache (#14374 ) ReleaseToSystem aggressively when there are little free memory.	2022-11-28 20:17:51 +08:00
Xinyi Zou	21416f9947	[enhancement](memory) Support Jemalloc metrics and default allocator changed to Jemalloc (#14384 )	2022-11-18 21:02:54 +08:00
Xinyi Zou	bd5a593403	[enhancement](memtracker) Use proc/meminfo MemAvailable to control memory and optimize MemTracker log printing (#14335 )	2022-11-17 22:46:07 +08:00

1 2 3

122 Commits