doris

Author	SHA1	Message	Date
Pxl	cb80ae906f	[Bug](runtime-filter) disable sync filter when pipeline engine is off (#36994 ) ## Proposed changes 1. disable sync filter when pipeline engine is off 2. reduce some warning log	2024-06-28 16:59:26 +08:00
Mryange	dec5f0ca98	Revert "[fix](profile) Fix reporting the profile while building the p… (#34498 ) * Revert "[fix](profile) Fix reporting the profile while building the pipeline profile. (#34215)" This reverts commit eb0d963389e1b7d150cbc18c927091648e0a60f7. * Revert "[feature](profile) sort pipelineX task by total time #34053" This reverts commit 67b394f2b0dddab3801d2faa82a91c52ef875e76.	2024-05-07 22:58:50 +08:00
Mryange	7d77fd0286	[fix](profile) Fix reporting the profile while building the pipeline profile. (#34215 ) (#34326 )	2024-04-30 11:38:03 +08:00
Pxl	09b973db49	[Chore](runtime-filter) adjust need_local_merge setting conditions (#33886 )	2024-04-19 23:50:04 +08:00
Pxl	ba05ef4405	[Chore](runtime-filter) add tmp debug info to investigate unknown filter error #33857	2024-04-18 21:03:09 +08:00
Pxl	9bcb23351c	[Improvement](runtime-filter) make sync rf size work when need_local_merge (#33717 ) make sync rf size work when need_local_merge	2024-04-17 23:42:14 +08:00
Gabriel	01f333086d	[pipelineX](fix) Fix data pooling judgement for bucket join (#33533 )	2024-04-17 23:42:00 +08:00
Pxl	5688c28364	[Bug](runtime-filter) try to fix heap use after free on runtime filter send filter size (#33465 ) (#33522 )	2024-04-11 13:10:24 +08:00
Xinyi Zou	cf7595d423	[opt](memory) Optimize mem tracker accuracy (#32039 ) (#33140 )	2024-04-10 11:42:19 +08:00
Gabriel	a8232c67f9	[pipelineX](runtime filter) Fix task timeout caused by runtime filter (#33332 ) (#33369 )	2024-04-08 16:30:32 +08:00
Pxl	5e4da61df9	[Bug](top-n) do not get runtime predicate when predicate not initialized (#32208 )	2024-03-15 18:06:15 +08:00
TengJianPing	d2e7a68d11	[enhancement](util) print if using nereids planner when be coredump (#31981 )	2024-03-09 19:55:47 +08:00
Pxl	25d1934289	[Feature](topn) support multiple topn filter on backend (#31665 ) support multiple topn filter on backend	2024-03-06 13:05:22 +08:00
Mryange	90a7f04349	[refine](pipelinex) get sink local state does not require an id. #31195	2024-02-22 13:01:49 +08:00
HappenLee	c56cb0ac3e	[Exec](RF) Support merge remote rf local first (#31067 )	2024-02-22 13:01:48 +08:00
HappenLee	45b4189bb6	[Refactor](opt) Opt rf and remove unless code (#30900 ) Opt rf and remove unless code	2024-02-18 11:50:16 +08:00
yiguolei	f66f6b2a82	[refactor](close) refactor ispendingfinish logic and close logic to do close more quickly (#30021 )	2024-01-23 10:06:05 +08:00
Mryange	aa4de6f39a	(feature)[pipelineX]Make operator_id negative in pipelineX (#29649 ) "operator_id" should be invisible, but the local shuffle is a planned operator in the BE (Backend), without a plan node ID. We use it in profiles and other places, and there might be duplicates. Therefore, we switch it to a negative number here to distinguish it as a plan node ID.	2024-01-12 11:44:21 +08:00
lw112	172f68480b	[Enhancement](load) Limit the number of incorrect data drops and add documents (#27727 ) In the load process, if there are problems with the original data, we will store the error data in an error_log file on the disk for subsequent debugging. However, if there are many error data, it will occupy a lot of disk space. Now we want to limit the number of error data that is saved to the disk. Be familiar with the usage of doris' import function and internal implementation process Add a new be configuration item load_error_log_limit_bytes = default value 200MB Use the newly added threshold to limit the amount of data that RuntimeState::append_error_msg_to_file writes to disk Write regression cases for testing and verification Co-authored-by: xy720 <22125576+xy720@users.noreply.github.com>	2023-12-22 10:43:18 +08:00
Mryange	c26c0c31a5	[refactor](runtimefilter) do not use QueryContext in runtime filter (#28559 )	2023-12-20 10:28:55 +08:00
Mryange	310d1ab9a9	[feature](pipelineX)add parent-child relationship between the sink and downstream operators in profile (#28406 )	2023-12-14 23:54:54 +08:00
Mryange	877935442f	[feature](pipelineX)use markFragments instead of markInstances in pipelineX (#27829 )	2023-12-11 17:59:53 +08:00
Gabriel	34e53acaea	[pipelineX](fix) Fix local exchange on pipelineX engine (#27763 )	2023-11-30 11:16:20 +08:00
zhengyu	b93dd1d5f7	[enhancement](load) improve error msg for load when cancelled by mem gc (#26809 ) Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-11-28 17:36:11 +08:00
Mryange	3838b6fbae	[refine](pipelineX) refine some code in pipelineX (#27472 )	2023-11-27 11:04:16 +08:00
Xinyi Zou	de6ecd2035	[fix](tls) Manually track memory in Allocator instead of mem hook and ThreadContext life cycle to manual control (#26904 ) Manually track query/load/compaction/etc. memory in Allocator instead of mem hook. Can still use Mem Hook when cannot manually track memory code segments and find memory locations during debugging. This will cause memory tracking loss for Query, loss less than 10% compared to the past, but this is expected to be more controllable. Similarly, Mem Hook will no longer track unowned memory to the orphan mem tracker by default, so the total memory of all MemTrackers will be less than before. Not need to get memory size from jemalloc in Mem Hook each memory alloc and free, which would lose performance in the past. Not require caching bthread local in pthread local for memory hook, in the past this has caused core dumps inside bthread, seems to be a bug in bthread. ThreadContext life cycle to manual control In the past, ThreadContext was automatically created when it was used for the first time (this was usually in the Jemalloc Hook when the first malloc memory), and was automatically destroyed when the thread exited. Now instead of manually controlling the create and destroy of ThreadContext, it is mainly created manually when the task thread start and destroyed before the task thread end. Run 43 clickbench query tests. Use MemHook in the past:	2023-11-14 10:30:42 +08:00
Gabriel	a354f87d2e	[refactor](pipeline) simplify runtime state ctor (#26461 )	2023-11-08 09:57:09 +08:00
Gabriel	606223ab62	Revert "[refactor](pipeline) simplify runtime state ctor (#25995 )" (#26029 ) This reverts commit a01922cdc55e2b3a63d9a9aafb38ac5ed64c6dd3.	2023-10-27 18:15:30 +08:00
Gabriel	a01922cdc5	[refactor](pipeline) simplify runtime state ctor (#25995 )	2023-10-27 15:45:29 +08:00
Mryange	97c2fe75d1	[feature](pipelineX) use expected<T, Status> in local_state (#25878 )	2023-10-25 15:23:17 +08:00
Mryange	552091f21f	[performance](pipelineX) optimize pipelineX (#25713 )	2023-10-25 10:13:17 +08:00
zhiqiang	87b414cdae	[Fix](query execution) Fix result sink fragment can't be cancelled in non-pipeline (#25524 )	2023-10-24 11:30:29 +08:00
Pxl	642c149e6a	remove datetime_value and move vecdatetime_value to doris namespace (#25695 ) remove datetime_value and move vecdatetime_value to doris namespace	2023-10-20 22:08:17 +08:00
Xinyi Zou	022762d5f0	[fix](memory) Fix work load group GC and add logs to locate slow GC #24975 Fix work load group GC, add cancel load and add logs. Unify the format and change all to lowercase of GC logs, avoid unnecessary trouble when grep or less Add logs to help locate the cause of slow GC.	2023-10-12 10:33:56 +08:00
Gabriel	bb670118f5	[coverage](test) Delete unused function to improve test coverage (#25233 )	2023-10-11 11:50:51 +08:00
bobhan1	642e5cdb69	[Fix](Status) Make `Status` `[[nodiscard]]` and handle returned `Status` correctly (#23395 )	2023-09-29 22:38:52 +08:00
Lijia Liu	864a0f9bcb	[opt](pipeline) Make pipeline fragment context send_report asynchronized (#23142 )	2023-09-28 17:55:53 +08:00
Mryange	b9997d69fa	[refactor](pipelineX) return error when local_state can not find id (#24360 )	2023-09-21 16:04:08 +08:00
Gabriel	65f41f71c1	[pipelineX](refactor) refine codes (#23726 )	2023-09-01 07:57:35 +08:00
Gabriel	d22290e548	[pipelineX](join) support hash join (#23689 )	2023-08-31 13:01:26 +08:00
Gabriel	29b94c4ed7	[pipeline](refactor) refine pipeline fragment context (#23478 )	2023-08-28 15:55:02 +08:00
Gabriel	1609b6cbf2	[pipelineX](sort) Support sort operator (#23322 )	2023-08-22 19:36:50 +08:00
Gabriel	dcd6c3c022	[pipelineX](refactor) propose a new pipeline execution model (#22562 )	2023-08-21 15:38:45 +08:00
bobhan1	2b2ac10e93	[feature](partial update) add failure tolerance for strict mode partial update stream load	2023-07-21 16:46:44 +08:00
Siyang Tang	b013f8006d	[enhancement](multi-table) enable mullti table routine load on pipeline engine (#21729 )	2023-07-14 12:16:32 +08:00
Adonis Ling	c470bf56a5	[chore](build) Fix compilation errors reported by GCC-13 (#21215 ) Add missing headers to fix the compilation errors reported by GCC-13.	2023-06-27 17:04:44 +08:00
Pxl	17a395f5e3	[Bug](runtime-filter) fix runtime filter not register on vdata_gen_scan_node (#20787 ) fix runtime filter not register on vdata_gen_scan_node	2023-06-15 14:06:14 +08:00
Xinyi Zou	0f21166110	[fix](memory) Fix runtime state default mem tracker (#20615 ) start time: Wed 07 Jun 2023 06:50:14 PM CST * Query id: e9000000e9-eb00000073 * * Aborted at 1686136356 (unix time) try "date -d @1686136356" if you are using GNU date * * Current BE git commitID: 5c33dd7a2c * * SIGSEGV address not mapped to object (@0x23000000235) received by PID 2131238 (TID 2132258 OR 0x7f708eff7700) from PID 565; stack trace: * 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t, void) at /mnt/hdd01/repo_center/doris_branch-2.0-beta/doris/be/src/common/signal_handler.h:413 1# 0x00007F727BBE3090 in /lib/x86_64-linux-gnu/libc.so.6 2# doris::AttachTask::AttachTask(doris::RuntimeState) at /mnt/hdd01/repo_center/doris_branch-2.0-beta/doris/be/src/runtime/thread_context.cpp:43 3# std::_Function_handler<void (doris::PTabletWriterAddBlockResult const&, bool), doris::stream_load::VNodeChannel::open_wait()::$_1>::_M_invoke(std::_Any_data const&, doris::PTabletWriterAddBlockResult const&, bool&&) at /var/local/ldb_toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291 4# doris::stream_load::ReusableClosure<doris::PTabletWriterAddBlockResult>::Run() at /mnt/hdd01/repo_center/doris_branch-2.0-beta/doris/be/src/vec/sink/vtablet_sink.h:176 5# brpc::Controller::EndRPC(brpc::Controller::CompletionInfo const&) in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 6# brpc::Controller::OnVersionedRPCReturned(brpc::Controller::CompletionInfo const&, bool, int) in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 7# brpc::policy::ProcessRpcResponse(brpc::InputMessageBase) in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 8# brpc::InputMessenger::InputMessageClosure::~InputMessageClosure() in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 9# brpc::InputMessenger::OnNewMessages(brpc::Socket) in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 10# brpc::Socket::ProcessEvent(void) in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 11# bthread::TaskGroup::task_runner(long) in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be 12# bthread_make_fcontext in /root/20230607171843-doris-branch-2.0-beta-5c33dd7a/be/lib/doris_be	2023-06-09 21:09:07 +08:00
ZhangYu0123	1c950d6930	[fix](config) fix memory config enable_query_memroy_overcommit spell problem #19898	2023-05-22 00:32:20 +08:00
Yusheng Xu	9edbfa37cd	[Enhancement](Broker Load) New progress manager for showing loading progress status (#19170 ) This work is in the early stage, current progress is not accurate because the scan range will be too large for gathering information, what's more, only file scan node and import job support new progress manager ## How it works for example, when we use the following load query: ``` LOAD LABEL test_broker_load ( DATA INFILE("XXX") INTO TABLE `XXX` ...... ) ``` Initial Progress: the query will call `BrokerLoadJob` to create job, then `coordinator` is called to calculate scan range and its location. Update Progress: BE will report runtime_state to FE and FE update progress status according to jobID and fragmentID we can use `show load` to see the progress PENDING: ``` State: PENDING Progress: 0.00% ``` LOADING: ``` State: LOADING Progress: 14.29% (1/7) ``` FINISH: ``` State: FINISHED Progress: 100.00% (7/7) ``` At current time, full output of `show load\G` looks like: ``` ************************* 1. row ************************* JobId: 25052 Label: test_broker State: LOADING Progress: 0.00% (0/7) Type: BROKER EtlInfo: NULL TaskInfo: cluster:N/A; timeout(s):250000; max_filter_ratio:0.0 ErrorMsg: NULL CreateTime: 2023-05-03 20:53:13 EtlStartTime: 2023-05-03 20:53:15 EtlFinishTime: 2023-05-03 20:53:15 LoadStartTime: 2023-05-03 20:53:15 LoadFinishTime: NULL URL: NULL JobDetails: {"Unfinished backends":{"5a9a3ecd203049bc-85e39a765c043228":[10080]},"ScannedRows":39611808,"TaskNumber":1,"LoadBytes":7398908902,"All backends":{"5a9a3ecd203049bc-85e39a765c043228":[10080]},"FileNumber":1,"FileSize":7895697364} TransactionId: 14015 ErrorTablets: {} User: root Comment: ``` ## TODO: 1. The current partition granularity of scan range is too large, resulting in an uneven loading process for progress." 2. Only broker load supports the new Progress Manager, support progress for other query	2023-05-06 22:44:40 +08:00

1 2 3

141 Commits