doris

Author	SHA1	Message	Date
bobhan1	642e5cdb69	[Fix](Status) Make `Status` `[[nodiscard]]` and handle returned `Status` correctly (#23395 )	2023-09-29 22:38:52 +08:00
Jerry Hu	8a85a75b8b	[chore](scanner) check columns' nullable with schema (#24724 ) Add a validation to prevent potential schema inconsistency issues.	2023-09-22 11:34:53 +08:00
Yongqiang YANG	71dcb58db9	[improvement](scanner_schedule) reduce memory consumption of scanner (#24199 ) * [improvement](scanner_schedule) reduce memory consumption of scanner 1. limit scanner by memory consumptin rather than blocks. 2. scheduler run correcty instread of at lest 1.	2023-09-19 21:36:23 +08:00
Gabriel	d24f3efd4a	[pipelineX](profile) Phase 1: refactor pipelineX detailed profile (#24322 )	2023-09-15 16:14:05 +08:00
Pxl	35c5d71549	[Improvement](join) some improvement of hash join (#23972 ) some improvement of hash join	2023-09-14 17:55:35 +08:00
meiyi	82dc970916	[feature](insert) Support group commit insert (#22829 )	2023-09-08 15:51:03 +08:00
Gabriel	3317909141	[pipelineX](join) support nested loop join operator (#23756 )	2023-09-04 10:08:22 +08:00
Gabriel	65f41f71c1	[pipelineX](refactor) refine codes (#23726 )	2023-09-01 07:57:35 +08:00
TengJianPing	962221cb18	[test](log) add log for debug case failure (#23506 )	2023-08-28 10:45:25 +08:00
Gabriel	dcd6c3c022	[pipelineX](refactor) propose a new pipeline execution model (#22562 )	2023-08-21 15:38:45 +08:00
HappenLee	433a6103ab	[Enhancement](scanner) allocate blocks in scanner_context on demand and free them on close (#23182 ) Introduced #19389 , removed #20785	2023-08-19 12:13:24 +08:00
Mingyu Chen	c9dc715c5d	[fix](broker-load) fix error when using multi data description for same table in load stmt (#22666 ) For load request, there are 2 tuples on scan node, input tuple and output tuple. The input tuple is for reading file, and it will be converted to output tuple based on user specified column mappings. And the broker load support different column mapping in different data description to same table(or partition). So for each scanner, the output tuples are same but the input tuple can be different. The previous implements save the input tuple in scan node level, causing different scanner using same input tuple, which is incorrect. This PR remove the input tuple from scan node and save them in each scanners.	2023-08-07 20:03:03 +08:00
Pxl	c1c38c956d	[exec] fix coredump when limit<0 and limit!=-1 with 1.2 fe (#22622 )	2023-08-04 22:18:45 +08:00
Gabriel	23e7423748	[pipeline](refactor) refactor pipeline task schedule logics (#22028 )	2023-07-25 17:18:26 +08:00
Pxl	3089e4b3b6	[Bug](excution) fix ScannerContext is done make query failed (#21923 ) fix ScannerContext is done make query failed	2023-07-18 17:58:00 +08:00
Pxl	b3d3ffa2de	[Bug](pipeline) adjust scanner scheduler.submit and _num_scheduling_ctx maintain (#21843 ) adjust scanner scheduler.submit and _num_scheduling_ctx maintain	2023-07-18 11:55:21 +08:00
Gabriel	e348b9464e	[scan](freeblocks) use ConcurrentQueue to replace vector for free blocks (#21241 )	2023-06-28 15:10:07 +08:00
Lijia Liu	76bdcf1d26	[improvement](pipeline) task group scan entity (#19924 )	2023-06-25 14:43:35 +08:00
Gabriel	81abdeffbc	[Improvement](pipeline) Improve shared scan performance (#20785 )	2023-06-21 14:36:05 +08:00
Chuanle Chen	6efe6ef6e8	[Enhancement](scanner) allocate blocks in scanner_context on demand and free them on close (#19389 ) Firstly, to reduce memory usage, we do not pre-allocate blocks, instead we lazily allocate block when upper call get_free_block. And when upper call return_free_block to return free block, we add the block to a queue for memory reuse, and we will free the blocks in the queue when the scanner_context was closed instead of destructed. Secondly, to limit the memory usage of the scanner, we introduce a variable _free_blocks_capacity to indicate the current number of free blocks available to the scanners. The number of scanners that can be scheduled will be calculated based on this value. ssb flat test previous lineorder 1.2G: load time: 3s, query time: 0.355s lineorder 5.8G: load time: 330s, query time: 0.970s load time: 349s, query time: 0.949s load time: 349s, query time: 0.955s load time: 360s, query time: 0.889s (pipeline enabled) after lineorder 1.2G: load time: 3s, query time: 0.349s lineorder 5.8G: load time: 342s, query time: 0.929s load time: 337s, query time: 0.913s load time: 345s, query time: 0.946s load time: 346s, query time: 0.865s (pipeline enabled)	2023-05-23 18:17:21 +08:00
yiguolei	a262f42a28	[refactor](exceptionsafe) make scanner and scancontext exception safe (#19057 )	2023-04-27 09:23:01 +08:00
HappenLee	b2c26e17e1	[Compile](vec) Fix compile by BHREAD_SCANNER (#18979 )	2023-04-24 17:07:06 +08:00
yiguolei	3736530585	[refactor](query context) rename query fragments context to query context and make query context safe (#18950 ) * [refactor](query context) rename query fragments context to query context and make query context safe --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-04-23 22:53:56 +08:00
yiguolei	63a76ed115	[refactor](exceptionsafe) disallow call new method explicitly (#18830 ) disallow call new method explicitly force to use create_shared or create_unique to use shared ptr placement new is allowed reference https://abseil.io/tips/42 to add factory method to all class. I think we should follow this guide because if throw exception in new method, the program will terminate. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-04-21 09:13:24 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
HappenLee	eb128753ac	[Opt](pipeline) opt pipeline shared scan (#18715 )	2023-04-17 13:06:39 +08:00
wangbo	0f00ad4d2a	[fix](executor)Fix scanner's _max_thread_num may == 0 #18465	2023-04-16 18:17:18 +08:00
HappenLee	69ae14f228	[Bug](pipeline) regression heap use after free (#18701 )	2023-04-16 16:22:41 +08:00
HappenLee	56d84739c1	[Opt](pipeline) opt the scanner ctx schedule in pipeline engine (#18545 )	2023-04-14 09:59:03 +08:00
HappenLee	40a352959d	[Pipeline](exec) Support shared scan in colo agg (#18457 )	2023-04-13 17:25:41 +08:00
morrySnow	e29fc3b46b	[fix](chore) fix compile failed in JdbcExecutor and revert #18306 since be crash randomly (#18371 ) fix 2 problems: 1. PR #18187 use the api resizeColumn in JNINativeMethod has been removed by #17960 2. revert PR #18306 to fix pipeline core when load	2023-04-04 20:04:28 +08:00
wangbo	fc407f4afe	[improvement](executor) Reduce ScannnerCtx Scheduling times (#18306 ) * remove sche in scan operator	2023-04-03 22:54:34 +08:00
HappenLee	8be43857ef	[feature](executor) Add memory limit for pip_scanner_context (#18238 ) Co-authored-by: wangbo <506340561@qq.com>	2023-03-31 09:36:57 +08:00
HappenLee	39b5682d59	[Pipeline](shared_scan_opt) Support shared scan opt in pipeline exec engine	2023-03-13 10:33:57 +08:00
yiguolei	1b83829cff	[improvement](block exception safe) make block queue exception safe (#16657 ) * [improvement](block exception safe) make block queue exception safe This is part of exception safe: #16366. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-14 10:50:21 +08:00
yiguolei	646ba2cc88	[bugfix](scannode) 1. make rows_read correct 2. use single scanner if has limit clause (#16473 ) make rows_read correct so that the scheduler could using this correctly. use single scanner if has limit clause. Move it from fragment context to scannode. --------- Co-authored-by: yiguolei <yiguolei@gmail.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2023-02-09 14:12:18 +08:00
Xiaocc	0142ef8b95	[improvement](scanner) Supports bthread scanner (#16031 )	2023-02-09 10:24:56 +08:00
lihangyu	3894de49d2	[Enhancement](topn) support two phase read for topn query (#15642 ) This PR optimize topn query like `SELECT * FROM tableX ORDER BY columnA ASC/DESC LIMIT N`. TopN is is compose of SortNode and ScanNode, when user table is wide like 100+ columns the order by clause is just a few columns.But ScanNode need to scan all data from storage engine even if the limit is very small.This may lead to lots of read amplification.So In this PR I devide TopN query into two phase: 1. The first phase we just need to read `columnA`'s data from storage engine along with an extra RowId column called `__DORIS_ROWID_COL__`.The other columns are pruned from ScanNode. 2. The second phase I put it in the ExchangeNode beacuase it's the central node for topn nodes in the cluster.The ExchangeNode will spawn a RPC to other nodes using the RowIds(sorted and limited from SortNode) read from the first phase and read row by row from storage engine. After the second phase read, Block will contain all the data needed for the query	2023-01-19 10:01:33 +08:00
Lijia Liu	c57fa7c930	[Pipeline] Fix PipScannerContext::can_finish return wrong status (#15259 ) Now in ScannerContext::push_back_scanner_and_reschedule, _num_running_scanners-- is before _num_scheduling_ctx++. InPipScannerContext::can_finish, we check _num_running_scanners == 0 && _num_scheduling_ctx == 0 without obtaining _transfer_lock. In follow case, PipScannerContext::can_finish will return wrong result. _num_running_scanners-- Check _num_running_scanners == 0 && _num_scheduling_ctx == 0` return true. _num_scheduling_ctx++ So, we can set _num_running_scanners-- in the last of this func. Describe your changes. PipScannerContext::get_block_from_queue not block. Set _num_running_scanners-- in the last of ScannerContext::push_back_scanner_and_reschedule.	2023-01-09 08:46:58 +08:00
yiguolei	0e651365ca	[profile](scanner) add per scanner running time profile (#15321 ) * [profile](scanner) add per scanner running time profile Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-12-26 08:55:07 +08:00
TengJianPing	8c0e13ab51	[improvement](profile) add detail memory counter for exec nodes (#14806 ) * [improvement](profile) improve accuraccy of memory usage and add detail memory counter * fix	2022-12-05 11:51:52 +08:00
HappenLee	12304bc0ee	[Pipeline](exec) Support pipeline exec engine (#14736 ) Co-authored-by: Lijia Liu <liutang123@yeah.net> Co-authored-by: HappenLee <happenlee@hotmail.com> Co-authored-by: Jerry Hu <mrhhsg@gmail.com> Co-authored-by: Pxl <952130278@qq.com> Co-authored-by: shee <13843187+qzsee@users.noreply.github.com> Co-authored-by: Gabriel <gabrielleebuaa@gmail.com> ## Problem Summary: ### 1. Design DSIP: https://cwiki.apache.org/confluence/display/DORIS/DSIP-027%3A+Support+Pipeline+Exec+Engine ### 2. How to use: Set the environment variable `set enable_pipeline_engine = true; `	2022-12-02 17:11:34 +08:00
starocean999	95591ce49a	[refactor](cv)wait on condition variable more gently (#12620 )	2022-11-08 08:40:31 +08:00
Mingyu Chen	2fb218173e	[improvement](scan) change the max thread num and num of free blocks in new scan (#13793 ) 1. In the previous implementation, the max thread num of olap scanner was set relatively small, such as 3. which would slow down some of queries. In this PR, I changed the max thread num to a quarter of the scaner thread pool(default is 12), which is less than the old scan node's max thread num, but larger than the previous implementation. The upper limit of the max thread num of the old scan node is too high, which is not reasonable. 2. Lower down the number of pre allocated free blocks.	2022-10-31 14:00:06 +08:00
Mingyu Chen	c1ce48ffe4	[fix](new-scann) scanner may be marked close twice (#13263 )	2022-10-11 15:37:15 +08:00
slothever	820ec435ce	[feature-wip](parquet-reader) refactor parquet_predicate (#12896 ) This change serves the following purposes: 1. use ScanPredicate instead of TCondition for external table, it can reuse old code branch. 2. simplify and delete some useless old code 3. use ColumnValueRange to save predicate	2022-09-28 21:27:13 +08:00
Mingyu Chen	efd2bdb203	[improvement](new-scan) avoid too many scanner context scheduling (#12491 ) When select large number of data from a table, the profile will show that: - ScannerCtxSchedCount: 2.82664M(2826640) But there is only 8 times of ScannerSchedCount, most of them are busy running. After improvement, the ScannerCtxSchedCount will be reduced to only 10.	2022-09-12 10:22:54 +08:00
Mingyu Chen	a16cf0e2c8	[feature-wip](scan) add profile for new olap scan node (#12042 ) Copy most of profiles from VOlapScanNode and VOlapScanner to NewOlapScanNode and NewOlapScanner. Fix some blocking bug of new scan framework. TODO: Memtracker Opentelemetry spen The new framework is still disabled by default, so it will not effect other feature.	2022-08-30 10:55:48 +08:00
Mingyu Chen	05da3d947f	[feature-wip](new-scan) add scanner scheduling framework (#11582 ) There are currently many types of ScanNodes in Doris. And most of the logic of these ScanNodes is the same, including: Runtime filter Predicate pushdown Scanner generation and scheduling So I intend to unify the common logic of all ScanNodes. Different data sources only need to implement different Scanners for data access. So that the future optimization for scan can be applied to the scan of all data sources, while also reducing the code duplication. This PR mainly adds 4 new class: VScanner All Scanners' parent class. The subclasses can inherit this class to implement specific data access methods. VScanNode The unified ScanNode, and is responsible for common logic including RuntimeFilter, predicate pushdown, Scanner generation and scheduling. ScannerContext ScannerContext is responsible for recording the execution status of a group of Scanners corresponding to a ScanNode. Including how many scanners are being scheduled, and maintaining a producer-consumer blocks queue between scanners and scan nodes. ScannerContext is also the scheduling unit of ScannerScheduler. ScannerScheduler schedules a ScannerContext at a time, and submits the Scanners to the scanner thread pool for data scanning. ScannerScheduler Unified responsible for all Scanner scheduling tasks Test: This work is still in progress and default is disabled. I tested it with jmeter with 50 concurrency, but currently the scanner is just return without data. The QPS can reach about 9000. I can't compare it to origin implement because no data is read for now. I will test it when new olap scanner is ready. Co-authored-by: morningman <morningman@apache.org>	2022-08-23 08:45:18 +08:00

49 Commits