doris

Author	SHA1	Message	Date
Mryange	f7d52b5b1c	[feature](expr) add type check when expr prepare (#33330 )	2024-04-11 09:31:50 +08:00
Mryange	8e19cdd745	[featrue](expr) support common subexpression elimination be part (#32673 )	2024-04-10 11:56:21 +08:00
Mryange	baf3ae1a93	[refactor](nereids)unify outputTupleDesc and projection be part (#32439 )	2024-03-22 16:35:43 +08:00
wangbo	0433b8730d	[Feature](profile)add shuffle send rows/bytes #30456	2024-01-28 18:25:08 +08:00
zclllyybb	24ed3e4103	[Fix](Expr&code-style) check prepare&open before every VExpr execute (#26673 )	2024-01-23 10:09:54 +08:00
Jerry Hu	1b1e088e83	[fix](exec_node) crashing caused by cancelled query in ExecNode (#30192 )	2024-01-23 10:09:54 +08:00
wangbo	0d691c638b	[Feature](profile)Support report runtime workload statistics #29591	2024-01-12 11:59:27 +08:00
caiconghui	7081139bdc	[fix](block) fix be core while mutable block merge may cause different row size between columns in origin block (#27943 )	2023-12-25 20:35:22 +08:00
Mryange	10483ea12c	[fix](profile) fix error set with peak_memory_usage in pipeline #27749	2023-12-02 14:12:38 +08:00
yiguolei	6ed0be8e3c	[refactor](profilev2) unify the counter name in shuffle operator and normal operator (#27267 ) using blocksproduced and rowsproduced to unify the counter name in DataStreamSender and other exec node, or exchange operator and other operators. blocks produced and rows produced are more easy to understand. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-11-20 14:21:39 +08:00
yiguolei	836cda65d8	[refactor](profilev2) split merged profile to a single runtime profile to make the logic more clear (#27184 )	2023-11-19 13:21:50 +08:00
zhiqiang	d3fd923447	[opt](pipeline) Return InternalError to FE instead of doing a useless DCHECK in ExecNode #27035 Effect: Client will see error message like below when BE meeting plan logical error. RROR 1105 (HY000): errCode = 2, detailMessage = ([xxx]())[CANCELLED]Logical error during processing VNewOlapScanNode(dr_case_tag), output of projections 2 mismatches with exec node output 3	2023-11-15 18:15:21 +08:00
zhiqiang	a5565f68b2	[Refactor](opentelemetry) Remove opentelemetry (#26605 )	2023-11-09 18:05:34 +08:00
Mingyu Chen	e20cab64f4	[improvement](scan) avoid too many scanners for file scan node (#25727 ) In previous, when using file scan node(eq, querying hive table), the max number of scanner for each scan node will be the `doris_scanner_thread_pool_thread_num`(default is 48). And if the query parallelism is N, the total number of scanner would be 48 * N, which is too many. In this PR, I change the logic, the max number of scanner for each scan node will be the `doris_scanner_thread_pool_thread_num / query parallelism`. So that the total number of scanners will be up to `doris_scanner_thread_pool_thread_num`. Reduce the number of scanner can significantly reduce the memory usage of query.	2023-10-29 17:41:31 +08:00
zhiqiang	d6c64d305f	[chore](log) Add log to trace query execution #25739	2023-10-26 14:09:25 +08:00
Mryange	6b2eed779c	[feature](AuditLog) add scanRows scanBytes in auditlog (#25435 )	2023-10-25 10:00:35 +08:00
Jerry Hu	b5ee4a9dbb	[enhancement](profilev2) add some fields for profile v2 (#25611 ) Add 3 counters for ExecNode: ExecTime - Total execution time(excluding the execution time of children). OutputBytes - The total number of bytes output to parent. BlockCount - The total count of blocks output to parent.	2023-10-23 15:55:40 +08:00
Mryange	0631ed61b0	[feature](profilev2) Preliminary support for profilev2. (#24881 ) You can set the level of counters on the backend using ADD_COUNTER_WITH_LEVEL/ADD_TIMER_WITH_LEVEL. The profile can then merge counters with level 1. set profile_level = 1; such as sql select count() from customer join item on c_customer_sk = i_item_sk profile Simple profile PLAN FRAGMENT 0 OUTPUT EXPRS: count() PARTITION: UNPARTITIONED VRESULT SINK MYSQL_PROTOCAL 7:VAGGREGATE (merge finalize) \| output: count(partial_count())[#44] \| group by: \| cardinality=1 \| TotalTime: avg 725.608us, max 725.608us, min 725.608us \| RowsReturned: 1 \| 6:VEXCHANGE offset: 0 TotalTime: avg 52.411us, max 52.411us, min 52.411us RowsReturned: 8 PLAN FRAGMENT 1 PARTITION: HASH_PARTITIONED: c_customer_sk STREAM DATA SINK EXCHANGE ID: 06 UNPARTITIONED TotalTime: avg 106.263us, max 118.38us, min 81.403us BlocksSent: 8 5:VAGGREGATE (update serialize) \| output: partial_count()[#43] \| group by: \| cardinality=1 \| TotalTime: avg 679.296us, max 739.395us, min 554.904us \| BuildTime: avg 33.198us, max 48.387us, min 28.880us \| ExecTime: avg 27.633us, max 40.278us, min 24.537us \| RowsReturned: 8 \| 4:VHASH JOIN \| join op: INNER JOIN(PARTITIONED)[] \| equal join conjunct: c_customer_sk = i_item_sk \| runtime filters: RF000[bloom] <- i_item_sk(18000/16384/1048576) \| cardinality=17,740 \| vec output tuple id: 3 \| vIntermediate tuple ids: 2 \| hash output slot ids: 22 \| RowsReturned: 18.0K (18000) \| ProbeRows: 18.0K (18000) \| ProbeTime: avg 862.308us, max 1.576ms, min 666.28us \| BuildRows: 18.0K (18000) \| BuildTime: avg 3.8ms, max 3.860ms, min 2.317ms \| \|----1:VEXCHANGE \| offset: 0 \| TotalTime: avg 48.822us, max 67.459us, min 30.380us \| RowsReturned: 18.0K (18000) \| 3:VEXCHANGE offset: 0 TotalTime: avg 33.162us, max 39.480us, min 28.854us RowsReturned: 18.0K (18000) PLAN FRAGMENT 2 PARTITION: HASH_PARTITIONED: c_customer_id STREAM DATA SINK EXCHANGE ID: 03 HASH_PARTITIONED: c_customer_sk TotalTime: avg 753.954us, max 1.210ms, min 499.470us BlocksSent: 64 2:VOlapScanNode TABLE: default_cluster:tpcds.customer(customer), PREAGGREGATION: ON runtime filters: RF000[bloom] -> c_customer_sk partitions=1/1, tablets=12/12, tabletList=1550745,1550747,1550749 ... cardinality=100000, avgRowSize=0.0, numNodes=1 pushAggOp=NONE TotalTime: avg 18.417us, max 41.319us, min 10.189us RowsReturned: 18.0K (18000) --------- Co-authored-by: yiguolei <676222867@qq.com>	2023-10-07 11:16:53 +08:00
Lijia Liu	864a0f9bcb	[opt](pipeline) Make pipeline fragment context send_report asynchronized (#23142 )	2023-09-28 17:55:53 +08:00
zhiqqqq	09e03247ec	[chore](readability) Better readability of ExecNode.cpp #24733	2023-09-22 08:54:57 +08:00
meiyi	82dc970916	[feature](insert) Support group commit insert (#22829 )	2023-09-08 15:51:03 +08:00
HappenLee	a1223218f3	[pipeline](exec) Support shared scan in jdbc and odbc scan node (#22826 ) Support shared scan in jdbc and odbc scan node to improve exec performance	2023-08-10 18:34:45 +08:00
Xinyi Zou	96f42ca20a	[fix](memory) Independent count exec node memory profile (#22598 ) Independent count exec node memory profile, after #22582	2023-08-06 10:56:31 +08:00
zhangstar333	1c6246f7ee	[improve](agg) support distinct agg node (#22169 ) select c_name from customer union select c_name from customer this sql used agg node to get distinct row of c_name, so it's no need to wait for inserted all data to hash map, could output the data which it's inserted into hash map successed.	2023-07-28 13:54:10 +08:00
Mryange	6875ef4b8b	[refactor](mem_reuse) refactor mem_reuse in MutableBlock (#21564 )	2023-07-20 22:53:19 +08:00
Lijia Liu	d86c67863d	Remove unused code (#21735 )	2023-07-12 14:48:13 +08:00
yiguolei	31a4f96f01	[refactor](exprcontext) move close to expr context's dector method (#20747 ) The close method does nothing. But I am not sure we could remove it. So that I add it to dector method and remove many many calls.	2023-06-14 18:01:07 +08:00
HappenLee	51bbf17786	[Refactor](Profile) Add and refactor the join profile (#20693 )	2023-06-13 09:06:51 +08:00
HappenLee	576288cc89	[Profile](exec) Remove unless profile in pipeline exec engine (#20337 )	2023-06-02 11:39:11 +08:00
Jerry Hu	9f8de89659	[refactor](exec) replace the single pointer with an array of 'conjuncts' in ExecNode (#19758 ) Refactoring the filtering conditions in the current ExecNode from an expression tree to an array can simplify the process of adding runtime filters. It eliminates the need for complex merge operations and removes the requirement for the frontend to combine expressions into a single entity. By representing the filtering conditions as an array, each condition can be treated individually, making it easier to add runtime filters without the need for complex merging logic. The array can store the individual conditions, and the runtime filter logic can iterate through the array to apply the filters as needed. This refactoring simplifies the codebase, improves readability, and reduces the complexity associated with handling filtering conditions and adding runtime filters. It separates the conditions into discrete entities, enabling more straightforward manipulation and management within the execution node.	2023-05-29 11:47:31 +08:00
zhangstar333	53ae24912f	[vectorized](feature) support partition sort node (#19708 )	2023-05-25 11:22:02 +08:00
luozenglin	272a7565b8	[improvement](tracing) Remove useless span levels from be side tracing (#19665 ) 1. Remove an exec node method corresponding to a span and replace it with an exec node corresponding to a span; 2. Fix some problems with tracing in pipeline.	2023-05-17 19:04:52 +08:00
yiguolei	8ef9212ddc	[enhancement](exceptionsafe) force check exec node method's return value (#19538 )	2023-05-12 10:21:00 +08:00
yiguolei	4e4fb33995	[refactor](conjuncts) simplify conjuncts in exec node (#19254 ) Co-authored-by: yiguolei <yiguolei@gmail.com> Currently, exec node save exprcontext*, but the object is in object pool, the code is very unclear. we could just use exprcontext.	2023-05-04 18:04:32 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
yongjinhou	b59c4b4702	[fix](build) Fix missing header files (#18740 )	2023-04-17 21:22:15 +08:00
Adonis Ling	9e960f4c4f	[chore](build) Use include-what-you-use to optimize includes (#18681 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-17 11:44:58 +08:00
yiguolei	ac5b47e515	[bugfix](addlog) expr context is not closed and will core during deconstructor (#18134 )	2023-03-27 21:59:46 +08:00
Gabriel	06788bc2d0	[Bug](pipeline) Fix projection on streaming operator (#16592 )	2023-02-10 15:57:26 +08:00
yiguolei	eba70f972e	[improvement](global context) remove some unused method from runtime state (#16329 ) This is part of #16296. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-02 10:24:55 +08:00
luozenglin	62ff20d462	[fix](compile) fix the compile error when WITH_MYSQL is ON. (#16195 )	2023-01-29 22:52:44 +08:00
Pxl	2b5f95f08a	[Bug](function) remove datev2 signature of hour_ceil/hour_floor #16168	2023-01-29 11:27:56 +08:00
yiguolei	79ad74637d	[refactor](remove expr) remove non vectorized Expr and ExprContext related codes (#16136 )	2023-01-24 10:45:35 +08:00
yiguolei	a3cd0ddbdc	[refactor](remove broker scan node) it is not useful any more (#16128 ) remove broker scannode remove broker table remove broker scanner remove json scanner remove orc scanner remove hive external table remove hudi external table remove broker external table, user could use broker table value function instead Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-23 19:37:38 +08:00
Gabriel	d062ca2944	[refactor](vectorized) remove unnecessary vectorization check (#15984 )	2023-01-17 12:21:46 +08:00
Xinyi Zou	97fcad76f8	[enhancement](memtracker) Improve readability (#15716 )	2023-01-16 16:30:35 +08:00
yiguolei	16862d9b43	[refactor](remove unused code) remove buffer pool and disk io mgr (#15853 ) * [refactor](remove buffer pool and disk io mgr) remove unused code Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-13 09:42:58 +08:00
yiguolei	d857b4af1b	[refactor](remove row batch) remove impala rowbatch structure (#15767 ) * [refactor](remove row batch) remove impala rowbatch structure Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-11 09:37:35 +08:00
slothever	90a92f0643	[feature-wip](multi-catalog) add iceberg tvf to read snapshots (#15618 ) Support new table value function `iceberg_meta("table" = "ctl.db.tbl", "query_type" = "snapshots")` we can use the sql `select * from iceberg_meta("table" = "ctl.db.tbl", "query_type" = "snapshots")` to get snapshots info of a table. The other iceberg metadata will be supported later when needed. One of the usage: Before we use following sql to time travel: `select * from ice_table FOR TIME AS OF "2022-10-10 11:11:11"`; `select * from ice_table FOR VERSION AS OF "snapshot_id"`; we can use the snapshots metadata to get the `committed time` or `snapshot_id`, and then, we can use it as the time or version in time travel clause	2023-01-10 22:37:35 +08:00
Gabriel	d0e8f84279	[feature](vectorized) Support MemoryScratchSink on vectorized engine (#15612 )	2023-01-10 10:38:35 +08:00

1 2 3

148 Commits