doris

Author	SHA1	Message	Date
Chester	7402fee1fc	[feature](function) support ip function ipv6_string_to_num(_or_default, _or_null), inet6_aton (#28361 )	2024-01-05 19:24:45 +08:00
Xin Liao	67b9d38d83	[fix](delete) fix incorrect tablet schema of delete predicate rowset (#29536 )	2024-01-05 18:26:30 +08:00
morrySnow	2b3e75bb27	[fix](Nereids) exists should not return null (#29435 )	2024-01-05 18:13:21 +08:00
zhangstar333	132ff6c6de	[opt](Nereids) add float type signature for sum aggregate function (#29503 ) * [opt](Nereids) add float type signature for sum aggregate function	2024-01-05 18:06:16 +08:00
morrySnow	64696829d1	[fix](Nereids) mark join should not eliminate join when child is empty (#29409 )	2024-01-05 11:55:37 +08:00
zhangstar333	f3bbc7b876	[enchancement](delete) fix delete stmt return error with fold on be (#28557 )	2024-01-05 11:27:21 +08:00
morrySnow	baec2657dd	[fix](Nereids) should cast NOT's child to boolean when analyze (#29433 )	2024-01-05 11:20:39 +08:00
Jerry Hu	c0f63915f7	[chore](test) make configuartion of parallel scan be fuzzy (#29356 )	2024-01-05 11:09:43 +08:00
HHoflittlefish777	9aafcb18bd	[fix](move-memtable) disable move memtable when light schema change is false (#29362 )	2024-01-04 23:03:35 +08:00
seawinde	6a836a53df	[feature](mv) add mv rewrite info to explain (#29153 ) In query rewrite by mv process, we may want know the mv rewrite process info such as which materializedView is used by rewrite, which materializedView is rewritten successfully, and chose which materializedView by cost finally. We can run sql as following to see the mv rewrite process summary info `explain <your_query_sql>` MaterializedView rewrite info is under the MATERIALIZATIONS tag. For example as following: we can see that materializedView with name `mv2_3` is rewritten successfuly and chosen finally. and materializedView with name `mv2_4` and `mv1_3` is avaliable but rewrite fail Materialized View MaterializedViewRewriteFail: name: mv2_4 FailSummary: The graph logic between query and view is not consistent name: mv1_3 FailSummary: Match mode is invalid MaterializedViewRewriteSuccessButNotChose: Names: MaterializedViewRewriteSuccessAndChose: Names: mv2_3 `MaterializedViewRewriteFail`: it means that it's failure when try to use this materilaized view to represnt the query, `NAME` is the name of MTMV. `FAIL_SUMMARY` is the summary for the fail reason. `MaterializedViewRewriteSuccessButNotChose` it means that try to use this materilaized view to represnt the query successfully, but cbo optimizer doesn't chose it finally. `MaterializedViewRewriteSuccessAndChose` it means that try to use this materilaized view to represnt the query successfully and cbo optimizer chose it finally. If want to see detail info, we can also run sql as following to see the mv rewrite process detail info `explain memo plan <your_query_sql>` MaterializedView rewrite info is under the MATERIALIZATIONS tag, For example as following: we can see the materializedView with name `mv2_3` is rewritten successfuly and chosen finally. and materializedViews with name of `mv2_4` and `mv1_3` is failed with falil reason. ========== MATERIALIZATIONS ========== materializationContexts: MaterializationContext[mv1_3] ( rewriteSuccess=false, failReason=[ ObjectId : ObjectId#257. Summary : Match mode is invalid. Reason : matchMode is VIEW_PARTIAL. ObjectId : ObjectId#260. Summary : Match mode is invalid. Reason : matchMode is VIEW_PARTIAL. ObjectId : ObjectId#251. Summary : Match mode is invalid. Reason : matchMode is VIEW_PARTIAL. ObjectId : ObjectId#254. Summary : Match mode is invalid. Reason : matchMode is VIEW_PARTIAL. ] ) MaterializationContext[mv2_4] ( rewriteSuccess=false, failReason=[ ObjectId : ObjectId#771. Summary : The graph logic between query and view is not consistent. Reason : graph logical is not equal query join edges is [<{0} --LEFT_OUTER_JOIN-- {1}>], view join edges is [<{0} --LEFT_OUTER_JOIN-- {1}>], query filter edges is [<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]], view filter edges is [<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]] inferred edge with conditions {} with error edge <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]=[(o_orderdate#20 = 2023-12-01)]. ObjectId : ObjectId#762. Summary : The graph logic between query and view is not consistent. Reason : graph logical is not equal query join edges is [<{0} --LEFT_OUTER_JOIN-- {1}>], view join edges is [<{0} --LEFT_OUTER_JOIN-- {1}>], query filter edges is [<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]], view filter edges is [<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]] inferred edge with conditions {} with error edge <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]=[(o_orderdate#20 = 2023-12-01)]. ] ) MaterializationContext[mv2_3] ( rewriteSuccess=true, failReason=[ ] ) `ObjectId` is the id of group expression. `Summary`is is the summary for the fail reason. `Reason` is the detail fail reason such as the info as above MaterializationContext[mv2_4] ( rewriteSuccess=false, failReason=[ ObjectId : ObjectId#762. Summary : The graph logic between query and view is not consistent. Reason : graph logical is not equal query join edges is [<{0} --LEFT_OUTER_JOIN-- {1}>], view join edges is [<{0} --LEFT_OUTER_JOIN-- {1}>], query filter edges is [<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]], view filter edges is [<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]] inferred edge with conditions {} with error edge <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]=[(o_orderdate#20 = 2023-12-01)]. ] `0` represent table lineitem `1` represent table orders `[<{0} --LEFT_OUTER_JOIN-- {1}>]` means the edge which is lineitem left outer join orders `[<{0} --FILTER-- {}>, <{1} --FILTER-- {}>[[] , [<{0} --LEFT_OUTER_JOIN-- {1}>]]]` means there is filter above orders which can not pull up because the edge `[<{0} --LEFT_OUTER_JOIN-- {1}>]`. this can not rewrite because `[(o_orderdate#20 = 2023-12-01)]` in query is not found in mv2_4 mv1_3 def as following: CREATE MATERIALIZED VIEW mv1_3 BUILD IMMEDIATE REFRESH auto ON SCHEDULE EVERY 1 hour DISTRIBUTED BY RANDOM BUCKETS 12 PROPERTIES ('replication_num' = '1') as select o_orderstatus, o_clerk from orders where O_ORDERDATE = '2023-12-01' group by o_orderstatus, o_clerk; mv2_3 def as following: CREATE MATERIALIZED VIEW mv2_3 BUILD IMMEDIATE REFRESH auto ON SCHEDULE EVERY 1 hour DISTRIBUTED BY RANDOM BUCKETS 12 PROPERTIES ('replication_num' = '1') as select l_linestatus, o_clerk, from ( select * from lineitem where l_shipdate = '2023-12-01' ) t1 left join ( select * from orders where o_orderdate = '2023-12-01' ) t2 on l_orderkey = o_orderkey group by l_linestatus, o_clerk; mv2_4 def as following: CREATE MATERIALIZED VIEW mv2_4 BUILD IMMEDIATE REFRESH auto ON SCHEDULE EVERY 1 hour DISTRIBUTED BY RANDOM BUCKETS 12 PROPERTIES ('replication_num' = '1') as select l_linestatus, o_clerk, from ( select * from lineitem where l_shipdate >= '2023-12-01' and l_shipdate <= '2023-12-05' ) t1 left join ( select * from orders where o_orderdate >= '2023-12-01' and o_orderdate <= '2023-12-05' ) t2 on l_orderkey = o_orderkey group by l_linestatus, o_clerk;	2024-01-04 23:01:55 +08:00
LiBinfeng	abd9000368	[Feat](Nereids) add distribute hint to leading hint (#28562 ) add distribute hint to leading hint, we can use leading like: /+ leading(t1 broadcase{t2 t3}) / after this commit	2024-01-04 17:51:06 +08:00
Pxl	441fb49345	[Bug](load) fix load failed on stream load tvf into agg state (#28420 ) fix load failed on stream load tvf into agg state	2024-01-04 17:38:31 +08:00
wuwenchi	bfe65565d8	[feature](paimon)support native reader (#29339 ) Support native reader fro paimon. Upgrade paimon 0.5 to 0.6 : apache/doris-shade#32	2024-01-04 14:31:48 +08:00
Pxl	b26f3c37bd	[Bug](config) set enabe_agg_state to need forward (#29454 ) set enabe_agg_state to need forward	2024-01-04 10:47:29 +08:00
seawinde	5e39cdf053	[doc](nereids)Add query rewrite by materialized view feature summary and desc doc (#29370 )	2024-01-04 10:38:30 +08:00
Jerry Hu	4ba4767eef	[improvement](scan) make global runtime filter support in-list filter (#29394 )	2024-01-04 09:12:30 +08:00
seawinde	49a3bab399	[fix](nereids) fix aggregate function roll up when expression arguments is not equals (#29256 ) when aggregate function roll up, we should check the qury and mv function argument is equal such as mv def and query sql as following, it should not rewrite success, because the bitmap_union_basic field augument is not equal to the `count(distinct case when o_shippriority > 10 and o_orderkey IN (1, 3) then o_custkey else null end)` field in query mv def: > select l_shipdate, o_orderdate, l_partkey, l_suppkey, > sum(o_totalprice) as sum_total, > max(o_totalprice) as max_total, > min(o_totalprice) as min_total, > count() as count_all, > bitmap_union(to_bitmap(case when o_shippriority > 1 and o_orderkey IN (1, 3) then o_custkey else null end)) as bitmap_union_basic > from lineitem > left join orders on lineitem.l_orderkey = orders.o_orderkey and l_shipdate = o_orderdate > group by > l_shipdate, > o_orderdate, > l_partkey, > l_suppkey; query sql: > select t1.l_partkey, t1.l_suppkey, o_orderdate, > sum(o_totalprice), > max(o_totalprice), > min(o_totalprice), > count(), > count(distinct case when o_shippriority > 10 and o_orderkey IN (1, 3) then o_custkey else null end) > from (select * from lineitem where l_shipdate = '2023-12-11') t1 > left join orders on t1.l_orderkey = orders.o_orderkey and t1.l_shipdate = o_orderdate > group by > o_orderdate, > l_partkey, > l_suppkey;	2024-01-03 18:58:18 +08:00
LiBinfeng	d19530c4c2	[Fix](Nereids) fix leading hint dealing with big brace (#29405 ) Co-authored-by: libinfeng <libinfeng@selectdb.com>	2024-01-03 18:13:38 +08:00
Jibing-Li	44628d37c8	Enable minmax push down for unique table while doing analyze. (#29462 )	2024-01-03 18:10:38 +08:00
starocean999	193b7518ab	[enhancement](nereids)throw readable exception when meet missing column in agg's output (#29243 )	2024-01-03 12:59:32 +08:00
wuwenchi	2a9b4a0f76	[enhancement](paimon)support predict for null and notnull (#29134 )	2024-01-03 12:53:39 +08:00
zy-kkk	79eb575d7c	[Improvement](nereids)Support ODBC table for new planner. (#29129 )	2024-01-03 12:51:07 +08:00
zclllyybb	c0db8533af	[fix](load) fix single replica load with auto partition	2024-01-03 11:53:09 +08:00
DuRipeng	08353f6027	[Enhance](fe) Iceberg table in HMS catalog supports broker scan (#28107 ) My organization uses HMS catalog to accelerate Lake query. Sine we have custom distributed file system and hard to integrate to FE / BE, we introduce HMS Catalog broker scan support (#24830) and implement custom distributed file system adaption in broker. We want to expand the scope of use to Iceberg table scan in HMS Catalog. This PR introduces broker-scan-related `IcebergBrokerIO`, `BrokerInputFile`, `BrokerInputStream` for Iceberg table scan	2024-01-03 11:29:12 +08:00
HappenLee	14e7eb7624	[Opt](rf) Opt broadcast join remote runtime filter merge and wait (#29439 )	2024-01-03 11:21:28 +08:00
morrySnow	cdf2bb24a4	[fix](Nereids) topn runtime filter only support simplest case (#29312 ) only support simple case: select ... from tbl [where ...] order by ... limit ...	2024-01-03 09:55:00 +08:00
airborne12	97234a6485	[Enhancement](inverted index) strictly checkout inverted index properties (#29421 )	2024-01-03 09:54:31 +08:00
meiyi	69524ccf98	[improve](group commit) Group commit support commit by data size (#29428 )	2024-01-02 23:20:23 +08:00
meiyi	d54cde51ba	[fix](group commit) fix http stream group commit begin a txn (#29399 )	2024-01-02 21:58:00 +08:00
jakevin	f2fa62f524	[minor](Nereids): cleanup code (#29413 )	2024-01-02 20:13:59 +08:00
zhangstar333	af39217d14	[improve](agg)support push down min/max on unique table (#29242 )	2024-01-02 19:40:23 +08:00
Jibing-Li	3eca457edd	[fix](statistics)Fix set partition loaded replayer try to write edit log bug (#29395 ) While observer or checkpointer replay SetPartitionLoaded log, it shouldn't try to write the log. This pr is to fix the bug.	2024-01-02 19:38:34 +08:00
Pxl	d474d4e701	[Bug](rollup) fallback and disable tulpe when rollup select failed on old planner (#29226 ) * fallback and disable tulpe when rollup select failed on old planner * add case	2024-01-02 17:54:56 +08:00
starocean999	4692a62e47	[fix](planner)nullable info of agg node's intermediate and output slots should be same (#29315 )	2024-01-02 15:53:28 +08:00
meiyi	706463781c	[refactor](group commit) refactor group commit wal code (#29375 )	2024-01-02 15:52:03 +08:00
谢健	c07e3b7b47	[feature](Nereids): eliminate left outer join by unique (#28853 )	2024-01-02 15:50:49 +08:00
Calvin Kirs	eac96005b3	[Fix](Job)Concurrency may result in event loss (#29385 )	2024-01-02 14:06:16 +08:00
xzj7019	90b2ee90b2	[nereids] consider numNulls in filter estimation (#29184 ) consider numNulls in filter estimation	2024-01-02 13:51:11 +08:00
Mingyu Chen	4cbbd25d8c	[fix](ctx) manager the lifecycle of connection context (#29346 ) In FrontendService, we may create some connection context and set it as a thread local varaible. These context should be removed from thread local after call. Otherwise, it may be reused by other thread incorrectly.	2024-01-01 23:32:28 +08:00
zhangdong	01a0f0915f	[fix](mtmv)fix insert overwrite will generate garbage temporary partition when restarting FE (#29075 )	2024-01-01 08:12:26 +08:00
zhangdong	5985d216f3	[feature](mtmv)support cancel mtmv task command (#29252 ) - `CANCEL MATERIALIZED VIEW TASK taskId on mvName` - CANCEL MATERIALIZED VIEW TASK, tasks("type"="mv") and jobs("type"="mv") support check auth use priv of mv - tasks and jobs add column mvName and mvDbName，you can use `select * from tasks("type"="mv") where MvName="xxx"` get all tasks of mv - fix `desc mv all` error - fix p0 The task sequence is incorrect	2023-12-31 23:10:30 +08:00
Nitin-Kashyap	6c74313f60	[Fix](ut)(show) added UT cases for show backup stmt (#28604 ) * Added UT for show backup * fixed some toSql() space and missing CLAUSE * Updated Doc with relevant syntax	2023-12-31 15:49:57 +08:00
Guangming Lu	f92c0acba0	[Bug] When using the hive catalog the hive table name is uppercase, the Doris query is reported wrong #29058 (#29059 )	2023-12-31 13:48:08 +08:00
minghong	ac636a2aba	[fix](nereids) partition prunner evaluates "not x=const" on single node range #29164	2023-12-31 08:50:16 +08:00
morrySnow	6aaa829870	[fix](Nereids) connection_id() should return bigint type (#29287 )	2023-12-30 22:25:27 +08:00
lw112	f5e2ea1699	[fix][fe]Fixed show proc tablet inaccuracies (#29186 ) Both tables simply need to change the tablet id to get the same result from the show proc statement. before： image Just change the tablet id to get the result, 10139 does not belong to the tablet in the table id of 10127 later: image	2023-12-30 20:49:01 +08:00
walter	80fb496523	[improve](proc) Show journal size in bdbje journal dir (#29314 )	2023-12-30 16:43:06 +08:00
huanghaibin	03901b9a7a	[enhancement](group_commit): refector relay wal code (#29183 )	2023-12-30 12:59:46 +08:00
qiye	2c4e52e44e	[fix](es catalog) only es_query function can push down to ES (#29320 ) Issue Number: close #29318 1. Only push down `es_query` function to ES 2. Add null check where ES query result not have `_source` or `fields` fields.	2023-12-30 09:33:26 +08:00
Xiangyu Wang	8407490053	[feature-wip](nereids) Support some spark-sql built-in functions when set dialect=spark_sql (#28531 )	2023-12-30 00:10:35 +08:00

1 2 3 4 5 ...

5734 Commits