doris

Author	SHA1	Message	Date
Jack Drogon	7d49d9cf99	[improvement](dynamic partition) Fix dynamic partition no bucket (#18300 ) Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-04-02 15:51:21 +08:00
slothever	97aab138aa	[fix](parquet-reader) reset value idx in bool rle decoder and support iceberg datetime(3) (#18245 ) 1. Fix value idx in bool rle decoder 2. Iceberg table support datetimev2(3). In the previous version, we converted hive timestamp to datetimev2(0) default.	2023-04-01 21:00:01 +08:00
jakevin	9e087622ab	[fix](Nereids): fix JoinReorderContext in withXXX() of LogicalJoin. (#18299 )	2023-04-01 16:51:27 +08:00
Mingyu Chen	7e61a85331	[refactor](libhdfs) introduce hadoop libhdfs (#18204 ) 1. Introduce hadoop libhdfs 2. For Linux-X86 platform, use the hadoop libhdfs 3. For other platform, use libhdfs3, because currently we don't have hadoop libhdfs binary for other platform Co-authored-by: adonis0147 <adonis0147@gmail.com>	2023-03-31 18:41:39 +08:00
mch_ucchi	3ea98b65df	[Fix](Nereids) fix nereids failed to parse set operation with query in parenthesis (#18062 ) sql like the format (q1, q2, q3 is a query): ``` sql (q1) UNION ALL (q2) UNION ALL (q3) ORDER BY keys ``` cannot be parsed by nereids, because order will be recognized as an alias of query, we add queryOrganization to avoid it.	2023-03-31 15:55:52 +08:00
morrySnow	1a56c56e90	[fix](planner) lateral view do not support lower case table name config (#18165 ) TableFunctionNode lower_case_table_names set to 1 and 2	2023-03-31 13:42:24 +08:00
yongkang.zhong	1c2f95b887	[improve](clickhouse jdbc) support clickhouse jdbc 4.x version (#18258 ) In clickhouse's 4.x version of jdbc, some UInt types use special Java types, so I adapted Doris's ClickHouse JDBC External ``` com.clickhouse.data.value.UnsignedByte; com.clickhouse.data.value.UnsignedInteger; com.clickhouse.data.value.UnsignedLong; com.clickhouse.data.value.UnsignedShort; ```	2023-03-31 13:40:10 +08:00
gitccl	20b3bdb000	[vectorized](function) support array_first_index function (#18175 ) mysql> select array_first_index(x->x+1>3, [2, 3, 4]); +-------------------------------------------------------------------+ \| array_first_index(array_map([x] -> x(0) + 1 > 3, ARRAY(2, 3, 4))) \| +-------------------------------------------------------------------+ \| 2 \| +-------------------------------------------------------------------+ mysql> select array_first_index(x -> x is null, [null, 1, 2]); +----------------------------------------------------------------------+ \| array_first_index(array_map([x] -> x(0) IS NULL, ARRAY(NULL, 1, 2))) \| +----------------------------------------------------------------------+ \| 1 \| +----------------------------------------------------------------------+ mysql> select array_first_index(x->power(x,2)>10, [1, 2, 3, 4]); +---------------------------------------------------------------------------------+ \| array_first_index(array_map([x] -> power(x(0), 2.0) > 10.0, ARRAY(1, 2, 3, 4))) \| +---------------------------------------------------------------------------------+ \| 4 \| +---------------------------------------------------------------------------------+	2023-03-31 12:51:29 +08:00
Pxl	307170030c	[Bug](materialized-view) fix core dump when create mv have case different with base table (#18206 ) fix core dump when create mv have case different with base table	2023-03-31 12:32:09 +08:00
zhangstar333	1b2aaab2f2	[vectorized](bug) fix some case in enable fold constant (#17997 ) fix some case in enable fold constant	2023-03-31 11:41:31 +08:00
Pxl	e7bcd970f5	[Bug](materialized-view) fix isDisableTuplesMVRewriter rreturn true when expr is literal (#18246 ) fix isDisableTuplesMVRewriter rreturn true when expr is literal	2023-03-31 11:30:47 +08:00
jakevin	8e15388074	[fix](Nereids): use CBO rule instead of using rewrite rule. (#18256 )	2023-03-31 11:23:26 +08:00
Kang	4e1e0ce06d	[bugfix](topn) fix topn optimzation wrong result for NULL values (#18121 ) 1. add PassNullPredicate to fix topn wrong result for NULL values 2. refactor RuntimePredicate to avoid using TCondition 3. refactor using ordering_exprs in fe and vsort_node	2023-03-31 10:01:34 +08:00
minghong	1abb19d0fd	filter estimation refactor (#18170 )	2023-03-31 08:49:38 +08:00
abmdocrt	a88e80f8ee	[fix](ssl)refactor some SSL info logs to debug logs (#18234 )	2023-03-31 08:41:02 +08:00
AKIRA	b5ea299697	[fix](planner) Fix agg on inlineview which with constant slot (#18201 ) Since slot that reference to constant has been marked as constant expr either, just add condition check to make sure such slot wouldn't be eliminated as constant from group exprs	2023-03-30 23:54:37 +08:00
jakevin	28793b6441	[fix](Nereids): fix copyIn() in Memo when useless project with groupplan (#18223 )	2023-03-30 23:49:21 +08:00
Xiangyu Wang	0c2ff09fcf	[Fix](multi-catalog) fix hms automatic update error. (#18252 ) Co-authored-by: wangxiangyu@360shuke.com <wangxiangyu@360shuke.com>	2023-03-30 23:09:07 +08:00
jakevin	3d2c70f75d	[fix](Nereids): fix merge_group(). (#18250 )	2023-03-30 20:34:47 +08:00
mch_ucchi	fefc0d6814	[Fix](planner)fix create view ignore order by info bug. (#18197 )	2023-03-30 20:17:46 +08:00
liujinhui	ce79ff947a	[Enhancement](spark load)Support spark time out config (#17108 )	2023-03-30 20:12:46 +08:00
morrySnow	99bd5ec022	[fix](Nereids) fix some bugs in Subquery to window rule (#18233 ) we introduce this rule by PR #17968, but some corner case do not be processed correctly. This PR fix these bugs: 1. fix window function generation method, replace inner slot with equivalent outer slot 2. forbid below scenes a. inner has a mapping project b. inner has an unexpected filter c. outer has a mapping project d. outer has an unexpected filter e. outer has additional table f. outer has same table g. outer and inner with different join condition h. outer and inner has same table with different join condition	2023-03-30 16:09:16 +08:00
amory	ea41d94582	[Improve](complex-type) Support Count(complexType) (#17868 ) Support count function for ARRAY/MAP/STRUCT type	2023-03-30 15:43:32 +08:00
Gabriel	b7af110f61	[Bug](bloomfilter) Fix bloom filter for date type (#18205 )	2023-03-30 14:15:06 +08:00
Pxl	cec983b7ef	[Chore](materialized-view) forbiden create mv with where clause contained aggregate column (#18168 ) forbiden create mv with where clause contained aggregate column create table a_table( k1 int null, k2 int not null, k3 bigint null, k4 bigint sum null, k5 bitmap bitmap_union null, k6 hll hll_union null ) aggregate key (k1,k2,k3) distributed BY hash(k1) buckets 3 properties("replication_num" = "1"); create materialized view where_1 as select k1,k4 from a_table where k4 =1; // invalid, mv on agg table need group by create materialized view where_2 as select k1,sum(k4) from a_table where k4 =1 group by k1; // invalid, k4 is agg column create materialized view where_2 as select k1,sum(k4) from a_table where k1+k4 =1 group by k1; // invalid, k4 is agg column	2023-03-30 13:03:03 +08:00
Pxl	c8ad62a3cd	[Enchancement](materialized-view) enchance materialized view where clause match (#18179 ) enchance materialized view where clause match	2023-03-30 13:02:21 +08:00
mch_ucchi	c8ea5bff1d	[Fix](planner) fix nested udf bind arguments exception (#18188 ) nested alias function will cause bind argument exception, sql like: ``` sql CREATE ALIAS FUNCTION f1(DATETIMEV2(3), INT) with PARAMETER (datetime1, int1) as date_trunc(days_sub(datetime1, int1), 'day') CREATE ALIAS FUNCTION f2(DATETIMEV2(3), int) with PARAMETER (datetime1, int1) as DATE_FORMAT(HOURS_ADD( date_trunc(datetime1, 'day'), add(multiply(floor(divide(HOUR(datetime1), divide(24,int1))), 1), 1) ), '%Y%m%d:%H') select f2(f1(now(3), 2), 3) ``` bug in FunctionCallExpr#rewriteExpr(), the retExpr will be replaced to originExpr to change the alias function to builtin function, but the retExpr.fn is not null, so when return to outer scope, the fn will be covered. That's the example: ``` f1(f1()) -> date_trunc(days_sub(date_trunc(days_sub()))) is correct and f1(f1()) -> date_trunc(days_sub(days_sub())) is bug. ``` we fix it.	2023-03-30 11:39:02 +08:00
starocean999	b3657959c9	[fix](planner )need add LateralViewRef's id into TableRef's allTableRefIds (#18220 ) 1. add LateralViewRef's id into TableRef's allTableRefIds, so the caller won't miss LateralViewRef when trying to get all the tableref ids. 2. TableFunctionNode should use child node's output tuple id as the input tuple id	2023-03-30 11:32:18 +08:00
lihangyu	9c1aad06ea	[Improve](query) improve column match performance by introducing a column name map in `MaterializedIndexMeta` (#18203 ) improve column match performance by introducing a column name map in `MaterializedIndexMeta` `getColumnByName` is slow due to the linear search process, using a map to speed up search.	2023-03-30 11:24:51 +08:00
zhangstar333	525f15dddf	[vectorized](function) support array_sortby function (#18071 )	2023-03-30 11:07:49 +08:00
ElvinWei	c0d55b54c0	[Improvement](statistics) Support for statistics removing and incremental collection (#18069 ) * Support for removing statistics and incremental collection * Fix syntax	2023-03-30 10:40:43 +08:00
Kang	58bc18af54	[Improvement](log) avoid warn log in specializeTemplateFunction at initialization (#18070 ) avoid warn log in specializeTemplateFunction at initialization like this: ``` 2023-03-23 21:17:54,931 INFO (leaderCheckpointer\|89) [FunctionSet.specializeTemplateFunction():1337] specializeTemplateFunction exception at initialize org.apache.doris.catalog.TypeException: ARRAY<DECIMALV3(9, 0)> is not MapType at org.apache.doris.catalog.MapType.specializeTemplateType(MapType.java:137) ~[fe-common-1.2-SNAPSHOT.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.FunctionSet.specializeTemplateFunction(FunctionSet.java:1321) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.FunctionSet.getFunction(FunctionSet.java:1251) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.FunctionSet.getFunction(FunctionSet.java:1216) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.FunctionSet.addBuiltinBothScalaAndVectorized(FunctionSet.java:1449) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.FunctionSet.addScalarAndVectorizedBuiltin(FunctionSet.java:1432) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.builtins.ScalarBuiltins.initBuiltins(ScalarBuiltins.java:108) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.FunctionSet.init(FunctionSet.java:87) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.Env.<init>(Env.java:585) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.catalog.Env.getCurrentEnv(Env.java:665) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.master.Checkpoint.doCheckpoint(Checkpoint.java:143) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.master.Checkpoint.runAfterCatalogReady(Checkpoint.java:77) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.common.util.MasterDaemon.runOneCycle(MasterDaemon.java:58) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.common.util.Daemon.run(Daemon.java:116) ~[doris-fe.jar:1.2-SNAPSHOT] ```	2023-03-30 10:17:21 +08:00
huangzhaowei	55bf38dbab	[feature-wip](MTMV) Use SSB ddl to test (#18150 ) Add regression tests for MTMV.	2023-03-30 00:11:38 +08:00
jakevin	58de8ec2df	[enhance](Nereids): add variable to enable Bushy Tree (#18202 )	2023-03-29 21:53:24 +08:00
Xinyi Zou	6964d9f99c	[fix](function) resubmit-fix AES/SM3/SM4 encrypt/ decrypt algorithm initialization vector bug (#17907 ) * Revert "[fix](function) fix AES/SM3/SM4 encrypt/ decrypt algorithm initialization vector bug (#17420)" This reverts commit 397cc011c4f1ba5a25c770258c13f1cd3f28b47d. * [fix-resubmit](function) fix AES/SM3/SM4 encrypt/ decrypt algorithm initialization vector bug (#17420) ECB algorithm, block_encryption_mode does not take effect, it only takes effect when init vector is provided. Solved: 192/256 supports calculation without init vector For other algorithms, an error should be reported when there is no init vector Initialization Vector. The default value for the block_encryption_mode system variable is aes-128-ecb, or ECB mode, which does not require an initialization vector. The alternative permitted block encryption modes CBC, CFB1, CFB8, CFB128, and OFB all require an initialization vector. Reference: https://dev.mysql.com/doc/refman/8.0/en/encryption-functions.html#function_aes-decrypt Note: This fix does not support smooth upgrades. during upgrade process, query may report error: funciton not found	2023-03-29 21:13:01 +08:00
minghong	f24174ebf1	when estimated rowCount is 0, adjust to 1 (#18174 )	2023-03-29 19:03:53 +08:00
zhengshiJ	b92087dee8	[Fix](Nereids) ReorderJoin rule cannot process MarkJoin correctly (#18159 ) Fix two problems, 1. The logical join containing the MarkJoinSlotRefrance column will generate a plan->MarkJoinSlotreference structure when reorderJoin is executed, and the MarkJoinSlotreference column will be restored after the reorder is completed. But when filter+crossJoin exists, it will be transformed into innerJoin in the rules, causing the map to fail, and the corresponding plan cannot be found, thus losing the MarkJoinSlotreference column. 2. Originally, the MarkJoinSlotReference column was used as the NonUserVisibleOutput of logicalJoin. At the same time, when logicalApply was generated, the added logicalProject did not include the MarkJoinSlotReference column, and the invalid logicalProject was deleted based on other rules, so as to ensure that LogicalApply was under the logicalFilter and could recognize the MarkJoinSlotReference column. But there will be problems if logicalProject cannot be deleted. Repair method 1. For logicalJoin containing MarkJoinSlotreference, the rules of reorderJoin are not executed. 2. Use MarkJoinSlotreference as the output of logicalJoin and also as the output of LogicalApply. 3. When generating LogicalApply, if MarkJoinSlotreference is included, you need to add an additional logicalProject to logicalFilter, and remove the MarkJoinSlotreference column. eg ``` logicalFilter(subquery with disconjunct) after SubqueryToApply logicalProject(without markJoinSlotReference) +-- logicalFilter(markJoinSlotReference) +-- logicalProject(with markJoinSlotReference) +-- logicalApply() ``` ``` SELECT * FROM sub_query_correlated_subquery1 WHERE k1 IN (SELECT k1 FROM sub_query_correlated_subquery3) OR k1 < 10; +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ \| Explain String \| +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ \| LogicalProject[60] ( distinct=false, projects=[k1#0, k2#1], excepts=[], canEliminate=true ) \| \| +--LogicalProject[59] ( distinct=false, projects=[k1#0, k2#1], excepts=[], canEliminate=true ) \| \| +--LogicalFilter[58] ( predicates=($c$1#7#false OR (k1#0 < 10)) ) \| \| +--LogicalProject[57] ( distinct=false, projects=[k1#0, k2#1, $c$1#7#false], excepts=[], canEliminate=true ) \| \| +--LogicalApply ( correlationSlot=[], correlationFilter=Optional.empty, isMarkJoin=true, MarkJoinSlotReference=$c$1#7#false, scalarSubCorrespondingSlot=empty ) \| \| \|--LogicalOlapScan ( qualified=default_cluster:regression_test_nereids_syntax_p0.sub_query_correlated_subquery1, indexName=<index_not_selected>, selectedIndexId=63105, preAgg=ON ) \| \| +--LogicalProject[34] ( distinct=false, projects=[k1#2], excepts=[], canEliminate=true ) \| \| +--LogicalOlapScan ( qualified=default_cluster:regression_test_nereids_syntax_p0.sub_query_correlated_subquery3, indexName=<index_not_selected>, selectedIndexId=63115, preAgg=ON ) \| +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ ```	2023-03-29 16:12:42 +08:00
Pxl	503c6bf38e	[Chore](materialized-view) forbiden create mv with some constant expr and curdate() (#18145 ) forbiden create mv with some constant expr and curdate()	2023-03-29 16:08:48 +08:00
Lei Zhang	f7f7958d35	[fix](bdbje) fix handle bdb RollbackException incorrectly (#17483 )	2023-03-29 16:02:55 +08:00
morrySnow	545160a343	[refactor](planner) Separate the planning process for the legacy planner and Nereids (#17991 ) 1. separate the planning process for legacy planner and Nereids in StmtExecutor 2. add forward to master logic to Nereids 3. refactor Command process for Nereids, add run interface to Command 4. internal query could run on Nereids as normal query 5. fix CreatePolicyCommand syntax, let it exactlly same with legacy planner 6. let Nereids session variables forward to master	2023-03-29 11:36:38 +08:00
starocean999	db25165498	[fix](nereids)move AdjustAggregateNullableForEmptySet before NormalizeAggregate (#18147 )	2023-03-29 11:34:01 +08:00
Pxl	0c01df6bb2	[Bug](view) fix AES_ENCRYPT have wrong result on view (#18034 )	2023-03-29 10:49:39 +08:00
Pxl	fd18e34c0c	[Chore](planner) add error information for OnClause contain ExistsPredicates (#18090 )	2023-03-29 10:47:41 +08:00
zhangdong	7e9e02a173	[Enhancement](auth)Desc table check col auth (#18114 ) 1.Change permission exception format 2.when desc table ,we show different cols by auth 3.delete unused code	2023-03-29 10:42:18 +08:00
Mingyu Chen	05db6e9b55	[refactor](file-system)(step-2) remove env, file_utils and filesystem_utils (#18009 ) Follow #17586. This PR mainly changes: Remove env/ Remove FileUtils/FilesystemUtils Some methods are moved to LocalFileSystem Remove olap/file_cache Add s3 client cache for s3 file system In my test, the time of open s3 file can be reduced significantly Fix cold/hot separation bug for s3 fs. This is the last PR of #17764. After this, all IO operation should be in io/fs. Except for tests in #17586, I also tested some case related to fs io: clone concurrency query on local/s3/hdfs load error log create and clean disk metrics	2023-03-29 09:00:52 +08:00
WenYao	c3fe113894	rename PaloFe to DorisFE (#18167 )	2023-03-29 00:30:16 +08:00
奕冷	5d218388f3	[enhancement](stmt-forward) make fe follower err msg shown to client be consistent with master (#18180 ) Found that RPC timeout is too short that RPC client will close before execute result is return. Therefore, use a coefficient to prolong the RPC client timeout, so that it can wait for the real cause to be recieved.	2023-03-28 21:27:45 +08:00
Liqf	012f7bd031	[feature](function)Add ST_Area function (#18138 )	2023-03-28 19:36:09 +08:00
jakevin	cff6a7195b	[feature](Nereids): add bushy tree rule; (#18130 )	2023-03-28 19:32:53 +08:00
GoGoWen	b9161295b7	[Fix](plan) fix bug that the case sensibility of column name may impact join method (#17904 ) Issue Number: close #17876	2023-03-28 15:18:30 +08:00

1 2 3 4 5 ...

3078 Commits