doris

Author	SHA1	Message	Date
morrySnow	b91a3b5a72	[fix](planner) should not bind slot on brother's tuple in subquery (#17813 ) consider the query like this: ```sql SELECT k3, k4 FROM test WHERE EXISTS( SELECT d.* FROM (SELECT k1 AS _1234, SUM(k2) FROM `test` d GROUP BY _1234) d LEFT JOIN (SELECT k1 AS _1234, SUM(k2) FROM `test` GROUP BY _1234) temp ON d._1234 = temp._1234) ORDER BY k3, k4 ``` when we analyze group by exprs in `temp` inline view. we bind the `_1234` on `d._1234` by mistake. that because, when we do analyze in a SUB-QUERY, we will resolve SlotRef by itself AND parent's tuple. in the meanwhile, we register child's tuple to parent's analyzer. So, in a SUB-QUERY, the brother's tuple will affect the resolve result of current inlineview's slot. This PR: 1. add a flag on the function `resolveColumnRef` in `Analyzer` ```java private TupleDescriptor resolveColumnRef(String colName, boolean requestFromChild); private TupleDescriptor resolveColumnRef(TableName tblName, String colName, boolean requestByChild); ``` 2. add a flag to specify whether the tuple is from child. ```java // alias name -> <from child, tupleDesc> private final Multimap<String, Pair<Boolean, TupleDescriptor>> tupleByAlias; ``` when `requestByChild == true`, we SKIP the tuple from other child to avoid resolve error.	2023-03-22 11:00:55 +08:00
morrySnow	a4b151e469	[fix](planner) should always execute projection plan (#17885 ) 1. should always execute projection plan, whatever the statement it is. 2. should always execute projection plan, since we only have vectorized engine now	2023-03-22 10:53:15 +08:00
jakevin	6fa239384d	[refactor](Nereids) remove tabletPruned flag in LogicalOlapScan. (#17983 )	2023-03-22 10:45:14 +08:00
zhangdong	7ddba7bf54	[fix](multi-catalog) when checkProperties failed,will have dirty data (#17877 )	2023-03-22 09:40:07 +08:00
zhangdong	545d3b1c3e	[Enhancement](auth)support ranger col priv (#17915 ) 1.When querying data, it is no longer necessary to verify the permissions of the entire table, but rather to verify the permissions of the queried columns. Currently, the 'ranger' already supports column permissions, and the internal catalog provides the implementation of dummy column permissions (the actual verified permissions are still table permissions) 2.delete roles in userIdentity 3.Change trigger logic of initAccessController	2023-03-22 09:00:17 +08:00
huangzhaowei	8df4a94826	[fix](MTMV) Tasks leak when dropping job (#17984 ) 1. Divide MTMV regression tests into 4 suites 2. Try to remove tasks which were killed by dropping job actions in running map.	2023-03-21 23:22:17 +08:00
Mingyu Chen	cb79e42e5c	[refactor](file-system)(step-1) refactor file sysmte on BE and remove storage_backend (#17586 ) See #17764 for details I have tested: - Unit test for local/s3/hdfs/broker file system: be/test/io/fs/file_system_test.cpp - Outfile to local/s3/hdfs/broker. - Load from local/s3/hdfs/broker. - Query file on local/s3/hdfs/broker file system, with table value function and catalog. - Backup/Restore with local/s3/hdfs/broker file system Not test: - cold & host data separation case.	2023-03-21 21:08:38 +08:00
zhengshiJ	82716ec99d	[fix](Nereids) type coercion for subquery (#17661 ) Complete the type coercion of the subquery in the function Binder process. Expressions generated when subqueries are nested are uniformly converted to implicit types in the analyze stage. Method: Add a typeCoercionExpr field to the subquery expression to store the generated cast information. Fix scenario where scalarSubQuery handles arithmetic expressions when implicitly converting types	2023-03-21 20:38:06 +08:00
Mellorsssss	4193884a32	[feature](array_zip) Support array_zip function (#17696 )	2023-03-21 18:44:30 +08:00
jakevin	861d9c985c	[refactor](Nereids): refactor Join Reorder Rule. (#17809 )	2023-03-21 16:12:07 +08:00
morrySnow	ed7c880e18	[fix](Nereids) should turn off parallel scan when do local finalize agg (#17961 )	2023-03-21 11:55:35 +08:00
lihangyu	1f569b7a7d	[enhancement](topn explain) display explain two phase read more precise (#17946 )	2023-03-21 10:53:47 +08:00
huangzhaowei	4023670f35	[BugFix](DOE) Add http prefix when it's not set in hosts properties. (#17745 ) * Add http prefix when it's not set in hosts properties	2023-03-21 10:08:20 +08:00
Mingyu Chen	6c8ed9135d	[fix](truncate) fix unable to truncate table due to wrong storage medium (#17917 ) When setting FE config default_storage_medium to SSD, and set all BE storage path as SSD. And table will be stored with storage medium SSD. But there is a FE config storage_cooldown_second and its default value is 30 days. So after 30 days, the storage medium of table will be changed to HDD, which is unexpected. This PR removes the storage_cooldown_second, and use a max value to set the cooldown time of SSD storage medium when the default_storage_medium is SSD.	2023-03-21 10:04:47 +08:00
Mingyu Chen	11a0ae9a87	[fix](ctas) fix show load throw NPE after ctas (#17937 ) Missing userinfo java.lang.NullPointerException: null at org.apache.doris.load.loadv2.LoadJob.getShowInfo(LoadJob.java:816) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.load.loadv2.LoadManager.getLoadJobInfosByDb(LoadManager.java:557) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.ShowExecutor.handleShowLoad(ShowExecutor.java:1094) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.ShowExecutor.execute(ShowExecutor.java:280) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.StmtExecutor.handleShow(StmtExecutor.java:1862) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.StmtExecutor.execute(StmtExecutor.java:619) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.StmtExecutor.execute(StmtExecutor.java:435) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.ConnectProcessor.handleQuery(ConnectProcessor.java:414) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.ConnectProcessor.dispatch(ConnectProcessor.java:558) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.qe.ConnectProcessor.processOnce(ConnectProcessor.java:799) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.mysql.ReadListener.lambda$handleEvent$0(ReadListener.java:52) ~[doris-fe.jar:1.2-SNAPSHOT] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) ~[?:1.8.0_131] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) ~[?:1.8.0_131] at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_131]	2023-03-21 08:49:51 +08:00
huangzhaowei	bae9d8d7f2	[Feature-Wip](MySQL LOAD)Add trim quotes property for mysql load (#17775 ) Add trim quotes property for mysql load to trim double quotes in the load files.	2023-03-21 00:32:58 +08:00
Stalary	5cac64413a	[Feature](ES): Support es get alias field type. (#17783 ) Support es get alias field type.	2023-03-21 00:32:24 +08:00
zhangstar333	dc284b62d9	[vectorized](function) support array_filter function (#17832 )	2023-03-20 23:18:10 +08:00
starocean999	8ffc85b6ff	[fix](planner)project should be done inside inlineview (#17831 ) * [fix](planner)project should be done inside inlineview * add src column for slots in scan node's output tuple	2023-03-20 21:12:45 +08:00
Pxl	a92115f709	[Bug](materialized-view) fix select mv rollback fail on left join (#17850 ) fix select mv rollback fail on left join	2023-03-20 19:14:17 +08:00
AlexYue	b4634342aa	[bug](txn) fix concurrent txns's status data race when aborting txn (#17893 )	2023-03-20 17:55:03 +08:00
minghong	223d7a36eb	adjust distribution stats derive, fix bug in join estimation (#17916 )	2023-03-20 13:04:29 +08:00
zhangdong	93cfd5cd2b	[Enhance](ComputeNode)support k8s watch (#17442 ) Describe your changes. 1.Add the watch mechanism to listen for changes in k8s statefulSet and update nodes in time. 2.For broker, there is only one name by default when using deployManager 3.Refactoring code makes it easier to understand and maintain 4.Fix jar package conflicts between okhttp-ws and okhttp Previously, the logic of k8sDeployManager.getGroupHostInfos was to call the endpoints () interface of k8s, which would cause if the pod was unexpectedly restarted, k8sDeployManager would delete the pod before the restart from the fe or be list and add the pod after the restart to the fe or be list, which obviously does not meet our expectations. Now, after fqdn is enabled, we call the statefulSets() interface of k8s to listen for the number of copies to determine whether we need to be online or offline. In addition, the watch mechanism is added to avoid the possible A-B-A problem caused by timed polling. For the sake of stability, when the watch mechanism does not receive messages for a period of time, it will be degraded to the polling mode. Now several environment variables have been added，ENV_FE_STATEFULSET，ENV_FE_OBSERVER_STATEFULSET，ENV_BE_STATEFULSET，ENV_BROKER_STATEFULSET，ENV_CN_STATEFULSET For statefulsetName，One-to-one correspondence with ENV_FE_SERVICE，ENV_FE_OBSERVER_SERVICE，ENV_BE_SERVICE，ENV_BROKER_SERVICE，ENV_CN_SERVICE，If a serviceName is configured, the corresponding statefulsetName must be configured, otherwise the program cannot be started.	2023-03-20 11:36:32 +08:00
AKIRA	5c990fb737	[fix](nereids) Analyze failed for SQL that has count distinct with same col (#17928 ) This problem is caused by the slots with same hashcodes was put in the hashset results into the wrong rules was selected.Use list instead of set as return type of getDistinctArguments method	2023-03-19 21:31:47 +08:00
AKIRA	74dfdc00dc	[nerids](statistics) remove lock in statistics cache loader #17833 remove the redandunt lock in the CacheLoader, since it use the forkjoinpool in default Add execute time log for collect stats Avoid submit duplicate task, when there already has a task to load for the same column	2023-03-19 20:30:21 +08:00
morrySnow	295b26db00	[chore](fe) update aspectj-maven-plugin to 1.14.0 version (#17890 ) In #17797 , we introduced aspectj to help log exception easily. However, the plugin version 1.11 do not support jdk9 and later. For support compile FE with jdk11 update aspectj-maven-plugin to 1.14.0 version add new dependency org.aspectj.aspectjrt 1.9.7 to fe-core according to: aspectj java version compatibility aspectj-maven-plugin issue aspectj release note intro to aspectj	2023-03-19 14:50:09 +08:00
zhangstar333	e359e412e1	[vectorized](udaf) fix java udaf meet error of std::bad_alloc (#17848 ) Now if the user code of java udaf throws exception, because c++ code of agg function nobody could deal with it, so maybe get error of std::bad_alloc	2023-03-19 11:52:15 +08:00
Lei Zhang	14dcdd188e	[fix](fe) fix MetricRepo.THRIFT_COUNTER_RPC_ALL NullPointException (#17552 )	2023-03-19 11:39:19 +08:00
Yusheng Xu	b111f9a518	[fix](insert) Session varaiables dont work for transaction insert (#17551 )	2023-03-19 10:43:02 +08:00
yongkang.zhong	c5c89f3016	[Improve](hana catalog)Currently logged in users should only see the schemas they can access (#17918 ) In the case of hana catalog, I think the current logged-in users should only see the schemas they can access.	2023-03-18 22:21:01 +08:00
lexluo09	c95eb8a67f	[enhancement] Function(create/drop) support the global operation (#16973 ) (#17608 ) Support create/drop global function. When you create a custom function, it can only be used within in one database. It cannot be used in other database/catalog. When there are many databases/catalog, it needs to create function one by one. ## Problem summary Describe your changes. 1、 When a function is created or deleted, add the global keyword. CREATE [GLOBAL] [AGGREGATE] [ALIAS] FUNCTION function_name (arg_type [, ...]) [RETURNS ret_type] [INTERMEDIATE inter_type] [WITH PARAMETER(param [,...]) AS origin_function] [PROPERTIES ("key" = "value" [, ...]) ] DROP [GLOBAL] FUNCTION function_name (arg_type [, ...]) 2、A completely global global function is set, and the global function metadata is stored in the image. The function lookup strategy is to look in the database first, and if it can't be found, it looks in the global function. Co-authored-by: lexluo <lexluo@tencent.com>	2023-03-18 22:06:48 +08:00
HappenLee	88713037bf	[Bug][Fix] pipeline exec engine get wrong result when run regression test (#17896 ) Fix regression p1：regression-test/suites/datev2/tpcds_sf1_p1/sql/pipeline case	2023-03-18 20:41:10 +08:00
YueW	3593b82498	[fix](schema change) Fix fe restart failed because of replay schema change alter job failed (#17825 )	2023-03-17 20:54:50 +08:00
Tiewei Fang	46d88ede02	[Refactor](Metadata tvf) Reconstruct Metadata table-value function into a more general framework. (#17590 )	2023-03-17 19:54:50 +08:00
minghong	8debc96d74	[enhancement](nereids) update FilterEstimation and Agg in stats derive (#17790 ) * 1. update ndv in Stats, 2. skip __DORIS_DELETE_SIGN__=0 in stats derive, 3. equalTo in stats derive 4. update agg stats derive, support the case: all column_stats are unknown * computeSize * fix ut	2023-03-17 18:01:50 +08:00
zhangstar333	5bd5402378	[bug](udf) add synchronized to test resolve error of zip file closed (#17812 )	2023-03-17 14:35:26 +08:00
caiconghui	1080a413a2	[fix](metric) Fix bug for that register txn replica failed (#17855 )	2023-03-17 11:42:40 +08:00
Kang	5d3de05976	[feature](map) basic functions for map datatype (#16916 ) basic functions for map datatype: - MAP<K, V> map(K k1, V v1, ...) - BIGINT map_size(MAP<K, V> m) - BOOL map_contains_key(MAP<K, V> m, K k1) - BOOL map_contains_value(MAP<K, V> m, V v1) - ARRAY< K> map_keys(MAP<K, V> m) - ARRAY< V> map_values(MAP<K, V> m)	2023-03-17 10:28:17 +08:00
NetShrimp	0ec10d4836	[Enhancement](fe exception) write a java annotation to catch throwable from a method and print log (#17797 ) How it works? Aspectj is used to implement the aspect function of annotations. During the compilation process, the aspectj-maven-plugin plugin will automatically weave the code with aspect annotations into the generated classes file. When to use to? When a method wants to add a try catch to save exception information, the LogException annotation can be used. When there is a method that does not allow errors, the NoException annotation can be used. What is the result when adding this annotation? Use the LogException annotation to automatically capture exceptions into the Log file, and the code can be more concise. Use the NoException annotation to automatically capture the exception to the Log file and exit the program when an exception occurs.	2023-03-17 08:52:27 +08:00
xueweizhang	b17c421f52	[fix](datetime) will get String index out of range exception (#17735 ) will get String index out of range exception when use error datetime values like '2020-02-01' before: MySQL [test]> select test121.k1 from test121 where k1 != ('9102-12-'); ERROR 1105 (HY000): errCode = 2, detailMessage = Unexpected exception: String index out of range: 8 after: MySQL [test]> select test121.k1 from test121 where k1 = '9102-12-'; ERROR 1105 (HY000): errCode = 2, detailMessage = Incorrect datetime value: '9102-12-' in expression: k1 = '9102-12-' Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-16 16:13:47 +08:00
xueweizhang	5dde910931	[feature](profile) add clean all profile sql (#17751 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-16 16:12:21 +08:00
yiguolei	cbfbe67508	[enhancement](fe query schedule) use try write lock to avoid too much wait time for planner (#17822 ) * [enhancement](fe query schedule) use try write lock to avoid too much wait time for planner; prin acii id instead of big int --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-16 16:01:33 +08:00
HappenLee	c29582bd57	[pipeline](split by segment)support segment split by scanner (#17738 ) * support segment split by scanner * change code by cr	2023-03-16 15:25:52 +08:00
pengxiangyu	b3d8be7cac	[fix](cooldown)add push conf for alter storage policy (#17818 ) * add push conf for alter storage policy	2023-03-16 14:27:27 +08:00
amory	ee7226348d	[FIX](Map) fix map compaction error (#17795 ) When compaction case, memory map offsets coming to same olap convertor which is from 0 to 0+size but it should be continue in different pages when in one segment writer . eg : last block with map offset : [3, 6, 8, ... 100] this block with map offset : [5, 10, 15 ..., 100] the same convertor should record last offset to make later coming offset followed last offset. so after convertor : the current offset should [105, 110, 115, ... 200], then column writer just call append_data() to make the right offset data append pages	2023-03-16 13:54:01 +08:00
morrySnow	0086fdbbdb	[enhancement](planner) support delete from using syntax (#17787 ) support syntax delete using, this syntax only support UNIQUE KEY model use the result of `t2` join `t3` to romve rows from `t1` ```sql -- create t1, t2, t3 tables CREATE TABLE t1 (id INT, c1 BIGINT, c2 STRING, c3 DOUBLE, c4 DATE) UNIQUE KEY (id) DISTRIBUTED BY HASH (id) PROPERTIES('replication_num'='1', "function_column.sequence_col" = "c4"); CREATE TABLE t2 (id INT, c1 BIGINT, c2 STRING, c3 DOUBLE, c4 DATE) DISTRIBUTED BY HASH (id) PROPERTIES('replication_num'='1'); CREATE TABLE t3 (id INT) DISTRIBUTED BY HASH (id) PROPERTIES('replication_num'='1'); -- insert data INSERT INTO t1 VALUES (1, 1, '1', 1.0, '2000-01-01'), (2, 2, '2', 2.0, '2000-01-02'), (3, 3, '3', 3.0, '2000-01-03'); INSERT INTO t2 VALUES (1, 10, '10', 10.0, '2000-01-10'), (2, 20, '20', 20.0, '2000-01-20'), (3, 30, '30', 30.0, '2000-01-30'), (4, 4, '4', 4.0, '2000-01-04'), (5, 5, '5', 5.0, '2000-01-05'); INSERT INTO t3 VALUES (1), (4), (5); -- remove rows from t1 DELETE FROM t1 USING t2 INNER JOIN t3 ON t2.id = t3.id WHERE t1.id = t2.id; ``` the expect result is only remove the row where id = 1 in table t1 ``` +----+----+----+--------+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+--------+------------+ \| 2 \| 2 \| 2 \| 2.0 \| 2000-01-02 \| \| 3 \| 3 \| 3 \| 3.0 \| 2000-01-03 \| +----+----+----+--------+------------+ ```	2023-03-16 13:12:00 +08:00
AKIRA	bece027135	[ehancement](profile) Add HTTP interface for q-error (#17786 ) 1. Add Http interface for query q-error 2. Fix the selectivity calculation of inner join, it would always be 0 if there is only one join condition before	2023-03-16 12:19:23 +08:00
谢健	c2edca7bda	[fix](Nereids) construct project with all slots in semi-semi-transpose-project rule (#17811 ) error msg in tpch 20 ``` SlotRef have invalid slot id: , desc: 22, slot_desc: tuple_desc_map: [Tuple(id=10 slots=[Slot(id=51 type=DECIMALV2(27, 9) col=-1, colname= null=(offset=0 mask=80)), Slot(id=52 type=INT col=-1, colname= null=(offset=0 mask=0)), Slot(id=53 type=INT col=-1, colname= null=(offset=0 mask=0)), Slot(id=54 type=INT col=-1, colname= null=(offset=0 mask=0)), Slot(id=55 type=INT col=-1, colname= null=(offset=0 mask=0))] has_varlen_slots=0)] tuple_id_map: [-1, -1, -1, -1, -1, -1, -1, -1, -1, -1, 0] tuple_is_nullable: [0] , desc_tbl: Slot(id=22 type=INT col=-1, colname= null=(offset=0 mask=0)) ``` Before we only use slots in `hashJoin` conditions to construct projects, which may lost some slots in `project`, such as ``` LOGICAL_SEMI_JOIN_LOGICAL_JOIN_TRANSPOSE_PROJECT LogicalJoin[1135] ( type=LEFT_SEMI_JOIN, markJoinSlotReference=Optional.empty, hashJoinConjuncts=[(PS_PARTKEY#0 = P_PARTKEY#6)], otherJoinConjuncts=[] ) \|--LogicalProject[1128] ( distinct=false, projects=[PS_PARTKEY#0, PS_SUPPKEY#1], excepts=[], canEliminate=true ) \| +--LogicalJoin[1120] ( type=LEFT_SEMI_JOIN, markJoinSlotReference=Optional.empty, hashJoinConjuncts=[(L_PARTKEY#17 = PS_PARTKEY#0), (L_SUPPKEY#18 = PS_SUPPKEY#1)], otherJoinConjuncts=[(cast(PS_AVAILQTY#2 as DECIMAL(27, 9)) > (0.5 * sum(L_QUANTITY))#33)] ) \| \|--GroupPlan( GroupId#2 ) \| +--GroupPlan( GroupId#7 ) +--GroupPlan( GroupId#12 ) ----------------------after---------------------- LogicalJoin[1141] ( type=LEFT_SEMI_JOIN, markJoinSlotReference=Optional.empty, hashJoinConjuncts=[(L_PARTKEY#17 = PS_PARTKEY#0), (L_SUPPKEY#18 = PS_SUPPKEY#1)], otherJoinConjuncts=[(cast(PS_AVAILQTY#2 as DECIMAL(27, 9)) > (0.5 * sum(L_QUANTITY))#33)] ) \|--LogicalProject[1140] ( distinct=false, projects=[PS_PARTKEY#0, PS_SUPPKEY#1], excepts=[], canEliminate=true ) \| +--LogicalJoin[1139] ( type=LEFT_SEMI_JOIN, markJoinSlotReference=Optional.empty, hashJoinConjuncts=[(PS_PARTKEY#0 = P_PARTKEY#6)], otherJoinConjuncts=[] ) \| \|--GroupPlan( GroupId#2 ) \| +--GroupPlan( GroupId#12 ) +--GroupPlan( GroupId#7 ) ``` `PS_AVAILQTY#2` lost in project Now we use all slots to construct projest	2023-03-16 11:53:32 +08:00
LiBinfeng	ebe651dae9	[Fix](Planner)Add call once logic to analyze of function aes_decrypt #17829 The problem is an exception when doing analyze: java.lang.IllegalStateException: exceptions : errCode = 2, detailMessage = select list expression not produced by aggregation output (missing from GROUP BY clause?): xxx The scenario is: select aes_decrypt(xxx,xxx) as c0 from table group by c0; Analyze of problem: The direct problem is mismatched of slotref, and this mismatched due to the mismatched of parameter number of aes_decrypt function. When debuging, we can see the slotref of group column is added to ExprSubstitutionMap, but can not matching with select result columns. And this is because when substiting expr it will analyze again, so the parameter would be added twice. This will cause the mismatching of function, so it would not be substitute as a slotref, the exception would be throw. Fix: Add call once to adding third parameter of aes_decrypt type function. Compare the child we want to add to the last child of function. If they are the same, do not add it.	2023-03-16 11:04:21 +08:00
meiyi	1da3e7596e	[fix](point query) Fix NegativeArraySizeException when prepared statement contains a long string (#17651 )	2023-03-16 10:24:33 +08:00

... 83 84 85 86 87 ...

8289 Commits