doris

Author	SHA1	Message	Date
minghong	e46077fbf4	print group id for physical plan node (#17742 )	2023-03-14 22:35:08 +08:00
morrySnow	6348819c27	[fix](Nereids) remove bitmap_union_int(bigint) signature (#17356 )	2023-03-14 20:42:47 +08:00
zhbinbin	ff9e03e2bf	[Feature](add bitmap udaf) add the bitmap intersection and difference set for mixed calculation of udaf (#15588 ) * Add the bitmap intersection and difference set for mixed calculation of udaf Co-authored-by: zhangbinbin05 <zhangbinbin05@baidu.com>	2023-03-14 20:40:37 +08:00
minghong	65f71d9e06	[enhance](nereids) broadcast cost calculate (#17711 ) update broadcast join cost estimate according to BE implementation. there is an enhancement on BE. in broadcast join, BE only build one hash table, not instanceNum hash tables.	2023-03-14 19:45:03 +08:00
morrySnow	699159698e	[enhancement](planner) support update from syntax (#17639 ) support update from syntax note: enable_concurrent_update is not supported now ``` UPDATE <target_table> SET <col_name> = <value> [ , <col_name> = <value> , ... ] [ FROM <additional_tables> ] [ WHERE <condition> ] ``` for example: t1 ``` +----+----+----+-----+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+-----+------------+ \| 3 \| 3 \| 3 \| 3.0 \| 2000-01-03 \| \| 2 \| 2 \| 2 \| 2.0 \| 2000-01-02 \| \| 1 \| 1 \| 1 \| 1.0 \| 2000-01-01 \| +----+----+----+-----+------------+ ``` t2 ``` +----+----+----+------+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+------+------------+ \| 4 \| 4 \| 4 \| 4.0 \| 2000-01-04 \| \| 2 \| 20 \| 20 \| 20.0 \| 2000-01-20 \| \| 5 \| 5 \| 5 \| 5.0 \| 2000-01-05 \| \| 1 \| 10 \| 10 \| 10.0 \| 2000-01-10 \| \| 3 \| 30 \| 30 \| 30.0 \| 2000-01-30 \| +----+----+----+------+------------+ ``` t3 ``` +----+ \| id \| +----+ \| 1 \| \| 5 \| \| 4 \| +----+ ``` do update ```sql update t1 set t1.c1 = t2.c1, t1.c3 = t2.c3 * 100 from t2 inner join t3 on t2.id = t3.id where t1.id = t2.id; ``` the result ``` +----+----+----+--------+------------+ \| id \| c1 \| c2 \| c3 \| c4 \| +----+----+----+--------+------------+ \| 3 \| 3 \| 3 \| 3.0 \| 2000-01-03 \| \| 2 \| 2 \| 2 \| 2.0 \| 2000-01-02 \| \| 1 \| 10 \| 1 \| 1000.0 \| 2000-01-01 \| +----+----+----+--------+------------+ ```	2023-03-14 19:26:30 +08:00
AKIRA	f1dde20315	[ehancemnet](nereids) Refactor statistics (#17637 ) 1. Support for more expression type 2. Support derive with histogram 3. Use StatisticRange to abstract to logic 4. Use Statistics rather than StatisDeriveResult	2023-03-14 13:10:55 +08:00
jakevin	be3a7e69cd	[refactor](Nereids): polish code SemiJoinLogicalJoinTranspose. (#17740 )	2023-03-14 12:48:58 +08:00
谢健	3a97190661	[fix](Nereids) Compare plan with their output rather than string in UnrankTest (#17698 ) After adding a unique ID, the unRankTest fail because each plan has a different ID in the string. To avoid the effect of unique ID, Compare the plan with the output rather than the string	2023-03-14 11:10:06 +08:00
spaces-x	5b39fa9843	[Feature](vec)(quantile_state): support quantile state in vectorized engine (#16562 ) * [Feature](vectorized)(quantile_state): support vectorized quantile state functions 1. now quantile column only support not nullable 2. add up some regression test cases 3. set default enable_quantile_state_type = true --------- Co-authored-by: spaces-x <weixiang06@meituan.com>	2023-03-14 10:54:04 +08:00
huangzhaowei	f3c6ee5961	[Enhance](ComputeNode) ES Scan node support to be scheduled to compute node (#16533 ) ES Scan node support to be scheduled to compute node.	2023-03-14 00:13:24 +08:00
lihangyu	9b7596f1c6	[Feature](Dynamic schema table) step1 support schema change expression (#17494 ) 1. introduce a new type `VARIANT` to encapsulate dynamic generated columns for hidding the detail of types and names of newly generated columns 2. introduce a new expression `SchemaChangeExpr` for doing schema change for extensibility	2023-03-13 15:12:42 +08:00
gitccl	c302fa2564	[Feature](array-function) Support array_pushfront function (#17584 )	2023-03-13 14:26:02 +08:00
pengxiangyu	ac944e2ac1	[fix](cooldown)Fix bug for storage policy in dynamic partition (#17665 ) * fix bug for partition storage policy	2023-03-13 14:13:55 +08:00
yiguolei	be5147c32e	[enhancement](feservice) catch throwable and print log for frontend service (#17708 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-13 11:27:00 +08:00
starocean999	782001c75b	[fix](planner) project should be done inside subquery (#17630 ) WITH t0 AS( SELECT report.date1 AS date2 FROM( SELECT DATE_FORMAT(date, '%Y%m%d') AS date1 FROM cir_1756_t1 ) report GROUP BY report.date1 ), t3 AS( SELECT date_format(date, '%Y%m%d') AS date3 FROM cir_1756_t2 ) SELECT row_number() OVER(ORDER BY date2) FROM( SELECT t0.date2 FROM t0 LEFT JOIN t3 ON t0.date2 = t3.date3 ) tx; The DATE_FORMAT(date, '%Y%m%d') was calculated in GROUP BY node, which is wrong. This expr should be calculated inside the subquery.	2023-03-13 11:10:27 +08:00
abmdocrt	55c42da511	[Feature](array) Support array<decimalv3> data type (#16640 )	2023-03-13 10:48:13 +08:00
HappenLee	39b5682d59	[Pipeline](shared_scan_opt) Support shared scan opt in pipeline exec engine	2023-03-13 10:33:57 +08:00
Xiangyu Wang	a0a2809324	[Enhancement](multi-catalog) support hms event deserialization for HDP/CDH Hive versions. (#17660 ) Some HDP/CDH Hive versions use gzip to compress the message body of hms NotificationEvent, so com.qihoo.finance.hms.event.MetastoreEventFactory can not transfer it rightly.	2023-03-13 09:47:28 +08:00
Mingyu Chen	b0d1166989	[fix](meta) fix concurrent modification exception and potential NPE (#17602 )	2023-03-12 22:12:07 +08:00
Mingyu Chen	46dcf69644	[fix](jdbc-catalog) avoid calculate driver's md5 when replaying edit log (#17693 )	2023-03-12 22:11:45 +08:00
AKIRA	54e5c71e52	[fix](planner) Fix NPE when update stats by profile	2023-03-12 21:40:47 +08:00
caiconghui	a651926ba9	[fix](fqdn) Add UnknownHostException handle logic in FQDNManager to avoid that active ip could be incorrectly assigned to dead be or dead fe (#17689 ) 1.if be is dead and be ip not changed by FQDNManager，A situation may occur that after a while the old ip is used by other new alive pod，this may cause two be share same ip which is unexpected. 2.when enable_fqdn is false, user can still set hostname in be when add backend Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-03-12 21:12:33 +08:00
Jibing-Li	0d05e4cce0	[Improvement](multi-catalog) The interface of external Splitter. WIP (#17390 ) This is PR introduce splitter interface external table. The splitter interface contain one method getSplits, which is used by QueryScanProvider to get the external file split. For Hive/Iceberg/TVF, a split is a file block. For ES, it is a shard. This PR also move the getSplits logic in FileScanProviderIf to the new Splitter interface. In the future, we may unify internal table as well.	2023-03-12 20:11:08 +08:00
zhangdong	a452db35da	[improvement](filecache)Change the hash field of the backend (#17499 ) ip of backend may change use id as a hash field	2023-03-12 20:04:25 +08:00
jakevin	b93e553958	[enhance](Nereids): allow empty hash condition (#17699 )	2023-03-12 18:51:22 +08:00
Weijie Guo	11fbe07221	[refactor](Nereids) Refactor all rewrite logical unit tests by match-pattern (#17691 )	2023-03-12 18:49:12 +08:00
jakevin	d774162a53	[minor](Nereids): rename rule (#17509 )	2023-03-12 00:17:07 +08:00
Yulei-Yang	9745ee60a7	[fix](priv) fix bug of grant priv on ctl.db.* not work (#17612 ) currently, when use grant xxx_priv on ctl.db.* to user_a, it does not work. When user_a switch to ctl, he cannot see or use any database.	2023-03-11 22:27:26 +08:00
caiconghui	692d510edb	[fix](schema_hash) remove useless schema_hash param in tablet and replica url (#17489 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-03-11 21:34:47 +08:00
minghong	d7cb5cf3db	[feature](nereids) add session var: dump_nereids_memo (#17666 ) * dump_nereids_memo * print groupexpr id	2023-03-11 13:40:15 +08:00
minghong	3231fab8c2	[feature](nereids) add unique id for groupExpression and plan node (#17628 ) * add unqiue id for groupExpression and plan node * fix ut	2023-03-11 13:23:41 +08:00
jakevin	db9692a114	[feature](Nereids): convert CrossJoin to InnerJoin. (#17681 )	2023-03-11 13:23:28 +08:00
谢健	3745e6c18a	[fix](Nereids): order of project's logical properties is different with that of project expression (#17648 )	2023-03-11 00:26:54 +08:00
jakevin	051ab7a9c6	[refactor](Nereids): refactor Join-Dependent Predicate Duplication. (#17653 )	2023-03-10 22:19:45 +08:00
Weijie Guo	566d133610	[enhancement](Nereids) Refactor EliminateLimitTest and EliminateFilterTest by match-pattern (#17631 )	2023-03-10 21:24:36 +08:00
yongjinhou	9cfa61b402	[Enhancement](HttpServer) Provide authentication interface for BE (#17073 ) Add an authentication interface in FE for BE	2023-03-10 16:34:47 +08:00
minghong	9ae5ec4dc5	[fix](nereids) PushdownExpressionsInHashCondition contains duplicate column and WindowExpression miss column stats (#17624 ) tpcds: q47 and q57 1. PushdownExpressionsInHashCondition:project contains duplicate column 2. WindowExpression stats caclucate: miss column stats	2023-03-10 16:08:43 +08:00
xueweizhang	739e043c8d	[fix](publish) add retry publish when succeed replica num less than quorum and transaction not VISIBLE (#17453 ) for some reasons, transaction pushlish succeed replica num less than quorum, this transaction's status can not to be VISIBLE, and this publish task of this replica of this tablet on this backend need retry publish success to make transaction VISIBLE when last publish failed. Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-10 12:02:15 +08:00
Pxl	1a549edac2	[Chore](third-party) upgrade thrift from 0.13 to 0.16 (#17202 ) upgrade thrift from 0.13 to 0.16 There is thrift's release notes https://github.com/apache/thrift/blob/master/CHANGES.md	2023-03-10 11:33:16 +08:00
Yulei-Yang	f84b8b7c8b	[fix](priv) fix extract real user name when do privilege check (#17488 ) fix extract real user name of root/admin	2023-03-10 10:22:13 +08:00
Mingyu Chen	fe6361f4b5	[regression-test](p0) fix some unstable p0 cases (#17518 ) drop database before create remove some large, unused debug log	2023-03-10 10:21:39 +08:00
Mingyu Chen	c7aa3f9717	[fix](backup) backup throw NPE when no partition in table (#17546 ) If table has no partition, backup will report error: 2023-03-06 17:35:32,971 ERROR (backupHandler\|24) [Daemon.run():118] daemon thread got exception. name: backupHandler java.util.NoSuchElementException: No value present at java.util.Optional.get(Optional.java:135) ~[?:1.8.0_152] at org.apache.doris.catalog.OlapTable.selectiveCopy(OlapTable.java:1259) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.prepareBackupMeta(BackupJob.java:505) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.prepareAndSendSnapshotTask(BackupJob.java:398) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.run(BackupJob.java:301) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupHandler.runAfterCatalogReady(BackupHandler.java:188) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.common.util.MasterDaemon.runOneCycle(MasterDaemon.java:58) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.common.util.Daemon.run(Daemon.java:116) ~[doris-fe.jar:1.0-SNAPSHOT]	2023-03-10 10:19:37 +08:00
huangzhaowei	4ba93efc98	[Enhance](DOE)Support parse default es iso datetime string (#17412 ) * support parse default es iso datetime string	2023-03-10 09:59:20 +08:00
morrySnow	006f7a91ac	[fix](planner) should not turn on push agg op when olapscan has conjuncts on it (#17598 ) we should not set PushAggOp to any type, if olap scan already has conjunct on it.	2023-03-10 09:33:08 +08:00
luozenglin	c3c7bc4340	[fix](profile) fix profile sort child list exception (#17613 )	2023-03-10 08:44:32 +08:00
Xinyi Zou	f9baf9c556	[improvement](scan) Support pushdown execute expr ctx (#15917 ) In the past, only simple predicates (slot=const), and, like, or (only bitmap index) could be pushed down to the storage layer. scan process: Read part of the column first, and calculate the row ids with a simple push-down predicate. Use row ids to read the remaining columns and pass them to the scanner, and the scanner filters the remaining predicates. This pr will also push-down the remaining predicates (functions, nested predicates...) in the scanner to the storage layer for filtering. scan process: Read part of the column first, and use the push-down simple predicate to calculate the row ids, (same as above) Use row ids to read the columns needed for the remaining predicates, and use the pushed-down remaining predicates to reduce the number of row ids again. Use row ids to read the remaining columns and pass them to the scanner.	2023-03-10 08:35:32 +08:00
huangzhaowei	4ddd303cfc	[Feature-wip](MySQL Load)Support cancel query for mysql load (#17233 ) Notice some changes: 1. Support cancel query for mysql load 2. Change the thread pool for mysql load manager. 3. Fix sucret path check logic 4. Fix some doc error	2023-03-09 22:08:26 +08:00
YueW	4a0361914b	[fix](alter inverted index) add or drop inverted index also need change table state to SCHEMA_CHANGE (#17471 ) before this pr, add or drop inverted index not change table state, maybe multiple alter jobs executed at the same time, that may lead to some unexpected problems.	2023-03-09 16:33:46 +08:00
Adonis Ling	310bdb60f4	[chore](maven) Prefer protoc in thirdparty to the one in maven artifacts (#17596 ) The prebuilt protoc-gen-grpc-java binary uses glibc on Linux and the version of glibc which Centos 6 uses is too old.	2023-03-09 16:21:38 +08:00
morrySnow	6c894be007	[enhancement](Nereids) support decimalv3 and precision derive (#17393 )	2023-03-09 14:12:10 +08:00

1 2 3 4 5 ...

4018 Commits