doris

Author	SHA1	Message	Date
xy720	7948c8ede2	[bug](function) Fix bug in the process of generating template functions #17676 If users compile fe with python version <= 2.6 They will meet a compile error in gen_builtins_functions.py with following msg:	2023-03-13 07:29:32 +08:00
HappenLee	6386458498	[Refactor](exec) remove unless attr of slot ref (#17688 ) Remove unless attr of slot ref	2023-03-12 23:45:32 +08:00
Mingyu Chen	b0d1166989	[fix](meta) fix concurrent modification exception and potential NPE (#17602 )	2023-03-12 22:12:07 +08:00
Mingyu Chen	46dcf69644	[fix](jdbc-catalog) avoid calculate driver's md5 when replaying edit log (#17693 )	2023-03-12 22:11:45 +08:00
AKIRA	54e5c71e52	[fix](planner) Fix NPE when update stats by profile	2023-03-12 21:40:47 +08:00
caiconghui	a651926ba9	[fix](fqdn) Add UnknownHostException handle logic in FQDNManager to avoid that active ip could be incorrectly assigned to dead be or dead fe (#17689 ) 1.if be is dead and be ip not changed by FQDNManager，A situation may occur that after a while the old ip is used by other new alive pod，this may cause two be share same ip which is unexpected. 2.when enable_fqdn is false, user can still set hostname in be when add backend Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-03-12 21:12:33 +08:00
Tiewei Fang	13e05c4a5d	[Enhencement](stream load) add some regression test for json format streamload (#17520 )	2023-03-12 20:13:07 +08:00
Jibing-Li	0d05e4cce0	[Improvement](multi-catalog) The interface of external Splitter. WIP (#17390 ) This is PR introduce splitter interface external table. The splitter interface contain one method getSplits, which is used by QueryScanProvider to get the external file split. For Hive/Iceberg/TVF, a split is a file block. For ES, it is a shard. This PR also move the getSplits logic in FileScanProviderIf to the new Splitter interface. In the future, we may unify internal table as well.	2023-03-12 20:11:08 +08:00
slothever	455c800405	[feature](parquet-reader) add rle bool and delta decoder to read AWS Glue (#17112 ) Support delta encoding and rle(bool) to read Glue data add delta bit pack decoder, add delta length byte array decoder, add delta byte array decoder. add rle bool decoder. We find some data type is read with delta encoding on AWS Glue, so it should be supported. The definition of delta encoding can refer to the delta encoding in parquet.	2023-03-12 20:09:58 +08:00
zhangdong	a452db35da	[improvement](filecache)Change the hash field of the backend (#17499 ) ip of backend may change use id as a hash field	2023-03-12 20:04:25 +08:00
jakevin	b93e553958	[enhance](Nereids): allow empty hash condition (#17699 )	2023-03-12 18:51:22 +08:00
Weijie Guo	11fbe07221	[refactor](Nereids) Refactor all rewrite logical unit tests by match-pattern (#17691 )	2023-03-12 18:49:12 +08:00
abmdocrt	9b687026bd	[Doc](TLS) add doc for TLS connection (#17683 )	2023-03-12 10:01:07 +08:00
jakevin	d774162a53	[minor](Nereids): rename rule (#17509 )	2023-03-12 00:17:07 +08:00
maheshnikam	4566de1cec	Enhancement in some Hyperlinks (#17654 )	2023-03-11 22:56:43 +08:00
Yulei-Yang	9745ee60a7	[fix](priv) fix bug of grant priv on ctl.db.* not work (#17612 ) currently, when use grant xxx_priv on ctl.db.* to user_a, it does not work. When user_a switch to ctl, he cannot see or use any database.	2023-03-11 22:27:26 +08:00
caiconghui	692d510edb	[fix](schema_hash) remove useless schema_hash param in tablet and replica url (#17489 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-03-11 21:34:47 +08:00
minghong	d7cb5cf3db	[feature](nereids) add session var: dump_nereids_memo (#17666 ) * dump_nereids_memo * print groupexpr id	2023-03-11 13:40:15 +08:00
minghong	3231fab8c2	[feature](nereids) add unique id for groupExpression and plan node (#17628 ) * add unqiue id for groupExpression and plan node * fix ut	2023-03-11 13:23:41 +08:00
jakevin	db9692a114	[feature](Nereids): convert CrossJoin to InnerJoin. (#17681 )	2023-03-11 13:23:28 +08:00
zhangdong	9c7854f1ff	[Enhancement](k8s) Add k8s yaml demo (#17281 )	2023-03-11 10:56:57 +08:00
Pxl	8328ab69ad	[Chore](Materialized-View) add some mv regression test case (#17345 ) 1. add some mv regression test case 2. rename materialized_view_p0 to mv_p0 (avoid create database failed because long db name)	2023-03-11 10:55:11 +08:00
huanghaibin	697cba9a85	[fix](broker-load) fix broker's Dockerfile (#17657 ) there is some spelling mistake in broker's Dockerfile and need to fix it.	2023-03-11 10:43:09 +08:00
Bowen Liang	a74ef2377f	typo fix in kyuubi doc (#17672 )	2023-03-11 09:11:10 +08:00
superche	48a2fe68ad	[typo](docs) Fix some display errors (#17663 ) * [fix](docs) fix some errors in docs	2023-03-11 09:10:48 +08:00
谢健	3745e6c18a	[fix](Nereids): order of project's logical properties is different with that of project expression (#17648 )	2023-03-11 00:26:54 +08:00
jakevin	051ab7a9c6	[refactor](Nereids): refactor Join-Dependent Predicate Duplication. (#17653 )	2023-03-10 22:19:45 +08:00
Weijie Guo	566d133610	[enhancement](Nereids) Refactor EliminateLimitTest and EliminateFilterTest by match-pattern (#17631 )	2023-03-10 21:24:36 +08:00
camby	6dcd791b74	[feature](struct-type) support CAST AS Struct type (#17553 ) 1. add support `CAST AS Struct` from Struct type; 2. fix crash while `CAST('{}' AS Struct)`; 3. `CAST('' AS complext_type)` should return NULL instead of empty object; --------- Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2023-03-10 21:21:16 +08:00
zhengyu	2739a44eaf	[fix](segcompaction) heap overflow when doing segcompaction for cancelling load(#17529 ) Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-03-10 20:52:05 +08:00
zhangguoqiang	47e9217c1e	[improvement](pipeline)Avoid duplicate trigger teamcity build (#17647 ) * add clickbench and arm pipeline trigger；test merge check * set compile required; add clickbench and arm pipeline trigger to buildall; * avoid duplicate builds * simplify auto trigger code and avoid repeated triggers 0310	2023-03-10 19:56:14 +08:00
WuWQ98	948654ad38	[fix](ui)format the text file of profile #17645	2023-03-10 19:54:28 +08:00
yongjinhou	9cfa61b402	[Enhancement](HttpServer) Provide authentication interface for BE (#17073 ) Add an authentication interface in FE for BE	2023-03-10 16:34:47 +08:00
minghong	9ae5ec4dc5	[fix](nereids) PushdownExpressionsInHashCondition contains duplicate column and WindowExpression miss column stats (#17624 ) tpcds: q47 and q57 1. PushdownExpressionsInHashCondition:project contains duplicate column 2. WindowExpression stats caclucate: miss column stats	2023-03-10 16:08:43 +08:00
morrySnow	365c8eed7e	[fix](function) width_bucket should get min and max from each tuple (#17466 )	2023-03-10 13:14:12 +08:00
xueweizhang	739e043c8d	[fix](publish) add retry publish when succeed replica num less than quorum and transaction not VISIBLE (#17453 ) for some reasons, transaction pushlish succeed replica num less than quorum, this transaction's status can not to be VISIBLE, and this publish task of this replica of this tablet on this backend need retry publish success to make transaction VISIBLE when last publish failed. Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-10 12:02:15 +08:00
lihangyu	a79b8ede88	[Bug](ColumnArray) Fix array column replicate `replicate_offsets` not matched (#17616 ) the input replicate_offsets should be the same size as ColumnArray's offset. ``` IColumn::Offsets replicate_offsets(get_offsets().size(), 0); // \|---------------------\|-------------------------\|-------------------------\| // [0, begin) [begin, begin + count_sz) [begin + count_sz, size()) // do not need to copy copy counts[n] times do not need to copy ``` we should	2023-03-10 11:52:22 +08:00
Pxl	1a549edac2	[Chore](third-party) upgrade thrift from 0.13 to 0.16 (#17202 ) upgrade thrift from 0.13 to 0.16 There is thrift's release notes https://github.com/apache/thrift/blob/master/CHANGES.md	2023-03-10 11:33:16 +08:00
lihangyu	fcd25b53bf	[Optimize](Random distribution) Improve the performance of tablet sin… (#17389 ) The current distribution model for Doris is as follows: OlapTableSink seperate the original Block into serveral subblocks of each node(BE) by tablets distribution and distributes subblocks to storage engine of backends, then the storage engine will seperate the subblock into multiple tablets channel and each delta writer will handle partial of the block. This model causes blocks to be split according to tablets, and the splitting process can be a relatively heavy operation. After splitting, the blocks are distributed to different DeltaWriters (Memtables) through RPCs to TabletChannels. The distribution operation on TabletChannels is also a relatively heavy operation. If the distribution property of the table is RANDOM distribution, then we have the opportunity to distribute the blocks according to the complete block during distribution. The advantage of doing so is to reduce memory copying and improve write locality, similar to appending the entire block to the memtable. This optimze could save 10% ~ 20% CPU cost of RANDOM distribution table load when enable load_to_single_tablet	2023-03-10 10:52:40 +08:00
Yulei-Yang	f84b8b7c8b	[fix](priv) fix extract real user name when do privilege check (#17488 ) fix extract real user name of root/admin	2023-03-10 10:22:13 +08:00
Mingyu Chen	fe6361f4b5	[regression-test](p0) fix some unstable p0 cases (#17518 ) drop database before create remove some large, unused debug log	2023-03-10 10:21:39 +08:00
bobhan1	e1bf9411de	[feature](array function) add support for array_enumerate_uniq (#17541 ) add support for array_enumerate_uniq()	2023-03-10 10:20:49 +08:00
Mingyu Chen	c7aa3f9717	[fix](backup) backup throw NPE when no partition in table (#17546 ) If table has no partition, backup will report error: 2023-03-06 17:35:32,971 ERROR (backupHandler\|24) [Daemon.run():118] daemon thread got exception. name: backupHandler java.util.NoSuchElementException: No value present at java.util.Optional.get(Optional.java:135) ~[?:1.8.0_152] at org.apache.doris.catalog.OlapTable.selectiveCopy(OlapTable.java:1259) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.prepareBackupMeta(BackupJob.java:505) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.prepareAndSendSnapshotTask(BackupJob.java:398) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupJob.run(BackupJob.java:301) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.backup.BackupHandler.runAfterCatalogReady(BackupHandler.java:188) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.common.util.MasterDaemon.runOneCycle(MasterDaemon.java:58) ~[doris-fe.jar:1.0-SNAPSHOT] at org.apache.doris.common.util.Daemon.run(Daemon.java:116) ~[doris-fe.jar:1.0-SNAPSHOT]	2023-03-10 10:19:37 +08:00
huangzhaowei	4ba93efc98	[Enhance](DOE)Support parse default es iso datetime string (#17412 ) * support parse default es iso datetime string	2023-03-10 09:59:20 +08:00
morrySnow	006f7a91ac	[fix](planner) should not turn on push agg op when olapscan has conjuncts on it (#17598 ) we should not set PushAggOp to any type, if olap scan already has conjunct on it.	2023-03-10 09:33:08 +08:00
WenYao	a745ab1703	[fix](schema scanner) fix query some schema table report invalid parameter (#17626 ) Example: SELECT ROUTINE_SCHEMA AS PROCEDURE_CAT, NULL AS PROCEDURE_SCHEM,ROUTINE_NAME AS PROCEDURE_NAME,NULL AS NUM_INPUT_PARAMS,NULL AS NUM_OUTPUT_PARAMS,NULL AS NUM_RESULT_SETS,ROUTINE_COMMENT AS REMARKS,IF(ROUTINE_TYPE = 'FUNCTION', 2,IF(ROUTINE_TYPE= 'PROCEDURE', 1, 0)) AS PROCEDURE_TYPE FROM INFORMATION_SCHEMA.ROUTINES WHERE ROUTINE_SCHEMA = DATABASE(); ERROR 1105 (HY000): errCode = 2, detailMessage = invalid parameter This wrong and some BI tools could not work correctly.	2023-03-10 08:52:09 +08:00
Jerry Hu	08f0170895	[fix](olap) The 'scan key' generated by the 'is null' expression causes incorrect query results (#17569 )	2023-03-10 08:51:06 +08:00
luozenglin	c3c7bc4340	[fix](profile) fix profile sort child list exception (#17613 )	2023-03-10 08:44:32 +08:00
Xinyi Zou	f9baf9c556	[improvement](scan) Support pushdown execute expr ctx (#15917 ) In the past, only simple predicates (slot=const), and, like, or (only bitmap index) could be pushed down to the storage layer. scan process: Read part of the column first, and calculate the row ids with a simple push-down predicate. Use row ids to read the remaining columns and pass them to the scanner, and the scanner filters the remaining predicates. This pr will also push-down the remaining predicates (functions, nested predicates...) in the scanner to the storage layer for filtering. scan process: Read part of the column first, and use the push-down simple predicate to calculate the row ids, (same as above) Use row ids to read the columns needed for the remaining predicates, and use the pushed-down remaining predicates to reduce the number of row ids again. Use row ids to read the remaining columns and pass them to the scanner.	2023-03-10 08:35:32 +08:00
xueweizhang	0334cde2b1	[fix](merge-on-write) when if publish and be down, need recalc delete bitmap for MoW (#17617 ) * (merge-on-write) when if publish and be down, need recalc delete bitmap for MoW Signed-off-by: nextdreamblue <zxw520blue1@163.com> * fix code Signed-off-by: nextdreamblue <zxw520blue1@163.com> --------- Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-10 07:55:00 +08:00

1 2 3 4 5 ...

9246 Commits