doris

Author	SHA1	Message	Date
Pxl	87e64115ae	[Chore](materialized-view) add case about insert data imidiately after create mv(#21281 ) add case about insert data imidiately after create mv	2023-06-29 11:17:38 +08:00
Jibing-Li	3a12b67517	[Improvement](statistics, multi catalog)Implement hive table statistic connector (#21053 ) This pr is to add the collecting hive statistic function. While the CBO fetching hive table statistics, statistic cache will first load from internal stats olap table. If not found, then using this pr's function to fetch from remote Hive metastore.	2023-06-29 10:50:54 +08:00
Pxl	45f1909bc3	[Bug](lateral-view) make lateral view function's nullable mode work (#21242 ) make lateral view function's nullable mode work	2023-06-29 10:50:07 +08:00
zy-kkk	449c8d4568	[fix](jdbc) Handling Zero DateTime Values in Non-nullable Columns for JDBC Catalog Reading MySQL (#21296 )	2023-06-28 22:51:17 +08:00
starocean999	016870b673	[opt](nereids) use Expression's isConstant to check whether could be remove from group by key (#21195 )	2023-06-28 19:12:36 +08:00
shuke	f77c69ab95	[fix](test) case bug, streamload without sync. (#21161 )	2023-06-28 18:22:19 +08:00
morrySnow	7588abe76b	[refactor](Nereids) refactor physical properties and plan translator (#21168 ) this PR 1. refactor physical properties, property deriver and property regular to ensure Nereids could generate plan with sufficent PhysicalDistribute. 2. refactor PhyscialPlanTranslator to ensure all ExchangeNode generated by PhysicalDistribute, except CTEConsumer. We will refactor all cte related node later. the detail changes of this PR: 1. update DistributionSpec of physical properties: - Any: random distribution, used in output and require - StorageAny: random distribution but constrained by where the data is stored, used in output - ExecutionAny: random distribution to present random shuffle, used in output - Gather: gather distribution, used in output and require - StorageGather: gather distribution but constrained by where the data is stored, used in output - Replicated: broadcast distribution - Hash: bucket distribution 2. update shuffle type of DistributionSpecHash - REQUIRE: used in require - NATURAL: distribution as storage engine hash algorithm, constrained by where the data is stored - STORAGE_BUCKETED: distribution as storage engine hash algorithm - EXECUTION_BUCKETED: distribution as execution engine hash algorithm 3. update HideOneRowRelationUnderSetOperation to MergeOneRowRelationIntoSetOperation 4. update property deriver of SetOperation to ensure suitable PhysicalDistribute be added at top and below of SetOperation 5. refactor PhysicalPlanTranslator to ensure no unplanned exchange node will be added	2023-06-28 15:15:11 +08:00
Pxl	1fc1e76fc7	[Bug](alter table) return error status to avoid core dump on schema change meet invalid input (#21273 ) return error status to avoid core dump on schema change meet invalid input	2023-06-28 14:20:16 +08:00
amory	b1e973b721	[Improve](func)support array to window-func first-last-value arg type (#21201 ) * support array to windown-func first-last-value arg type * add regress test for first-last-value of array type * update * format be:	2023-06-28 10:02:00 +08:00
zy-kkk	d871df64ca	[improvement](oracle jdbc)Support for automatically obtaining the precision of the oracle timestamp type (#21252 )	2023-06-28 00:19:01 +08:00
Gabriel	5506faa7b4	[datetimev2](minor) Add scale parameter for datetimev2 (#21176 )	2023-06-27 19:55:35 +08:00
AKIRA	acba8648a5	[enhancement](nereids) Add log for stats (#21164 ) 1. LOG sql when analyze failed 2. Return directly for analyze_test suite when there is more than one frontend 3. Set query_timeout for tpcds suites to avoid unneccessary failed caused by analyze sync	2023-06-27 19:17:22 +08:00
mch_ucchi	64a1eb77f0	[opt](planner) support delete with a subquery in predicate by construct an insert. (#20983 ) complex predicate in delete stmt like: ```sql delete from t1 where t1.id in (select id from t2); ``` will be replaced to an insert stmt. ```sql insert into t1(id, __DORIS_DELETE_SIGN__) select id, 1 from t1 where id in (select id from t2); ```	2023-06-27 17:51:13 +08:00
starocean999	c52c73c1c6	[fix](nereids)return original expr if cast to decimal literal overflow (#21189 )	2023-06-27 17:25:04 +08:00
mch_ucchi	1fb0bdf66c	[fix](regression-test) fix unstable case complex_insert.groovy caused by create mv (#21140 ) currently, insert into a table creating a mv will rise an exception, we fix it by use the create mv action to ensure when we insert to a table, it will not be creating a mv.	2023-06-27 17:24:17 +08:00
starocean999	84554ec0fd	[fix](planner) the resultExprs should be substituted using table function node's outputSmap (#21182 )	2023-06-27 17:19:49 +08:00
zhangdong	7b93b26b8c	[feature-wip](MTMV) optimize lock of mtmv job & task, to avoid dead lock (#21054 )	2023-06-27 16:23:50 +08:00
DeadlineFen	4b94d34ec2	[fix](regression) Add get master token into regression framework (#21198 )	2023-06-27 11:54:31 +08:00
Pxl	70ddf64126	[Chore](agg-state) add documentation about agg_state, add group_concat agg_state test case (#21147 ) add documentation about agg_state, add group_concat agg_state test case	2023-06-27 11:28:19 +08:00
Yulei-Yang	e0b20f0437	[feature](function) add ip function ipv4numtostring (alias inet_ntoa) (#20936 )	2023-06-27 10:17:40 +08:00
zy-kkk	c9306e9c48	[improvement](ms jdbc)Support for automatically obtaining the precision of the sqlserver datetime type (#21145 )	2023-06-26 23:10:46 +08:00
lihangyu	50c1d55769	[Improve](dynamic schema) support filtering invalid data (#21160 ) * [Improve](dynamic schema) support filtering invalid data 1. Support dynamic schema to filter illegal data. 2. Expand the regular expression for ColumnName to support more column names. 3. Be compatible with PropertyAnalyzer and support legacy tables. 4. Default disable parse multi dimenssion array, since some bug unresolved	2023-06-26 19:32:43 +08:00
starocean999	102b7f8873	remove useless case (#21166 )	2023-06-26 16:27:32 +08:00
Armando Zhu	5d2b69b06d	[Enhancement](regression) let test case fail fast when job is cancelled (#20578 ) (#21103 ) In doris regression-test/suites, a lot of test cases quit immediately only if "FINISHED", otherwise they will wait till timeout. For example: while (max_try_secs--) { String res = getJobState(tbName1) if (res == "FINISHED") { sleep(3000) break } else { Thread.sleep(1000) if (max_try_secs < 1) { println "test timeout," + "state:" + res assertEquals("FINISHED", res) } } } This PR added checks so that these test cases can quit immediately also if "CANCELLED", which is the only unchanging status other than "FINISHED".	2023-06-26 12:58:51 +08:00
ZhangYu0123	66005570c9	[fix](regression) fix p1 test_backup_restore fail caused by http download 401 invalid token error #21107	2023-06-26 12:56:46 +08:00
TengJianPing	baf9a2107b	[fix](regression) fix case failure by adding sync after stream load (#21155 )	2023-06-26 10:38:46 +08:00
starocean999	58b3e5ebdb	[fix](nereids)scan node's smap should use materiazlied slots and project list as left and right expr list (#21142 )	2023-06-25 22:34:43 +08:00
airborne12	1ac8cdec7e	[Fix](inverted index) fix inverted query cache for chinese tokenizer (#21106 ) 1. query cache for chinese tokenizer is confusing when just converting w_char to char. 2. seperate query_type from inverted_index_reader to clean code.	2023-06-25 22:04:02 +08:00
minghong	2d1163c4d8	[refactor](nereids) update Agg stats derive method #21036 This pr has no effect on tpch queries. Some tpcds queries are impacted. They are 4/11/23/24/47/51/57/65/74, in which 4 and 51 are improved	2023-06-25 21:47:32 +08:00
AKIRA	638aa41988	[fix](planner) fix push filter through agg #21080 In the previous implementation, the check for groupby exprs was ignored. Add this necessary check to make sure it would work You could reproduce it by runnning belowing sql: CREATE TABLE t_push_filter_through_agg (col1 varchar(11451) not null, col2 int not null, col3 int not null) UNIQUE KEY(col1) DISTRIBUTED BY HASH(col1) BUCKETS 3 PROPERTIES( "replication_num"="1" ); CREATE VIEW `view_i` AS SELECT `b`.`col1` AS `col1`, `b`.`col2` AS `col2` FROM ( SELECT `col1` AS `col1`, sum(`cost`) AS `col2` FROM ( SELECT `col1` AS `col1`, sum(CAST(`col3` AS INT)) AS `cost` FROM `t_push_filter_through_agg` GROUP BY `col1` ) a GROUP BY `col1` ) b; SELECT SUM(`total_cost`) FROM view_a WHERE `dt` BETWEEN '2023-06-12' AND '2023-06-18' LIMIT 1;	2023-06-25 19:14:20 +08:00
shuke	55e7af1e31	[fix](test) fix two case bug #21124	2023-06-25 18:53:20 +08:00
Siyang Tang	46f0295b78	[feature](load-refactor-with-tvf) S3 load with S3 tvf and native insert (#19937 )	2023-06-25 17:45:31 +08:00
Mryange	6896776034	[test](regression) update some case in p2 (#21094 ) update some case in p2	2023-06-25 11:16:56 +08:00
shuke	6a2226867c	[fix](test) load case bug in tpch_sf10_unique_p2 #21093	2023-06-22 15:19:46 +08:00
shuke	d5455dd04b	[fix](regression-test) test case variable scope bug (#21071 )	2023-06-22 09:07:44 +08:00
starocean999	8b561cfb03	[fix](nereids)create datev2 and datetimev2 literal if enable_date_conversion is true (#21065 )	2023-06-21 20:29:36 +08:00
airborne12	6ac0bfeceb	[Feature](inverted index) add unicode parser for inverted index (#21035 )	2023-06-21 20:14:06 +08:00
zhannngchen	cc53391c9a	Revert "[feature](merge-on-write) enable merge on write by default (#… (#21041 )	2023-06-21 18:36:46 +08:00
HHoflittlefish777	2beed11256	[Bug](streamload) fix inconsistent load result of be and fe (#20950 )	2023-06-21 18:12:51 +08:00
Mryange	8bcd42d3f6	[test](regression) update some case in brown_p2 #21037	2023-06-21 16:25:07 +08:00
shuke	4766e75d4e	[test](regression) add analyze after stream load (#21066 )	2023-06-21 16:03:46 +08:00
DeadlineFen	4d84cd8ca1	Revert "Revert "[Test](regression) CCR syncer thrift interface regression test (#20935 )" (#20990 )" (#21022 ) This reverts commit 2a294801f1324a999570158eea3224239eefbb29.	2023-06-21 15:20:21 +08:00
Qi Chen	bad22dd4e2	[Fix](orc-reader) Fix orc dict filter null value issue in `_convert_dict_cols_to_string_cols` which caused incorrect result. (#21047 ) Query results should not have empty values. ``` use regresssion.multi_catalog; select commit_id from github_events_orc WHERE (event_type = 'CommitCommentEvent') AND commit_id != "" limit 10; ``` ``` +------------------------------------------+ \| commit_id \| +------------------------------------------+ \| 685c1fd8dbbdc10c042932f9a9f88be00ff96c75 \| \| 685c1fd8dbbdc10c042932f9a9f88be00ff96c75 \| \| 4e3ab2ff2d2474f5d51334b9b0fdf17e9845a166 \| \| \| \| \| \| \| \| \| \| \| \| \| \| 7191c20cb49da07a7fc16aa32dc0de4faff528b2 \| +------------------------------------------+ 10 rows in set (0.54 sec) ```	2023-06-21 14:54:01 +08:00
lihangyu	62fb0e642e	[chore](dynamic schema) deprecated create dynamic schema table (#21058 )	2023-06-21 14:44:57 +08:00
Pxl	5f0bb49d46	[Feature](materialized-view) support create mv contain aggstate column (#20812 ) support create mv contain aggstate column	2023-06-21 13:06:52 +08:00
amory	18beb822a3	[FIX](array-type) fix array string output with fe const expr (#21042 ) fe foldconstRule make array() function expr with const literal , and would not pass this array literal to be . but we should make fe array string output format is same with be array string output	2023-06-21 11:52:02 +08:00
Pxl	b4773e1195	[Chore](materialized-view) enable nereids planner on regression test mv_p0 (#21023 ) enable nereids planner on regression test mv_p0	2023-06-21 10:01:27 +08:00
dujl	0cf9de8cef	[fix](decimalv3) fix result error when cast a round decimalv3 to double (#20678 )	2023-06-21 00:02:48 +08:00
Kang	2c11ce0a02	[bugfix](topn) fix key topn merge block conflict with index predicate result columns (#20820 )	2023-06-20 21:23:00 +08:00
LiBinfeng	f10258577b	[Fix](Planner) Fix group concat with multi distinct and segs (#20912 ) Problem: when use select group_concat(distinct a, 'seg1'), group_concat(distinct b, 'seg2') ... Error would rised Reason: Group_concat function regard 'seg' as arguments also, so multi distinct column error would rised Solved: let Multi Distinct group_concat function only get first argument as real argument	2023-06-20 21:00:18 +08:00

1 2 3 4 5 ...

1948 Commits