doris

Author	SHA1	Message	Date
starocean999	fff1983f40	[fix](planner)use tupleId of agg node to get its unsigned conjuncts (#21949 )	2023-07-19 00:46:49 +08:00
TengJianPing	a9ea138caf	[fix](two level hash table) fix dead loop when converting to two level hash table for zero value (#21899 ) When enable two level hash table , if there is zero value in the existing one level hash table, it will cause dead loop when converting to two level hash table, because the PartitionedHashTable::_is_partitioned flag is not set correctly when doing the converting.	2023-07-18 19:50:30 +08:00
zhangstar333	87556b5741	[bug](test) fix regression test case failed with curdate (#21922 ) fix regression test case failed with curdate	2023-07-18 19:10:55 +08:00
Pxl	417e3e5616	[Feature](delete) support fold constant on delete stmt (#21833 ) support fold constant on delete stmt	2023-07-18 12:56:28 +08:00
Pxl	19492b06c1	[Bug](decimalv3) fix failed on test_dup_tab_decimalv3 due to wrong precision (#21890 ) fix failed on test_dup_tab_decimalv3 due to wrong precision	2023-07-18 12:53:09 +08:00
Jibing-Li	489171e4c1	[Fix](multi catalog)Fix hive partition value contains special character such as / bug (#21876 ) Hive escapes some special characters in partition value to %XX, for example, / is escaped to %2F. Doris didn't handle this case which will cause doris failed to list the files under partition with special characters. This pr is to fix this bug.	2023-07-18 11:20:38 +08:00
Mryange	b656f31cf2	[Enchancement](compatible) show decimalv3 to decimal (#21782 )	2023-07-18 09:17:14 +08:00
Jibing-Li	a92508c3f9	[Fix](statistics) Fix analyze db always use internal catalog bug (#21850 ) `Analyze database db_name ` command couldn't use current catalog, it is always using the internal catalog. This will cause the command failed to find the db. This pr is to fix this bug.	2023-07-17 15:28:54 +08:00
zy-kkk	03b575842d	[Feature](table function) support explode_json_array_json (#21795 )	2023-07-17 11:40:02 +08:00
Pxl	86841d8653	[Bug](materialized-view) fix some problems of mv and make ssb mv work on nereids (#21559 ) fix some problems of mv and make ssb mv work on nereids	2023-07-17 10:08:25 +08:00
abmdocrt	c409fa0f58	[Feature](Compaction)Support full compaction (#21177 )	2023-07-16 13:21:15 +08:00
Kang	83ce4379ff	[regression] add order by in test case for stable output (#21815 )	2023-07-14 18:01:43 +08:00
mch_ucchi	c9a99ce171	[Feature](Nereids) support udf for Nereids (#18257 ) Support alias function, Java UDF, Java UDAF for Nereids. Implementation: UDFs(alias function, Java UD(A)F) are saved in database object, we get it by FunctionDesc, which requires function name and arg types. So firstly we bind expressions of its children so that we can get the return type of args. Then we get the best selection. Secondly: For alias function: The original function of the alias function is represented as original planner-style function, it's too hard to translate it to nereids-style expression hence we transfer it to the corresponding sql and parse it. Now we get the nereids-style function, and try to bind the function. the bound function will also change the type by add cast node of its children to its expecting input types, so that if we travel a bound function more than one times, the cast node will be different. To solve the problem, we add a flag isAnalyzedFunction. it's set false by default and will be set true when return from the visitor function. If the flag is true, it will return immediately in visitor function. Now we can ensure that the bound functions in children will be the same though we travel it more than one time. we can replace the alias function to its original function and bind the unbound functions. For JavaUDF and JavaUDAF JavaUDF and JavaUDAF can be recognized as a catalog function and hard to be entirely translated to Nereids-style function, we create a nereids expression object JavaUdf and JavaUdaf to wrap it. All in all, now Nereids support UDFs and nesting them.	2023-07-14 17:02:01 +08:00
minghong	f95d728d3e	[shape](nereids) TPCDS check all query shape, except ds64 (#21742 ) there is a known bug on ds64 analyze. add ds 64 shape check latter	2023-07-14 16:56:46 +08:00
Pxl	4d44cea784	[Bug](materialized-view) check group expr at create mv (#21798 ) check group expr at create mv	2023-07-14 15:39:38 +08:00
daidai	ca6e33ec0c	[feature](table-value-functions)add catalogs table-value-function (#21790 ) mysql> select * from catalogs() order by CatalogId;	2023-07-14 10:25:16 +08:00
Qi Chen	6fd8f5cd2f	[Fix](parquet-reader) Fix parquet string column min max statistics issue which caused query result incorrectly. (#21675 ) In parquet, min and max statistics may not be able to handle UTF8 correctly. Current processing method is using min_value and max_value statistics introduced by PARQUET-1025 if they are used. If not, current processing method is temporarily ignored. A better way is try to read min and max statistics if it contains only ASCII characters. I will improve it in the future PR.	2023-07-14 00:09:41 +08:00
Xin Liao	35fa9496e7	[fix](merge-on-write) fix wrong result when query with prefix key predicate (#21770 )	2023-07-13 19:56:00 +08:00
Kang	abc21f5d77	[bugfix](ngram bf index) process differently for normal bloom filter index and ngram bf index (#21310 ) * process differently for normal bloom filter index and ngram bf index * fix review comments for readbility * add test case * add testcase for delete condition	2023-07-13 17:31:45 +08:00
mch_ucchi	d4bdd6768c	[Feature](Nereids) support select into outfile (#21197 )	2023-07-13 17:01:47 +08:00
YueW	00c48f7d46	[opt](regression case) add more index change case (#21734 )	2023-07-12 21:52:48 +08:00
amory	be55cb8dfc	[Improve](jsonb_extract) support jsonb_extract multi parse path (#21555 ) support jsonb_extract multi parse path	2023-07-12 21:37:36 +08:00
AKIRA	88c719233a	[opt](nereids) convert OR expression to IN expression (#21326 ) Add new rule named "OrToIn", used to convert multi equalTo which has same slot and compare to a literal of disjunction to a InPredicate so that it could be pushdown to storage engine. for example: ```sql col1 = 1 or col1 = 2 or col1 = 3 and (col2 = 4) col1 = 1 and col1 = 3 and col2 = 3 or col2 = 4 (col1 = 1 or col1 = 2) and (col2 = 3 or col2 = 4) ``` would be converted to ```sql col1 in (1, 2) or col1 = 3 and (col2 = 4) col1 = 1 and col1 = 3 and col2 = 3 or col2 = 4 (col1 in (1, 2) and (col2 in (3, 4))) ```	2023-07-12 10:53:06 +08:00
daidai	ff42cd9b49	[feature](hive)add read of the hive table textfile format array type (#21514 )	2023-07-11 22:37:48 +08:00
zhangy5	cb69349873	[regression] add bitmap filter p1 regression case (#21591 )	2023-07-11 14:27:03 +08:00
zy-kkk	5ed42705d4	[fix](jdbc scan) `1=1` does not translate to `TRUE` (#21688 ) For most database systems, they recognize where 1=1 but not where true, so we should send the original 1=1 to the database	2023-07-11 14:04:49 +08:00
zy-kkk	d3be10ee58	[improvement](column) Support for the default value of current_timestamp in microsecond (#21487 )	2023-07-11 14:04:13 +08:00
bobhan1	7b403bff62	[feature](partial update)support insert new rows in non-strict mode partial update with nullable unmentioned columns (#21623 ) 1. expand the semantics of variable strict_mode to control the behavior for stream load: if strict_mode is true, the stream load can only update existing rows; if strict_mode is false, the stream load can insert new rows if the key is not present in the table 2. when inserting a new row in non-strict mode stream load, the unmentioned columns should have default value or be nullable	2023-07-11 09:38:56 +08:00
TengJianPing	736d6f3b4c	[improvement](timezone) support mixed uppper-lower case of timezone names (#21572 )	2023-07-11 09:37:14 +08:00
Mryange	8973610543	[feature](datetime) "timediff" supports calculating microseconds (#21371 )	2023-07-10 19:21:32 +08:00
acnot	202a5c636f	[fix](create table) modify varchar default length 1 to 65533 (#21302 ) modify archer default length 1 to varchar.max.length , when create table. ```mysql create table t2 ( k1 CHAR, K2 CHAR(10) , K3 VARCHAR , K4 VARCHAR(1024) ) duplicate key (k1) distributed by hash(k1) buckets 1 properties('replication_num' = '1'); desc t2; ``` \| Field \| Type \| Null \| Key \| Default \| Extra \| \| -- \|--\|--\| -\| -\| -\| \| k1 \| CHAR(1) \| Yes \| true \| NULL \| \| \| K2 \| CHAR(10) \| Yes \| false \| NULL \| NONE \| \| K3 \| VARCHAR(65533) \| Yes \| false \| NULL \| NONE \| \| K4 \| VARCHAR(1024) \| Yes \| false \| NULL \| NONE \|	2023-07-10 17:57:21 +08:00
zy-kkk	0be349e250	[feature](jdbc) Support jdbc catalog to read json types (#21341 )	2023-07-10 16:21:00 +08:00
Jibing-Li	f9c56d59fc	[improvement](statistics)Support external table show table stats, modify column stats and drop stats (#21624 ) Support external table show table stats, modify column stats and drop stats.	2023-07-10 11:33:06 +08:00
Pxl	77336bff44	[Bug](materialized-view) adjust limit for create materialized view on uniq/agg table (#21580 ) adjust limit for create materialized view on uniq/agg table	2023-07-10 10:04:17 +08:00
YueW	c58d5cd81b	[opt](regression case) add more index change regression case (#21633 )	2023-07-08 22:23:09 +08:00
morrySnow	2d445bbb6d	[opt](Nereids) forbid some bad case on agg plans (#21565 ) 1. forbid all candidates that need to gather process except must do it 2. forbid do local agg after reshuffle of two phase agg of distinct 3. forbid one phase agg after reshuffle 4. forbid three or four phase agg for distinct if any stage need reshuffle 5. forbid multi distinct for one distinct agg if do not need reshuffle	2023-07-07 17:45:55 +08:00
Mingyu Chen	0b7b5dc991	[fix](catalog) wrong required slot info causing BE crash (#21598 ) For file scan node, this is a special field `requiredSlot`, this field is set depends on the `isMaterialized` info of slot. But `isMaterialized` info can be changed during the plan process, so we must update the `requiredSlot` in `finalize` phase of scan node, otherwise, it may causing BE crash due to mismatching slot info.	2023-07-07 17:10:50 +08:00
morrySnow	f908ea5573	[fix](Nereids) union distinct should not prune any column (#21610 )	2023-07-07 14:38:28 +08:00
bobhan1	2a721be4f7	[fix](partial update) correct col_nums when init agg state in memtable (#21592 )	2023-07-07 14:03:33 +08:00
starocean999	fba3ae96b9	Revert "[Fix](planner) Set inline view output as non constant after analyze (#21212 )" (#21581 ) This reverts commit 0c3acfdb7c744decb7b60e372007707a55d14e00.	2023-07-06 20:30:27 +08:00
starocean999	2e651bbc9a	[fix](nereids) fix some planner bugs (#21533 ) 1. allow cast boolean as date like type in nereids, the result is null 2. PruneOlapScanTablet rule can prune tablet even if a mv index is selected. 3. constant conjunct should not be pushed through agg node in old planner	2023-07-06 16:13:37 +08:00
LiBinfeng	0c3acfdb7c	[Fix](planner) Set inline view output as non constant after analyze (#21212 ) Problem: Select list should be non const when from list have tables or multiple tuples. Or upper query will regard wrong of isConstant And make wrong constant folding For example： when using nullif funtion with subquery which result in two alternative constant, planner would treat it as constant expr. So analyzer would report an error of order by clause can not be constant Solusion: Change inline view output to non constant, because (select 1 a from table) as view , a in output is no constant when we see view.a outside	2023-07-06 15:37:43 +08:00
HHoflittlefish777	6a0a21d8b0	[regression-test](load) add streamload default value test (#21536 )	2023-07-06 10:14:13 +08:00
morrySnow	4d414c649a	[fix](Nereids) set operation physical properties derive is wrong (#21496 )	2023-07-05 15:44:40 +08:00
abmdocrt	48bfb8e9cf	[Enhancement](regression-test)Add regression test for MoW backup and restore (#21223 )	2023-07-05 15:16:04 +08:00
xzj7019	f9bc433917	[fix](nereids) fix runtime filter expr order (#21480 ) Current runtime filter pushing down to cte internal, we construct the runtime filter expr_order with incremental number, which is not correct. For cte internal rf pushing down, the join node will be always different, the expr_order should be fixed as 0 without incrementation, otherwise, it will lead the checking for expr_order and probe_expr_size illegal or wrong query result. This pr will revert 2827bc1 temporarily, it will break the cte rf pushing down plan pattern.	2023-07-05 14:27:35 +08:00
DeadlineFen	0469c02202	[Test](regression) Temporarily disable quickTest for SHOW CREATE TABLE to adapt to enable_feature_binlog=true (#21247 )	2023-07-05 10:12:02 +08:00
morrySnow	90dd8716ed	[refactor](multicast) change the way multicast do filter, project and shuffle (#21412 ) Co-authored-by: Jerry Hu <mrhhsg@gmail.com> 1. Filtering is done at the sending end rather than the receiving end 2. Projection is done at the sending end rather than the receiving end 3. Each sender can use different shuffle policies to send data	2023-07-04 16:51:07 +08:00
starocean999	599ba4529c	[fix](nereids) need run ConvertInnerOrCrossJoin rule again after EliminateNotNull (#21346 ) after running EliminateNotNull rule, the join conjuncts may be removed from inner join node. So need run ConvertInnerOrCrossJoin rule to convert inner join with no join conjuncts to cross join node.	2023-07-04 10:52:36 +08:00
Qi Chen	f80df20b6f	[Fix](multi-catalog) Fix read error in mixed partition locations. (#21399 ) Issue Number: close #20948 Fix read error in mixed partition locations(for example, some partitions locations are on s3, other are on hdfs) by `getLocationType` of file split level instead of the table level.	2023-07-03 15:14:17 +08:00

1 2 3 4 5 ...

1415 Commits