doris

Author	SHA1	Message	Date
starocean999	2e651bbc9a	[fix](nereids) fix some planner bugs (#21533 ) 1. allow cast boolean as date like type in nereids, the result is null 2. PruneOlapScanTablet rule can prune tablet even if a mv index is selected. 3. constant conjunct should not be pushed through agg node in old planner	2023-07-06 16:13:37 +08:00
starocean999	599ba4529c	[fix](nereids) need run ConvertInnerOrCrossJoin rule again after EliminateNotNull (#21346 ) after running EliminateNotNull rule, the join conjuncts may be removed from inner join node. So need run ConvertInnerOrCrossJoin rule to convert inner join with no join conjuncts to cross join node.	2023-07-04 10:52:36 +08:00
amory	b1e973b721	[Improve](func)support array to window-func first-last-value arg type (#21201 ) * support array to windown-func first-last-value arg type * add regress test for first-last-value of array type * update * format be:	2023-06-28 10:02:00 +08:00
starocean999	84554ec0fd	[fix](planner) the resultExprs should be substituted using table function node's outputSmap (#21182 )	2023-06-27 17:19:49 +08:00
Pxl	a0d4f11667	[Bug](function) catch error state in function cast to avoid core dump (#20751 ) catch error state in function cast to avoid core dump	2023-06-14 17:34:34 +08:00
starocean999	7942bd0bf9	[fix](planner) cast string literal to date like type should not be an implict cast (#20709 ) 1. cast string literal to date like type should not be an implict cast 2. the string representation of float like type should not be scientific notation 3. the data type of like function's regex expr should be string type even if it's a null literal 4. add -Xss4m in fe.conf to prevent stack overflow in some case	2023-06-13 17:57:14 +08:00
Xujian Duan	0b228b3414	[fix](load)Support load json data with default value (#20624 ) * support json default value --------- Co-authored-by: duanxujian <duanxujian@jd.com>	2023-06-12 14:51:31 +08:00
starocean999	bcc37c9405	[fix](planner)the common type of floating and decimal should be floating type (#20634 ) * [fix](planner)the common type of floating and decimal should be floating type * fix test cases	2023-06-12 11:32:23 +08:00
starocean999	5b2efd196b	[fix](execution) result_filter_data should be filled by 0 when can_filter_all is true (#20438 )	2023-06-05 17:05:35 +08:00
Mryange	519f01133a	[feature](decimal)support cast rounding half up and div precision increment in decimalv3. (#19811 )	2023-06-01 13:09:58 +08:00
starocean999	4cbb6ece10	[fix](fe)ordering exprs should be substituted in the same way as select part (#20091 )	2023-05-27 21:00:57 +08:00
starocean999	558f625d3b	[fix](planner) The group by part should be substituted in the same way as select part (#20019 )	2023-05-26 11:05:02 +08:00
ZI-MA	f43e8cc98f	[regressiontest](unionall) Regression_test_similar_query_boolean (#19553 ) * regression_test_similar_query * add the ORDER BY * update ORDER BY to comfirm correctness --------- Co-authored-by: ZI-MA <chime316@qq.com>	2023-05-18 12:21:32 +08:00
Kang	88ca4f3e6b	[feature](like) make like regexp used as a sql function (#19755 )	2023-05-18 10:03:12 +08:00
Weijie Guo	9535ed01aa	[feature](tvf) Support compress file for tvf hdfs() and s3() (#19530 ) We can support this by add a new properties for tvf, like : `select * from hdfs("uri" = "xxx", ..., "compress_type" = "lz4", ...)` User can: Specify compression explicitly by setting `"compression" = "xxx"`. Doris can infer the compression type by the suffix of file name(e.g. `file1.gz`) Currently, we only support reading compress file in `csv` format, and on BE side, we already support. All need to do is to analyze the `"compress_type"` on FE side and pass it to BE.	2023-05-16 08:50:43 +08:00
zclllyybb	92bf485abd	[Bug] Fix doris pipeline shared scan and top n opt (#19599 )	2023-05-15 10:00:44 +08:00
lvshaokang	af04c3acab	[fix](sequence-column) Fix sequence_col column used default expr insert failed (#18933 )	2023-05-08 17:18:25 +08:00
starocean999	3e3262361c	[fix](fe)havingClause should be substituted the same way as resultExprs (#19261 ) substituted havingClause in the same way as resultExprs to prevent " HAVING clause not produced by aggregation output" error	2023-05-05 18:03:43 +08:00
starocean999	aacc075f09	[fix](planner) SetOperationNode's slots' nullability calculation is wrong (#19108 ) SetOperationNode's slots' nullability should consider slots info from all children, even some children have EmptyResultSet	2023-04-26 21:18:37 +08:00
Qi Chen	61b7a52444	[Enhancement](multi-catalogs) Use decimal V3 type in multi-catalogs module. (#18926 ) 1. Use decimal V3 type in JDBC and Iceberg tables. 2. Fix hdfs TVF decimal V3 type and regression test.	2023-04-25 14:49:40 +08:00
Qi Chen	3328a65b75	[Fix](mutli-catalog) Use decimal v3 type to fix decimal loss issue in multi-catalog module. (#18835 ) Fix decimal v3 precision loss issues in the multi-catalog module. Now it will use decimal v3 to represent decimal type in the multi-catalog module. Regression Test: `test_load_with_decimal.groovy`	2023-04-20 11:02:53 +08:00
starocean999	aa6b3cc537	[fix](planner)keep all agg functions if there is any virtual slots in group by list (#18630 ) Because of the limitation of ProjectPlanner, we have to keep set agg functions materialized if there is any virtual slots in the group by list, such as 'GROUPING_ID' in the group by list etc.	2023-04-13 19:44:46 +08:00
zclllyybb	43392918cd	[Optimization](functions)Optimize function call for const columns. (#18310 )	2023-04-12 11:11:01 +08:00
Mingyu Chen	ecd3fd07f6	[feature](colocate) support cross database colocate join (#18152 )	2023-04-03 14:03:42 +08:00
Pxl	e77833bfa1	[Bug](materialized-view) fix where clause persistence replay incorrect (#18228 ) fix where clause persistence replay incorrect	2023-04-03 12:49:01 +08:00
Jerry Hu	d27201f331	[fix](nested_loop_join)got incorrect result from nested loop join without condition (#18139 )	2023-03-28 16:20:05 +08:00
Tiewei Fang	d7dcdfcba9	[Fix](Create View) support create view from tvf (#18087 ) Support create view as select * from tvf()	2023-03-28 15:07:32 +08:00
starocean999	8ffc85b6ff	[fix](planner)project should be done inside inlineview (#17831 ) * [fix](planner)project should be done inside inlineview * add src column for slots in scan node's output tuple	2023-03-20 21:12:45 +08:00
starocean999	782001c75b	[fix](planner) project should be done inside subquery (#17630 ) WITH t0 AS( SELECT report.date1 AS date2 FROM( SELECT DATE_FORMAT(date, '%Y%m%d') AS date1 FROM cir_1756_t1 ) report GROUP BY report.date1 ), t3 AS( SELECT date_format(date, '%Y%m%d') AS date3 FROM cir_1756_t2 ) SELECT row_number() OVER(ORDER BY date2) FROM( SELECT t0.date2 FROM t0 LEFT JOIN t3 ON t0.date2 = t3.date3 ) tx; The DATE_FORMAT(date, '%Y%m%d') was calculated in GROUP BY node, which is wrong. This expr should be calculated inside the subquery.	2023-03-13 11:10:27 +08:00
Tiewei Fang	13e05c4a5d	[Enhencement](stream load) add some regression test for json format streamload (#17520 )	2023-03-12 20:13:07 +08:00
Jerry Hu	08f0170895	[fix](olap) The 'scan key' generated by the 'is null' expression causes incorrect query results (#17569 )	2023-03-10 08:51:06 +08:00
Jerry Hu	caacee253d	[fix](olap)Crashing caused by IS NULL expression (#17463 ) Issue Number: close #17462	2023-03-07 15:32:52 +08:00
starocean999	479d57df88	[fix](planner) the project expr should be calculated in join node in some case (#17035 ) Consider the sql bellow: select sum(cc.qlnm) as qlnm FROM outerjoin_A left join (SELECT outerjoin_B.b, coalesce(outerjoin_C.c, 0) AS qlnm FROM outerjoin_B inner JOIN outerjoin_C ON outerjoin_B.b = outerjoin_C.c ) cc on outerjoin_A.a = cc.b group by outerjoin_A.a; The coalesce(outerjoin_C.c, 0) was calculated in the agg node, which is wrong. This pr correct this, and the expr is calculated in the inner join node now.	2023-02-24 15:20:05 +08:00
xueweizhang	90af1b0113	[fix](subquery) fix bug of using constexpr and some agg func(like count,max) as subquery's output (#16579 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-02-14 00:11:56 +08:00
TengJianPing	f6a20f844b	[fix](hashjoin) join produce blocks with rows larger than batch size: handle join with other conjuncts (#16402 )	2023-02-08 14:26:35 +08:00
starocean999	df3a6e2412	[fix](fe)only set column info for slots in sortTupleDesc (#16407 )	2023-02-04 23:14:25 +08:00
starocean999	dd63897757	[fix](be)the set operation node should accept both nullable and non-nullable data from child node (#16126 )	2023-02-04 23:08:59 +08:00
xy720	cd457312e4	[Enhancement](grouping) Add a switch for users to force using alias name in group by and having clause (#15748 )	2023-01-31 23:46:31 +08:00
yiguolei	5eaa995704	[refactor](some mempool) not memset 0 in default value iterator (#16194 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-29 22:50:39 +08:00
gnehil	25046fabec	[regression-test](sub query) add regression test for subquery with limit (#16051 ) * [regression-test](sub query) add regression test for subquery with limit * add lisence header	2023-01-21 08:06:49 +08:00
Gabriel	6e090e4daf	[Bug](predicate) fix date predicate (#16053 )	2023-01-19 14:14:48 +08:00
AKIRA	c5beab39c0	[fix](nereids) Bind slot in having to its direct child instead of grand child (#16047 ) For example, in this case, the `date` in having clause should be bind to alias which has same name, instead of `date` field of the relation SELECT date_format(date, '%x%v') AS `date` FROM `tb_holiday` WHERE `date` between 20221111 AND 20221116 HAVING date = 202245 ORDER BY date;	2023-01-19 13:19:16 +08:00
camby	47097a3db8	[fix](having) revert 15143 and fix having clause with multi-conditions (#15745 ) Describe your changes. Firstly having clause of Mysql is really very complex, we are hard to follow all rules, so we revert pr15143 to keep the logic the same as before. Secondly the origin implementation has problem while having clause has multi-conditions. For example: case1: here v2 inside having clause use table column test_having_alias_tb.v2 SELECT id, v1-2 as v, sum(v2) v2 FROM test_having_alias_tb GROUP BY id,v having(v2>1); ERROR 1105 (HY000): errCode = 2, detailMessage = HAVING clause not produced by aggregation output (missing from GROUP BY clause?): (`v2` > 1) case2: here v2 inside having clause use alias name v2 =sum(test_having_alias_tb.v2), another condition make logic of v2 differently. SELECT id, v1-2 as v, sum(v2) v2 FROM test_having_alias_tb GROUP BY id,v having(v>0 AND v2>1) ORDER BY id,v; +------+------+------+ \| id \| v \| v2 \| +------+------+------+ \| 2 \| 1 \| 3 \| +------+------+------+ So here we try to make the having clause rules simple: Rule1: if alias name inside having clause is the same as column name, we use column name not alias name; Rule2: if alias name inside having clause do not have same name as column name, we use alias name; Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2023-01-10 15:57:29 +08:00
luozenglin	05d72e8919	[fix](join) fix anti join incorrectly outputs null values (#15567 )	2023-01-06 09:55:48 +08:00
camby	59f34be41f	[fix](having-clause) having clause do not works correct with same alias name (#15143 )	2023-01-05 10:15:15 +08:00
morrySnow	917b266799	[fix](planner) table valued function could not used in subquery (#15496 )	2022-12-30 10:01:25 +08:00
Xin Liao	e72404c537	[fix](scan) fix that be may core dump when the predicates are all false (#15332 )	2022-12-24 15:27:43 +08:00
jakevin	bfaaa2bd7c	[feature](Nereids) support digital_masking function (#15252 )	2022-12-23 18:59:08 +08:00
Tiewei Fang	e7a077a81f	[fix](jdbc catalog) fix bugs of jdbc catalog and table valued function (#15216 ) * fix bugs * add `desc function` test * add test * fix	2022-12-23 16:46:39 +08:00
starocean999	82fbfab77f	[fix](union)the union node should not pass through children in some case (#15286 ) the union node will make children pass through in wrong condition. If the children's materialized slots are different from union node, children can't be passed through.	2022-12-23 10:27:49 +08:00

1 2

98 Commits