doris

Author	SHA1	Message	Date
Mryange	4c6df9062e	[fix](DECIMALV3)fix cumulative precision when literal and DECIMALV3 operations in Legacy (#20354 ) The precision handling for division with DECIMALV3 is as follows (excluding cases where division increases precision): (p1, s1) / (p2, s2) ----> (p1 + s2, s1) However, due to precision loss in division, it is considered to increase the precision of the left operand: (p1, s1) / (p2, s2) =====> (p1 + s2, s1 + s2) / (p2, s2) ----> (p1 + s2, s1) However, the legacy optimizer repeats the analyze and substitute steps for an expression, which can result in the accumulation of precision: (p1, s1) / (p2, s2) =====> (p1 + s2, s1 + s2) / (p2, s2) =====> (p1 + s2 + s2, s1 + s2 + s2) / (p2, s2) To address this, the previous approach was to forcibly convert the left operand of DECIMALV3 calculations. This results in rewriting the expression as: (p1, s1) / (p2, s2) =====> cast((p1, s1) as (p1 + s2, s1 + s2)) / (p2, s2) Then, during the substitution step, a check is performed. If it is a cast expression, the expression modified by the cast is extracted: cast((p1, s1) as (p1 + s2, s1 + s2)) =====> (p1, s1) protected Expr substituteImpl(ExprSubstitutionMap smap, ExprSubstitutionMap disjunctsMap, Analyzer analyzer) { if (isImplicitCast()) { return getChild(0).substituteImpl(smap, disjunctsMap, analyzer); } This way, there won't be repeated analysis, preventing the continuous increase in precision. However, if the left expression is a constant (literal), theoretically, the precision would continue to increase. Unfortunately, the code that was removed in this PR (#19926) obscured this issue. for (Expr child : children) { if (child instanceof DecimalLiteral && child.getType().isDecimalV3()) { ((DecimalLiteral)child).tryToReduceType(); } } An attempt will be made to reduce the precision of literals in the expressions. However, this code snippet can cause such a bug. mysql [test]>select cast(1 as DECIMALV3(16, 2)) / cast(3 as DECIMALV3(16, 2)); +-----------------------------------------------------------+ \| CAST(1 AS DECIMALV3(16, 2)) / CAST(3 AS DECIMALV3(16, 2)) \| +-----------------------------------------------------------+ \| 0.00 \| +-----------------------------------------------------------+ 1.00 / 3.00, due to reduced precision, becomes 1 / 3. <--Describe your changes.-->	2023-06-09 08:58:55 +08:00
Mryange	519f01133a	[feature](decimal)support cast rounding half up and div precision increment in decimalv3. (#19811 )	2023-06-01 13:09:58 +08:00
Mryange	94e1072d14	Revert "[fix](DECIMALV3) Fix the error in DECIMALV3 when explicitly casting. (#19926 )" (#20204 ) This reverts commit 8ca4f9306763b5a18ffda27a07ab03cc77351e35.	2023-05-30 10:35:33 +08:00
Mryange	8ca4f93067	[fix](DECIMALV3) Fix the error in DECIMALV3 when explicitly casting. (#19926 ) before mysql [test]>select cast(1 as DECIMALV3(16, 2)) / cast(3 as DECIMALV3(16, 2)); +-----------------------------------------------------------+ \| CAST(1 AS DECIMALV3(16, 2)) / CAST(3 AS DECIMALV3(16, 2)) \| +-----------------------------------------------------------+ \| 0.00 \| +-----------------------------------------------------------+ mysql [test]>select * from divtest; +------+------+ \| id \| val \| +------+------+ \| 3 \| 5.00 \| \| 2 \| 4.00 \| \| 1 \| 3.00 \| +------+------+ mysql [test]>select cast(1 as decimalv3(16,2)) / val from divtest; +-------------------------------------+ \| CAST(1 AS DECIMALV3(16, 2)) / `val` \| +-------------------------------------+ \| 0 \| \| 0 \| \| 0 \| +-------------------------------------+ after mysql [test]>select cast(1 as DECIMALV3(16, 2)) / cast(3 as DECIMALV3(16, 2)); +-----------------------------------------------------------+ \| CAST(1 AS DECIMALV3(16, 2)) / CAST(3 AS DECIMALV3(16, 2)) \| +-----------------------------------------------------------+ \| 0.33 \| +-----------------------------------------------------------+ mysql [test]>select cast(1 as decimalv3(16,2)) / val from divtest; +-------------------------------------+ \| CAST(1 AS DECIMALV3(16, 2)) / `val` \| +-------------------------------------+ \| 0.250000 \| \| 0.200000 \| \| 0.333333 \| +-------------------------------------+ This is because in the previous code, the constant 1.000 would be transformed into 1. remove "ReduceType	2023-05-29 19:51:12 +08:00
Mryange	a86134cb39	[fix](executor) Fixed an error with cast as time. #20144 before mysql [(none)]>select cast("10:10:10" as time); +-------------------------------+ \| CAST('10:10:10' AS TIMEV2(0)) \| +-------------------------------+ \| 00:00:00 \| +-------------------------------+ after mysql [(none)]>select cast("10:10:10" as time); +-------------------------------+ \| CAST('10:10:10' AS TIMEV2(0)) \| +-------------------------------+ \| 10:10:10 \| +-------------------------------+ In the past, we supported this syntax. mysql [(none)]>select cast("2023:05:01 13:14:15" as time); +------------------------------------------+ \| CAST('2023:05:01 13:14:15' AS TIMEV2(0)) \| +------------------------------------------+ \| 13:14:15 \| +------------------------------------------+ However, "10:10:10" is also a valid datetime. mysql [(none)]>select cast("10:10:10" as datetime); +-----------------------------------+ \| CAST('10:10:10' AS DATETIMEV2(0)) \| +-----------------------------------+ \| 2010-10-10 00:00:00 \| +-----------------------------------+ So here, the order of parsing has been adjusted.	2023-05-29 12:17:21 +08:00
yangshijie	ed8a4b4120	[feature-wip](duplicate_no_keys) skip sort function if the table is duplicate without keys (#19483 )	2023-05-11 14:44:16 +08:00
Mryange	5fd6d8ebd4	[fix](function) Support more behaviors of cast time in MySQL	2023-04-26 07:49:54 +08:00
Mryange	de0e89d1b4	[feature](function) Modified cast as time to behave more like MySQL (#18565 ) Because the underlying type of time was float64, select cast("19:22:18" as time) would result in a null value in the past. Results in the following:	2023-04-22 06:11:59 +08:00
Xinyi Zou	f9baf9c556	[improvement](scan) Support pushdown execute expr ctx (#15917 ) In the past, only simple predicates (slot=const), and, like, or (only bitmap index) could be pushed down to the storage layer. scan process: Read part of the column first, and calculate the row ids with a simple push-down predicate. Use row ids to read the remaining columns and pass them to the scanner, and the scanner filters the remaining predicates. This pr will also push-down the remaining predicates (functions, nested predicates...) in the scanner to the storage layer for filtering. scan process: Read part of the column first, and use the push-down simple predicate to calculate the row ids, (same as above) Use row ids to read the columns needed for the remaining predicates, and use the pushed-down remaining predicates to reduce the number of row ids again. Use row ids to read the remaining columns and pass them to the scanner.	2023-03-10 08:35:32 +08:00
htyoung	69c62b6c6c	[Fix](vectorization) fixed that when a column's _fixed_values exceeds the max_pushdown_conditions_per_column limit, the column will not perform predicate pushdown, but if there are subsequent columns that need to be pushed down, the subsequent column pushdown will be misplaced in _scan_keys and it causes query results to be wrong (#17405 ) the max_pushdown_conditions_per_column limit, the column will not perform predicate pushdown, but if there are subsequent columns that need to be pushed down, the subsequent column pushdown will be misplaced in _scan_keys and it causes query results to be wrong Co-authored-by: tongyang.hty <hantongyang@douyu.tv>	2023-03-08 07:23:56 +08:00
TengJianPing	fb0d08ff4c	[fix](mark join) fix bug of mark join with other conjuncts (#16655 ) Fix bug that probe_index is not increased for mark hash join with other conjuncts.	2023-02-14 14:47:15 +08:00
morrySnow	a512469537	[fix](planner) cannot process more than one subquery in disjunct (#16506 ) before this PR, Doris cannot process sql like that ```sql CREATE TABLE `test_sq_dj1` ( `c1` int(11) NULL, `c2` int(11) NULL, `c3` int(11) NULL ) ENGINE=OLAP DUPLICATE KEY(`c1`) COMMENT 'OLAP' DISTRIBUTED BY HASH(`c1`) BUCKETS 3 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "in_memory" = "false", "storage_format" = "V2", "disable_auto_compaction" = "false" ); CREATE TABLE `test_sq_dj2` ( `c1` int(11) NULL, `c2` int(11) NULL, `c3` int(11) NULL ) ENGINE=OLAP DUPLICATE KEY(`c1`) COMMENT 'OLAP' DISTRIBUTED BY HASH(`c1`) BUCKETS 3 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "in_memory" = "false", "storage_format" = "V2", "disable_auto_compaction" = "false" ); insert into test_sq_dj1 values(1, 2, 3), (10, 20, 30), (100, 200, 300); insert into test_sq_dj2 values(10, 20, 30); -- core SELECT * FROM test_sq_dj1 WHERE c1 IN (SELECT c1 FROM test_sq_dj2) OR c1 IN (SELECT c1 FROM test_sq_dj2) OR c1 < 10; -- invalid slot SELECT * FROM test_sq_dj1 WHERE c1 IN (SELECT c1 FROM test_sq_dj2) OR c1 IN (SELECT c2 FROM test_sq_dj2) OR c1 < 10; ``` there are two problems: 1. we should remove redundant sub-query in one conjuncts to avoid generate useless join node 2. when we have more than one sub-query in one disjunct. we should put the conjunct contains the disjunct at the top node of the set of mark join nodes. And pop up the mark slot to the top node.	2023-02-08 18:46:06 +08:00
Gabriel	91229bb87d	[Bug](makr join) Fix mark join with other conjuncts (#16435 )	2023-02-07 09:31:41 +08:00
Gabriel	5ff5b8fc98	[feature](mark join) Support mark join for hash join node (#15569 ) * [feature](mark join) Support mark join for hash join node	2023-01-05 09:32:26 +08:00
morrySnow	5cf21fa7d1	[feature](planner) mark join to support subquery in disjunction (#14579 ) Co-authored-by: Gabriel <gabrielleebuaa@gmail.com>	2022-12-20 15:22:43 +08:00
Kikyou1997	283b23f6da	[fix](planner) wrong results when select from view which has with clause (#14747 )	2022-12-02 18:10:52 +08:00
Yongqiang YANG	6b6d548df9	[enhancement](test) add more p0 cases (#12285 )	2022-09-29 10:45:17 +08:00
minghong	f949262ddf	[fix](planner) a slot id is bounded on a wrong tuple id, if cross join has a hash join as child (#12156 )	2022-08-31 09:07:55 +08:00
yinzhijian	3ca6f34c87	[fix](view) Fix view not showing specific lengths for varchar type (#12107 )	2022-08-29 12:09:48 +08:00
starocean999	5219d2aab0	[fix](union)the result exprs of union node should substitute by child node's smap (#11933 ) union node's result exprs should be substitued by child node's smap first, then the following "computePassthrough" method would have correct information to do its job.	2022-08-24 19:43:40 +08:00
Gabriel	1f9eec5462	[Regression](datev2) Add test cases for datev2/datetimev2 (#11831 )	2022-08-19 10:57:55 +08:00
starocean999	6c6328fc6d	[fix](join)fix outer join bug when a subquery as nullable side #11700	2022-08-12 11:50:15 +08:00
minghong	2d5ffac590	[fix](optimization) InferFiltersRule bug: a self inner join on a view, which contains where clause, will cause mis-inference. (#11566 )	2022-08-11 17:13:26 +08:00
minghong	02a3f21b65	[fix](analyzer) InferFilterRule bug: equations in on clause of outer/anti join are not inferable. (#11515 )	2022-08-11 09:36:43 +08:00
morrySnow	1701ffa7c0	[fix](planner)push constant expr in predicate to outer join's other conjuncts by mistake (#11527 ) constant expr in predicate should not be pushed to outer join's other conjuncts	2022-08-08 20:56:08 +08:00
Yongqiang YANG	60b5ed16a8	[improvement](test) move correctness and account suites to p0 while tpcds_sf1 t0 p1 (#11350 )	2022-08-02 11:23:01 +08:00
morrySnow	d9fab77100	[fix](planner)LateralViewRef#toSql throw NPE if it is not analyzed (#11221 )	2022-07-27 14:44:27 +08:00
TengJianPing	2ed46eee64	[bugfix] fix coredump caused by nullable const column compare to non-nullable const column (#11227 )	2022-07-27 12:00:26 +08:00
Jerry Hu	4d95bb4888	[regression-test]Add order-by to qt_groupby in correctness/table_valued_function (#11219 )	2022-07-27 10:07:34 +08:00
starocean999	3e3b2d15d4	[bug]string pad functions should always be nullable (#11140 ) * string pad functions should always be nullable	2022-07-26 10:20:11 +08:00
starocean999	ca0906626f	[BUG] fix bitmap function bug (#10502 ) * fix bitmap function bug * add regression test	2022-07-01 15:30:16 +08:00
Tiewei Fang	17eb8c00d3	[feature] add table valued function framework and numbers table valued function (#10214 )	2022-06-28 14:01:57 +08:00
morrySnow	59b3023adf	[fix](regression)bucket shuffle join with collocate table should use order_qt (#10082 )	2022-06-14 15:34:39 +08:00
zy-kkk	bf984c1e80	[test] Add drop force to cases associated with ALTER operations (#10049 )	2022-06-12 21:27:44 +08:00
morrySnow	3f575e3e7c	[fix](planner) produce wrong result when use bucket shuffle join with colocate left table (#10045 ) When plan bucket shuffle join, we need to know left table bucket number. Currently, we use tablet number directly based on the assumption that left table has only one partition. But, when left table is colocated table, it could have more than one partition. In this case, some data in right table will be dropped incorrectly and produce wrong result for query. reproduce could follow regression test in PR.	2022-06-11 21:44:47 +08:00
starocean999	bf8b4fb2d3	[Bugfix] be crash when executing sql contains bitmap_intersect function (#9910 ) * fix bitmap serialize bug * add regression test for bitmap seralize bugfix * add missing regression test out file * fix reggresion test failed issue	2022-06-09 08:45:46 +08:00
BePPPower	99fb830023	[feature] datetime column type support auto-initialized with default … (#9972 )	2022-06-09 00:28:03 +08:00
zhengshengjun	49d4798276	[fix](function) fix bug in time_round function (#9712 )	2022-06-06 08:58:22 +08:00
morrySnow	3031919e8f	[fix] (planner) slot nullable does not set correctly when plan outer join with inline view (#9927 ) - set inline view's slot descriptor to nullable in register column ref - propagate slot nullable when generate inline view's query node in SingleNodePlanner	2022-06-03 17:50:10 +08:00
zhangstar333	e896fffd76	[Vectorized][Function] fix bitmap_intersect get wrong result (#9907 )	2022-06-01 23:51:52 +08:00
morrySnow	a3183ec45c	[fix](planner) unnecessary cast will be added on children in CaseExpr sometimes (#9600 ) unnecessary cast will be added on children in CaseExpr because use symbolized equal to compare to `Expr`'s type. it will lead to expression compare mistake and then lead to expression substitute failed when use `ExprSubstitutionMap`	2022-05-18 22:44:51 +08:00
zhangstar333	953429e370	[fix](function) fix last_value get wrong result when have order by clause (#9247 )	2022-05-15 23:56:01 +08:00
zhangstar333	fa6e4db4ca	[fix](Function) fix case when function return null with abs function (#9493 )	2022-05-14 09:50:45 +08:00
zhangstar333	fd11a6b493	[fix][feature](Function) fix return type && support hll_union_agg/group_concat agg to window function (#9119 )	2022-05-07 20:44:04 +08:00
zhangstar333	cdd1b6d6dd	[fix](function) fix lag/lead function return invalid data (#9076 )	2022-04-26 09:34:46 +08:00
zhangy5	498f50a837	[regression-test] update test case dir which divided by basic functions (#9084 ) 1. Add test case dir. 2. Add some test suites.	2022-04-21 11:55:41 +08:00
Zhengguo Yang	7634e55513	[fix] fix p0 test failed because of char type cannot convert to datetime (#8996 ) fix p0 test failed because of char type cannot convert to datetime	2022-04-15 15:16:00 +08:00
zhangstar333	9ac6d23a44	[Feature]support stddev/variance agg functions to window function (#8962 )	2022-04-14 12:07:26 +08:00
Pxl	8a066e2586	[fix](vectorized) core dump on ST_AsText (#8870 )	2022-04-11 09:39:32 +08:00
Pxl	453485abfb	[Bug] Fix some bugs(rewrite rule/symbol transport) of `like predicate` (#8770 )	2022-04-08 14:32:09 +08:00

1 2

51 Commits