doris

Author	SHA1	Message	Date
谢健	d8f598eeab	[enhancement](Nereids) add timestampadd, timestampdiff functions (#16072 )	2023-01-19 01:05:25 +08:00
mch_ucchi	baf62b4418	[test](Nereids) add regression-test for running_difference and regexp_extract_all (#16049 )	2023-01-18 22:24:52 +08:00
谢健	feeb69438b	[opt](Nereids) optimize DistributeSpec generator of OlapScan (#15965 ) use the size of selected partitions instead of olap table partition size to decide whether generate hashDistributeSpec	2023-01-18 20:18:11 +08:00
AKIRA	0916cbcb10	[ehancement](nereids) Made the parse for named expression more complete (#16010 ) After this PR, we could support such grammar. SELECT SUBSTRING("dddd编", 0, 3) AS "测试"; SELECT SUBSTRING("dddd编", 0, 3) "测试";	2023-01-18 19:44:51 +08:00
Mingyu Chen	4035bd83c3	[fix](jdbc) fix jdbc driver bug and external datasource p2 test case issue (#16033 ) Fix bug that when create jdbc resource with only jdbc driver file name, it will failed to do checksum This is because we forgot the pass the full driver url to JdbcClient. Add ResultSet.FETCH_FORWARD and set AutoCommit to false to jdbc connection, so to avoid OOM when fetching large amount of data set useCursorFetch in jdbc url for both MySQL and PostgreSQL. Fix some p2 external datasource bug	2023-01-18 17:48:06 +08:00
谢健	1fa2b662cf	[opt](Nereids) add date_add/sub function (#16048 ) 1. add week_add week_diff function 2. register all date_add/date_diff function	2023-01-18 17:11:44 +08:00
Hong Liu	94628f09e9	[regression-test](spark-connector) Add the regression case of the spark doris connector (#14877 ) * [regression-test](spark-connector) Add the regression case of the spark doris connector	2023-01-18 16:41:41 +08:00
Kang	65d9293fa9	[testcase](bitmap index)bitmap index testcase (#15975 ) * add bitmap index testcases for all scalar types * add bitmap index testcases for all scalar types	2023-01-18 14:17:24 +08:00
minghong	46ce97a190	[enhance](planner)convert 'or' into 'in-predicate' (#15737 ) in previous [PR 12872](https://github.com/apache/doris/pull/12872), we convert multi equals on same slot into `in predicate`. for example, `a =1 or a = 2` => `a in (1, 2)` This pr makes 4 changes about convert or to in: 1. fix a bug: `Not IN` is merged with equal. `a =1 or a not in (2, 3)` => `a in (1, 2, 3)` 2. extends this rule on more cases - merge for more than one slot: 'a =1 or a = 2 or b = 3 or b = 4' => `a in (1, 2) or b in (3, 4)` - merge skip not-equal and not-in: 'a =1 or a = 2 or b !=3 or c not in (1, 2)' => 'a in (1, 2) or b!=3 or c not in (1,2)` 3. rewrite recursively. 4. OrToIn is implemented in ExtractCommonFactorsRule. This rule will generate new exprs. OrToIn should apply on such generated exprs. for example `(a=1 and b=2) or (a=3 and b=4)` => `(a=1 or a=3) and (b=2 or b=4) and [(a=1 and b=2) or (a=3 and b=4)]` => `a in (1,3) and b in (2 ,4) and [(a=1 and b=2) or (a=3 and b=4)]` In addition, this pr add toString() for some Expr.	2023-01-18 12:33:20 +08:00
starocean999	96b9115286	[fix](nereids) fix bug of invalid column in olap scan node when a materialized view is selected (#15976 ) if a materialized view is selected, the olap scan node's NonUserVisibleOutput property may contains column from other materialized view. This pr remove invalid column	2023-01-18 01:02:12 +08:00
starocean999	0c8255d9b8	[fix](nereids)nest loop join should support filter conjuncts like hash join (#15979 )	2023-01-17 20:38:38 +08:00
starocean999	e2d145cf5d	[fix](fe)fix anti join bug (#15955 ) * [fix](fe)fix anti join bug * fix fe ut	2023-01-17 20:25:00 +08:00
AKIRA	38663526b7	[fix](planner) Keep type of null literal expr when register conjuncts (#15878 ) For now, type information of child expr which is NullLiteral would get lost in the CastExpr#getResultValue, this will produce a NullLiteral with Null type which cause BE core when doing cast	2023-01-17 16:48:02 +08:00
morrySnow	7e4bc1fee6	[fix](Nereids) add a rule to adjust nullable of all expressions (#15791 ) we have some rules that change output's nullable in rewrite step. So we need a rule to adjust nullable at the end of rewrite step. TODO - remove the output slot map - add nullable compare into slot reference - use exprid to compare two slot if do not need to compare nullable - merge all rules into one to adjust all type plans	2023-01-17 15:51:25 +08:00
morrySnow	d98abb12f9	[fix](Nereids)set oepration type coercion is diff with legacy planner (#15982 )	2023-01-17 11:41:41 +08:00
morrySnow	ce1d19b373	[fix](Nereids) lateral view cannot bind function nested in generators (#15960 )	2023-01-17 11:37:56 +08:00
morrySnow	6609eb804d	[fix](regression) result of withUnionAll in query_p0/select_no_from is unstable (#15958 )	2023-01-17 11:34:41 +08:00
minghong	8d25b156aa	[fix](nereids) bind slot using exactly match (#15950 ) example: unbound slot k bounded [k, t.k] In previous binding algorithm, there are 2 candidate bindings, in which bounded k is exactly matched unbound slot k, it has higher priority than that of t1.k	2023-01-17 11:25:08 +08:00
YueW	b1caa68706	[Feature-WIP](inverted index) inverted index reader's implementation, and add mysql_fulltext regression case to test fulltext query (#15823 ) Issue Number: Step2 of DSIP-023: Add inverted index for full text search implementation of inverted index reader dependency pr: #14211 #15807 #15821	2023-01-17 09:13:56 +08:00
Kang	806cd9fb3c	[regression-test](topn)add test cases for nonkey topn query for each scalar type (#15790 ) related to #15558 #15693 1. dup key table with 17 scalar datatypes 2. unique key table with mow enabled 3. unique key table with mow disabled	2023-01-16 16:49:59 +08:00
minghong	fa03c8a241	[feature](nereids) const folding for in-predicate with null literal (#15880 ) select 1 in (2 , null) => null select 1 in (1 , null) => true select 1 not in (2 , null) => null select 1 not in (1 , null) => false	2023-01-16 13:48:45 +08:00
Pxl	81bab55d43	[Bug](function) catch function calculation error on aggregate node to avoid core dump (#15903 )	2023-01-16 11:21:28 +08:00
AlexYue	0d61ca7cdd	[chore](regression) remove redundant case (#15935 )	2023-01-15 11:06:33 +08:00
Gabriel	5af7bcaa55	[Bug](decimalv3) Fix missing precision and scale in predicates (#15930 )	2023-01-15 00:01:48 +08:00
Lightman	313e14d220	[Bugfix] (ROLLUP) fix the coredump when add rollup by link schema change (#15654 ) Because of the rollup has the same keys and the keys's order is same, BE will do linked schema change. The base tablet's segments will link to the new rollup tablet. But the unique id from the base tablet is starting from 0 and as the rollup tablet also. In this case, the unique id 4 in the base table is column 'city', but in the rollup tablet is 'cost'. It will decode the varcode page to bigint page so that be coredump. It needs to be rejected. I think that if a rollup add by link schema change, it means this rollup is redundant. It brings no additional revenue and wastes storage space. So It needs to be rejected.	2023-01-14 10:20:07 +08:00
Tiewei Fang	2580c88c1b	[feature](multi-catalog) support oracle jdbc catalog (#15862 )	2023-01-14 00:01:33 +08:00
caoliang-web	a788623ee2	doris largeint type execute where query, the result is incorrect (#15034 )	2023-01-13 23:12:02 +08:00
AlexYue	514de605b6	[Bug](predicate) add double predicate creator (#15762 ) Add one double predicator the same as integer predicate creator.	2023-01-13 18:34:09 +08:00
AlexYue	049f8ad2f9	[Bug](sort)fix merge sorter might div zero when block bytes less than block rows (#15859 ) If block bytes are bigger than the corresponding block's rows, then the avg_size_per_row would be zero. Which would end up diving zero in the following logic.	2023-01-13 18:33:40 +08:00
minghong	67378a2dc3	[fix](nereids) fix bug in SequenceFunction legality check (#15812 ) 1. fix bug in sequence_match function 2. do type promotion instead of explicit cast for - varcharLiteral -> stringLiteral - charLiteral->stringLiteral	2023-01-13 12:09:53 +08:00
luozenglin	b1fb1277dd	[fix](bitmap) fix bitmap iterator comparison error (#15779 ) Fix the bug that bitmap.begin() == bitmap.end() is always true when the bitmap contains a single value.	2023-01-13 11:37:07 +08:00
HappenLee	9468711f9f	[Bug](join) fix bug null aware left anti join not correct result (#15841 )	2023-01-13 10:18:05 +08:00
Adonis Ling	14e3879c4b	[regression-test](MTMV) Make the case test_create_mtmv more robust (#15866 ) ## Proposed changes 1. Check the state of MTMV task as the loop condition. 2. Check the data in materialized view. ## Problem summary There are some minor issues with #15546. 1. The case used a retry strategy as the loop condition, it may not be stable while the host machine is busy. 2. The case didn't check the final data in materialized view.	2023-01-13 00:13:24 +08:00
abmdocrt	7441b4dc96	[Feature](function) Support width_bucket function (#14396 )	2023-01-12 13:59:21 +08:00
谢健	39697bb83e	[fix](Nereids) make the type of the first parameter in window_funnel is intergerLike (#15810 )	2023-01-12 11:53:28 +08:00
starocean999	cfb110c905	[fix](nereids) fix some nereids bugs (#15714 ) 1. remove forcing nullable for slot on EmptySetNode. 2. order by xxx desc should use nulls last as default order. 3. don't create runtime filter if runtime filter mode is OFF. 4. group by constant value need check the corresponding expr shouldn't have any aggregation functions. 5. fix two left outer join reorder bug( A left join B left join C). 6. fix semi join and left outer join reorder bug.( A left join B semi join C ). 7. fix group by NULL bug. 8. change ceil and floor function to correct signature. 9. add literal comparasion for string and date type. 10. fix the getOnClauseUsedSlots method may not return valid value. 11. the tightness common type of string and date should be date. 12. the nullability of set operation node's result exprs is not set correctly. 13. Sort node should remove redundent ordering exprs.	2023-01-11 17:18:44 +08:00
starocean999	006b3bd61a	[fix](nereids) orthogonal_bitmap_intersect's return type should be bitmap (#15784 )	2023-01-11 12:53:37 +08:00
mch_ucchi	7f2c433e08	[feature](Nereids) add relation id to unboundTVFRelation to avoid incorrect group expression comparison (#15740 )	2023-01-11 12:49:14 +08:00
Pxl	2587095811	[Bug](mv) fix mv selector check group expr && forbid create dup mv with bitmap/hll && add some case (#15738 )	2023-01-11 11:38:56 +08:00
mch_ucchi	bc34a44f06	[Fix](Nereids) fix type coercion for binary arithmetic (#15185 ) support sql like: select true + 1 + '2.0' and prevent select true + 1 + 'x';	2023-01-11 02:55:44 +08:00
minghong	280603b253	[fix](nereids) bind sort key priority problem (#15646 ) `a.b.c` should only bind on `a.b.c`, not on `b.c` or `c`	2023-01-11 02:03:09 +08:00
starocean999	fec89ad58c	[fix](nereids) week should be able to recognized as function name in function call context (#15735 )	2023-01-10 19:54:59 +08:00
AKIRA	7767931aca	[ehancement](nereids) let parser support utf8 identifier (#15721 ) After this PR, below SQL could be parsed well too - SELECT k1 AS 测试 FROM test; - SELECT k1 AS テスト FROM test;	2023-01-10 19:43:04 +08:00
camby	bb28144c76	[fix](schema change) bugfix for light schema change while with rollup (#15681 ) Describe your changes. this problem come from pr: #11494 After add column to rollup index, it also change column UniqueId inside base index.	2023-01-10 19:03:06 +08:00
caoliang-web	672d11522b	[regression](flink)add flink doris connector case (#15676 ) * add flink doris connector case	2023-01-10 17:25:06 +08:00
camby	47097a3db8	[fix](having) revert 15143 and fix having clause with multi-conditions (#15745 ) Describe your changes. Firstly having clause of Mysql is really very complex, we are hard to follow all rules, so we revert pr15143 to keep the logic the same as before. Secondly the origin implementation has problem while having clause has multi-conditions. For example: case1: here v2 inside having clause use table column test_having_alias_tb.v2 SELECT id, v1-2 as v, sum(v2) v2 FROM test_having_alias_tb GROUP BY id,v having(v2>1); ERROR 1105 (HY000): errCode = 2, detailMessage = HAVING clause not produced by aggregation output (missing from GROUP BY clause?): (`v2` > 1) case2: here v2 inside having clause use alias name v2 =sum(test_having_alias_tb.v2), another condition make logic of v2 differently. SELECT id, v1-2 as v, sum(v2) v2 FROM test_having_alias_tb GROUP BY id,v having(v>0 AND v2>1) ORDER BY id,v; +------+------+------+ \| id \| v \| v2 \| +------+------+------+ \| 2 \| 1 \| 3 \| +------+------+------+ So here we try to make the having clause rules simple: Rule1: if alias name inside having clause is the same as column name, we use column name not alias name; Rule2: if alias name inside having clause do not have same name as column name, we use alias name; Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2023-01-10 15:57:29 +08:00
luozenglin	05f6e4c48a	[fix](predicate) fix be core dump caused by pushing down the double column predicate (#15693 )	2023-01-09 19:31:04 +08:00
AKIRA	5ceb5441f4	[feature](nereids) let set operation syntax campatible with lagecy planner (#15664 ) Though this syntax doesn't get suppoted in many other systems since the order by clause here almost redandunt and useless but we have to keep consistent with the legacy doris syntax Here is a example: SELECT * FROM (SELECT k1, k3 FROM tbl1 ORDER BY k3 UNION ALL SELECT k1, k5 FROM tbl2) t;	2023-01-09 15:31:29 +08:00
AKIRA	7543d677fa	[fix](nereids) Fix the bugs of data distribution calculation on OlapScan (#15699 ) when need to scan more than one olap table partition and it is not a colocate table or its colocate group is unstable, we need to make it as any distribution even if its distribution type is Hash	2023-01-09 15:25:54 +08:00
Gabriel	e2492cf7fc	[Bug](DECIMALV3) Fix binary predicate between decimalv3 and float (#15696 )	2023-01-09 15:16:59 +08:00

1 2 3 4 5 ...

697 Commits