doris

Author	SHA1	Message	Date
minghong	74c0677d62	[fix](planner) fix bugs in uncheckedCastChild (#15905 ) 1. `uncheckedCastChild` may generate redundant `CastExpr` like `cast( cast(XXX as Date) as Date)` 2. generate DateLiteral to replace cast(IntLiteral as Date)	2023-01-19 15:51:08 +08:00
AKIRA	21b78cb820	[fix](nereids) Fix bind failed of the slots in the group by clause (#16077 ) Child's slot with same name to the slots in the outputexpression would be discarded which would cause the bind failed, since the slots in the group by expressions cannot find the corresponding bound slots from the child's output	2023-01-19 15:36:13 +08:00
minghong	0144c51ddb	[fix](nereids) fix bug in CaseWhen.getDataType and add some missing case for findTightestCommonType (#15776 )	2023-01-19 15:30:25 +08:00
qiye	47aa53fa72	[fix](multi-catalog)switching catalogs after dropping will get NPE. (#16067 ) Issue Number: close #16066	2023-01-19 15:13:21 +08:00
AKIRA	c5beab39c0	[fix](nereids) Bind slot in having to its direct child instead of grand child (#16047 ) For example, in this case, the `date` in having clause should be bind to alias which has same name, instead of `date` field of the relation SELECT date_format(date, '%x%v') AS `date` FROM `tb_holiday` WHERE `date` between 20221111 AND 20221116 HAVING date = 202245 ORDER BY date;	2023-01-19 13:19:16 +08:00
morrySnow	abdf56bfa5	[fix](Nereids) wrong result of group_concat with order by or null args (#16081 ) 1. signatures without order element are wrong 2. signature with one arg is miss 3. group_concat should be NullableAggregateFunction 4. fold constant on fe should not fold NullableAggregateFunction with null arg TODO 1. reorder rewrite rules, and then only forbid fold constant on NullableAggregateFunction with alwaysNullable == true	2023-01-19 11:22:30 +08:00
jakevin	e846e8c0fd	[enhance](Nereids): remove Group constructor for UT. (#16005 )	2023-01-19 11:13:23 +08:00
lihangyu	3894de49d2	[Enhancement](topn) support two phase read for topn query (#15642 ) This PR optimize topn query like `SELECT * FROM tableX ORDER BY columnA ASC/DESC LIMIT N`. TopN is is compose of SortNode and ScanNode, when user table is wide like 100+ columns the order by clause is just a few columns.But ScanNode need to scan all data from storage engine even if the limit is very small.This may lead to lots of read amplification.So In this PR I devide TopN query into two phase: 1. The first phase we just need to read `columnA`'s data from storage engine along with an extra RowId column called `__DORIS_ROWID_COL__`.The other columns are pruned from ScanNode. 2. The second phase I put it in the ExchangeNode beacuase it's the central node for topn nodes in the cluster.The ExchangeNode will spawn a RPC to other nodes using the RowIds(sorted and limited from SortNode) read from the first phase and read row by row from storage engine. After the second phase read, Block will contain all the data needed for the query	2023-01-19 10:01:33 +08:00
Liqf	c7a72436e6	[Feature](multi-catalog)Add support for JuiceFS (#15969 ) The broker implements the interface to juicefs,It supports loading data from juicefs to doris through broker. At the same time, it also implements the multi catalog to read the hive data stored in juicefs	2023-01-19 08:54:16 +08:00
wxy	7288f1f1d4	[Fix](profile) do not send export profile when enable_profile=false. (#15996 )	2023-01-19 08:06:39 +08:00
pengxiangyu	c43edbdfea	[bug](cooldown)fix bug for single cooldown (#16040 ) * fix bug for single cooldown * fix bug for single cooldown	2023-01-19 08:03:32 +08:00
jakevin	76622bcab4	[enhance](FE): remove constructor just used for UT and useless ERROR code (#16080 ) * [enhance](FE): remove constructor just used for UT. * [enhance](FE): remove useless ERROR Code * fix checkstyle	2023-01-19 08:00:48 +08:00
谢健	d8f598eeab	[enhancement](Nereids) add timestampadd, timestampdiff functions (#16072 )	2023-01-19 01:05:25 +08:00
jakevin	2acf634f84	[CleanUp](FE): cleanup useless code in FE. (#16058 )	2023-01-18 22:25:41 +08:00
mch_ucchi	78ba446487	[Enhancement](Nereids) add more clear message when parse failed (#16056 )	2023-01-18 22:19:46 +08:00
jakevin	cbcd5228b7	[enhance](nereids): polish code for mergeGroup(). (#16057 )	2023-01-18 21:03:46 +08:00
谢健	feeb69438b	[opt](Nereids) optimize DistributeSpec generator of OlapScan (#15965 ) use the size of selected partitions instead of olap table partition size to decide whether generate hashDistributeSpec	2023-01-18 20:18:11 +08:00
Drogon	34075368ec	(improvement)[bucket] Add auto bucket implement (#15250 )	2023-01-18 19:50:18 +08:00
AKIRA	0916cbcb10	[ehancement](nereids) Made the parse for named expression more complete (#16010 ) After this PR, we could support such grammar. SELECT SUBSTRING("dddd编", 0, 3) AS "测试"; SELECT SUBSTRING("dddd编", 0, 3) "测试";	2023-01-18 19:44:51 +08:00
Mingyu Chen	4035bd83c3	[fix](jdbc) fix jdbc driver bug and external datasource p2 test case issue (#16033 ) Fix bug that when create jdbc resource with only jdbc driver file name, it will failed to do checksum This is because we forgot the pass the full driver url to JdbcClient. Add ResultSet.FETCH_FORWARD and set AutoCommit to false to jdbc connection, so to avoid OOM when fetching large amount of data set useCursorFetch in jdbc url for both MySQL and PostgreSQL. Fix some p2 external datasource bug	2023-01-18 17:48:06 +08:00
谢健	5265f5142f	[fix](Nereids) add string and character type (#16044 )	2023-01-18 17:27:45 +08:00
谢健	1fa2b662cf	[opt](Nereids) add date_add/sub function (#16048 ) 1. add week_add week_diff function 2. register all date_add/date_diff function	2023-01-18 17:11:44 +08:00
morrySnow	bd0d650c3d	[fix](Nereids) prohibit cross join with on clause (#16035 )	2023-01-18 16:21:01 +08:00
yiguolei	d257059e6b	[refactor](remove hadoop dpp) remove hadoop dpp code since it is not used (#16009 )	2023-01-18 15:01:04 +08:00
starocean999	de0e402e52	[fix](nereids) bucket shuffle join use wrong shuffled column info (#16011 )	2023-01-18 14:44:36 +08:00
minghong	46ce97a190	[enhance](planner)convert 'or' into 'in-predicate' (#15737 ) in previous [PR 12872](https://github.com/apache/doris/pull/12872), we convert multi equals on same slot into `in predicate`. for example, `a =1 or a = 2` => `a in (1, 2)` This pr makes 4 changes about convert or to in: 1. fix a bug: `Not IN` is merged with equal. `a =1 or a not in (2, 3)` => `a in (1, 2, 3)` 2. extends this rule on more cases - merge for more than one slot: 'a =1 or a = 2 or b = 3 or b = 4' => `a in (1, 2) or b in (3, 4)` - merge skip not-equal and not-in: 'a =1 or a = 2 or b !=3 or c not in (1, 2)' => 'a in (1, 2) or b!=3 or c not in (1,2)` 3. rewrite recursively. 4. OrToIn is implemented in ExtractCommonFactorsRule. This rule will generate new exprs. OrToIn should apply on such generated exprs. for example `(a=1 and b=2) or (a=3 and b=4)` => `(a=1 or a=3) and (b=2 or b=4) and [(a=1 and b=2) or (a=3 and b=4)]` => `a in (1,3) and b in (2 ,4) and [(a=1 and b=2) or (a=3 and b=4)]` In addition, this pr add toString() for some Expr.	2023-01-18 12:33:20 +08:00
morrySnow	c0ea9b0b81	[fix](Nereids) running_difference return type is not right (#16028 )	2023-01-18 11:35:02 +08:00
morrySnow	121f4d6ac0	[fix](Nereids) cannot put two same table value function into one memo (#16026 )	2023-01-18 11:32:09 +08:00
jakevin	18f71180ce	[fix](Nereids) avoid same group expression add to one group when do merge (#15999 )	2023-01-18 11:22:18 +08:00
huangzhaowei	40fa5b4019	[fix](MTMV) Show MTMV statement on table raises exceptions (#15882 )	2023-01-18 10:25:33 +08:00
starocean999	96b9115286	[fix](nereids) fix bug of invalid column in olap scan node when a materialized view is selected (#15976 ) if a materialized view is selected, the olap scan node's NonUserVisibleOutput property may contains column from other materialized view. This pr remove invalid column	2023-01-18 01:02:12 +08:00
Adonis Ling	388d623506	[fix](MTMV) Refine the process of refreshing data (#16006 ) 1. Remove some redundant code. 2. Fix the issue with the state of MTMV task. 3. Fix the case - test_create_mtmv. ## Problem summary 1. We used a retry policy to re-run the failed MTMV tasks, but we set the state to `FAILURE` during re-running the tasks. We should do this after all the retry runs fail. 2. There are some redundant code can be removed. 3. In the case test_create_mtmv, we created many background tasks to refresh the data. Some task may fail due to the concurrency and cause the test fail. Actually, we only need single one task to verify the functionality.	2023-01-17 23:08:12 +08:00
starocean999	0c8255d9b8	[fix](nereids)nest loop join should support filter conjuncts like hash join (#15979 )	2023-01-17 20:38:38 +08:00
谢健	3d05ffb10e	[fix](Nereids) add 'integer' as alias of int type (#15983 )	2023-01-17 20:33:26 +08:00
starocean999	e2d145cf5d	[fix](fe)fix anti join bug (#15955 ) * [fix](fe)fix anti join bug * fix fe ut	2023-01-17 20:25:00 +08:00
wxy	061b28b32e	[Fix](profile) fix /rest/v1/query_profile action. (#15981 ) Co-authored-by: wangxiangyu@360shuke.com <wangxiangyu@360shuke.com>	2023-01-17 20:21:48 +08:00
morrySnow	02a7995171	[fix](planner)wrong result when has order by under join (#15974 )	2023-01-17 20:20:56 +08:00
AKIRA	38663526b7	[fix](planner) Keep type of null literal expr when register conjuncts (#15878 ) For now, type information of child expr which is NullLiteral would get lost in the CastExpr#getResultValue, this will produce a NullLiteral with Null type which cause BE core when doing cast	2023-01-17 16:48:02 +08:00
morrySnow	7e4bc1fee6	[fix](Nereids) add a rule to adjust nullable of all expressions (#15791 ) we have some rules that change output's nullable in rewrite step. So we need a rule to adjust nullable at the end of rewrite step. TODO - remove the output slot map - add nullable compare into slot reference - use exprid to compare two slot if do not need to compare nullable - merge all rules into one to adjust all type plans	2023-01-17 15:51:25 +08:00
chenlinzhong	82e2102e18	[fix](MTMV) Exceptions occur when dropping meterialized view with if exists (#15568 )	2023-01-17 15:29:39 +08:00
Gabriel	d062ca2944	[refactor](vectorized) remove unnecessary vectorization check (#15984 )	2023-01-17 12:21:46 +08:00
morrySnow	b469efdb17	[fix](Nereids) all slot in grouping sets in repeat node should be nullable (#15991 ) according to be's code, all slot in grouping set should be nullable. reference to be code (`be3482e6d6/be/src/vec/exec/vrepeat_node.cpp (L113)`)	2023-01-17 11:47:55 +08:00
morrySnow	d98abb12f9	[fix](Nereids)set oepration type coercion is diff with legacy planner (#15982 )	2023-01-17 11:41:41 +08:00
morrySnow	ce1d19b373	[fix](Nereids) lateral view cannot bind function nested in generators (#15960 )	2023-01-17 11:37:56 +08:00
minghong	8d25b156aa	[fix](nereids) bind slot using exactly match (#15950 ) example: unbound slot k bounded [k, t.k] In previous binding algorithm, there are 2 candidate bindings, in which bounded k is exactly matched unbound slot k, it has higher priority than that of t1.k	2023-01-17 11:25:08 +08:00
zhannngchen	b6d9e73c59	[feature](merge-on-write) enable by default (#15920 )	2023-01-17 11:15:42 +08:00
Gabriel	1ea11aa120	[Bug](datev2) Fix wrong cast expr (#15985 ) Found by regression tests when I turn on enable_date_conversion	2023-01-17 10:18:20 +08:00
Mingyu Chen	4b49d05e97	[refactor](fe) remove type related class to fe-common to reduce java-udf jar size (#15808 )	2023-01-17 00:01:15 +08:00
slothever	525f990d2b	[feture-wip](multi-catalog) upgrade iceberg pom version to 1.1.0, for rest catalog api (#15964 ) Co-authored-by: jinzhe <jinzhe@selectdb.com>	2023-01-16 23:10:41 +08:00
zhangdong	899f5f5cf5	[feature](multi-catalog) support hive metastore more events (#15702 ) support hive metastore more events	2023-01-16 14:16:12 +08:00

1 2 3 4 5 ...

3577 Commits