doris

Author	SHA1	Message	Date
Qi Chen	73ad885e19	[Feature][Fix](multi-catalog) Implements transactional hive full acid tables. (#20679 ) After supporting insert-only transactional hive full acid tables #19518, #19419, this PR support transactional hive full acid tables. Support hive3 transactional hive full acid tables. Hive2 transactional hive full acid tables need to run major compactions.	2023-06-13 08:55:16 +08:00
mch_ucchi	6652287b52	[Fix](regression-test) fix unstable test case nereids_p0/update (#20692 )	2023-06-12 20:55:22 +08:00
LiBinfeng	c25c19bddc	[test](regression) Add cases to test join condition push and not like (#20453 ) Add testing cases to issue #19613	2023-06-12 18:26:23 +08:00
zhangdong	daf18a4b0e	[fix](MTMV) Support refreshing data manually (#20108 )	2023-06-12 17:57:06 +08:00
jiawei liang	99c0592157	[Feature](array-function) Support array_pushback function #17417 (#19988 ) Implement array_pushback. mysql> select array_pushback([1, 2], 3); +--------------------------------+ \| array_pushback(ARRAY(1, 2), 3) \| +--------------------------------+ \| [1, 2, 3] \| +--------------------------------+ 1 row in set (0.01 sec)	2023-06-12 16:51:12 +08:00
Xujian Duan	0b228b3414	[fix](load)Support load json data with default value (#20624 ) * support json default value --------- Co-authored-by: duanxujian <duanxujian@jd.com>	2023-06-12 14:51:31 +08:00
zxealous	10134ea8c6	[fix](planner) fix RewriteInPredicateRule may be useless (#20668 ) Issue Number: close #20669 RewriteInPredicateRule may cast InPredicate expr's two child to the same type, for example: where cast(age as char) in ('11'), the type of age is int, RewriteInPredicateRule will cast expr's two child type to int. As in the example above, child 0 will be such struct: ``` child 0: type: int \|--- child: type : char \|-- child: type : int ``` Due to the RewriteInPredicateRule cast the type of the expr to int, it will reanalyze stmt, but it will reset stmt first before reanalyze the stmt, and reset opt will change child 0 to such struct: ``` child: type : char \|-- child: type : int ``` It cause two child's type will be cast to varchar in func castAllToCompatibleType, the logic of RewriteInPredicateRule will be useless. In 1.1-lts and 1.2-lts, such case " where cast(age as char) in ('11')" can't work well, because func castAllToCompatibleType will cast int to char but int can't cast to char(master can work well because func castAllToCompatibleType will cast int to varchar in such case). ``` MySQL [test]> select user_id from test_cast where cast(age as char) in ('45'); ERROR 1105 (HY000): errCode = 2, detailMessage = type not match, originType=INT, targeType=CHAR(*) ```	2023-06-12 14:39:01 +08:00
Mingyu Chen	f90d5dbacf	[fix](test) fix unstable dynamic partition regression test (#20674 ) Add to define variable with def keyword	2023-06-12 14:28:30 +08:00
Pxl	7f8c5c81e7	[Feature](agg_state) support agg_state combinator on nereids (#20164 ) support agg_state combinator on nereids	2023-06-12 12:49:26 +08:00
starocean999	bcc37c9405	[fix](planner)the common type of floating and decimal should be floating type (#20634 ) * [fix](planner)the common type of floating and decimal should be floating type * fix test cases	2023-06-12 11:32:23 +08:00
GoGoWen	4c340f2851	[Feature] (Multi-Catalog) support query hll column in doris jdbc table - part 1 (#19413 ) Issue Number: close #17895	2023-06-12 11:16:19 +08:00
Pxl	ab7ac31d89	[Chore](case) fix failed on test_big_pad when enable pipeline engine #20644	2023-06-12 09:15:55 +08:00
Xinyi Zou	a347063390	[fix](case expr) fix coredump of case for null value 2 (#20635 ) fix coredump of case for null value 2	2023-06-11 23:08:53 +08:00
catpineapple	c9b08d5c20	[feature](planner) multi partition create by integer column (#19597 ) Create partitions use ： ``` PARTITION BY RANGE(integer_col)( FROM (10) TO (1000) INTERVAL 50 ) ```	2023-06-11 22:42:21 +08:00
zhannngchen	ca1e2ddf43	[fix](regression) tests in unique_with_mow_p0/partial_update are flaky (#20633 )	2023-06-11 13:51:49 +08:00
AlexYue	8a2e0504e4	[chore](coldHeatCases) drop table first to enhance robustness (#20629 )	2023-06-11 13:51:05 +08:00
minghong	def6a8ec94	[regression](nereids) check tpch sf1T and sf500 plan shape on 3 BE environment #20610	2023-06-09 22:46:40 +08:00
YueW	656b9ad3da	[enhancement](index) Nereids support no need to read raw data for index column that only in filter conditions (#20605 )	2023-06-09 21:54:48 +08:00
Jack Drogon	70819fae22	[feature](alter) Add AlterDatabasePropertyStmt binlog impl (#20550 )	2023-06-09 17:29:21 +08:00
xueweizhang	a1a587fec6	[fix](replay) fix truncate partition name need case insensitive (#20098 ) truncate table with partition name need case insensitive	2023-06-09 09:34:55 +08:00
Mryange	4c6df9062e	[fix](DECIMALV3)fix cumulative precision when literal and DECIMALV3 operations in Legacy (#20354 ) The precision handling for division with DECIMALV3 is as follows (excluding cases where division increases precision): (p1, s1) / (p2, s2) ----> (p1 + s2, s1) However, due to precision loss in division, it is considered to increase the precision of the left operand: (p1, s1) / (p2, s2) =====> (p1 + s2, s1 + s2) / (p2, s2) ----> (p1 + s2, s1) However, the legacy optimizer repeats the analyze and substitute steps for an expression, which can result in the accumulation of precision: (p1, s1) / (p2, s2) =====> (p1 + s2, s1 + s2) / (p2, s2) =====> (p1 + s2 + s2, s1 + s2 + s2) / (p2, s2) To address this, the previous approach was to forcibly convert the left operand of DECIMALV3 calculations. This results in rewriting the expression as: (p1, s1) / (p2, s2) =====> cast((p1, s1) as (p1 + s2, s1 + s2)) / (p2, s2) Then, during the substitution step, a check is performed. If it is a cast expression, the expression modified by the cast is extracted: cast((p1, s1) as (p1 + s2, s1 + s2)) =====> (p1, s1) protected Expr substituteImpl(ExprSubstitutionMap smap, ExprSubstitutionMap disjunctsMap, Analyzer analyzer) { if (isImplicitCast()) { return getChild(0).substituteImpl(smap, disjunctsMap, analyzer); } This way, there won't be repeated analysis, preventing the continuous increase in precision. However, if the left expression is a constant (literal), theoretically, the precision would continue to increase. Unfortunately, the code that was removed in this PR (#19926) obscured this issue. for (Expr child : children) { if (child instanceof DecimalLiteral && child.getType().isDecimalV3()) { ((DecimalLiteral)child).tryToReduceType(); } } An attempt will be made to reduce the precision of literals in the expressions. However, this code snippet can cause such a bug. mysql [test]>select cast(1 as DECIMALV3(16, 2)) / cast(3 as DECIMALV3(16, 2)); +-----------------------------------------------------------+ \| CAST(1 AS DECIMALV3(16, 2)) / CAST(3 AS DECIMALV3(16, 2)) \| +-----------------------------------------------------------+ \| 0.00 \| +-----------------------------------------------------------+ 1.00 / 3.00, due to reduced precision, becomes 1 / 3. <--Describe your changes.-->	2023-06-09 08:58:55 +08:00
Qi Chen	845d459f05	[Fix](orc-reader) Fix some bugs of orc lazy materialization. (#20410 ) Fix some bugs of orc lazy materialization(#18615) - Fix issue causing column size to continuously increase after `execute_conjuncts()` by calling `Block::erase_useless_column()`. - Fix partition issues of orc lazy materialization. - Fix lazy materialization will not be used when the predicate column is inconsistent with the orc file.	2023-06-09 08:53:01 +08:00
lihangyu	234be0c517	[regression-test](test_point_query) fix output (#20604 )	2023-06-09 00:13:18 +08:00
TengJianPing	dd71e101d3	[fix](case expr) fix coredump of case for null value (#20564 ) be coredump when when expr is null:	2023-06-08 20:05:23 +08:00
LiBinfeng	a759b6535b	[test](regression) Add cases to test cast function substitution (#20481 ) This is a mirror to pr #20479, master do not have this problem, but test cases also need to be added	2023-06-08 19:56:51 +08:00
Qi Chen	4faee4d8fd	[Fix](multi-catalog) Fix be crashed when query hive table after schema changed(new column added). (#20537 ) Fix be crashed when query hive table after schema changed(new column added). Regression Test: test_hive_schema_evolution.groovy	2023-06-08 18:10:36 +08:00
mch_ucchi	41d7c535f2	[fix](regression-test) add sync after insert into table for nereids case (#20516 )	2023-06-08 17:52:36 +08:00
Pxl	a56449f86e	[Bug](Agg-state) try to make test_agg_state stable (#20574 ) try to make test_agg_state stable	2023-06-08 17:17:51 +08:00
Pxl	5fe7106b83	[Bug](planner) fix pre condition check fail on max(null) (#20509 ) fix pre condition check fail on max(null)	2023-06-08 14:49:52 +08:00
lihangyu	24fb05ec83	[Bug](row-store) Fix row store with materialize index (#20356 ) If a query hits a materialized view that has row storage enabled, but the row storage column is not present in the materialized view, it will result in a query crash. Therefore, it is necessary to include the row storage column when creating the materialized view, and serialize the row storage column during the execution of SchemaChange.	2023-06-08 10:55:22 +08:00
starocean999	d2d6ce5d0b	[fix](nereids) add push down filter and project through cte anchor rules (#20547 ) we should not plan any Filter or Project above CteAnchor, because there are project or filter under anchor sometimes. and the whole plan can not translate to a valid plan for BE.	2023-06-08 10:34:42 +08:00
Gabriel	325ddab34e	[conf](pipeline) turn pipeline on by default (#20458 )	2023-06-08 09:20:51 +08:00
bobhan1	187bf14d81	[feature-wip](auto-inc)(step-1) add syntax support for duplicate table (#20284 ) Co-authored-by: yifeng <cnissnzg@126.com>	2023-06-07 22:01:28 +08:00
zhannngchen	53970192aa	[fix](regression) unique_with_mow_p2/test_pk_uk_case (#20497 )	2023-06-07 21:34:34 +08:00
Pxl	36216f0925	[Bug](Agg-State) fix coredump when state combinator input const column (#20510 ) fix coredump when state combinator input const column	2023-06-07 11:28:55 +08:00
starocean999	cd70c37402	[fix](nereids) filter and project node should be pushed down through cte (#20508 ) 1.move PushdownFilterThroughCTEAnchor and PushdownProjectThroughCTEAnchor into PUSH_DOWN_FILTERS rule set 2.move PushdownFilterThroughProject before MergeProjectPostProcessor	2023-06-07 10:36:32 +08:00
Jerry Hu	49f8f20fb1	[fix](regex) String with Chinese characters matching failed (#20493 )	2023-06-07 07:27:47 +08:00
AlexYue	a68afd0672	[fix](cooldown) fix bug due to tablets info changed (#20465 )	2023-06-06 22:15:17 +08:00
mch_ucchi	05bdbce8fc	[Feature](Nereids) support update unique table statement (#20313 )	2023-06-06 20:32:43 +08:00
zgxme	61d9bd2ba1	[fix](regression) fix export file test cases (#20463 )	2023-06-06 20:07:31 +08:00
camby	7df8459e21	[fix](regression-test) add retry time to avoid regression test failed (#20487 ) Now after alter table ${tbl} set('dynamic_partition.end'='5'), we add dynamic partition async. We need to wait dynamic scheduler.	2023-06-06 15:50:11 +08:00
Chengpeng Yan	ae428c29e2	[feature](planner)(nereids) support user defined variable (#20334 ) Support user-defined variables. After this PR, we can use `set @a = xx` to define a user variable and use it in the query like `select @a`. the changes of this PR: 1. Support the grammar for `set user variable` in the parser. 2. Add the `userVars` in `VariableMgr` to store the user-defined variables. 3. For the `set @a = xx`, we will store the variable name and its value in the `userVars` in `VariableMgr`. 4. For the `select @a`, we will get the value for the variable name in `userVars`.	2023-06-06 14:35:16 +08:00
amory	1f032a551d	[Improve](array-functions) support array first function (#20397 ) add array_first(lambda, [1,2,3,null]) function for doris	2023-06-06 12:08:46 +08:00
TengJianPing	1b94b6368f	[fix](load) in strict mode, return error for insert if datatype convert fails (#20378 ) * [fix](load) in strict mode, return error for load and insert if datatype convert fails Revert "[fix](MySQL) the way Doris handles boolean type is consistent with MySQL (#19416)" This reverts commit 68eb420cabe5b26b09d6d4a2724ae12699bdee87. Since it changed other behaviours, e.g. in strict mode insert into t_int values ("a"), it will result 0 is inserted into table, but it should return error instead. * fix be ut * fix regression tests	2023-06-06 12:04:03 +08:00
morrySnow	e553615a27	[opt](Nereids) perfer use datev2 / datetimev2 in date related functions (#20224 ) 1. update all date related functions' signatures order. 1.1. if return value need to be compute with time info, args with datetimev2 at the top of the list, followed by datev2, datetime and date 1.2. if return value need to be compute with only date info, args with datev2 at the top of list, followed by datetimev2, date and datetime 2. Priority for use datev2, if we must cast date to datev2 or datetime/datetimev2	2023-06-06 11:42:29 +08:00
Yang, Xu	d02737a293	[feature](struct-type) support struct_element function (#19045 ) This commit support a function allows return a field column in named struct column. Since the function can return any type, this commit also supports ANY_STRUCT_TYPE and ANY_ELEMENT_TYPE.	2023-06-06 10:44:08 +08:00
Mingyu Chen	f839c90c27	[fix][refactor](backend-policy)(compute) refactor the hierarchy of external scan node and fix compute node bug #20402 There should be 2 kinds of ScanNode: OlapScanNode ExternalScanNode The Backends used for ExternalScanNode should be controlled by FederationBackendPolicy. But currently, only FileScanNode is controlled by FederationBackendPolicy, other scan node such as MysqlScanNode, JdbcScanNode will use Mix Backend even if we enable and prefer to use Compute Backend. In this PR, I modified the hierarchy of ExternalScanNode, the new hierarchy is: ScanNode OlapScanNode SchemaScanNode ExternalScanNode MetadataScanNode DataGenScanNode EsScanNode OdbcScanNode MysqlScanNode JdbcScanNode FileScanNode FileLoadScanNode FileQueryScanNode MaxComputeScanNode IcebergScanNode TVFScanNode HiveScanNode HudiScanNode And previously, the BackendPolicy is the member of FileScanNode, now I moved it to the ExternalScanNode. So that all subtype ExternalScanNode can use BackendPolicy to choose Compute Backend to execute the query. All all ExternalScanNode should implement the abstract method createScanRangeLocations(). For scan node like jdbc scan node/mysql scan node, the scan range locations will be selected randomly from compute node(if preferred). And for compute node selection. If all scan nodes are external scan nodes, and prefer_compute_node_for_external_table is set to true, the BE for this query will only select compute nodes.	2023-06-06 10:35:30 +08:00
zclllyybb	378ffa133e	[fix](regression-test) Add lost ddl file for tpcds_sf1_p2 #20288	2023-06-06 09:57:38 +08:00
Kaijie Chen	6c96e1dc9f	[fix](regression) add sync after streamload in test_stream_load (#20425 ) Add sync after streamload in test_stream_load to fix following error: Exception in load_p0/stream_load/test_stream_load.groovy(line 180): throw exception } log.info("Stream load result: ${result}".toString()) def json = parseJson(result) assertEquals("success", json.Status.toLowerCase()) assertEquals(1, json.NumberTotalRows) assertEquals(0, json.NumberFilteredRows) } } order_qt_sql1 " SELECT * FROM ${tableName2}" ^^^^^^^^^^^^^^^^^^^^^^^^^^ERROR LINE^^^^^^^^^^^^^^^^^^^^^^^^^^ // test common case def tableName3 = "test_all" def tableName4 = "test_less_col" def tableName5 = "test_bitmap_and_hll" def tableName6 = "test_unique_key" def tableName7 = "test_unique_key_with_delete" def tableName8 = "test_array" def tableName10 = "test_struct" sql """ DROP TABLE IF EXISTS ${tableName3} """ Exception: java.lang.IllegalStateException: Check tag 'sql1' failed: Check tag 'sql1' failed, line 1 mismatch, real line is empty, but expect is 2019 9 9 9 7.700 a 2019-09-09 1970-01-01T08:33:39 k7 9.0 9.0 sql: SELECT * FROM load_nullable_to_not_nullable	2023-06-06 08:32:25 +08:00
yangshijie	0a90a9d507	[feature-wip](duplicate_no_keys) Add some test cases of all the duplicate tables in test case tpcds_sf100_dup_without_key_p2 and make them duplicate tables without keys (#20431 )	2023-06-05 21:04:41 +08:00

1 2 3 4 5 ...

1711 Commits