doris

Author	SHA1	Message	Date
zhangstar333	43c6428aea	[Function](string) support sub_replace function (#13736 ) * [Function](string) support sub_replace function * remove conf	2022-10-28 08:40:08 +08:00
carlvinhust2012	36053d2419	[fix](array-type) fix the be core dump when select the invalid array format (#13514 ) 1. this pr is used to fix the be core dump when select the invalid array. 2. before the change, we run "select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]');" will cause be core dump. MySQL [example_db]> select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]'); ERROR 1105 (HY000): RpcException, msg: io.grpc.StatusRuntimeException: UNAVAILABLE: Network closed for unknown reason 3. after the change, we run "select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]');" will get error message. MySQL [example_db]> select array_intersect([1, 2, 3, 1, 2, 3], '1[3, 2, 5]'); errCode = 2, detailMessage = No matching function with signature: array_intersect(array<tinyint(4)>, varchar(-1))" Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-10-27 23:11:12 +08:00
Ashin Gau	45b31506c7	[improvement](delete) support delete from partitioned table without partition specified (#13533 ) Support delete from partitioned table without partition specified in [DELETE] stmt. ## Usage If it is a partitioned table, you can specify a partition. If not specified, Doris will infer partition from the given conditions. In two cases, Doris cannot infer the partition from conditions: 1) the conditions do not contain partition columns; 2) The operator of the partition column is `not in`. When a partition table does not specify the partition, or the partition cannot be inferred from the conditions, the session variable `delete_without_partition` needs to be `true` to make delete statement be applied to all partitions. ## Test case Test case is added in `regression-test/suites/delete_p0/test_delete_from_partition.groovy`, user can delete from partitioned table without partition specified now.	2022-10-27 21:32:45 +08:00
camby	738da0b139	[bugfix](join) inner join return wrong result (#13608 ) * bug fix for vhash join * add regression test Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-10-27 11:48:41 +08:00
starocean999	c874931ac8	[fix](join)output all value from no-null side of outer join (#13655 ) * [fix](joinoutput all value from no-null side of outer join * add regression test	2022-10-27 10:48:36 +08:00
Mingyu Chen	7557980d64	[improvement](regression-test) avoid query empty result after loading finished (#13682 ) When running regression test, we always found that the query return empty result after loading finished, even if we call "sync" before the query. This is because for `stream load`, the load task result will be returned immediately after the txn's status changed to VISIBLE, but before writing the edit log. So if we do the query right after we got the load task result, it is possible that we can not see the latest loaded data. Same issue with `insert` operation	2022-10-27 09:47:18 +08:00
Mingyu Chen	5bd66243ee	[minor](log) remove some unused logs (#13689 ) 1. When running regression test with specific suites or group, do not print other suite name or file name 2. Remove unused alter table job log.	2022-10-27 09:37:32 +08:00
camby	bed759b3f5	[Fix](array-type) support CTAS for ARRAY column from collect_list and collect_set (#13627 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-10-26 19:42:15 +08:00
luozenglin	3548d0b824	[fix](statistics) fix cross join statistics exception (#13645 )	2022-10-26 14:10:57 +08:00
Tiewei Fang	c418bbd2d1	[feature-wip](new-scan) support Json reader (#13546 ) Issue Number: close #12574 This pr adds `NewJsonReader` which implements GenericReader interface to support read json format file. TODO: 1. modify `_scann_eof` later. 2. Rename `NewJsonReader` to `JsonReader` when `JsonReader` is deleted.	2022-10-26 12:52:21 +08:00
morrySnow	15130c469f	[fix](planner) cannot recogonize column's table when analyze rewrite expr (#13597 ) We save mv column with alias as table name, and search it with original table name.	2022-10-26 11:15:48 +08:00
minghong	e5b33abd3c	[fix](planner) inlineView alias error (#13600 )	2022-10-26 10:14:04 +08:00
Yongqiang YANG	295d887cf5	[improvement](thread) set name for priority thread pool (#13552 )	2022-10-26 09:32:15 +08:00
Gabriel	e00734348b	[Chore](regression) Fix wrong result for decimal (#13644 )	2022-10-26 09:24:46 +08:00
Mingyu Chen	6f18726f01	[improvement](test) add sync for test_agg_keys_schema_change_datev2 (#13643 ) 1. add "sync" to avoid some potential meta sync problem when running regression test on multi-node cluster 2. Use /tmp dir as dest dir of outfile test, to avoid "No such file or directory" error.	2022-10-25 22:29:05 +08:00
yinzhijian	f209b7ab6e	[fix](Nereids) add exchange node check between local and global agg in plan translator (#12913 ) ### table schema CREATE TABLE `t1` ( `k1` int(11) NULL, `v1` int(11) NULL ) ENGINE=OLAP DUPLICATE KEY(`k1`, `v1`) COMMENT 'OLAP' DISTRIBUTED BY HASH(`k1`) BUCKETS 3 PROPERTIES('replication_num'='1') ### query select k1,count(distinct v1+1) from t1 group by k1; ### error java.lang.ClassCastException: org.apache.doris.planner.OlapScanNode cannot be cast to org.apache.doris.planner.AggregationNode	2022-10-25 16:55:29 +08:00
starocean999	e103531e69	[fix](sort)order by constant expr bug (#13613 ) Issue Number: close (#13350)	2022-10-25 16:43:18 +08:00
zhengyu	b85c78ee00	[fix](regression) add 'if not exists' to 'create table' to support parallel test (#13576 ) (#13578 ) Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2022-10-25 16:37:07 +08:00
lihangyu	235c105554	[feature-array](array-type) Add array function array_enumerate (#13612 ) Add array function array_enumerate	2022-10-25 15:12:11 +08:00
Mingyu Chen	de5bc6a8a5	[fix](regression-test) set label for stream load (#13620 )	2022-10-25 14:13:24 +08:00
lsy3993	f802fc37ff	add date function 'last_day' (#13609 )	2022-10-25 13:46:16 +08:00
Yongqiang YANG	57f479d9d2	[fix](test) let cases use their own table name (#13602 )	2022-10-25 13:45:01 +08:00
Gabriel	7fe7c01125	[Bug](decimal) Fix incorrect result for decimal multiply (#13591 ) Fix incorrect result for decimal multiply	2022-10-25 12:08:49 +08:00
starocean999	7faad9f004	[FIX](agg)fix group by constant child expr bug (#13485 )	2022-10-24 16:32:36 +08:00
starocean999	40e122e5ef	[fix](join)the build and probe expr should be calculated before converting input block to nullable (#13436 ) * [fix](join)the build and probe expr should be calculated before converting input block to nullable * remove_nullable can be called on const column	2022-10-24 14:50:06 +08:00
xy720	177e82bdab	[Enhancement](array-type) Add type derivation for array functions (#13534 ) From now, we don't support type derivation for array function's arguments. So that the cases below will return wrong values or even cause be core. mysql> select array_union([1],[10000000]); +----------------------------------------+ \| array_union(ARRAY(1), ARRAY(10000000)) \| +----------------------------------------+ \| [1, -128] \| +----------------------------------------+ 1 row in set (0.03 sec) mysql> select array_union([NULL],[1]); ERROR 1105 (HY000): RpcException, msg: io.grpc.StatusRuntimeException: UNAVAILABLE: Network closed for unknown reason mysql> select array_union([],[1]); ERROR 1105 (HY000): RpcException, msg: io.grpc.StatusRuntimeException: UNAVAILABLE: Network closed for unknown reason This commit make a small fix to derivate the argument types of the array function 1、 For null type in arguments, cast the null type to boolean type, because null type should not be seen in be. 2、For different types in arguments, cast all arguments type to their compatible type.	2022-10-24 11:51:47 +08:00
luozenglin	e17c2416f0	[fix](join) fix be core dump when using right join with other join predicates (#13511 )	2022-10-24 10:35:07 +08:00
Mingyu Chen	4b5a2c1a65	[fix](export)(outfile) fix bug that export may fail when writing SUCCESS file (#13574 )	2022-10-23 13:02:49 +08:00
Gabriel	a7c221d04e	[Bug](sort) Fix bug in string sorter (#13548 )	2022-10-22 21:26:23 +08:00
Gabriel	c1cce29b20	[chore](regression) modify duplicate table name for regression cases (#13527 )	2022-10-21 21:37:13 +08:00
carlvinhust2012	a555f45834	[fix](array-type) fix the wrong result of array_join function (#13477 ) this pr is used to fix the wrong result of array_join function. before the change, the array_join function will return wrong result. MySQL [example_db]> select array_join(["", "1", "2"], ''); +--------------------------------------+ \| array_join(ARRAY('', '1', '2'), '') \| +--------------------------------------+ \| 1_2 \| +--------------------------------------+ 3.after the change, the array_join function will return correct result. MySQL [example_db]> select array_join(["", "1", "2"], ''); +--------------------------------------+ \| array_join(ARRAY('', '1', '2'), '') \| +--------------------------------------+ \| _1_2 \| +--------------------------------------+ Issue Number: #7570	2022-10-21 17:36:44 +08:00
Zhengguo Yang	3e92f742bf	[Bugfix](MV) Fix insert negative value to table with bitmap_union MV will cause count distinct result incorrect (#13507 )	2022-10-21 16:07:31 +08:00
Mingyu Chen	847b80ebfa	[test](jdbc) add jdbc and hive regression test (#13143 ) 1. Modify default behavior of `build.sh` The `BUILD_JAVA_UDF` is default ON, so that jvm is needed for compilation and runtime. 2. Add docker-compose for MySQL 5.7, PostgreSQL 14 and Hive 2 See `docker/thirdparties/docker-compose`. 3. Add some regression test cases for jdbc query on MySQL, PG and Hive Catalog The default is `false`, if set to true, you need first start docker for MySQL/PG/Hive. 4. Support `if not exists` and `if exists` for create/drop resource and create/drop encryptkey	2022-10-21 15:29:27 +08:00
Pxl	88ceace855	[Bug](predicate) fix core dump on bool type runtime filter (#13417 ) fix core dump on bool type runtime filter	2022-10-21 13:15:22 +08:00
zhangstar333	3ca8bfaf30	[Function](array) support array_difference function (#13440 )	2022-10-21 10:57:37 +08:00
Mingyu Chen	3e168c87c6	[improvement](regression-test) wait for publish timeout of stream load (#13531 )	2022-10-21 10:11:03 +08:00
Gabriel	9a3c1f0867	[Improvement](decimal) print decimal according to the real precision and scale (#13437 )	2022-10-21 10:00:01 +08:00
camby	1f7829e099	[Fix](array-type) bugfix for array column with delete condition (#13361 ) Fix for SQL with array column: delete from tbl where c_array is null; more info please refer to #13360 Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-10-21 09:29:02 +08:00
Xin Liao	27d84eafc5	[feature](alter) support rename column for table with unique column id (#13410 )	2022-10-21 08:45:34 +08:00
Kikyou1997	4ae777bfc5	[fix](Nereids) NPE caused by GroupExpression has null owner group when choosing best plan (#13252 )	2022-10-20 22:23:36 +08:00
ChPi	1e774036f1	[fix](function)fix be coredump when using json_object function (#13443 )	2022-10-20 17:32:37 +08:00
Mingyu Chen	32b1456b28	[feature-wip](array) remove array config and check array nested depth (#13428 ) 1. remove FE config `enable_array_type` 2. limit the nested depth of array in FE side. 3. Fix bug that when loading array from parquet, the decimal type is treated as bigint 4. Fix loading array from csv(vec-engine), handle null and "null" 5. Change the csv array loading behavior, if the array string format is invalid in csv, it will be converted to null. 6. Remove `check_array_format()`, because it's logic is wrong and meaningless 7. Add stream load csv test cases and more parquet broker load tests	2022-10-20 15:52:31 +08:00
Gabriel	3c837a9bdd	[regression](load) modify variable definition (#13506 )	2022-10-20 14:07:53 +08:00
Dongyang Li	8637ac1ca3	[regression](framework)set random parallel_fragment_exec_instance_num… (#13383 ) Some problems have been found with the setting of parallel_fragment_exec_inistance_num > 1. Try to use this way to set a random parallel_fragment_exec_inistance_num value for each query to cover more situations.	2022-10-20 10:02:27 +08:00
xiaojunjie	4996eafe74	[bugfix](VecDateTimeValue) eat the value of microsecond in function from_date_format_str (#13446 ) * [bugfix](VecDateTimeValue) eat the value of microsecond in function from_date_format_str * add sql based regression test Co-authored-by: xiaojunjie <xiaojunjie@baidu.com>	2022-10-20 09:02:33 +08:00
camby	9ac4cfc9bb	[bugfix](array-type) ColumnDate lost is_date_type after cloned (#13420 ) Problem: IColumn::is_date property will lost after ColumnDate::clone called. Fix: After ColumnDate created, also set IColumn::is_date. Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-10-19 21:29:36 +08:00
Gabriel	c4b5ba2a4f	[Regression](java-udf) Move source code used by Java UDF test case (#13476 )	2022-10-19 21:05:06 +08:00
Mingyu Chen	5423de68dd	[refactor](new-scan) remove old file scan node (#13433 ) All these files are not used anymore, can be removed.	2022-10-19 14:25:32 +08:00
luozenglin	c449028a5f	[fix](year) fix `year()` results are not as expected (#13426 ) fix `year()` results are not as expected	2022-10-19 11:28:00 +08:00
zy-kkk	8a068c8c92	[function](string_function) add new string function 'not_null_or_empty' (#13418 )	2022-10-19 11:10:37 +08:00

1 2 3 4 5 ...

509 Commits