doris

Author	SHA1	Message	Date
minghong	eb65cc6954	[Fix](nereids) eliminate_outer_join regression case fix #24262	2023-09-14 18:22:17 +08:00
bobhan1	3ee89aea35	[Feature](merge-on-write)Support ignore mode for merge-on-write unique table (#21773 )	2023-09-14 18:03:51 +08:00
starocean999	d035a58374	[feature](nereids) support unnest subquery in LogicalOneRowRelation (#24355 ) select (select 1); before : ERROR 1105 (HY000): errCode = 2, detailMessage = Subquery is not supported in the select list. after: mysql> select (select 1); +---------------------------------------------------------------------+ \| (SCALARSUBQUERY) (LogicalOneRowRelation ( projects=[1 AS `1`#0] )) \| +---------------------------------------------------------------------+ \| 1 \| +---------------------------------------------------------------------+ 1 row in set (0.61 sec)	2023-09-14 17:22:08 +08:00
zclllyybb	4fbb25bc55	[Enhancement](function) Support date_trunc(date) and use it in auto partition (#24341 ) Support date_trunc(date) and use it in auto partition	2023-09-14 16:53:09 +08:00
zclllyybb	b6d7116dea	[fix](datetime) fix compare of DatetimeLiteral (#24343 ) fix compare of DatetimeLiteral	2023-09-14 16:51:50 +08:00
starocean999	ccba5a729a	[fix](planner)cast string to float like type should return NULL literal if it fails (#24222 )	2023-09-14 15:59:20 +08:00
amory	268c867679	[Improve](serde)replace function_cast from_string to serde (#24087 ) Now we can not support streamload with column which is map/array nested map/array serde can do this now , so we can replace it Notice. if item data in complex type data is empty we just return error, instead of makeup default value , because now we can not define right default for complex type	2023-09-14 13:53:16 +08:00
jakevin	d23d1870a2	[fix](Nereids): fix regression-test (#24329 )	2023-09-14 13:21:50 +08:00
zzzxl	ed108d48fa	[fix](invert index) fix query use char filter (#24268 )	2023-09-14 11:42:47 +08:00
morrySnow	46f5988245	[fix](Nereids) set operation children output order not same (#24060 ) we generate project for all set operation's children to ensure the order of all children are not changed. However, some rules, such as PushDownProjectThroughLimit could remove these projects involuntarily. When it happen, the column order is wrong and lead to BE core dump. This PR use a new variable in SetOperation to save the output order of children of set operation. Then the children's output order could be changed and never affect to SetOperation at all.	2023-09-14 11:09:58 +08:00
zhangstar333	9b7f041bea	[Bug](function) fix explode_json_array_int can't handle min/max values (#24284 ) the json str get value maybe beyond max/min of Int64, so add some check to limit the value, and return the max/min of Int64	2023-09-14 09:20:59 +08:00
jakevin	93a9f1007c	[fix](Nereids): fix regression test (#24336 ) fix failed regression test by #23842	2023-09-14 01:55:09 +08:00
qiye	11afd321cb	[fix](es catalog) fix issue with select and insert from es catalog core (#24318 ) Issue Number: close #24315 The root cause of this issue is that Elasticsearch's long type allows inserting floats and strings. Doris did not handle these cases when doing type conversion. The current strategy is to take the integer before the decimal point if a float or string is found.	2023-09-13 23:07:31 +08:00
Ashin Gau	d5b490b2e7	[test](regression) add file cache regression test (#24192 ) Add file cache regression test in tpch 1g on orc&parquet format. tpch will run 3 times: 1. running without file cache 2. running with file cache for the first time 3. running with file cache for the second time The file cache configuration is already added in `be/conf/be.conf` on the regression test environment, and the available capacity is 100MB. After running the tpch 1g test, the metrics introduced by https://github.com/apache/doris/pull/19177 is like: ``` doris_be_file_cache_normal_queue_curr_size{path="/mnt/datadisk1/gaoxin/file_cache"} 92808933 doris_be_file_cache_normal_queue_curr_elements{path="/mnt/datadisk1/gaoxin/file_cache"} 59 doris_be_file_cache_normal_queue_max_elements{path="/mnt/datadisk1/gaoxin/file_cache"} 102400 doris_be_file_cache_normal_queue_max_size{path="/mnt/datadisk1/gaoxin/file_cache"} 89128960 doris_be_file_cache_removed_elements{path="/mnt/datadisk1/gaoxin/file_cache"} 2132 doris_be_file_cache_segment_reader_cache_size{path="/mnt/datadisk1/gaoxin/file_cache"} 54 ```	2023-09-13 22:59:01 +08:00
Tiewei Fang	9847f7789f	[Feature](Export) `Export` sql supports to export data of `view` and `exrernal table` (#24070 ) Previously, EXPORT only supported the export of the olap table, This pr supports the export of view table and external table.	2023-09-13 22:55:19 +08:00
jakevin	d7e5f97b74	[feature](Nereids): eliminate AssertNumRows (#23842 )	2023-09-13 22:24:02 +08:00
Mryange	07dd6830e8	[pipelineX](refactor) add union node in pipelineX (#24286 )	2023-09-13 20:39:58 +08:00
starocean999	231038f050	[fix](planner)allow infer predicate for external table (#24227 ) CREATE EXTERNAL TABLE `dim_server` ( `col1` varchar(50) NOT NULL, `col2` varchar(50) NOT NULL ) create view ads_oreo_sid_report ( `col1` , `col2` ) AS select tmp.col1,tmp.col2 from ( select 'abc' as col1,'def' as col2 ) tmp inner join dim_server ds on tmp.col1 = ds.col1 and tmp.col2 = ds.col2; select * from ads_oreo_sid_report where col1='abc' and col2='def'; before this pr, col1='abc' and col2='def' can't be pushed to dim_server. now the 2 predicates can be pushed to odbc table.	2023-09-13 17:22:39 +08:00
谢健	335064f897	[feature](Nereids) add lambda argument and array_map function (#23598 ) add array_map function SELECT ARRAY_MAP(x->x+1, ARRAY(87, 33, -49)) +----------------------------------------------------------------------+ \| array_map([x] -> (x + 1), x#1 of array(87, 33, -49)) \| +----------------------------------------------------------------------+ \| [88, 34, -48] \| +----------------------------------------------------------------------+	2023-09-13 14:24:16 +08:00
daidai	e30c3f3a65	[fix](csv_reader)fix bug that Read garbled files caused be crash. (#24164 ) fix bug that read garbled files caused be crash.	2023-09-13 14:12:55 +08:00
daidai	ebe3749996	[fix](tvf)support s3,local compress_type and append regression test (#24055 ) support s3,local compress_type and append regression test.	2023-09-13 00:32:59 +08:00
Qi Chen	9df72a96f3	[Feature](multi-catalog) Support hadoop viewfs. (#24168 ) ### Feature Support hadoop viewfs. ### Test - Regression tests: - hive viewfs test. - tvf viewfs test. - Broker load with broker and with hdfs tests manually.	2023-09-13 00:20:12 +08:00
Mingyu Chen	c402d48f97	[fix](query-cache) fix query cache with empty set (#24147 ) If the query result set is empty, the query cache will not cache the result. This PR fix it.	2023-09-12 20:11:20 +08:00
zclllyybb	d3f1388717	[Feature](partitions) Support auto-partition (#24153 ) Co-authored-by: zhangstar333 <2561612514@qq.com>	2023-09-12 15:23:15 +08:00
TengJianPing	4bb9a12038	[function](bitmap) support bitmap_remove (#24190 )	2023-09-12 14:52:04 +08:00
bobhan1	6913d68ba0	[Enhancement](merge-on-write) use delete bitmap to mark delete for rows with delete sign when sequence column doesn't exist (#24011 )	2023-09-12 08:56:46 +08:00
Ashin Gau	6e28d878b5	[fix](hudi) compatible with hudi spark configuration and support skip merge (#24067 ) Fix three bugs: 1. Hudi slice maybe has log files only, so `new Path(filePath)` will throw errors. 2. Hive column names are lowercase only, so match column names in ignore-case-mode. 3. Compatible with [Spark Datasource Configs](https://hudi.apache.org/docs/configurations/#Read-Options), so users can add `hoodie.datasource.merge.type=skip_merge` in catalog properties to skip merge logs files.	2023-09-11 19:54:59 +08:00
minghong	115969c3fb	[opt](nereids) improve eliminate outerjoin in cascades (#24120 ) * eliminate outer join cascading	2023-09-11 19:42:05 +08:00
morrySnow	9c441a4a16	[feature](Nereids) support create table and ctas (#24150 ) Co-authored-by: sohardforaname <organic_chemistry@foxmail.com>	2023-09-11 12:37:58 +08:00
zhangstar333	cd13f9e8c6	[BUG](view) fix can't create view with lambda function (#23942 ) before the lambda function Expr not implement toSqlImpl() function. so it's call parent function, which is not suit for lambda function. and will be have error when create view.	2023-09-11 10:04:00 +08:00
amory	dcde83d6e6	[Improve](regresstests)add boundary regress tests for map & array #24133	2023-09-11 08:28:11 +08:00
zzzxl	71db844c64	[feature](invert index) add tokenizer CharFilter preprocessing (#24102 )	2023-09-10 23:08:28 +08:00
ZhenchaoXu	69f599bb53	[regression-test](fix)add test_ifnull. (#23956 )	2023-09-10 12:11:43 +08:00
Jerry Hu	93c1151f1a	[fix](join) incorrect result of mark join (#24112 )	2023-09-10 11:30:45 +08:00
Siyang Tang	232b58a27d	[fix](broker-load) make sequence column name case insensitive (#24071 )	2023-09-10 10:51:07 +08:00
Mingyu Chen	650af8f4df	[fix](test) fix broker load with default value test case (#24123 )	2023-09-10 10:28:22 +08:00
daidai	f9a75b5c4f	[feature](csv_serde)1.append csv serde for serialize to csv and deserialize from csv. 2.let csvReader use csv serde not text_converter. (#23352 ) 1. append csv serde for serialize to csv and deserialize from csv. 2. let csvReader use csv serde not text_converter.	2023-09-10 00:16:21 +08:00
starocean999	21e30d4374	[fix](planner)ctas's query part is not analyzed correctly (#24111 ) * [fix](planner)ctas's query part is not analyzed correctly	2023-09-09 20:55:09 +08:00
minghong	8c2a721873	[opt](nereids)push down filter through window #23935 select rank() over (partition by A, B) as r, sum(x) over(A, C) as s from T; A is a common partition key for all windowExpressions, that is A is intersection of {A,B} and {A, C} we could push filter A=1 through this window, since A is a common Partition key: select * from (select a, row_number() over (partition by a) from win) T where a=1; origin plan: ----filter((T.a = 1)) ----------PhysicalWindow ------------PhysicalQuickSort --------------PhysicalProject ------------------PhysicalOlapScan[win] transformed to ----PhysicalWindow ------PhysicalQuickSort --------PhysicalProject ----------filter((T.a = 1)) ------------PhysicalOlapScan[win] But C=1 can not be pushed through window.	2023-09-09 20:53:31 +08:00
Pxl	69868f18d6	[Bug](join) fix nested loop join some problems (#24034 )	2023-09-08 17:40:41 +08:00
jakevin	161520feb4	[feature](Nereids): enable convert CASE WHEN to IF (#24050 ) enable rule to convert CASE WHEN to IF.	2023-09-08 16:58:33 +08:00
meiyi	82dc970916	[feature](insert) Support group commit insert (#22829 )	2023-09-08 15:51:03 +08:00
jakevin	576855acb2	[fix](Nereids): fix regression-test (#24065 )	2023-09-08 14:14:48 +08:00
TengJianPing	b73f345479	[fix](intersect) fix wrong result of intersect node (#24044 ) Issue Number: close #24046	2023-09-08 10:27:37 +08:00
Tiewei Fang	a27349c83a	[fix](Export) Concatenation the outfile sql for Export (#23635 ) In the original logic, the `Export` statement generates `Selectstmt` for execution. But there is no way to make the `SelectStmt` use the new optimizer. Now, we change the `Export` statement to generate the `outfile SQL`, and then use the new optimizer to parse the SQL so that outfile can use the new optimizer.	2023-09-08 10:20:18 +08:00
zy-kkk	0bdd078b41	[fix](jdbc catalog) fixed the sqlserver jdbc url parm concatenation error (#23841 )	2023-09-08 09:58:20 +08:00
Jerry Hu	68acb8597b	[fix](nested_loop_join) null value should be output in semi-anti join (#23971 ) create table t1 (k1 bigint, k2 bigint) ENGINE=OLAP DUPLICATE KEY(k1, k2) COMMENT 'OLAP' DISTRIBUTED BY HASH(k2) BUCKETS 1 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "is_being_synced" = "false", "storage_format" = "V2", "light_schema_change" = "true", "disable_auto_compaction" = "false", "enable_single_replica_compaction" = "false" ); create table t3 (k1 bigint, k2 bigint) ENGINE=OLAP DUPLICATE KEY(k1, k2) COMMENT 'OLAP' DISTRIBUTED BY HASH(k2) BUCKETS 1 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "is_being_synced" = "false", "storage_format" = "V2", "light_schema_change" = "true", "disable_auto_compaction" = "false", "enable_single_replica_compaction" = "false" ); Data: insert into t1 values (1,null),(null,1),(1,2), (null,2),(1,3), (2,4), (2,5), (3,3), (3,4), (20,2), (22,3), (24,4),(null,null); insert into t3 values (1,null),(null,1),(1,4), (1,2), (null,3), (2,4), (3,7), (3,9),(null,null),(5,1); Query: select t1.* from t1 where not exists ( select k1 from t3 where t1.k2 < t3.k2 ); Result: Empty set Expect result: +------+------+ \| k1 \| k2 \| +------+------+ \| NULL \| NULL \| \| 1 \| NULL \| +------+------+	2023-09-08 09:28:55 +08:00
Pxl	ab7c2b9d22	[Bug](type) fix wildcard char's tostring get wrong result (#24041 ) fix wildcard char's tostring get wrong result	2023-09-07 20:25:38 +08:00
Mryange	20b3e5eafe	[feature](Datetime) add from_microsecond / from_millisecond function (#23902 )	2023-09-07 19:03:49 +08:00
zclllyybb	fdb7a44f57	Revert "[Feature](partitions) Support auto partition" (#24024 ) * Revert "[Feature](partitions) Support auto partition (#23236)" This reverts commit 6c544dd2011d731b8c9c51384c77bcf19c017981. * Update config.h	2023-09-07 17:08:26 +08:00

1 2 3 4 5 ...

1716 Commits