doris

Author	SHA1	Message	Date
LiBinfeng	8cac8df40c	[Fix](Planner) fix create view tosql not include partition (#22482 ) Problem: When create view with join in table partitions, an error would rise like "Unknown column" Example: CREATE VIEW my_view AS SELECT t1.* FROM t1 PARTITION(p1) JOIN t2 PARTITION(p2) ON t1.k1 = t2.k1; select * from my_view ==> errCode = 2, detailMessage = Unknown column 'k1' in 't2' Reason: When create view, we do tosql first in order to persistent view sql. And when doing tosql of table reference, partition key word was removed to keep neat of sql string. But here when we remove partition keyword it would regarded as an alias. So "PARTITION" keyword can not be removed. Solved: Add “PARTITION” keyword back to tosql string.	2023-08-02 20:04:59 +08:00
starocean999	527782f3d3	[fix](nereids)move RecomputeLogicalPropertiesProcessor rule before topn optimization (#22488 ) topn optimization will change MutableState. So need move RecomputeLogicalPropertiesProcessor rule before it	2023-08-02 17:36:56 +08:00
Mryange	ddd90855a9	[vectorized](udaf) java udaf support with map type (#22397 ) [vectorized](udaf) java udaf support with map type (#22397) * test * remove some unused * update * add case	2023-08-02 15:03:44 +08:00
jakevin	16461fdc1c	[feature](Nereids): pushdown COUNT through join (#22455 )	2023-08-02 14:55:25 +08:00
AKIRA	41f984bb39	[fix](fe) Fix stmt forward #22469 The call of String.format() contains orphan %s that will cause following error. Introduced from #21205	2023-08-02 10:34:04 +08:00
Chenyang Sun	19d1f49fbe	[improvement](compaction) compaction policy and options in the properties of a table (#22461 )	2023-08-01 22:02:23 +08:00
starocean999	809f67e478	[fix](nereids)fix bug of cast expr to decimalv3 without any check (#22466 )	2023-08-01 21:59:47 +08:00
slothever	94dee833cd	[fix](multi-catalog)fix compatible with hdfs HA empty prefix (#22424 )	2023-08-01 21:48:16 +08:00
qiye	b8399148ef	[fix](DOE) es catalog not working with pipeline,datetimev2, array and esquery (#22046 )	2023-08-01 21:45:16 +08:00
minghong	d5d82b7c31	[stats](nereids) fix bug for avg-size (#22421 )	2023-08-01 17:13:00 +08:00
谢健	d4a6ef3f8c	[fix](Nereids) fix test framework of hypergraph (#22434 )	2023-08-01 16:20:07 +08:00
jakevin	26737dddff	[feature](Nereids): pushdown MIN/MAX/SUM through join (#22264 ) * [minor](Nereids): add more comment to explain code * [feature](Nereids): pushdown MIN/MAX/SUM through join	2023-08-01 13:23:55 +08:00
yiguolei	a6e7e134a3	Revert "[fix](show-stmt) fix show create table missing storage_medium info (#21757 )" (#22443 ) This reverts commit ec72383d3372b519e7957f237fad456130230804. Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-08-01 12:00:34 +08:00
starocean999	450e0b1078	[fix](nereids) recompute logical properties in plan post process (#22356 ) join commute rule will swap the left and right child. This cause the change of logical properties. So we need recompute the logical properties in plan post process to get the correct result	2023-07-31 21:04:39 +08:00
yiguolei	bb67225143	[bugfix](profile summary) move detail info from summary to execution summary (#22425 ) * [bugfix](profile summary) move detail info from summary to execution summary --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-07-31 20:37:01 +08:00
zhengyu	ec72383d33	[fix](show-stmt) fix show create table missing storage_medium info (#21757 )	2023-07-31 19:26:21 +08:00
DeadlineFen	2a320ade82	[feature](property) Add table property "is_being_synced" (#22314 )	2023-07-31 18:14:13 +08:00
AKIRA	e72a012ada	[enhancement](stats) Retry when loading stats (#21849 )	2023-07-31 17:33:20 +08:00
AKIRA	afb6a57aa8	[enhancement](nereids) Improve stats preload performance (#21970 )	2023-07-31 17:32:01 +08:00
LiBinfeng	3a1d678ca9	[Fix](Planner) fix parse error of view with group_concat order by (#22196 ) Problem: When create view with projection group_concat(xxx, xxx order by orderkey). It will failed during second parse of inline view For example: it works when doing "SELECT id, group_concat(`name`, "," ORDER BY id) AS test_group_column FROM test GROUP BY id" but when create view it does not work "create view test_view as SELECT id, group_concat(`name`, "," ORDER BY id) AS test_group_column FROM test GROUP BY id" Reason: when creating view, we will doing parse again of view.toSql() to check whether it has some syntax error. And when doing toSql() to group_concat with order by, it add seperate ', ' between second parameter and order by. So when parsing again, it would failed because it is different semantic with original statement. group_concat(`name`, "," ORDER BY id) ==> group_concat(`name`, "," , ORDER BY id) Solved: Change toSql of group_concat and add order by statement analyze() of group_concat in Planner cause it would work if we get order by from view statement and do not analyze and binding slot reference to it	2023-07-31 17:20:23 +08:00
AKIRA	4c6458aa77	[enhancement](nereids) Execute sync analyze task with multi-thread (#22211 ) It was executed in sequentialy, which may cause a lot of time	2023-07-31 15:05:07 +08:00
谢健	8ccd8b4337	[fix](Nereids) fix ends calculation when there are constant project (#22265 )	2023-07-31 14:10:44 +08:00
zclllyybb	f2919567df	[feature](datetime) Support timezone when insert datetime value (#21898 )	2023-07-31 13:08:28 +08:00
wuwenchi	93a9cec406	[Improvement] Add iceberg metadata cache and support manifest file content cache (#22336 ) Cache the iceberg table. When accessing the same table, the metadata will only be loaded once. Cache the snapshot of the table to optimize the performance of the iceberg table function. Add cache support for iceberg's manifest file content a simple test from 2.0s to 0.8s before mysql> refresh table tb3; Query OK, 0 rows affected (0.03 sec) mysql> select * from tb3; +------+------+------+ \| id \| par \| data \| +------+------+------+ \| 1 \| a \| a \| \| 2 \| a \| b \| \| 3 \| a \| c \| .... \| 68 \| a \| a \| \| 69 \| a \| b \| \| 70 \| a \| c \| +------+------+------+ 70 rows in set (2.10 sec) mysql> select * from tb3; +------+------+------+ \| id \| par \| data \| +------+------+------+ \| 1 \| a \| a \| \| 2 \| a \| b \| \| 3 \| a \| c \| ... \| 68 \| a \| a \| \| 69 \| a \| b \| \| 70 \| a \| c \| +------+------+------+ 70 rows in set (2.00 sec) after mysql> refresh table tb3; Query OK, 0 rows affected (0.03 sec) mysql> select * from tb3; +------+------+------+ \| id \| par \| data \| +------+------+------+ \| 1 \| a \| a \| \| 2 \| a \| b \| ... \| 68 \| a \| a \| \| 69 \| a \| b \| \| 70 \| a \| c \| +------+------+------+ 70 rows in set (2.05 sec) mysql> select * from tb3; +------+------+------+ \| id \| par \| data \| +------+------+------+ \| 1 \| a \| a \| \| 2 \| a \| b \| \| 3 \| a \| c \| ... \| 68 \| a \| a \| \| 69 \| a \| b \| \| 70 \| a \| c \| +------+------+------+ 70 rows in set (0.80 sec)	2023-07-31 10:12:09 +08:00
Gabriel	ec0be8a037	[bug](decimal) change result type for decimalv2 computation (#22366 )	2023-07-31 10:00:34 +08:00
zhangdong	0e7f63f5f6	[fix](ipv6)Remove restrictions from IPv4 when add backend (#22323 ) When adding be, it is required to have only one colon, otherwise an error will be reported. However, ipv6 has many colons ``` String[] pair = hostPort.split(":"); if (pair.length != 2) { throw new AnalysisException("Invalid host port: " + hostPort); } ```	2023-07-30 22:47:24 +08:00
slothever	f87f29e1ab	[fix](multi-catalog)compatible with hdfs HA empty prefix (#22342 ) compatible with hdfs HA empty prefix for example: ’hdfs:///‘ will be replaced to ’hdfs://ha-nameservice/‘	2023-07-30 22:21:14 +08:00
AlexYue	06e4061b94	[enhance](ColdHeatSeparation) carry use path style info along with cold heat separation to support using minio (#22249 )	2023-07-30 21:03:33 +08:00
Jibing-Li	03761c37cd	[Improvement](multi catalog) Support Iceberg, Paimon and MaxCompute table in nereids. (#22338 )	2023-07-29 21:43:35 +08:00
Mryange	47c2cc5c74	[vectorized](udf) java udf support with return map type (#22300 )	2023-07-29 12:52:27 +08:00
Jack Drogon	ebd114b384	[enhancement](binlog) CreateTable inherit db binlog && Add some checks (#22293 )	2023-07-29 08:27:27 +08:00
daidai	ae8a26335c	[opt](hive)opt select count() stmt push down agg on parquet in hive . (#22115 ) Optimization "select count() from table" stmtement , push down "count" type to BE. support file type : parquet ，orc in hive . 1. 4kfiles , 60kwline num before: 1 min 37.70 sec after: 50.18 sec 2. 50files , 60kwline num before: 1.12 sec after: 0.82 sec	2023-07-29 00:31:01 +08:00
xzj7019	f7c106c709	[opt](nereids) enhance broadcast join cost calculation (#22092 ) Enhance broadcast join cost calculation, by considering both the build side effort from building bigger hash table, and more probe side effort from bigger cost of ProbeWhenBuildSideOutput and ProbeWhenSearchHashTable, if parallel_fragment_exec_instance_num is more than 1. Current solution gives a penalty factor on rightRowCount, and the factor is the total instance number to the power of 2. Penalty on outputRows is not taken currently and will be refined in next generation cost model. Also brings some update for shape checking: update original control variable in shape file parallel_fragment_exec_instance_num to parallel_pipeline_task_num, if pipeline is enabled. fix a be_number variable inactive issue.	2023-07-28 23:06:02 +08:00
HHoflittlefish777	05abfbc5ef	[improvement](regression-test) add compression algorithm regression test (#22303 )	2023-07-28 17:28:52 +08:00
Mryange	25f26198f4	[fix](executor) only mysql connect to set GlobalPipelineTask (#22205 )	2023-07-28 16:19:34 +08:00
starocean999	5a0ad09856	[fix](nereids) SubqueryToApply may lost conjunct (#22262 ) consider sql: ``` SELECT * FROM sub_query_correlated_subquery1 t1 WHERE coalesce(bitand( cast( (SELECT sum(k1) FROM sub_query_correlated_subquery3 ) AS int), cast(t1.k1 AS int)), coalesce(t1.k1, t1.k2)) is NULL ORDER BY t1.k1, t1.k2; ``` is Null conjunct is lost in SubqueryToApply rule. This pr fix it	2023-07-28 15:08:56 +08:00
谢健	80673406b1	[fix](Nereids) project hidden columns when show_hidden_columns is true (#22285 )	2023-07-28 15:08:18 +08:00
bobhan1	0c734a861e	[Enhancement](delete) eliminate reading the old values of non-key columns for delete stmt (#22270 )	2023-07-28 14:37:33 +08:00
AKIRA	9f565cf835	[fix](ut) fix ut of stats test #22325 After auto retry merged, it's hard to determine the execute times of doExecute method in compile time, and if the expected execute times in the expectation block is missed, unexpected invocation exception would be thrown, so just remove the expected execute times	2023-07-28 14:23:35 +08:00
zclllyybb	c2155678ca	[fix](functions) fix now(null) crash (#22321 ) before: BE crash now: mysql [test]>select now(null); +-----------+ \| now(NULL) \| +-----------+ \| NULL \| +-----------+ 1 row in set (0.06 sec)	2023-07-28 14:07:56 +08:00
zhangstar333	1c6246f7ee	[improve](agg) support distinct agg node (#22169 ) select c_name from customer union select c_name from customer this sql used agg node to get distinct row of c_name, so it's no need to wait for inserted all data to hash map, could output the data which it's inserted into hash map successed.	2023-07-28 13:54:10 +08:00
zclllyybb	ad080c691f	[chore](log)Move non-user-friendly error message to be.WARNING (#22315 ) Move non-user-friendly error message to be.WARNING	2023-07-28 13:15:25 +08:00
YueW	7be349a10b	[opt](inverted index) add session variable enable_inverted_index_query to control whether query with inverted index (#22255 )	2023-07-28 12:43:26 +08:00
morrySnow	5da5fac37a	[refactor](Nereids) add result sink node (#22254 ) use ResultSink as query root node to let plan of query statement has the same pattern with insert statement	2023-07-28 11:31:09 +08:00
catpineapple	e87174dd6b	[feature](planner) modify multi partition prefix value (#22098 ) modify multi partition prefix value: 'p_'	2023-07-28 10:21:32 +08:00
morrySnow	bfa7f8df6d	[fix](Nereids) parse logical binary stack overflow (#22308 ) 1. not use recursive parse to avoid stack overflow 2. To create a balanced tree instead of left deep tree TODO: add expr_depth_limit to Nereids' parser	2023-07-28 09:48:17 +08:00
Mingyu Chen	00863f25e9	[improvement](profile) add table name for file scan node (#22299 ) ``` VFILE_SCAN_NODE(region) (id=0):(Active: 3.537us, % non-child: 0.00%) - RuntimeFilters: : - UseSpecificThreadToken: False - AcquireRuntimeFilterTime: 501ns - AllocateResourceTime: 105.598us ```	2023-07-27 23:54:31 +08:00
Mingyu Chen	442ae632e3	[fix](fs-cache) add 'scheme://authority' to fs cache key (#22263 ) This file system cache key should contains `scheme://authority`, eg: `hdfs//nameservices1`. Or it will encounter error: ``` Wrong FS: hdfs//abc/xxxx, expected: hdfs://def ```	2023-07-27 23:53:54 +08:00
xzj7019	f7d5453be8	[fix](nereids) fix cte bucket shuffle path (#22311 )	2023-07-27 22:44:51 +08:00
yujun	461c4dfaae	[fix](tablet clone) fix single replica load failed during migration (#22077 )	2023-07-27 20:38:03 +08:00

1 2 3 4 5 ...

5415 Commits