doris

Author	SHA1	Message	Date
Jibing-Li	03761c37cd	[Improvement](multi catalog) Support Iceberg, Paimon and MaxCompute table in nereids. (#22338 )	2023-07-29 21:43:35 +08:00
Jack Drogon	ebd114b384	[enhancement](binlog) CreateTable inherit db binlog && Add some checks (#22293 )	2023-07-29 08:27:27 +08:00
daidai	ae8a26335c	[opt](hive)opt select count() stmt push down agg on parquet in hive . (#22115 ) Optimization "select count() from table" stmtement , push down "count" type to BE. support file type : parquet ，orc in hive . 1. 4kfiles , 60kwline num before: 1 min 37.70 sec after: 50.18 sec 2. 50files , 60kwline num before: 1.12 sec after: 0.82 sec	2023-07-29 00:31:01 +08:00
xzj7019	f7c106c709	[opt](nereids) enhance broadcast join cost calculation (#22092 ) Enhance broadcast join cost calculation, by considering both the build side effort from building bigger hash table, and more probe side effort from bigger cost of ProbeWhenBuildSideOutput and ProbeWhenSearchHashTable, if parallel_fragment_exec_instance_num is more than 1. Current solution gives a penalty factor on rightRowCount, and the factor is the total instance number to the power of 2. Penalty on outputRows is not taken currently and will be refined in next generation cost model. Also brings some update for shape checking: update original control variable in shape file parallel_fragment_exec_instance_num to parallel_pipeline_task_num, if pipeline is enabled. fix a be_number variable inactive issue.	2023-07-28 23:06:02 +08:00
HHoflittlefish777	05abfbc5ef	[improvement](regression-test) add compression algorithm regression test (#22303 )	2023-07-28 17:28:52 +08:00
Mryange	25f26198f4	[fix](executor) only mysql connect to set GlobalPipelineTask (#22205 )	2023-07-28 16:19:34 +08:00
starocean999	5a0ad09856	[fix](nereids) SubqueryToApply may lost conjunct (#22262 ) consider sql: ``` SELECT * FROM sub_query_correlated_subquery1 t1 WHERE coalesce(bitand( cast( (SELECT sum(k1) FROM sub_query_correlated_subquery3 ) AS int), cast(t1.k1 AS int)), coalesce(t1.k1, t1.k2)) is NULL ORDER BY t1.k1, t1.k2; ``` is Null conjunct is lost in SubqueryToApply rule. This pr fix it	2023-07-28 15:08:56 +08:00
谢健	80673406b1	[fix](Nereids) project hidden columns when show_hidden_columns is true (#22285 )	2023-07-28 15:08:18 +08:00
bobhan1	0c734a861e	[Enhancement](delete) eliminate reading the old values of non-key columns for delete stmt (#22270 )	2023-07-28 14:37:33 +08:00
AKIRA	9f565cf835	[fix](ut) fix ut of stats test #22325 After auto retry merged, it's hard to determine the execute times of doExecute method in compile time, and if the expected execute times in the expectation block is missed, unexpected invocation exception would be thrown, so just remove the expected execute times	2023-07-28 14:23:35 +08:00
zclllyybb	c2155678ca	[fix](functions) fix now(null) crash (#22321 ) before: BE crash now: mysql [test]>select now(null); +-----------+ \| now(NULL) \| +-----------+ \| NULL \| +-----------+ 1 row in set (0.06 sec)	2023-07-28 14:07:56 +08:00
zhangstar333	1c6246f7ee	[improve](agg) support distinct agg node (#22169 ) select c_name from customer union select c_name from customer this sql used agg node to get distinct row of c_name, so it's no need to wait for inserted all data to hash map, could output the data which it's inserted into hash map successed.	2023-07-28 13:54:10 +08:00
YueW	7be349a10b	[opt](inverted index) add session variable enable_inverted_index_query to control whether query with inverted index (#22255 )	2023-07-28 12:43:26 +08:00
morrySnow	5da5fac37a	[refactor](Nereids) add result sink node (#22254 ) use ResultSink as query root node to let plan of query statement has the same pattern with insert statement	2023-07-28 11:31:09 +08:00
catpineapple	e87174dd6b	[feature](planner) modify multi partition prefix value (#22098 ) modify multi partition prefix value: 'p_'	2023-07-28 10:21:32 +08:00
morrySnow	bfa7f8df6d	[fix](Nereids) parse logical binary stack overflow (#22308 ) 1. not use recursive parse to avoid stack overflow 2. To create a balanced tree instead of left deep tree TODO: add expr_depth_limit to Nereids' parser	2023-07-28 09:48:17 +08:00
Mingyu Chen	00863f25e9	[improvement](profile) add table name for file scan node (#22299 ) ``` VFILE_SCAN_NODE(region) (id=0):(Active: 3.537us, % non-child: 0.00%) - RuntimeFilters: : - UseSpecificThreadToken: False - AcquireRuntimeFilterTime: 501ns - AllocateResourceTime: 105.598us ```	2023-07-27 23:54:31 +08:00
Mingyu Chen	442ae632e3	[fix](fs-cache) add 'scheme://authority' to fs cache key (#22263 ) This file system cache key should contains `scheme://authority`, eg: `hdfs//nameservices1`. Or it will encounter error: ``` Wrong FS: hdfs//abc/xxxx, expected: hdfs://def ```	2023-07-27 23:53:54 +08:00
xzj7019	f7d5453be8	[fix](nereids) fix cte bucket shuffle path (#22311 )	2023-07-27 22:44:51 +08:00
yujun	461c4dfaae	[fix](tablet clone) fix single replica load failed during migration (#22077 )	2023-07-27 20:38:03 +08:00
YueW	e39d234db9	[opt](inverted index) add more check for create inverted index (#22297 )	2023-07-27 20:33:24 +08:00
谢健	716d58f5ff	[fix](Nereids) decimal divide should not return null if numerator is zero (#22309 )	2023-07-27 20:23:04 +08:00
Jack Drogon	816fd50d1d	[Enhancement](binlog) Add binlog enable diable check in BinlogManager (#22173 ) Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-07-27 20:16:21 +08:00
Jibing-Li	a87d34b19b	[Fix](multi catalog statistics)Improve external table statistics collection (#22224 ) Improve external table statistics collection, including log, observability and fix some bugs. 1. Add Running state for statistics job. 2. Add progress for show analyze job. (n/m tasks finished, n/m task failed and so on) 3. Add analyze time cost for show analyze task. 4. Make task failure message more clear. 5. Synchronize the job status updating code in updateTaskStatus. 6. Fix NPE in HMSAnalyzeTask. (Avoid refreshing statistics cache if the collection sql failed) 7. Return error message for with sync collection while timeout. 8. Log level improvement 9. Fix misuse of logCreateAnalysisJob for tasks.	2023-07-27 20:01:14 +08:00
starocean999	2c849c619d	[fix](nereids) only allow inner join in dphyper join reorder (#22307 ) current dphyper join reorder hasn't consider the join conjunct referencing only one side of the child. This is common case in outer join conjunct. So we need disable outer join reorder in dphyper until this problem is addressed.	2023-07-27 19:46:37 +08:00
morrySnow	ae5e39ad26	[opt](Nereids) add double signature back for round like function (#22284 ) add double signature back for round like function	2023-07-27 19:10:43 +08:00
Pxl	87b9425772	[Bug](materialized-view) fix where clause not analyzed after fe restart (#22268 ) fix where clause not analyzed after fe restart	2023-07-27 18:34:44 +08:00
AKIRA	b51fcbd9c7	[opt](stats) Scale replica of stats table to 3 when it's possible (#22227 ) So that we could improve the availability of stats.	2023-07-27 17:36:54 +08:00
lsy3993	4f6a3c5bf0	[feature](catalog) support clob type in oracle jdbc catalog (#21532 )	2023-07-27 15:49:15 +08:00
Gabriel	e78afedd0a	[minor](refactor) refine function logics (#22280 )	2023-07-27 15:09:23 +08:00
zhangstar333	ddfdf62993	[opt](planner) support to parse scientific notation(aEb) (#22248 )	2023-07-27 13:31:37 +08:00
starocean999	a630f127ce	[fix](planner) fix bug of push down conjuncts through agg (#22202 ) should use both contains and comeFrom method to check if the conjunct can be pushed down throgh agg node	2023-07-27 13:20:50 +08:00
starocean999	8b51bfa384	[fix](planner) fix bug of unexpected nest loop join (#22236 ) use isLiteral instead of isConstant to check if the expr is a literal. This prevent the unexpected nest loop join, see the test case for detail	2023-07-27 13:20:29 +08:00
wuwenchi	41a230b721	[fix] iceberg catalog to specify the version and time (#22209 ) problem: 1. create a iceberg_type catalog: 2. use iceberg catalog to specify verison ``` mysql> show catalog iceberg; +----------------------+--------------------------+ \| Key \| Value \| +----------------------+--------------------------+ \| type \| iceberg \| \| iceberg.catalog.type \| hms \| \| hive.metastore.uris \| thrift://127.0.0.1:9083 \| \| hadoop.username \| hadoop \| \| create_time \| 2023-07-25 16:51:00.522 \| +----------------------+--------------------------+ 5 rows in set (0.02 sec) mysql> select * from iceberg.iceberg_db.tb1 FOR VERSION AS OF 8783036402036752909; ERROR 5090 (42000): errCode = 2, detailMessage = Only iceberg/hudi external table supports time travel in current version ``` change: Add `ICEBERG_EXTERNAL_TABLE` type for specify the version and time	2023-07-27 12:04:41 +08:00
zy-kkk	619a2857e1	[improvement](jdbc catalog) improve mysql jdbc catalog read bytea`s types & else improve (#22233 )	2023-07-27 10:18:37 +08:00
Jack Drogon	052a416d49	[Enhencement](binlog) db enable binlog (#22256 ) * Improve db update binlog properties (binlog.enable = "true") with check all table enable binlog * Add more test_alter_database_property regression test	2023-07-27 10:03:51 +08:00
Gabriel	341c45974c	[round](decimalv2) round precise decimalv2 value (#22258 )	2023-07-27 10:00:36 +08:00
Xinyi Zou	163a38a527	[opt](Nereids) support sql cache (#22144 ) 1. let Nereids support sql cache 2. let legacy planner's sql cache supports union all	2023-07-27 09:57:31 +08:00
Jack Drogon	82fe78ce84	Update table binlog config disable failure when db binlog is enable && (#22253 ) modify table binlog more than one property Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-07-27 09:54:24 +08:00
zhangstar333	fb41265c27	[opt](Nereids) add boolean type signature for sum aggregate function (#21959 )	2023-07-27 09:41:19 +08:00
xiongjx	331ae6d2cb	[fix](Unique-Key) fix version upgrade caused MOR to become MOW (#22243 )	2023-07-26 19:38:24 +08:00
minghong	490ad14720	[stats](nereids)in predicate range adjust (#22156 ) 1. refactor in-predicate filter estimation example: A in (1, 2, 3, 4) after in-preidcate filter, A.stats.max<=4 and A.stats.min>=1 2. maintain minExpr and maxExpr in in-predicate stats derive	2023-07-26 19:10:04 +08:00
AKIRA	fdb26a524a	[fix](compile) Fix FE compile error (#22261 ) Fix FE compile error	2023-07-26 18:39:21 +08:00
zhannngchen	4e57f45d8e	[fix](partial-update) sequence column not updated if using function_column.sequence_type (#22250 )	2023-07-26 18:22:43 +08:00
meiyi	9a07ae890a	[fix](point query) Fix ArrayIndexOutOfBoundsException if close a prepare stmt (#22237 )	2023-07-26 18:22:07 +08:00
morrySnow	14dcc53135	[fix](Nereids) cast time should turn nullable on all valid types (#22242 ) valid types to cast to time/timev2: - TINYINT - SMALLINT - INT - BIGINT - LARGEINT - FLOAT - DOUBLE - CHAR - VARCHAR - STRING	2023-07-26 17:56:19 +08:00
bobhan1	be69025878	[opt](Nereids) add partial update support for delete stmt (#22184 ) Currently, the new optimizer don't consider anything about partial update. This PR add the ability to convert a delete statement to a partial update insert statement for merge-on-write unique table	2023-07-26 17:34:31 +08:00
AKIRA	582acad8a1	[feature](stats) Enable period time with cron expr (#22095 ) Support such grammar ANALYZE TABLE test WITH CRON "* * * * * ?" Such job would be scheduled as the cron expr specifie, but natively support minute-level schedule only	2023-07-26 17:25:57 +08:00
AKIRA	964ac4e601	[opt](nereids) Retry when async analyze task failed (#21889 ) Retry at most 5 times when async analyze task execution failed	2023-07-26 17:16:56 +08:00
Jack Drogon	af20d0c521	[fix](binlog) Fix BinlogUtils getExpiredMs overflow (#22174 )	2023-07-26 15:15:34 +08:00

1 2 3 4 5 ...

4173 Commits