doris

Author	SHA1	Message	Date
Guangdong Liu	42b3dd35bb	[regression test](broker load) add case for without filepath (#27658 )	2023-12-07 10:15:37 +08:00
wuwenchi	54d062ddee	[feature](stream load) (step one)Add arrow data type for stream load (#26709 ) By using the Arrow data format, we can reduce the streamload of data transferred and improve the data import performance	2023-12-06 23:29:46 +08:00
yiguolei	4a4d137402	[feature](workloadgroup) support nereids internal query and all dml query (#28054 ) support nereids internal query to bind a workload group support insert into select bind workload group support create table as select bind workload group change token wait timeout to be query timeout or queue timeout query queue should not bind to pipeline engine, it could be used every where.	2023-12-06 21:07:55 +08:00
bobhan1	00c8bab84d	[feature](merge-on-write) enable merge-on-write by default (#27188 )	2023-12-06 21:06:58 +08:00
Nitin-Kashyap	0ff5a1cc25	[fix](doc) spell error and aligned with code (#27609 )	2023-12-06 20:58:39 +08:00
Xiangyu Wang	ec08850c08	[Config](multi-catalog) Enable query hive views as default. (#27906 ) Remove EXPERIMENTAL tag for enable_query_hive_views and set enable_query_hive_views to true as default. This feature has been used on our cluster which has more then a hundred thousands of tables for several months, i think it is fine to enable it as default.	2023-12-06 20:46:09 +08:00
Pxl	299fcc443e	[Bug](agg-state) fix stream load failed on agg-state column (#28050 )	2023-12-06 20:41:29 +08:00
seawinde	ffd7023987	[feature](nereids) Support to get partition related table from mv and check the query operator (#28064 ) Function 1: check the select query plan is contain the stmt as following or not SELECT [hint_statement, ...] [ALL \| DISTINCT \| DISTINCTROW \| ALL EXCEPT ( col_name1 [, col_name2, col_name3, ...] )] elect_expr [, select_expr ...] [FROM table_references PARTITION partition_list] [TABLET tabletid_list] [TABLESAMPLE sample_value [ROWS \| PERCENT] [REPEATABLE pos_seek]] [WHERE where_condition] [GROUP BY [GROUPING SETS \| ROLLUP \| CUBE] {col_name \| expr \| position}] [HAVING where_condition] [ORDER BY {col_name \| expr \| position} [ASC \| DESC], ...] [LIMIT {[offset,] row_count \| row_count OFFSET offset}] [INTO OUTFILE 'file_name'] if analyzedPlan contains the stmt as following [PARTITION partition_list] [TABLET tabletid_list] or [TABLESAMPLE sample_value [ROWS \| PERCENT] [REPEATABLE pos_seek]] this method will return true. Function 2: Get related base table info which materialized view plan column reference, input param plan should be rewritten plan that sub query should be eliminated	2023-12-06 19:15:21 +08:00
谢健	ddb6eb5ad7	[feature](Nereids) add command for updating mv with partitions (#28060 )	2023-12-06 17:45:09 +08:00
morrySnow	1aa1b2f607	[opt](Nereids) add switch to control whether use pipeline in DML (#28037 ) to turn on pipeline for DML in Nerieds, please: set enable_nereids_dml_with_pipeline = true;	2023-12-06 17:06:11 +08:00
Gabriel	28817990b7	[pipelineX](improvement) enable local shuffle by default (#28046 )	2023-12-06 16:39:48 +08:00
Yulei-Yang	fa5096f510	[feature](analyze_cmd) add show-tablets-belong stmt for analyzing a batch of tablet-ids (#27994 )	2023-12-06 15:59:00 +08:00
zhiqiang	994c5c6f6e	[chore](log) Add log to trace query cancel #28020	2023-12-06 15:51:21 +08:00
jakevin	e791e31b7f	[test](Nereids): add regression test eliminate/infer rules (#27985 )	2023-12-06 14:21:06 +08:00
amory	393c491820	[FIX](map/struct)fix map/struct literal from fe (#28026 )	2023-12-06 13:56:56 +08:00
yiguolei	0a22d969e1	[refactor](queryqueue) using a priority queue in query queue in order to implement priority management in the future (#27969 )	2023-12-06 13:49:11 +08:00
minghong	a0fee4c96e	[fix](nereids) runtime filter prune skip filter with invisible column (#28010 ) if a conjunct only contains invisible column, this conjunct should not be used in runtime filter pruner	2023-12-06 12:42:40 +08:00
yiguolei	24fdb7ad4e	[refactor](unusedcode) remove internalquery since it is useless (#28039 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-12-06 12:30:49 +08:00
Calvin Kirs	cbf1f8620a	[Feature](job)support cancel task and fix log invalid (#27703 ) - Running task can be show and fix cancel fail - When the insert task scheduling cycle is reached, if there are still tasks running, the scheduling of this task will be canceled at this time. - refactor job status changes SQL - Fix timer job window error - Support cancel task	2023-12-06 10:44:09 +08:00
Gabriel	1be513b927	[pipelineX](local shuffle) Fix local shuffle for colocate/bucket join (#28032 )	2023-12-06 10:02:36 +08:00
slothever	e431c2b980	[Improvement](multi-catalog)make location easier to modified, decoupling all storage with single location class (#27874 ) decoupling all storage with single location class	2023-12-06 00:13:54 +08:00
AKIRA	7f1b558011	[fix](stats) truncate min/max if too long (#27955 ) For some string value the max/min might be a very long string which might take too many memory of FE, so we truncate to 1024 chars if it's too long	2023-12-05 20:40:38 +08:00
zzzxl	05adbfdb3d	[feature](inverted index) match_phrase_prefix feature added (#27404 ) select count() from test_index_match_phrase_prefix where request match_phrase_prefix 'xxx';	2023-12-05 20:15:13 +08:00
morrySnow	e79422addc	[refactor](Nereids) compatible with all ability legacy planner (#27947 ) refactor: 1. split InsertIntoTableCommand into three sub command - InsertIntoTableCommand - InsertOverwriteTableCommand - BatchInsertIntoTableCommand feature: 1. support DEFAULT keywords in values list 2. support empty values list 3. support temporary partition 4. support insert into values in txn model fix: 1. should start transaction before release read lock on target table	2023-12-05 19:10:55 +08:00
yiguolei	8e161ad0f2	[debug](timeout) add some log to debug timeout== 0 (#28011 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-12-05 18:35:36 +08:00
zhangdong	6074cddcf8	[feature](mtmv)add Job and task tvf (#27967 ) add: select * from jobs("type"="mv"); select * from tasks("type"="mv"); select * from jobs("type"="insert"); select * from tasks("type"="insert"); add check priv for mv_infos("database"="xxx"); change JobType MTMV==>MV	2023-12-05 15:12:36 +08:00
Jibing-Li	02512cd0e2	[fix](stats)Drop stats or update updated rows after truncate table (#27931 ) 1. Also clear follower's stats cache when doing drop stats. 2. Drop stats when truncate a table.	2023-12-05 14:53:35 +08:00
Pxl	8a761dff84	[Bug](materialized-view) fix create mv failed on unique table (#27971 ) fix create mv failed on unique table	2023-12-05 14:53:09 +08:00
zclllyybb	c98b80ae6a	[Feature](functions) support ignore and nullable functions (#27848 ) support ignore and nullable functions	2023-12-05 14:09:32 +08:00
HappenLee	54fe1a166b	[Refactor](scan) refactor scan scheduler to improve performance (#27948 ) * [Refactor](scan) refactor scan scheduler to improve performance * fix pipeline x core	2023-12-05 13:03:16 +08:00
Xinyi Zou	fa0b495b33	[fix](cache)Fix partition cache support DATEV2 (#27978 )	2023-12-05 12:59:47 +08:00
TengJianPing	17016b9797	[improvement](decimal) use new way for decimal arithmetic precision promotion (#27787 ) * [DNM](decimal) use new way for decimal arithmetic precision promotion * [improvement](decimal) [DNM](decimal) use new way for decimal arithmetic precision promotion 1. [DNM](decimal) use new way for decimal arithmetic precision promotion 2. throw exception if it overflows for decimal arithmetics 3. throw exception if it overflows when casting among number types * fix compile error of gcc * improvement --------- Co-authored-by: morrySnow <morrysnow@126.com>	2023-12-05 12:54:40 +08:00
zhangstar333	ca6949ee3e	[Bug](partition) fix auto list partition erros of incorrect partition name (#27974 ) the partition name need limit it's length and can't have negative "-"	2023-12-05 12:54:06 +08:00
谢健	2f63999066	[fix](Nereids): Preserve `""` in single quote strings and `''` in double quote strings. (#27959 )	2023-12-05 12:30:03 +08:00
seawinde	da40e1c767	[feature](nereids) Matiarilzed view query rewrite util implementation (#27568 ) The basic util implementatation which is used by materialized view rewrite	2023-12-05 11:48:04 +08:00
谢健	26d642d5e9	[enhancement](Nereids) format some code in functional deps (#27797 )	2023-12-05 11:45:03 +08:00
谢健	4afe07e12c	[feature](Nereids): support drop constraint on table (#27944 )	2023-12-05 11:41:25 +08:00
yuxuan-luo	3412a022f4	[fix](restore) fix Restore from __keep_on_local__ throws null pointer… (#26943 ) Co-authored-by: walter <patricknicholas@foxmail.com> Co-authored-by: hugoluo <hugoluo@tencent.com> Co-authored-by: walter <patricknicholas@foxmail.com>	2023-12-05 10:55:28 +08:00
starocean999	3c97e69f3c	[fix](Nereids) should not push down project to the nullable side of outer join (#27912 )	2023-12-05 10:43:33 +08:00
Tiewei Fang	20d4d7eb2b	[fix](Hudi-catalog) fix hudi catalog code (#27963 ) In the original logic, `allfields.addall` will modify the objects in `hmsTable`.	2023-12-04 22:28:19 +08:00
谢健	8e2961858e	[enhancement](Nereids): extract group plan in struct info node (#27939 )	2023-12-04 19:46:40 +08:00
Xinyi Zou	b096062680	[feature-wip](arrow-flight)(step6) Support regression test (#27847 ) Design Documentation Linked to #25514 Regression test add a new group: arrow_flight_sql, ./run-regression-test.sh -g arrow_flight_sql to run regression-test, can use jdbc:arrow-flight-sql to run all Suites whose group contains arrow_flight_sql. ./run-regression-test.sh -g p0,arrow_flight_sql to run regression-test, can use jdbc:arrow-flight-sql to run all Suites whose group contains arrow_flight_sql, and use jdbc:mysql to run other Suites whose group contains p0 but does not contain arrow_flight_sql. Requires attention, the formats of jdbc:arrow-flight-sql and jdbc:mysql and mysql client query results are different, for example: Datatime field type: jdbc:mysql returns 2010-01-02T05:09:06, mysql client returns 2010-01-02 05:09:06, jdbc:arrow-flight-sql also returns 2010-01-02 05:09 :06. Array and Map field types: jdbc:mysql returns ["ab", "efg", null], {"f1": 1, "f2": "a"}, jdbc:arrow-flight-sql returns ["ab ","efg",null], {"f1":1,"f2":"a"}, which is missing spaces. Float field type: jdbc:mysql and mysql client returns 6.333, jdbc:arrow-flight-sql returns 6.333000183105469, in query_p0/subquery/test_subquery.groovy. If the query result is empty, jdbc:arrow-flight-sql returns empty and jdbc:mysql returns \N. use database; and query should be divided into two SQL executions as much as possible. otherwise the results may not be as expected. For example: USE information_schema; select cast ("0.0101031417" as datetime) The result is 2000-01-01 03:14:1 (constant fold), select cast ("0.0101031417" as datetime) The result is null (no constant fold), In addition, doris jdbc:arrow-flight-sql still has unfinished parts, such as: Unsupported data type: Decimal256. INVALID_ARGUMENT: [INTERNAL_ERROR]Fail to convert block data to arrow data, error: [E3] write_column_to_arrow with type Decimal256 Unsupported null value of map key. INVALID_ARGUMENT: [INTERNAL_ERROR]Fail to convert block data to arrow data, error: [E33] Can not write null value of map key to arrow. Unsupported data type: ARRAY<MAP<TEXT,TEXT>> jdbc:arrow-flight-sql not support connecting to specify DB name, such asjdbc:arrow-flight-sql://127.0.0.1:9090/{db_name}", In order to be compatible with regression-test, use db_nameis added before all SQLs whenjdbc:arrow-flight-sql` runs regression test. select timediff("2010-01-01 01:00:00", "2010-01-02 01:00:00");, error java.lang.NumberFormatException: For input string: "-24:00:00"	2023-12-04 19:23:56 +08:00
yiguolei	86c2b93e5b	[improvement](fixreplica) move to healthy replica when fix replica bad (#27934 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-12-04 16:04:27 +08:00
minghong	e80526ee3a	[opt](nereids)remove partition & histogram from col stats to reduce memory usage #27885	2023-12-04 14:52:05 +08:00
Pxl	2b715924c5	[Chore](function) set normal function use_default_implementation_for_constants to default (#27891 ) set normal function use_default_implementation_for_constants to default	2023-12-04 14:19:25 +08:00
zhangstar333	e62d19d90d	[improve](partition) support auto list partition with more columns (#27817 ) before the partition by column only have one column. now remove those limit, could have more columns.	2023-12-04 11:33:18 +08:00
xueweizhang	80f528bf26	[enhancement](backup-restore) add config for upload/download task num per be (#27772 ) set upload/download task num per be, and improve the overall speed of upload/download, enhance the performance of backup and recovery. --------- Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-12-04 11:19:45 +08:00
zxealous	f8bdbf67b4	[fix](deploy) K8s deploy manager cannot get group host info by endpoint (#27813 ) K8s deploy manager cannot get group host info by endpoint. If we get group host info by endpoint, there is no need to init statefulset.	2023-12-04 10:50:43 +08:00
starocean999	a62ab4049e	[fix](nereids)add HllUnion and BitmapUnion for pre agg match (#27548 )	2023-12-04 09:48:53 +08:00
minghong	f2cfc87aca	[fix](nereids) temporary partition is selected only if user manually specified (#27893 ) q1: "select * from ut_p temporary partitions(tp1) where val > 0" in q1, temporary partition tp1 is scaned q2: "select * from ut_p where val > 0" in q2, temporary partition tp1 is not scaned.	2023-12-04 09:44:27 +08:00

1 2 3 4 5 ...

6827 Commits