Commit Graph

18635 Commits

Author SHA1 Message Date
5d1f5968eb [fix](case)fix PolicyTest testMergeFilterNereidsPlanner failed (#34637)
The results of explain may have some order that is not fixed
2024-05-18 18:33:41 +08:00
0febfc10e4 [Fix](inverted index) fix wrong fs in inverted_index_file_writer (#34903) 2024-05-18 18:30:10 +08:00
dff6171546 [fix](auto inc) db_id and table_id should be int64_t instead of int32_t (#34912) 2024-05-18 18:29:59 +08:00
4b96f9834f [fix](move-memtable) change brpc connection type to single (#34883) 2024-05-18 18:29:20 +08:00
849eeb39e9 [fix](load) skip sending cancel rpc if VNodeChannel is not inited (#34897) 2024-05-18 18:29:10 +08:00
a07876e807 [fix](planner)correlated predicate should include isnull predicate (#34833) 2024-05-18 18:28:54 +08:00
8264078a9a [fix](nereids)4 phase agg may lost parameter in some case (#34816) 2024-05-18 18:28:41 +08:00
5719f6ff0c [fix](planner) fix date_xxx functions without complete function signature (#34761)
Problem:
When using current_date as input of functions like date_sub,
fold constant would failed cause of missing of function signature in Planner

Solved:
Add complete function signature of functions like date_sub
2024-05-18 18:26:38 +08:00
71caf88ec1 [opt](mtmv) Optimize the logic of slot mapping generate for performance (#34597)
Slot mapping is used for materialized view rewritting
given the relation mapping, the slot mapping is the same

Optimize the slot mapping genarate logic
Cache the slot mapping in materialization context by realation mapping key
2024-05-18 18:25:43 +08:00
Pxl
4a8df53553 [Chore](rollup) check duplicate column name when create table with rollup (#34827)
check duplicate column name when create table with rollup
2024-05-18 18:23:44 +08:00
1e53a2a81d [Improve](inverted index) improve query performance by not using output index result column (#34281) 2024-05-18 18:18:12 +08:00
6b1c441258 [fix](group_commit) Wal reader should check block length to avoid reading empty block (#34792) 2024-05-18 18:17:56 +08:00
38bac76b37 [opt](mtmv) Cache materialization check result for performance (#34301)
Need check materialization sql pattern in different abstract rule when rewrite by materialized view.
Such as the subClass of AbstractMaterializedViewJoinRule, MaterializedViewScanRule, AbstractMaterializedViewAggregateRule.
This check result can be cached when has checked, this can avoid unnecessary repeat check
2024-05-18 18:14:59 +08:00
30a036e7a4 [feature](mtmv) create mtmv support partitions rollup (#31812)
if create MTMV `date_trunc(`xxx`,'month')`
when related table is `range` partition,and have 3 partitions:
```
20200101-20200102
20200102-20200103
20200201-20200202
```
then MTMV will have 2 partitions:
```
20200101-20200201
20200201-20200301
```

when related table is `list` partition,and have 3 partitions:
```
(20200101,20200102)
(20200103)
(20200201)
```
then MTMV will have 2 partitions:
```
(20200101,20200102,20200103)
(20200201)
```
2024-05-18 18:14:48 +08:00
f7801948ad fix backup and restore failed between force_replication_allocation setted and not setted clusters (#34608) 2024-05-18 18:14:18 +08:00
6c515e0c76 [fix](group commit) Make compatibility issues on serializing and deserializing wal file more clear (#34793) 2024-05-18 18:12:43 +08:00
80dd027ce2 [opt](join) For left semi/anti join without mark join conjunct and without other conjucnts, stop probing after matching one row (#34703) 2024-05-18 18:08:50 +08:00
b6409f5584 [improvement](inverted index) Disable the use of skipping write index on load (#34719)
When `skip_write_index_on_load` is turned on, users will get an error when querying for the latest data(not compacted), giving them a bad experience. And we can use `inverted_index_ram_dir_enable = true` and `inverted_index_storage_format=V2` to reduce IO and CPU consumption. So we disable it now.

1. Disable setting `skip_write_index_on_load` to `true` in create table stmt.
2. Disable setting `skip_write_index_on_load` to `true` in alter table properties stmt. You can still alter `skip_write_index_on_load` to `false`.

Co-authored-by: Luennng <luennng@gmail.com>
2024-05-18 18:07:51 +08:00
1545d96617 [WIP](test) remove enable_nereids_planner in regression cases (part 4) (#34642)
before PR are
#34417
#34490
#34558
2024-05-18 18:07:39 +08:00
46bf43130f [test](case) error format case in test_query_json_object (#34722)
error format case in test_query_json_object
2024-05-18 18:07:23 +08:00
c71d0b6b22 [fix](Nereids) cast from json should always nullable (#34707) 2024-05-18 18:06:23 +08:00
5012ddd87a [fix](Nereids) fix sql cache return old value when truncate partition (#34698)
1. fix sql cache return old value when truncate partition
2. use expire_sql_cache_in_fe_second to control the expire time of the sql cache which in the NereidsSqlCacheManager
2024-05-18 18:05:31 +08:00
b3b848f862 [feature](Nereids): eliminate useless project (#34611) 2024-05-18 18:05:00 +08:00
e2614d453a [case](regression) Add hdfs backup restore case (#34716) 2024-05-18 18:03:05 +08:00
6f91e9cc4d [fix](test) fix s3 load test failed (#34671) 2024-05-18 18:02:31 +08:00
876248aa4e [fix](function) json_object can not input null value (#34591) 2024-05-18 18:00:48 +08:00
7e967e53b8 Fix failed p2 hive statistics case. (#34663) 2024-05-18 17:59:44 +08:00
691f3c5ee7 [Performance](Variant) Improve load performance for variant type (#33890)
1. remove phmap for padding rows
2. add SimpleFieldVisitorToScarlarType for short circuit type deducing
3. correct type coercion for conflict types bettween integers
4. improve nullable column performance
5. remove shared_ptr dependancy for DataType use TypeIndex instead
6. Optimization by caching the order of fields (which is almost always the same)
and a quick check to match the next expected field, instead of searching the hash table.

benchmark:
In clickbench data, load performance:
12m36.799s ->7m10.934s about 43% latency reduce

In variant_p2/performance.groovy:
3min44s20 -> 1min15s80 about 66% latency reducy
2024-05-18 17:58:33 +08:00
b76cfcd007 [refactor](mtmv) Materialization context and mtmv decoupling (#34093) (#34916)
Decoupling the MTMV from the materialization context.
Change MaterializationContext to abstract which is the materialization desc.
It now has AsyncMaterializationContext sub class, can also has other type of MaterializationContext such as
SyncMaterializationContext and so on.
2024-05-17 22:54:21 +08:00
385739564d [test](executor) Add workload group upgrade test #35007 2024-05-17 17:34:08 +08:00
2dc65ce356 2.1.3-rc09 2024-05-16 06:37:36 +08:00
4ba15b27ca simple fix for cancel core block dep is released when call 2024-05-16 06:32:21 +08:00
e4b9ded806 2.1.3-rc08 2024-05-15 23:06:57 +08:00
ec4414f383 revoke colum privs the second time failed 2024-05-15 23:06:51 +08:00
fa35e54350 [fix](nereids)LogicalPlanDeepCopier will lost some info when coping logical relation (#34933) 2024-05-15 22:40:32 +08:00
e74b17c761 [Fix](Row store) support decimal256 type (#34887) 2024-05-15 19:01:18 +08:00
475d42f23e remove is cloud mode from regression test 2024-05-15 17:00:43 +08:00
91a154988d [feat](Nereids): Reject Commutativity Swap for Nested Loop Joins Affecting Parallelism (#34639) (#34798) 2024-05-15 16:52:26 +08:00
ab9ff0447d [Fix](regression-test) Fix test_hive_write_type. (#34667)
Because #34397 changed error code of ARITHMETIC_OVERFLOW_ERRROR, so the error msg is not expected in the test.
2024-05-15 15:21:57 +08:00
1a24895257 [opt](routine-load) optimize routine load task thread pool and related param(#32282) (#34896) 2024-05-15 12:42:02 +08:00
2cbe6740a5 [fix](reader) avoid be coredump in block reader in abnormal situation (#34878) 2024-05-15 12:38:40 +08:00
3ead073905 [fix](arrow-flight-sql) Fix Arrow Flight bind wrong Host in Fqdn #34850 2024-05-15 12:38:40 +08:00
baf9a45e57 [fix](mtmv) check groupby in agg-bottom-plan when rewrite agg query by mv (#34274)
check groupby in agg-bottom-plan when rewrite and rollup agg query by mv
2024-05-15 12:38:40 +08:00
f9c42f34dd [fix](auth)Compatible with previously enabled ldap configuration (#34891) 2024-05-15 12:36:47 +08:00
1f0c45204b [fix](iceberg) read the primary key columns if hasing equality delete (#34884)
backport: #34835
2024-05-15 11:37:25 +08:00
02084fd91f [fix](iceberg_orc)Fixed the bug that the iceberg reader did not perform position delete when reading the orc file without a predicate. (#34814) (#34882)
bp #34814
2024-05-15 11:31:29 +08:00
e13ce905cf [Fix](hive-writer) Fix hive partition update file size and remove redundant column names. (#34651) (#34885)
Backport #34651.
2024-05-15 11:23:32 +08:00
30256195c3 fix check column privilege failed by hidden column (#34849)
fix check column privilege failed by hidden column: DORIS_DELETE_SIGN
2024-05-15 10:38:10 +08:00
5ce58ed773 [fix](nereids) runtime filter push-down-cte column mapping bug #34875 2024-05-15 10:31:39 +08:00
00ce05393a [fix](profile) Load profile need to be registered to get real-time profile #34852 2024-05-15 10:29:04 +08:00