fb5002d723
[fix](binlog) Fix NPE when recover binlogs ( #39909 ) ( #39910 )
...
Cherry-pick #39909
The field partition ID of DropPartitionInfo was added in PR:
apache/doris#37196 , the old version doesn't contain this field so it
will be null.
2024-08-26 20:17:26 +08:00
9412ba56ae
[fix](restore) Reset the db name of the materialized index stmt #39710 ( #39848 )
...
cherry pick from #39710 , #39855
2024-08-26 16:51:50 +08:00
e21d2b82d6
[chore](backup) Fix the db name of the restored view ( #38072 )
...
Cherry-pick #37412 , #39855
Previously, during restore, the database name in the CREATE VIEW
statement was not modified, causing the restored view to be unviewable
with the SHOW VIEW command. This PR retains the original cluster's
database name in the BackupMeta and manually replaces it with the new
cluster's database name in the CREATE VIEW statement during restore.
2024-08-26 14:12:47 +08:00
d87a220d2b
[opt](privilege) Grant check name ( #39597 ) ( #39856 )
...
pick https://github.com/apache/doris/pull/39597
2024-08-26 09:53:45 +08:00
9997911ec9
[feat](Nereids) support nereids hint position detaction ( #39113 ) ( #39417 )
...
cherry-pick: #39113
When use hint in wrong position or use unsupport hint, use channel(2) to
filter it out
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
2024-08-24 23:59:54 +08:00
8e140727ae
[fix](udf) Fix exception thrown when replayDropFunction ( #39868 )
...
This PR completes #25965 to fix uncaught runtime exception when calling
`org.apache.doris.catalog.GlobalFunctionMgr#replayDropFunction`

2024-08-24 23:56:48 +08:00
2dea859bdb
[debug](rpc) debug rpc time consumption problem ( #39852 )
...
## Proposed changes
Issue Number: close #xxx
Add detail RPC time info for each channel, sorted by max rpc time of
channels:
```
DATA_STREAM_SINK_OPERATOR (id=1,dst_id=1):
- Partitioner: Crc32HashPartitioner(64)
- BlocksProduced: 74
- BrpcSendTime: 2.689us
- BrpcSendTime.Wait: 0ns
- BytesSent: 89.35 KB
- CloseTime: 680.152us
- CompressTime: 0ns
- ExecTime: 160.663ms
- InitTime: 263.608us
- InputRows: 32.512K (32512)
- LocalBytesSent: 0.00
- LocalSendTime: 0ns
- LocalSentRows: 0
- MemoryUsage:
- PeakMemoryUsage: 80.00 KB
- MergeBlockTime: 0ns
- OpenTime: 4.113ms
- OverallThroughput: 0.0 /sec
- PendingFinishDependency: 41.179ms
- RowsProduced: 32.512K (32512)
- RpcAvgTime: 11.850ms
- RpcCount: 10
- RpcMaxTime: 86.891ms
- RpcMinTime: 15.200ms
- RpcSumTime: 118.503ms
- SerializeBatchTime: 13.517ms
- SplitBlockDistributeByChannelTime: 38.923ms
- SplitBlockHashComputeTime: 2.659ms
- UncompressedRowBatchSize: 135.19 KB
- WaitForDependencyTime: 0ns
- WaitForRpcBufferQueue: 0ns
RpcInstanceDetails:
- Instance 85d4f75b72a9ea61: Count: 4, MaxTime: 36.238ms, MinTime: 12.107ms, AvgTime: 21.722ms, SumTime: 86.891ms
- Instance 85d4f75b72a9ea91: Count: 3, MaxTime: 11.107ms, MinTime: 2.431ms, AvgTime: 5.470ms, SumTime: 16.412ms
- Instance 85d4f75b72a9eac1: Count: 3, MaxTime: 7.554ms, MinTime: 3.160ms, AvgTime: 5.066ms, SumTime: 15.200m
```
2024-08-24 19:59:39 +08:00
14a2a66106
[fix](paimon) fix not able to read paimon data from hdfs with HA ( #39806 ) ( #39876 )
...
bp #39806
2024-08-24 17:51:15 +08:00
460605ae3c
[branch-2.1] pick some prs ( #39860 )
...
## Proposed changes
Issue Number: close #xxx
https://github.com/apache/doris/pull/38385 optimize parsing datetime
https://github.com/apache/doris/pull/38978 make stream load failure
message more clear and disable some error's stacktrace by default
https://github.com/apache/doris/pull/39255 fix random function coredump
https://github.com/apache/doris/pull/39324 fix function corr
inconsistency with doc
https://github.com/apache/doris/pull/39449 check auto partitoin nullity
when creating partition
https://github.com/apache/doris/pull/39695 make
DynamicPartitionScheduler immediately know interval's change
https://github.com/apache/doris/pull/39754 Add some partition expr check
on creating table
2024-08-24 17:26:42 +08:00
9640e2de44
[fix](catalog) refactor location path and support default fs #39116 ( #39203 )
2024-08-24 16:05:13 +08:00
de2e8f0ae6
[fix](ctas) fix NPE when ctas with old planner and varchar issue ( #39744 ) ( #39871 )
...
bp #39744
2024-08-24 09:24:47 +08:00
b9da934b16
[fix](hive) report error with escape char and null format ( #39700 ) ( #39869 )
...
bp #39700
Co-authored-by: Socrates <suxiaogang223@icloud.com >
2024-08-24 09:23:03 +08:00
00e4f343b0
[enhancement](delete) Add a hint msg for forbidden delete when MV or rollup exists ( #39505 ) ( #39857 )
...
## Proposed changes
When MV or Rollup exists, delete is forbidden on the base table
currently. Add a hint msg to indicate it.
2024-08-23 23:42:26 +08:00
9d597bde68
[fix](Nereids) producer to consumer should be multimap in cte ( #39850 ) ( #39867 )
2024-08-23 23:25:11 +08:00
508c7a7040
[fix](hive)Modify the Hive notification event processing method when using meta cache and add parameters to the Hive catalog. ( #39239 ) ( #39865 )
...
bp #39239
Co-authored-by: daidai <2017501503@qq.com >
2024-08-23 23:21:02 +08:00
19c51cc217
[fix](mtmv) use name instead of id in meta of MTMV ( #39355 ) ( #39748 )
...
pick from master #39355
2024-08-23 20:40:54 +08:00
6ceb574aa0
[branch-2.1]Pick IO limit/workload group usage table ( #39839 )
2024-08-23 18:51:47 +08:00
e716658fba
[branch-2.1](arrow-flight-sql) Fix exceed user property max connection cause Reach limit of connections ( #39836 )
...
pick #39127
pick #39802
2024-08-23 17:27:34 +08:00
424ad2384a
[opt](nereids) refine left semi/anti cost under short-cut opt ( #39636 )
...
## Proposed changes
pick from https://github.com/apache/doris/pull/37951
---------
Co-authored-by: xiongzhongjian <xiongzhongjian@selectdb.com >
2024-08-23 17:26:56 +08:00
8f15efdbb8
[cherry-pick](branch-2.1) fix delete random distributed tbl ( #39830 )
...
## Proposed changes
cherry-pick #37985
<!--Describe your changes.-->
2024-08-23 17:17:05 +08:00
7a7292ad5a
[branch-2.1][Refactor]use async to get be resource ( #38389 ) ( #39826 )
...
pick #38389
2024-08-23 17:16:19 +08:00
0934fbee7e
[improvement](query) prefer to chose tablet on alive disk #39467 ( #39654 )
...
cherry pick from #39467
2024-08-23 12:23:12 +08:00
6c10c47f79
[fix](fe) LIST partition table support modify default bucket num ( #39688 )
...
## Proposed changes
bp #39696
Issue Number: close #39684
```sql
CREATE TABLE `test1` (
`id1` VARCHAR(255) NULL COMMENT 'id1',
`id2` VARCHAR(255) NULL COMMENT 'id2',
`event_time` VARCHAR(255) NULL COMMENT '事件时间',
`event_date` VARCHAR(255) NULL COMMENT '事件日期',
`event_ts` VARCHAR(256) NULL COMMENT '事件发生时间戳(毫秒)',
`dt` VARCHAR(255) NOT NULL COMMENT '日期分区',
`hr` VARCHAR(255) NOT NULL COMMENT '小时分区'
) ENGINE = OLAP DUPLICATE KEY(`id1`) COMMENT 'xxx' PARTITION BY LIST(`dt`, `hr`) (
PARTITION p2024082021 VALUES IN (("2024-08-20", "21"))
) DISTRIBUTED BY HASH(`dt`, `hr`) BUCKETS 2 PROPERTIES (
"replication_allocation" = "tag.location.default: 1",
"min_load_replica_num" = "-1",
"is_being_synced" = "false",
"storage_medium" = "hdd",
"storage_format" = "V2",
"inverted_index_storage_format" = "V1",
"light_schema_change" = "true",
"disable_auto_compaction" = "false",
"enable_single_replica_compaction" = "false",
"group_commit_interval_ms" = "10000",
"group_commit_data_bytes" = "134217728"
);
```
1. 修改前表已有分区的bucket num为2

2. 修改List分区表bucket num(从2 -> 4)

3. 修改List分区后,添加的新分区bucket num为4

Co-authored-by: tongyang.han <tongyang.han@jiduauto.com >
2024-08-23 11:52:16 +08:00
eeacca9e8e
[cherry-pick](branch-2.1) fix hudi count bug ( #39785 )
...
bp #39656
2024-08-22 22:35:44 +08:00
29898d0dbc
[fix](downgrade) partition key should write upper case null type to image ( #39776 )
2024-08-22 18:42:08 +08:00
40500f2b10
[branch-2.1](thirdparty) upgrade arrow to 17.0.0 ( #39773 )
...
pick #38572
2024-08-22 18:35:32 +08:00
f553645a71
[fix](mtmv) transfer col in mysql varchar to text when create MTMV ( #37668 ) ( #39727 )
...
pick from master #37668
2024-08-22 15:20:59 +08:00
10f3e88f7a
[fix](nereids) fix distribution expr list ( #39435 )
...
pick from #39148
2024-08-22 15:19:51 +08:00
1c566253a8
[Pick][Improment]Query queued by be memory ( #37559 ) ( #39733 )
...
pick #37559
2024-08-22 15:14:47 +08:00
fd13962015
[chore](nereids) Added compatibility with mysql alias conflict ( #38104 ) ( #38440 )
...
throw table name/alias conflict exception to keep same behavior with mysql
for example:
```sql
select * from test.a b, test.b
```
error:
```
Not unique table/alias: 'b'
```
2024-08-22 14:37:49 +08:00
50f440e653
[chore](nereids) Added compatibility with mysql alias filter ( #39738 )
...
qt_filter_select4 """
select * from filter_alias_test.test b where filter_alias_test.b.id = 1;
"""
qt_filter_select5 """
select * from internal.filter_alias_test.test b where internal.filter_alias_test.b.id = 1;
"""
2024-08-22 14:36:14 +08:00
83acdc14f4
[fix](Nereids) should push project through limit after eliminate union node ( #39640 ) ( #39755 )
...
pick from master #39640
otherwise:
push limit through union could generate plan:
limit
+-- union
|-- limit
| +-- empty relation
+-- limit
+-- project
and then eliminate union will generate plan:
+-- limit
+- project
+-- limit
+-- project
it could not be processed by tranlator correctly
2024-08-22 13:57:39 +08:00
ca9e50e49d
[fix](Nereids) fix insert into table with null literal default value ( #39122 ) ( #39669 )
...
cherry-pick: #39122
Problem:
when use insert with default value null, it can not be insert
successfully
Solved:
when column is allow to be null, it can be null in create table with
null default value
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
2024-08-22 10:37:50 +08:00
8f580b523f
[opt](nereids) support partitionTopn for multi window exprs ( #39687 )
...
## Proposed changes
pick from https://github.com/apache/doris/pull/38393
Co-authored-by: xiongzhongjian <xiongzhongjian@selectdb.com >
2024-08-22 10:34:36 +08:00
021982fc71
[fix](mtmv) Fix some pr to 21, prs are ( #39041 )( #38958 )( #39541 ) ( #39678 )
...
## Proposed changes
pr: https://github.com/apache/doris/pull/39041
commitId: 22562985
pr: https://github.com/apache/doris/pull/38958
commitId: c365cb64
pr: https://github.com/apache/doris/pull/39541
commitId: 89bb669c
2024-08-22 10:27:55 +08:00
d36ba97eae
[fix](statistics) correct update rows when doing multi-table load ( #39548 )
...
rows of only one table is updated correctly, need to merge all table
commit infos.
Signed-off-by: freemandealer <freeman.zhang1992@gmail.com >
2024-08-22 09:57:45 +08:00
ed9794a0fe
[Pick][Improment]publish workload to BE by tag ( #38486 ) ( #39730 )
...
A workload group's tag property may be three cases as below: 1 empty
string, null or '', it could be published to all BE. 2 a value match
some BE' location, then the workload group could only be published to
the BE with same tag.
3 not an empty string, but some invalid string which can not math any
BE's location, then it could not be published any BE.
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
2024-08-22 00:48:16 +08:00
92671cbb73
[opt](Nereids) do not fallback if nereids failed because timeout ( #39499 ) ( #39718 )
...
pick from master #39499
since legacy planner will cost more time to plan, fallback will be worse
than throw exception directly
2024-08-22 00:45:23 +08:00
a55e109e97
[pick][Improment]Add schema table workload_group_privileges ( #38436 ) ( #39708 )
...
pick #38436
2024-08-22 00:44:43 +08:00
1e47d11560
[Improvement](runtime-filter) send RUNTIME_BLOOM_FILTER_MAX_SIZE to backends ( #39686 )
...
…ackends (#38972 )
## Proposed changes
pick from #38972
2024-08-22 00:37:25 +08:00
b5bdc325af
[fix](profile) task type not the same in observer and master ( #39245 ) ( #39698 )
...
pick #39245 to branch-2.1
2024-08-22 00:31:52 +08:00
76b7a5291a
[BUG] fix partition storage policy info lost ( #38700 ) ( #39677 )
...
## Proposed changes
cherry-pick from #38700
Issue Number: close #xxx
<!--Describe your changes.-->
---------
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
Co-authored-by: garenshi <garenshi@tencent.com >
2024-08-22 00:23:19 +08:00
77270a0479
[enhance](mtmv)Improve the performance of obtaining partition/table v… ( #39478 )
...
…ersions (#39301 )
pick: https://github.com/apache/doris/pull/39301
2024-08-22 00:07:52 +08:00
b878f7f1a6
[fix](planner) binary predicate result should compare with 0 ( #39474 ) ( #39717 )
...
pick from master #39474
2024-08-21 20:47:14 +08:00
2890a98b18
[fix](mtmv) mtmv getPlanBySql should not reuse ctx's StatementContext ( #39690 ) ( #39719 )
...
pick from master #39690
2024-08-21 19:20:40 +08:00
3a59ee1c5d
[fix](auditlog)Record return row count in audit log for internal query. ( #39616 ) ( #39702 )
...
backport: https://github.com/apache/doris/pull/39616
2024-08-21 17:37:01 +08:00
1460878bdf
[fix](cluster key) forbid cluster key and remove case ( #39679 )
...
branch-2.1 does not support mow cluster key
2024-08-21 14:31:54 +08:00
ba3b56d269
[fix](nereids)prevent null pointer exception if datetime value overflows ( #39675 )
...
pick from master https://github.com/apache/doris/pull/39482
2024-08-21 14:17:34 +08:00
0bfcee1251
[opt](file-cache) support system table file_cache_statistics ( #39552 )
...
1. Add new system table: `file_cache_statistics`
This table is used for viewing metrics related to file cache on BE side
```
mysql> select * from information_schema.file_cache_statistics limit 10;
+-------+---------------+----------------------------+--------------------------------+--------------------+
| BE_ID | BE_IP | CACHE_PATH | METRIC_NAME | METRIC_VALUE |
+-------+---------------+----------------------------+--------------------------------+--------------------+
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
disposable_queue_curr_elements | 0 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
disposable_queue_curr_size | 0 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
disposable_queue_max_elements | 102400 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
disposable_queue_max_size | 21474836480 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ | hits_ratio |
0.8539634687001242 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ | hits_ratio_1h | 0
|
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ | hits_ratio_5m | 0
|
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
index_queue_curr_elements | 0 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
index_queue_curr_size | 0 |
| 10003 | 172.20.32.136 | /mnt/output/be/file_cache/ |
index_queue_max_elements | 102400 |
+-------+---------------+----------------------------+--------------------------------+--------------------+
```
It will show metrics of file caches on each BE.
2. Add new metrics `hits_ratio_1h` and `hits_ratio_5m` for file cache
This 2 metrics will show the hit ratio of file cache in recent 1 hour or
5 minutes.
So that we can know recent hit ratio instead of global historical hit
ratio.
2024-08-21 10:03:39 +08:00
bf26f49505
[bugfix](external)add check of engine and catalog types for 2.1 #39343 ( #39643 )
...
bp #39343
2024-08-21 09:50:17 +08:00