Commit Graph

7090 Commits

Author SHA1 Message Date
e2350403a6 [fix](plan) fix wrong result for random distributed agg table with all keys not null (#36271) 2024-06-18 11:25:31 +08:00
4ae8607b2e [fix](hudi) disable fs.impl.cache to avoid FE OOM (#36402) (#36403)
bp #36402
2024-06-17 22:20:23 +08:00
4008a04da7 [bugfix](paimon)Fix field case issues for 2.1 (#36288)
bp:  #36239
2024-06-17 18:38:00 +08:00
98fccb1809 [improvement](build index)Make build index and clone mutually exclusive and add timeout for index change job (#36293)
Currently the index change job and clone task can be executed at the
same time. If the clone task gets stuck at this point, it will cause the
index change job to get stuck as well and keep retrying. To solve this
problem, we can refer to alter job and make index change job exclusive
with clone task, and introduce the timeout to prevent infinite retries
of build index.

Add the following checks and status in FE.
1. Check if table is stable (build index is not allowed when clone is in
progress)
1.1. Tablet is HEALTHY.
1.2. Whether the tablet is included in the Tablet scheduler, if so, it
means the current tablet is doing clone.
2. When creating the index change job, set the timeout at the same time.

pick from master #35724
2024-06-16 09:34:32 +08:00
55b4cf1658 [fix](load) fix NPE in LoadManager#jobRemovedTrigger() (#36173) (#36337)
cherry-pick #36173
2024-06-15 23:06:31 +08:00
bfab7a2537 [fix](shuffle) fix tablets num calculation in shuffle condition (#36050) (#36339)
cherry-pick #36050
2024-06-15 23:06:00 +08:00
bfb41c15de [fix](statistics)Fix sync analyze job timeout block bug. (#36199)
Fix sync analyze job timeout block bug. When a task of a analyze job
timeout, it should throw an exception instead of finish silently.
2024-06-14 09:47:51 +08:00
a23aee2883 [fix](broker) fix no error url when broker data quality error (#35643) (#36089)
## Proposed changes

cherry-pick from #35643
2024-06-14 09:29:14 +08:00
e2f7e0da0a [Fix](nereids) fix merge aggregate rule, rules should not have mutable members (#36223)
cherry-pick #36145  to branch-2.1
2024-06-13 17:49:57 +08:00
d70751a808 [fix](planner)remove constant expr in window function's partition and order exprs (#36185)
pick from master https://github.com/apache/doris/pull/36184
2024-06-13 15:05:21 +08:00
e51cd58d6e [fix](clone) fix check replica failed due to replica had drop #35994 (#36219)
cherry pick from #35994
2024-06-13 13:39:09 +08:00
375770f2b4 [fix](hudi) move wrong members in HMSExternalTable (#36187)
Previously, there are 2 members: TableScanParams and IncrementalRelation
in HMSExternalTable.
These 2 members are for Hudi's incremental query, so their lifecycle
should be with query task,
should not be saved in HMSExternalTable.

This PR mainly changes:

- Add LogicalHudiScan and PhysicalHudiScan, extends from LogicalFileScan
and PhysicalFileScan.
- Move TableScanParams and IncrementalRelation from HMSExternalTable to
XXXHudiScan.
- Add or modify related Nereids rules
2024-06-13 11:50:40 +08:00
226775f059 [Feature](Point Query) fully support in nereids #35823 (#36205) 2024-06-13 08:37:31 +08:00
3a3c8cd9ee [cherry-pick](branch-2.1) fix inverted index format is lost during a schema change #36059 (#36100) 2024-06-12 23:06:51 +08:00
6d54527395 [fix](dynamic partition) fix dynamic partition thread met uncatch exception #35778 (#36166)
cherry pick from #35778
2024-06-12 22:16:51 +08:00
9708ca8fcb [Feature](Prepared Statment) Implement in nereids planner (#35318) (#36172) 2024-06-12 19:54:17 +08:00
0b28420e1c [pick](Variant) make remote schema fetch rpc timeout configurable (#35296) (#36174) 2024-06-12 19:51:53 +08:00
c78c7f6b45 [branch-2.1](test) fix some tests in external p0 (#36127)
Also move the analysis exception of "Not support insert with partition
spec in hive catalog."
from create sink phase to bind sink phase.
So that when `set enable_fallback_to_original_planner=false;`, the
return error will be correct.
2024-06-11 22:15:28 +08:00
acbfcf7ad9 [fix](Nereids) fix four phase aggregation compute wrong result (#36131)
cherry pick from #36128
2024-06-11 20:40:18 +08:00
d2a6911791 [opt](split) close the batch mode of file split in default (#36109)
bp: #36108
2024-06-11 19:19:09 +08:00
3b23eee37c Revert "[fix](auto-partition) fix auto partition load lost data in multi sender (#35287)" (#36098)
Reverts apache/doris#35630 because it brought some more damaging bugs.
we will fix it and merge in next version
2024-06-11 17:11:42 +08:00
afe2c57e05 [Fix](explain) fix tablet showing problem (#35830) (#36028)
cherry-pick: #35830
2024-06-11 10:55:35 +08:00
75a6f28f2e [cherry-pick]Add query type when report (#35918)
pick #34978
2024-06-11 10:51:59 +08:00
936bf65622 [fix](nereids)decimal and datetime literal comparison should compare datatype too (#36064)
pick from master #36055
2024-06-08 22:01:37 +08:00
9e972cb0b9 [bugfix](iceberg)Fix the datafile path error issue for 2.1 (#36066)
bp: #35957
2024-06-08 21:51:46 +08:00
075481faf1 [opt](Nereids) use date signature for date arithmetic as far as possible (#36060)
pick from master #35863
2024-06-08 09:05:34 +08:00
16fcdcd4b7 [fix](Nereids) not do distinct when aggregate with distinct project (#36057)
pick from master #35899
2024-06-08 09:04:56 +08:00
240d8938f8 [bugfix](iceberg)Fixed missing type of iceberg table for timetravel for 2.1 (#36048)
bp:  #36047
2024-06-07 21:13:56 +08:00
5148c3908e [fix](mtmv)fix mtmv show partition error when base table dropped (#35729) (#36051)
bp #35729
2024-06-07 21:09:41 +08:00
67f4d88988 [enhancement](Nereids) support 4 phases distinct aggregate with full distribution (#36016)
cherry pick from #35871
2024-06-07 21:08:33 +08:00
19bc98a11a [pick 2.1 ][fix ut][fix](inverted index) cloud mode supports lowercase (#32841) (#36034)
pick from master #32841 
Co-authored-by: zzzxl <33418555+zzzxl1993@users.noreply.github.com>
2024-06-07 17:08:29 +08:00
a518915626 [fix](pipeline) Do not push data in local exchange if eos (#35972) (#36010)
pick #35972 and #34536
2024-06-07 15:40:55 +08:00
9f3fe3e57c [fix](DDL) not set table type as default comment when create table (#36025)
pick from master #35855
2024-06-07 15:29:10 +08:00
f751ca4e04 [branch-2.1](functions) fix be crash for function random_bytes and mark_first/last_n (#36003)
pick #35884
2024-06-07 10:30:41 +08:00
c794ea18c8 [fix](multi-catalog)put java udf to custom lib (#35984)
bp #34990
2024-06-06 22:54:24 +08:00
9efc7b63ec [fix](mtmv)Mtmv support row column (#35860) (#35956)
pick from master: #35860
2024-06-06 22:53:08 +08:00
4d5db6fee3 [fix](proc) fix keyword case sensitive issue for stmt show_frontends_disks (#35919) 2024-06-06 19:55:24 +08:00
fedb7e15a0 [fix](statistics) NPE when drop partition during publish (pick #35475) (#35977)
skip null partition when get base tablets for each be (for further usage
in dedup updated row count in MV) This may cause publish fail

cherry pick master #35475
2024-06-06 16:47:30 +08:00
726377a65c [bugfix](hive)fix the error message when creating hive table for 2.1 (#35975)
bp #35833
2024-06-06 16:32:10 +08:00
cd808c3ea0 [fix](mtmv) Fix that the storage medium specified for the mtmv is SSD, but the partition storage medium for the mtmv is still HDD (#35644) (#35955)
pick from master:#35644
2024-06-06 15:36:49 +08:00
104fcea725 [fix](mtmv) mtmv disable dynamic partition (#35686) (#35949)
bp #35686

Co-authored-by: zhangdong <493738387@qq.com>
2024-06-06 14:43:40 +08:00
5cecbfc6ea [cherry-pick]Add workload metric query_be_memory (#35911) 2024-06-06 14:33:30 +08:00
b6ab0c4e59 [fix](multi-catalog)fix hive partition insert regression case (#35846)
from #35821
2024-06-06 11:35:04 +08:00
b5a35b9cef [FIX] Pick array inverted index bugfix (#35837)
here with some array with inverted index bugfix:
see also: 
https://github.com/apache/doris/pull/34766
https://github.com/apache/doris/pull/35086
https://github.com/apache/doris/pull/34683
https://github.com/apache/doris/pull/34076
2024-06-06 09:54:14 +08:00
bcde9c6524 [enhancement](nereids)eliminate repeat node if there is only 1 grouping set and no grouping scalar function (#35872) 2024-06-05 18:03:20 +08:00
af31e96c4e [fix](storage_policy) fix cannot cancel a partition's storage policy (#35890) 2024-06-05 15:53:26 +08:00
630fd06ccf Pick "[feature](Resource) Support to specify the root path for hdfs resource #32632" (#35848)
same as #32632
2024-06-05 08:59:20 +08:00
0585de12b5 [Improvement](coldhot) add statement to show objects which use storage policy (#35839) 2024-06-04 19:24:31 +08:00
5c8f87e01e [opt](log) refine the FE logger (#35679)
Previously, FE logs were written to files. The main FE logs include
fe.log, fe.warn.log, fe.audit.log, fe.out, and fe.gc.log.
In a K8s deployment environment, logs usually need to be output to
standard output, and then other components process the log stream.

This PR made the following changes:

1. Modified the log4j configuration template

- When started with `--daemon`, logs are still written to various files,
and the format remains unchanged.
- When started with `--console`, all logs are output to standard output
and marked with different prefixes:

		- `StdoutLogger`: logs for standard output
		- `StderrLogger`: logs for standard error output
		- `RuntimeLogger`: logs for fe.log or fe.warn.log
		- `AuditLogger:` logs for fe.audit.log
		- No prefix: logs for fe.gc.log

		Examples are as follows:

		```
RuntimeLogger 2024-06-03 14:54:51,229 INFO (binlog-gcer|62)
[BinlogManager.gc():359] begin gc binlog
		```

2. Added a new FE config: `enable_file_logger`

Defaults to true. Indicates that logs will be recorded to files
regardless of the startup method. For example, if it is started with
`--console`, the log will be output to both the file and the standard
output. If it is `false`, the log will not be recorded in the file
regardless of the startup method.

3. Optimized the log format of standard output

The byte streams of stdout and stderr are captured. The logs previously
outputted using `System.out` will be captured in fe.log for unified
management.
2024-06-04 18:20:30 +08:00
f94222a04e [fix](log) Support fe log rollover size strategy (#34446) 2024-06-04 18:18:16 +08:00