Commit Graph

7081 Commits

Author SHA1 Message Date
d70751a808 [fix](planner)remove constant expr in window function's partition and order exprs (#36185)
pick from master https://github.com/apache/doris/pull/36184
2024-06-13 15:05:21 +08:00
e51cd58d6e [fix](clone) fix check replica failed due to replica had drop #35994 (#36219)
cherry pick from #35994
2024-06-13 13:39:09 +08:00
375770f2b4 [fix](hudi) move wrong members in HMSExternalTable (#36187)
Previously, there are 2 members: TableScanParams and IncrementalRelation
in HMSExternalTable.
These 2 members are for Hudi's incremental query, so their lifecycle
should be with query task,
should not be saved in HMSExternalTable.

This PR mainly changes:

- Add LogicalHudiScan and PhysicalHudiScan, extends from LogicalFileScan
and PhysicalFileScan.
- Move TableScanParams and IncrementalRelation from HMSExternalTable to
XXXHudiScan.
- Add or modify related Nereids rules
2024-06-13 11:50:40 +08:00
226775f059 [Feature](Point Query) fully support in nereids #35823 (#36205) 2024-06-13 08:37:31 +08:00
3a3c8cd9ee [cherry-pick](branch-2.1) fix inverted index format is lost during a schema change #36059 (#36100) 2024-06-12 23:06:51 +08:00
6d54527395 [fix](dynamic partition) fix dynamic partition thread met uncatch exception #35778 (#36166)
cherry pick from #35778
2024-06-12 22:16:51 +08:00
9708ca8fcb [Feature](Prepared Statment) Implement in nereids planner (#35318) (#36172) 2024-06-12 19:54:17 +08:00
0b28420e1c [pick](Variant) make remote schema fetch rpc timeout configurable (#35296) (#36174) 2024-06-12 19:51:53 +08:00
c78c7f6b45 [branch-2.1](test) fix some tests in external p0 (#36127)
Also move the analysis exception of "Not support insert with partition
spec in hive catalog."
from create sink phase to bind sink phase.
So that when `set enable_fallback_to_original_planner=false;`, the
return error will be correct.
2024-06-11 22:15:28 +08:00
acbfcf7ad9 [fix](Nereids) fix four phase aggregation compute wrong result (#36131)
cherry pick from #36128
2024-06-11 20:40:18 +08:00
d2a6911791 [opt](split) close the batch mode of file split in default (#36109)
bp: #36108
2024-06-11 19:19:09 +08:00
3b23eee37c Revert "[fix](auto-partition) fix auto partition load lost data in multi sender (#35287)" (#36098)
Reverts apache/doris#35630 because it brought some more damaging bugs.
we will fix it and merge in next version
2024-06-11 17:11:42 +08:00
afe2c57e05 [Fix](explain) fix tablet showing problem (#35830) (#36028)
cherry-pick: #35830
2024-06-11 10:55:35 +08:00
75a6f28f2e [cherry-pick]Add query type when report (#35918)
pick #34978
2024-06-11 10:51:59 +08:00
936bf65622 [fix](nereids)decimal and datetime literal comparison should compare datatype too (#36064)
pick from master #36055
2024-06-08 22:01:37 +08:00
9e972cb0b9 [bugfix](iceberg)Fix the datafile path error issue for 2.1 (#36066)
bp: #35957
2024-06-08 21:51:46 +08:00
075481faf1 [opt](Nereids) use date signature for date arithmetic as far as possible (#36060)
pick from master #35863
2024-06-08 09:05:34 +08:00
16fcdcd4b7 [fix](Nereids) not do distinct when aggregate with distinct project (#36057)
pick from master #35899
2024-06-08 09:04:56 +08:00
240d8938f8 [bugfix](iceberg)Fixed missing type of iceberg table for timetravel for 2.1 (#36048)
bp:  #36047
2024-06-07 21:13:56 +08:00
5148c3908e [fix](mtmv)fix mtmv show partition error when base table dropped (#35729) (#36051)
bp #35729
2024-06-07 21:09:41 +08:00
67f4d88988 [enhancement](Nereids) support 4 phases distinct aggregate with full distribution (#36016)
cherry pick from #35871
2024-06-07 21:08:33 +08:00
19bc98a11a [pick 2.1 ][fix ut][fix](inverted index) cloud mode supports lowercase (#32841) (#36034)
pick from master #32841 
Co-authored-by: zzzxl <33418555+zzzxl1993@users.noreply.github.com>
2024-06-07 17:08:29 +08:00
a518915626 [fix](pipeline) Do not push data in local exchange if eos (#35972) (#36010)
pick #35972 and #34536
2024-06-07 15:40:55 +08:00
9f3fe3e57c [fix](DDL) not set table type as default comment when create table (#36025)
pick from master #35855
2024-06-07 15:29:10 +08:00
f751ca4e04 [branch-2.1](functions) fix be crash for function random_bytes and mark_first/last_n (#36003)
pick #35884
2024-06-07 10:30:41 +08:00
c794ea18c8 [fix](multi-catalog)put java udf to custom lib (#35984)
bp #34990
2024-06-06 22:54:24 +08:00
9efc7b63ec [fix](mtmv)Mtmv support row column (#35860) (#35956)
pick from master: #35860
2024-06-06 22:53:08 +08:00
4d5db6fee3 [fix](proc) fix keyword case sensitive issue for stmt show_frontends_disks (#35919) 2024-06-06 19:55:24 +08:00
fedb7e15a0 [fix](statistics) NPE when drop partition during publish (pick #35475) (#35977)
skip null partition when get base tablets for each be (for further usage
in dedup updated row count in MV) This may cause publish fail

cherry pick master #35475
2024-06-06 16:47:30 +08:00
726377a65c [bugfix](hive)fix the error message when creating hive table for 2.1 (#35975)
bp #35833
2024-06-06 16:32:10 +08:00
cd808c3ea0 [fix](mtmv) Fix that the storage medium specified for the mtmv is SSD, but the partition storage medium for the mtmv is still HDD (#35644) (#35955)
pick from master:#35644
2024-06-06 15:36:49 +08:00
104fcea725 [fix](mtmv) mtmv disable dynamic partition (#35686) (#35949)
bp #35686

Co-authored-by: zhangdong <493738387@qq.com>
2024-06-06 14:43:40 +08:00
5cecbfc6ea [cherry-pick]Add workload metric query_be_memory (#35911) 2024-06-06 14:33:30 +08:00
b6ab0c4e59 [fix](multi-catalog)fix hive partition insert regression case (#35846)
from #35821
2024-06-06 11:35:04 +08:00
b5a35b9cef [FIX] Pick array inverted index bugfix (#35837)
here with some array with inverted index bugfix:
see also: 
https://github.com/apache/doris/pull/34766
https://github.com/apache/doris/pull/35086
https://github.com/apache/doris/pull/34683
https://github.com/apache/doris/pull/34076
2024-06-06 09:54:14 +08:00
bcde9c6524 [enhancement](nereids)eliminate repeat node if there is only 1 grouping set and no grouping scalar function (#35872) 2024-06-05 18:03:20 +08:00
af31e96c4e [fix](storage_policy) fix cannot cancel a partition's storage policy (#35890) 2024-06-05 15:53:26 +08:00
630fd06ccf Pick "[feature](Resource) Support to specify the root path for hdfs resource #32632" (#35848)
same as #32632
2024-06-05 08:59:20 +08:00
0585de12b5 [Improvement](coldhot) add statement to show objects which use storage policy (#35839) 2024-06-04 19:24:31 +08:00
5c8f87e01e [opt](log) refine the FE logger (#35679)
Previously, FE logs were written to files. The main FE logs include
fe.log, fe.warn.log, fe.audit.log, fe.out, and fe.gc.log.
In a K8s deployment environment, logs usually need to be output to
standard output, and then other components process the log stream.

This PR made the following changes:

1. Modified the log4j configuration template

- When started with `--daemon`, logs are still written to various files,
and the format remains unchanged.
- When started with `--console`, all logs are output to standard output
and marked with different prefixes:

		- `StdoutLogger`: logs for standard output
		- `StderrLogger`: logs for standard error output
		- `RuntimeLogger`: logs for fe.log or fe.warn.log
		- `AuditLogger:` logs for fe.audit.log
		- No prefix: logs for fe.gc.log

		Examples are as follows:

		```
RuntimeLogger 2024-06-03 14:54:51,229 INFO (binlog-gcer|62)
[BinlogManager.gc():359] begin gc binlog
		```

2. Added a new FE config: `enable_file_logger`

Defaults to true. Indicates that logs will be recorded to files
regardless of the startup method. For example, if it is started with
`--console`, the log will be output to both the file and the standard
output. If it is `false`, the log will not be recorded in the file
regardless of the startup method.

3. Optimized the log format of standard output

The byte streams of stdout and stderr are captured. The logs previously
outputted using `System.out` will be captured in fe.log for unified
management.
2024-06-04 18:20:30 +08:00
f94222a04e [fix](log) Support fe log rollover size strategy (#34446) 2024-06-04 18:18:16 +08:00
db3bbc2437 [feature](merge-cloud) Change fe log rolling max size (#32777) 2024-06-04 18:17:33 +08:00
4b5163c905 [Feat](nereids) add transform rule MergePercentileToArray (#35809)
cherry-pick #34313 to branch-2.1

MergePercentileToArray is to perform a transformation in this case:
select ss_item_sk, percentile(ss_quantity,0.9), percentile(ss_quantity,0.6), percentile(ss_quantity,0.3)
from store_sales group by ss_item_sk;
==>
select ss_item_sk, percentile_array(ss_quantity,[0.3,0.6,0.9]) from store_sales group by ss_item_sk;
2024-06-04 17:50:36 +08:00
c23ab25474 [fix](nereids)keep equal predicate as join conjunct even if it can be fold to null literal (#35842)
pick from master https://github.com/apache/doris/pull/35811

## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
2024-06-04 14:46:58 +08:00
bc6b316e87 [chore](index) add config enable_create_bitmap_index_as_inverted_index default true #33434 (#35521) 2024-06-04 12:07:03 +08:00
fe1a4c4136 [Feature](IP) support ipv4/ipv6 with inverted index and conjuncts for query (#35734)
support data type ipv4/ipv6 with inverted index 
and then we can query like "> or < or >= or <= or in/not in " this
conjuncts expr for ip with inverted index speeding up
2024-06-03 23:24:03 +08:00
940e26f341 [feat](nereids) support Iceberg time travel syntax (#35812)
backport: #34681
Co-authored-by: Butao Zhang <zhangbutao@cmss.chinamobile.com>
2024-06-03 20:24:11 +08:00
958687f7d4 [fix](Nereids) unix_timestamp compute signature and fold const is wrong (#35727)
1. compute signature should call super#computeSignature first
2. fold const return type not changed after signature changed in #26827

we already have p0 for this case, but our regression framework has bug
that it report success when compare decimal type if real result lose
scale
2024-06-03 18:56:53 +08:00
3e096dda91 [Cherry-pick]Support kill query in be (#35794)
## Proposed changes
 pick #35602

```
mysql [information_schema]>kill query '2047df937c66704d-3ac4cfaf17f65eae';
Query OK, 0 rows affected (0.01 sec)


I20240603 15:21:50.373333 3355508 internal_service.cpp:592] Cancel query 2047df937c66704d-3ac4cfaf17f65eae, reason: USER_CANCEL

```
2024-06-03 15:39:30 +08:00
4f0365e0bf [fix](s3) move s3 providers to fe-common to be accessible for jni reader (#35779)
backport: #35690

`PropertyConverter.setS3FsAccess` has add customized s3 providers:
```
public static final List<String> AWS_CREDENTIALS_PROVIDERS = Arrays.asList(
            DataLakeAWSCredentialsProvider.class.getName(),
            TemporaryAWSCredentialsProvider.class.getName(),
            SimpleAWSCredentialsProvider.class.getName(),
            EnvironmentVariableCredentialsProvider.class.getName(),
            IAMInstanceCredentialsProvider.class.getName());
```
And these providers are set as configuration value of
`fs.s3a.aws.credentials.provider`, which will be used as configuration
to build s3 reader in JNI readers. However,
`DataLakeAWSCredentialsProvider` is in `fe-core`, that is not dependent
by JNI readers, so we have to move s3 providers to `fe-common'.
2024-06-03 14:04:39 +08:00