Commit Graph

18982 Commits

Author SHA1 Message Date
9efc7b63ec [fix](mtmv)Mtmv support row column (#35860) (#35956)
pick from master: #35860
2024-06-06 22:53:08 +08:00
fbc82e0253 [opt](log) refine the BE logger (#35942) (#35988)
bp #35942
2024-06-06 22:25:22 +08:00
af779f5cd8 Pick "[fix](gclog) Skip tablet dir without schema hash dir in path gc (#32793)" (#35978)
## Proposed changes
Pick "[fix](gclog) Skip tablet dir without schema hash dir in path gc
(#32793)"
2024-06-06 22:24:30 +08:00
5966354165 [FIX](cases)fix cases for test_ip_in_inverted_index (#35971)
bp #35881
2024-06-06 21:52:53 +08:00
c3a4d36036 [Pick 2.1](inverted index) fix tokenize function wrong result when params with space seperator (#35997)
Pick from (#32671)
2024-06-06 21:51:15 +08:00
4d5db6fee3 [fix](proc) fix keyword case sensitive issue for stmt show_frontends_disks (#35919) 2024-06-06 19:55:24 +08:00
a42b06a168 [branch-2.1][test](jdbc catalog) Change the db2 image address and repair test (#35967) 2024-06-06 17:21:40 +08:00
170e87a17a [branch-2.1](github) remove need 2 approval workflow (#35985)
branch-2.1 does not need this workflow
2024-06-06 17:06:49 +08:00
1d5b7cb559 [fix](branch-2.1)(jdbc catalog) fix mariadb test conf port (#35982) 2024-06-06 16:54:20 +08:00
fedb7e15a0 [fix](statistics) NPE when drop partition during publish (pick #35475) (#35977)
skip null partition when get base tablets for each be (for further usage
in dedup updated row count in MV) This may cause publish fail

cherry pick master #35475
2024-06-06 16:47:30 +08:00
85328d4f79 [branch-2.1](test) comment out avro cases (#35963)
Avro scanner is deprecated. Remove related test suits
2024-06-06 16:43:37 +08:00
726377a65c [bugfix](hive)fix the error message when creating hive table for 2.1 (#35975)
bp #35833
2024-06-06 16:32:10 +08:00
cd808c3ea0 [fix](mtmv) Fix that the storage medium specified for the mtmv is SSD, but the partition storage medium for the mtmv is still HDD (#35644) (#35955)
pick from master:#35644
2024-06-06 15:36:49 +08:00
104fcea725 [fix](mtmv) mtmv disable dynamic partition (#35686) (#35949)
bp #35686

Co-authored-by: zhangdong <493738387@qq.com>
2024-06-06 14:43:40 +08:00
5cecbfc6ea [cherry-pick]Add workload metric query_be_memory (#35911) 2024-06-06 14:33:30 +08:00
b6ab0c4e59 [fix](multi-catalog)fix hive partition insert regression case (#35846)
from #35821
2024-06-06 11:35:04 +08:00
b5a35b9cef [FIX] Pick array inverted index bugfix (#35837)
here with some array with inverted index bugfix:
see also: 
https://github.com/apache/doris/pull/34766
https://github.com/apache/doris/pull/35086
https://github.com/apache/doris/pull/34683
https://github.com/apache/doris/pull/34076
2024-06-06 09:54:14 +08:00
efe17245b0 [Fix](inverted index) fix index_id wrong size in V2 (#35909)
This pull request modifies the index_id type in inverted index storage
format v2 to int64_t. The index_id is now stored in the inverted index
file using 4 bytes.
2024-06-05 20:36:40 +08:00
bcde9c6524 [enhancement](nereids)eliminate repeat node if there is only 1 grouping set and no grouping scalar function (#35872) 2024-06-05 18:03:20 +08:00
af31e96c4e [fix](storage_policy) fix cannot cancel a partition's storage policy (#35890) 2024-06-05 15:53:26 +08:00
fdd87fe008 [enhancement](delete-pred) enable delete sub predicate v2 for compaction (#35859) (#35895)
## Proposed changes

This PR enable `delete sub predicate v2` for compaction, and legacy
version of delete predicate will be processed in the original way.
2024-06-05 12:05:21 +08:00
630fd06ccf Pick "[feature](Resource) Support to specify the root path for hdfs resource #32632" (#35848)
same as #32632
2024-06-05 08:59:20 +08:00
c2b830e1e7 Pick "[Fix](Tablet) Fix the issue of redundant loading of stale rowset (#35768)" (#35882) 2024-06-05 07:55:04 +08:00
5541fd11e9 [branch-2.1](partial update)add logs for partial update (#35416)
add logs for partial update

the master PR is #35802

If this is a relatively large or complex change, kick off the discussion
at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why
you chose the solution you did and what alternatives you considered,
etc...
2024-06-04 22:47:48 +08:00
0585de12b5 [Improvement](coldhot) add statement to show objects which use storage policy (#35839) 2024-06-04 19:24:31 +08:00
5c8f87e01e [opt](log) refine the FE logger (#35679)
Previously, FE logs were written to files. The main FE logs include
fe.log, fe.warn.log, fe.audit.log, fe.out, and fe.gc.log.
In a K8s deployment environment, logs usually need to be output to
standard output, and then other components process the log stream.

This PR made the following changes:

1. Modified the log4j configuration template

- When started with `--daemon`, logs are still written to various files,
and the format remains unchanged.
- When started with `--console`, all logs are output to standard output
and marked with different prefixes:

		- `StdoutLogger`: logs for standard output
		- `StderrLogger`: logs for standard error output
		- `RuntimeLogger`: logs for fe.log or fe.warn.log
		- `AuditLogger:` logs for fe.audit.log
		- No prefix: logs for fe.gc.log

		Examples are as follows:

		```
RuntimeLogger 2024-06-03 14:54:51,229 INFO (binlog-gcer|62)
[BinlogManager.gc():359] begin gc binlog
		```

2. Added a new FE config: `enable_file_logger`

Defaults to true. Indicates that logs will be recorded to files
regardless of the startup method. For example, if it is started with
`--console`, the log will be output to both the file and the standard
output. If it is `false`, the log will not be recorded in the file
regardless of the startup method.

3. Optimized the log format of standard output

The byte streams of stdout and stderr are captured. The logs previously
outputted using `System.out` will be captured in fe.log for unified
management.
2024-06-04 18:20:30 +08:00
f94222a04e [fix](log) Support fe log rollover size strategy (#34446) 2024-06-04 18:18:16 +08:00
db3bbc2437 [feature](merge-cloud) Change fe log rolling max size (#32777) 2024-06-04 18:17:33 +08:00
4b5163c905 [Feat](nereids) add transform rule MergePercentileToArray (#35809)
cherry-pick #34313 to branch-2.1

MergePercentileToArray is to perform a transformation in this case:
select ss_item_sk, percentile(ss_quantity,0.9), percentile(ss_quantity,0.6), percentile(ss_quantity,0.3)
from store_sales group by ss_item_sk;
==>
select ss_item_sk, percentile_array(ss_quantity,[0.3,0.6,0.9]) from store_sales group by ss_item_sk;
2024-06-04 17:50:36 +08:00
c23ab25474 [fix](nereids)keep equal predicate as join conjunct even if it can be fold to null literal (#35842)
pick from master https://github.com/apache/doris/pull/35811

## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
2024-06-04 14:46:58 +08:00
bc6b316e87 [chore](index) add config enable_create_bitmap_index_as_inverted_index default true #33434 (#35521) 2024-06-04 12:07:03 +08:00
398919d090 2.1.4-rc02 2024-06-03 23:24:33 +08:00
fe1a4c4136 [Feature](IP) support ipv4/ipv6 with inverted index and conjuncts for query (#35734)
support data type ipv4/ipv6 with inverted index 
and then we can query like "> or < or >= or <= or in/not in " this
conjuncts expr for ip with inverted index speeding up
2024-06-03 23:24:03 +08:00
ba0161c8b9 Revert "[feature-wip](Cloud) Introduce azure core C++ sdk (#35208)" (#35826)
## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->

This reverts commit #35641 because of the compilation of such is not
successful on arm plateform.
2024-06-03 23:22:56 +08:00
940e26f341 [feat](nereids) support Iceberg time travel syntax (#35812)
backport: #34681
Co-authored-by: Butao Zhang <zhangbutao@cmss.chinamobile.com>
2024-06-03 20:24:11 +08:00
9cbf8cb3bd 2.1.4-rc01 2024-06-03 18:57:32 +08:00
958687f7d4 [fix](Nereids) unix_timestamp compute signature and fold const is wrong (#35727)
1. compute signature should call super#computeSignature first
2. fold const return type not changed after signature changed in #26827

we already have p0 for this case, but our regression framework has bug
that it report success when compare decimal type if real result lose
scale
2024-06-03 18:56:53 +08:00
f80b856405 [enhancement](oom) return error when bloom filter allocate memory failed (#35790)
## Proposed changes


1. return error when bloom filter allocate memory failed
2. return error when deserialize a block,  it may need a lot of memory.

---------

Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-06-03 18:22:11 +08:00
3e096dda91 [Cherry-pick]Support kill query in be (#35794)
## Proposed changes
 pick #35602

```
mysql [information_schema]>kill query '2047df937c66704d-3ac4cfaf17f65eae';
Query OK, 0 rows affected (0.01 sec)


I20240603 15:21:50.373333 3355508 internal_service.cpp:592] Cancel query 2047df937c66704d-3ac4cfaf17f65eae, reason: USER_CANCEL

```
2024-06-03 15:39:30 +08:00
4f0365e0bf [fix](s3) move s3 providers to fe-common to be accessible for jni reader (#35779)
backport: #35690

`PropertyConverter.setS3FsAccess` has add customized s3 providers:
```
public static final List<String> AWS_CREDENTIALS_PROVIDERS = Arrays.asList(
            DataLakeAWSCredentialsProvider.class.getName(),
            TemporaryAWSCredentialsProvider.class.getName(),
            SimpleAWSCredentialsProvider.class.getName(),
            EnvironmentVariableCredentialsProvider.class.getName(),
            IAMInstanceCredentialsProvider.class.getName());
```
And these providers are set as configuration value of
`fs.s3a.aws.credentials.provider`, which will be used as configuration
to build s3 reader in JNI readers. However,
`DataLakeAWSCredentialsProvider` is in `fe-core`, that is not dependent
by JNI readers, so we have to move s3 providers to `fe-common'.
2024-06-03 14:04:39 +08:00
d4956bfaf5 do not use path style to access s3 (#35788)
## Proposed changes
2024-06-03 13:57:13 +08:00
a086111a96 [improve](session) print more error msg when set a wrong session variable name (#35775)
## Proposed changes
when set a wrong session variable, eg:
mysql [(none)]>set enable_profileXXXXXXX=true;
ERROR 1228 (HY000): errCode = 2, detailMessage = Unknown system variable
'enable_profileXXXXXXX', the similar variables are {'enable_profile',
'enable_force_spill', 'enable_projection'}

<!--Describe your changes.-->
2024-06-03 12:45:58 +08:00
f25b7fb4eb [fix](binlog) Add is temp for UpsertRecord (#35774)
Cherry-pick #35636.

The ccr-syncer does not support syncing temporary partitions, so this PR
adds a field to record whether this upsert record comes from a temporary
partition.
2024-06-03 12:41:29 +08:00
c033c71aed [enhancement](cancel) add cancel query ability on be 2024-06-03 11:43:57 +08:00
f03cee5e30 [enhancement](oom) add exception in olap data convertor when memory is not enough to prevent oom (#35761)
Issue Number: close #xxx

<!--Describe your changes.-->

---------

Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-06-02 21:12:53 +08:00
e755d64e62 [feature](be jvm monitor)append enable_jvm_monitor in be.conf to control jvm monitor. (#35608) (#35764)
bp #35608

Co-authored-by: daidai <2017501503@qq.com>
2024-06-02 00:18:44 +08:00
72489a04c3 [cherry-pick](branch-2.1) remove some CHECKs in Tablet::revise_tablet_meta (#31268) (#34702)
## Proposed changes

Issue Number: close #xxx

cherry-pick #31268 

## Further comments

If this is a relatively large or complex change, kick off the discussion
at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why
you chose the solution you did and what alternatives you considered,
etc...
2024-06-02 00:15:31 +08:00
75fd8c4226 [fix](drop table) Improve error prompts when deleting materialized views (#35437) (#35763)
bp #35437

Co-authored-by: HB <hubiao01@corp.netease.com>
2024-06-01 23:48:52 +08:00
2675014491 [fix](s3) fix invalid s3 properties checking logic (#35757)
Introduced from #35747 
pick part of #35762
2024-06-01 23:30:22 +08:00
bc95902fed [fix](Nereids): Convert VarcharLiteral to StringLikeLiteral in Function Signatures (#35536)
## Proposed changes

This pull request updates the function signatures where VarcharLiteral
is currently used, replacing it with StringLikeLiteral. This change aims
to enhance flexibility and consistency across functions that handle
similar types of string data. By adopting StringLikeLiteral, we can
support a broader range of string-like types beyond the basic VARCHAR
type, facilitating more robust and versatile string handling
capabilities in our codebase. This update ensures better type
abstraction and promotes code reusability.

<!--Describe your changes.-->

## Further comments

If this is a relatively large or complex change, kick off the discussion
at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why
you chose the solution you did and what alternatives you considered,
etc...
2024-06-01 14:01:29 +08:00