Commit Graph

8906 Commits

Author SHA1 Message Date
ab3057b2d4 [Feat](nereids) support date function in partition prune (#38743) (#38898)
cherry-pick #38743 to branch-2.1
2024-08-06 09:13:13 +08:00
3b9394a8c7 [improvement](tablet scheduler) Adjust tablet sched priority to help load data succ #38528 (#38884)
cherry pick from #38528
2024-08-06 02:13:47 +08:00
9c020f9db1 [fix](fe) Fix the default value of ReplacePartitionClause.isStrictRange (#38688) (#38879) 2024-08-05 20:59:50 +08:00
ce75e6adfe [fix](group commit) Fix group commit debug log and improve performance (#38754) (#38841)
Pick https://github.com/apache/doris/pull/38754
2024-08-05 18:34:49 +08:00
0f0b0e9b37 [Feat](nereids) Support date_trunc function in partition prune (#38025) (#38849)
cherry-pick #38025 to branch-2.1
2024-08-05 18:29:10 +08:00
40567b5d69 [fix](nereids)support group_concat with distinct and order by (#38871)
## Proposed changes

pick from master https://github.com/apache/doris/pull/38080

<!--Describe your changes.-->
2024-08-05 18:23:55 +08:00
bf1c7a1c15 [fix](clone) fix stale tablet report miss the new cloning replica #38695 (#38839)
cherry pick from #38695
2024-08-05 18:04:24 +08:00
994c56f914 [fix](txn) fix abortTxn by label does not acquire table write lock (#38777) (#38842)
pick https://github.com/apache/doris/pull/38777
2024-08-05 16:33:20 +08:00
65154f8abe [branch-2.1] (doris-future) Support auto partition name function (#38853)
cherry-pick https://github.com/apache/doris/pull/34258 to branch-2.1
2024-08-05 16:04:24 +08:00
5dfc5d2c77 [enhancement](querycancel) print detail message when query is cancelled (#38859)
## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->

---------

Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-08-05 14:47:03 +08:00
de9b9d6a39 [Fix](nereids) change char(0) to char(1), varchar(0) to varchar(65533) when create table (#38427) (#38530)
cherry-pick #38427 to branch-2.1

---------

Co-authored-by: morrySnow <101034200+morrySnow@users.noreply.github.com>
2024-08-05 09:18:18 +08:00
9430b27e68 [branch-2.1][improvement](jdbc catalog) improvement some jdbc catalog properties check order (#38770)
pick (#38439)

1. Move the execution of testJdbcConnection() to checkWhenCreating
instead of the constructor
2. Move the logic of renaming lower_case_table_names to
lower_case_meta_names to setDefaultPropsIfMissing
2024-08-05 09:14:04 +08:00
5d02c48715 [feature](hive)Support reading renamed Parquet Hive and Orc Hive tables. (#38432) (#38809)
bp #38432 

## Proposed changes
Add `hive_parquet_use_column_names` and `hive_orc_use_column_names`
session variables to read the table after rename column in `Hive`.

These two session variables are referenced from
`parquet_use_column_names` and `orc_use_column_names` of `Trino` hive
connector.

By default, these two session variables are true. When they are set to
false, reading orc/parquet will access the columns according to the
ordinal position in the Hive table definition.

For example:
```mysql
in Hive :
hive> create table tmp (a int , b string) stored as parquet;
hive> insert into table tmp values(1,"2");
hive> alter table tmp  change column  a new_a int;
hive> insert into table tmp values(2,"4");

in Doris :
mysql> set hive_parquet_use_column_names=true;
Query OK, 0 rows affected (0.00 sec)

mysql> select  * from tmp;
+-------+------+
| new_a | b    |
+-------+------+
|  NULL | 2    |
|     2 | 4    |
+-------+------+
2 rows in set (0.02 sec)

mysql> set hive_parquet_use_column_names=false;
Query OK, 0 rows affected (0.00 sec)

mysql> select  * from tmp;
+-------+------+
| new_a | b    |
+-------+------+
|     1 | 2    |
|     2 | 4    |
+-------+------+
2 rows in set (0.02 sec)
```

You can use `set
parquet.column.index.access/orc.force.positional.evolution = true/false`
in hive 3 to control the results of reading the table like these two
session variables. However, for the rename struct inside column parquet
table, the effects of hive and doris are different.
2024-08-05 09:06:49 +08:00
40767003c6 [Fix](ScanNode) Move the finalize phase of ScanNode to after the end of the Physical Translate phase (#38604)
bp: #37565

Currently, Doris first obtains splits and then performs projection.
After column pruning, it calls `updateRequiredSlots` to update the
scanRange information. However, the Trino connector's column pruning
pushdown needs to be completed before obtaining splits.

Therefore, we move the finalize phase of `ScanNode` to after the end of
the `Physical Translate` phase, so that `createScanRangeLocations` can
use the final columns which have been pruning.

## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
2024-08-05 08:58:59 +08:00
f76397277e [fix](routine load) fix show routine load task result incorrect (#38523) (#38826)
pick (#38523)

Create a job:
```
CREATE ROUTINE LOAD testShow ON test_show_routine_load
COLUMNS TERMINATED BY ","
PROPERTIES
(
"max_batch_interval" = "5",
"max_batch_rows" = "300000",
"max_batch_size" = "209715200"
)
FROM KAFKA
(
"kafka_broker_list" = "127.0.0.1:19092",
"kafka_topic" = "test_show_routine_load",
"property.kafka_default_offsets" = "OFFSET_BEGINNING"
);
```
show routine load task:
```
SHOW ROUTINE LOAD TASK WHERE JobName = "testShow";
```
result:
```
ERROR 1105 (HY000): errCode = 2, detailMessage = The job named testshowdoes not exists or job state is stopped or cancelled
```

Do not  use `toLowerCase` method;
2024-08-04 22:18:25 +08:00
79b07d0b8a [fix](routine load) fix enclose and escape can not set in routine load job (#38402) (#38825)
pick (#38402)
2024-08-04 22:17:12 +08:00
7c70f75198 [Fix](Load)Audit logs avoid recording certain sensitive information #38769 (#38784)
…

## Proposed changes

#38769

<!--Describe your changes.-->
2024-08-04 10:53:03 +08:00
556f0fc784 [pick](json-keys) support json_keys function (#38631)
## Proposed changes
backport: https://github.com/apache/doris/pull/36411
Issue Number: close #xxx

<!--Describe your changes.-->
2024-08-02 19:10:00 +08:00
2425730609 [enhance](auth)support cache ranger datamask and row filter (#37723) (#38575)
pick: https://github.com/apache/doris/pull/37723
2024-08-02 14:59:32 +08:00
f24d55fc94 [fix](syntax) multi statements must delim with semicolon (#38670) (#38753)
pick from master #38670
2024-08-02 14:49:51 +08:00
da7b2cf578 [refactor](catalog) set "use_meta_cache" default to true (#38244)(#38352)(#38619) (#38355)
bp #38244 #38352 #38619

---------

Co-authored-by: Yulei-Yang <yulei.yang0699@gmail.com>
2024-08-02 14:13:38 +08:00
d800434859 [Chore](Fe)Upgrade dependencies (#38509) (#38747)
## Proposed changes

upgrade spring-boot to 2.7.18
upgrade zookeeper to 3.9.2
upgrade jetty to 9.4.55.v20240627
upgrade ivy to 2.5.2
upgrade icu4j to 75.1
upgrade ini4j to 0.5.4

(cherry picked from commit 3f633c2018e86c6c842647262853d88ad63672bf)

pick #38509
## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
2024-08-02 12:34:17 +08:00
f21d7e3833 [test](inverted index)Add cases for inverted index format v2 (#38132)(#38443) (#38222)
## Proposed changes

backport #38132 #38443
2024-08-02 12:04:26 +08:00
4f2ca43917 [minor](fe) simplfy some code in HMSExternalTable (#32344) (#38675)
bp #32344

Co-authored-by: DuRipeng <453243496@qq.com>
2024-08-02 11:27:10 +08:00
84d9b2fcf4 [pick](nestedtypes) support nested type with agg replace_if_not_null (#38719)
## Proposed changes
backport: https://github.com/apache/doris/pull/38304
Issue Number: close #xxx

<!--Describe your changes.-->
2024-08-02 11:18:33 +08:00
e140613ae1 [fix](Nereids) remove db readlock before get table from db (#38660) (#38729)
pick from master #38660

insert will hold readlock of target table before planning. if nereids
need db readlock after it, will lead to dead lock. because other
statement need to hold db lock before get table lock

for example:

insert: target table read lock -> database read lock
drop table: database write lock -> target table write lock
2024-08-02 08:34:59 +08:00
555dccb1a4 [fix](bloom filter)Fix rename column with bloom filter (#38431) (#38662)
backport #38431
2024-08-01 19:01:14 +08:00
2562cf33a7 [fix](mtmv) Choose a valid partition column when there are both valid and invalid expressions (#38367) (#38684)
## Proposed changes
pick #38367 

<!--Describe your changes.-->
2024-08-01 19:00:28 +08:00
60091f072a [fix](auth)fix create table like need create_priv of existed table (#… (#38570)
…37879)

pick: https://github.com/apache/doris/pull/37879
2024-08-01 18:57:44 +08:00
a4e793752f [bugfix](iceberg)revert count(*) directly returned by fe for 2.1 (#38566) (#38655)
bp: #38566
2024-08-01 18:56:19 +08:00
e8690b62ee [fix](group commit) Pick add debug log show why group commit not work; delete wal when replay success (#38611) (#38659)
Pick https://github.com/apache/doris/pull/38611
2024-08-01 16:59:54 +08:00
cafcf7acc1 [cherry-pick](SSL) Fix ssl connection close 2.1 (#38587) (#38677)
## Proposed changes

Issue Number: close #38590 

If SSL connection closed, a specified packet will sent to indicate the
closing of connection. The SSL engine will be shut down and output an
empty unwrapped result.

Therefore, handle this case correctly to avoid buffer overflow by
breaking the reading flow and do the cleanup stuff initiatively.
2024-08-01 16:06:30 +08:00
b0943064e0 [fix](kerberos)fix and refactor ugi login for kerberos and simple authentication (#38607)
pick from  (#37301)
2024-08-01 14:01:32 +08:00
c310b92727 [bugfix](external)Modify the default value of pushdowncount for 2.1 (#37754) (#38591)
bp: #37754
2024-08-01 12:10:54 +08:00
057ee1905f [bugfix](hudi)add timetravel for nereids for 2.1 (#38324) (#38582)
## Proposed changes

bp #38324
2024-08-01 11:37:57 +08:00
41fa7bc9fd [bugfix](paimon)Fixed the reading of timestamp with time zone type data for 2.1 (#37716) (#38592)
bp: #37716
2024-08-01 10:23:06 +08:00
9f1e41c623 [Cherry-pick](branch-2.1) Pick "[Enhancement](audit log) Add print audit log sesssion variable #38419" (#38624)
## Proposed changes

<!--Describe your changes.-->

Pick #38419
2024-08-01 10:19:19 +08:00
6bd93b119f [pick](cast)Feature cast complexttype2 json (#38632)
## Proposed changes
backport: https://github.com/apache/doris/pull/36548
Issue Number: close #xxx

<!--Describe your changes.-->
2024-08-01 09:18:15 +08:00
098ac67a75 [fix](catalog)fix db name may be null in NotificationEvent (#38421) (#38596)
pick: https://github.com/apache/doris/pull/38421
2024-07-31 22:56:44 +08:00
b21b906306 [Fix](outfile) FE check the hdfs URI of outfile (#38602)
bp: #38203

1. Previously, if the root path of the HDFS URI started with two
slashes, the outfile would be successfully exported without errors, but
the exported path would not be the expected path.
Currently, we will delete repeated '/' which specified by users in FE.

2. move the test case for outfile HDFS from p2 to p0.
2024-07-31 22:46:37 +08:00
ef8a1918c3 [case][fix](iceberg)move rest cases from p2 to p0 and fix iceberg version issue for 2.1 (#37898) (#38589)
bp: #37898
2024-07-31 22:41:56 +08:00
1c176db010 [bugfix](paimon)add support for 'in' and 'not in' for 2.1 #38390 (#38576)
## Proposed changes

bp: #38390
2024-07-31 22:38:27 +08:00
46dfb65b6b [fix](auth)fix show routine load db is null (#38574)
pick: https://github.com/apache/doris/pull/38365
2024-07-31 22:34:35 +08:00
96413e679d [branch-2.1](mtmv) Support read sync materialized view in async materialized view (#38462)
## Proposed changes

pick #37396

<!--Describe your changes.-->

---------

Co-authored-by: liutang123 <liulijia@gmail.com>
2024-07-31 22:32:28 +08:00
002232b2da [improvement](statistics)Analyze all columns when partition first loaded. (#38606)
backport: https://github.com/apache/doris/pull/38540
2024-07-31 19:59:41 +08:00
a75511ae08 [Feature](inverted index) add no need read data optimize config (#38584)
pick from #36686
2024-07-31 19:39:17 +08:00
e26f2441bd [opt](Nereids): disable PRUNE_EMPTY_PARTITION rule in SqlTestBase.java #38246 (#38455)
## Proposed changes

#38246

Co-authored-by: zclllhhjj <zhaochangle@selectdb.com>
2024-07-31 14:34:35 +08:00
9d8b2e85ae [fix](partial-update) insert only without auto_inc column should not use partial update (#38229) (#38504)
cherry-pick #38229 to branch-2.1


## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
2024-07-31 11:01:08 +08:00
9e696d72f1 [fix](nereids)check functionBuilders is not null before using it (#38535)
## Proposed changes

pick from master https://github.com/apache/doris/pull/38457

<!--Describe your changes.-->
2024-07-31 11:00:39 +08:00
94111da2a9 [Fix](nereids) fix normalize repeat alias rewrite (#38166) (#38454)
cherry-pick #38166 to branch-2.1
2024-07-31 10:59:15 +08:00