Commit Graph

12542 Commits

Author SHA1 Message Date
28561f77e9 [fix](regression)fix test_hdfs_tvf regression_test out file : decimalv3 -> decimal (#22852) 2023-08-11 20:44:18 +08:00
84ee814bc3 [docs](docs) Update invalid pics of release note 1.1.0 and 2.0-beta (#22804) 2023-08-11 20:08:21 +08:00
130c47e669 [Fix](Nereids)add need forward for enable_nereids_dml and format some cases (#22888) 2023-08-11 19:35:29 +08:00
045843991a [Fix](Nereids) fix insert into table of random distribution for nereids (#22831)
currently insert into a table of random distribution info is not supported, we fix it by set physical properties to Any.
2023-08-11 19:26:39 +08:00
a2fd488438 [chore](Nereids): polish StatsCalculatorTest (#22884) 2023-08-11 18:08:18 +08:00
7ac4df67ab [Fix](regression)Fix test_mysql_jdbc_catalog_nereids p2 test case (#22870)
Fix test_mysql_jdbc_catalog_nereids p2 test case.
2023-08-11 17:57:48 +08:00
a089fe3e43 [Improve](jni-avro)Reduce the volume of the avro-scanner-jar package (#22276)
The avro-scanner-jar package is reduced from 204M to 160M.

Hadoop-related dependencies in the original avro pom are directly packaged into a jar package, resulting in a jar volume of 200M. Now since there is already a hadoop jar package environment in be lib, it can be directly referenced.
2023-08-11 17:26:14 +08:00
db69457576 [fix](avro)Fix S3 TVF avro format reading failure (#22199)
This pr fixes two issues:

1. when using s3 TVF to query files in AVRO format, due to the change of `TFileType`, the originally queried `FILE_S3 ` becomes `FILE_LOCAL`, causing the query failed.
2. currently, both parameters `s3.virtual.key` and `s3.virtual.bucket` are removed. A new `S3Utils`  in jni-avro to parse the bucket and key of s3.
The purpose of doing this operation is mainly to unify the parameters of s3.
2023-08-11 17:22:48 +08:00
72e264dd59 [fix](executor)fix error when FixedContainer with null (#22850) 2023-08-11 17:20:50 +08:00
4f7c6aa27f [fix](case) update nereids_delete_using to pass run without load data (#22853)
Co-authored-by: stephen <hello-stephen@qq.com>
2023-08-11 17:16:29 +08:00
3e169511e3 [test](jdbc_mysql)update test_jdbc_query_mysql regression test result #22866 2023-08-11 17:15:14 +08:00
548226acfc [fix](planner)shouldn't change the child type to assignmentCompatibleType if it's INVALID_TYPE (#22841)
if changing the child type to INVALID_TYPE, the later getBuiltinFunction call will fail
2023-08-11 17:14:49 +08:00
bcac160013 [fix](broadcast shuffle) fix wrong result of broadcast shuffle (#22847)
When data stream sender is doing broadcast shuffle, it accumulate to batch size and then send blocks to destinations, but for local receivers, it ONLY send the current block, which will cause data loss.

This issue is introduced by #22218.

If #22218 is pick to 2.0 branch, then also need to pick this PR.
2023-08-11 17:01:11 +08:00
3e9ba632d7 [typo](docs) Add a guide to using SQL for the jdbc catalog (#22880) 2023-08-11 16:28:42 +08:00
0c38f42827 [fix](doc) Remove introduction to unstable features (#22832)
1. Remove introduction to unstable features
2. Rename some sub-titles to avoid mixed use of chiniese and english
2023-08-11 15:59:16 +08:00
f88f021e52 [fix](bug) Fix BE thread safe start and stop #22560 2023-08-11 15:34:10 +08:00
8c3b95c523 [Fix](multi-catalog) sync default catalog when forwarding query to master. (#22684)
Assume that there is a hive catalog named hive_ctl, a hive db named db1 and a table named tbl1, if we connect a slave FE and execute following commands:

1. `switch hive_ctl`
2. `show partitions from db1.tbl1`

Then we will meet the error like this:
```
MySQL [(none)]> show partitions from db1.tbl1;
ERROR 1049 (42000): errCode = 2, detailMessage = Unknown database 'default_cluster:db1'
```

The reason is that the slave FE  will forward the `ShowPartitionStmt` to master FE but we do not sync the default catalog information, so the parser can not find the db and throws this exception. This is just one case, some other simillar cases will failed too.
2023-08-11 14:59:04 +08:00
72837a3ab4 [enhancement](Nereids): Plan equals() hashcode() don't need LogicalProprties (#22774)
- deepEquals don't need to compare LogicalProperties
- Plan equals() hashcode() don't need logicalProperty
2023-08-11 14:53:47 +08:00
209f36f1bf [fix](multi-catalog)fix jdbc loader (#22814) 2023-08-11 14:36:19 +08:00
94a7b44540 [Improvement](log) add config to controll compression of fe log & fe audit log (#22865)
fe log is large for a busy doris cluster, if you want to preserve some historical logs, it cost too much disk space.
enable compression is a good way to save space.
and a gzip compressed text file can be viewed without decompression.
2023-08-11 14:08:08 +08:00
f2075d0a81 [Fix](multi-catalog) Fix decimal precision issue in regression test result. (#22819)
Fix decimal precision issue in regression test result.
2023-08-11 13:49:30 +08:00
080d613238 [enhancement](Nereids): speed up rewrite() (#22846)
- use Set<Integer> instead of Set<String> to speedup `contains`
- remove `getValidRules` and use `if` in `for` to avoid `toImmutableList`
2023-08-11 13:04:30 +08:00
caf496a67e [Chore](RoutineLoad)Change max_batch_interval minimum limit from 5 to 1 (#22858) 2023-08-11 12:02:20 +08:00
b9b9071c9b [improvement](create partition) create partition require quorum replicas succ (#22554) 2023-08-11 11:59:05 +08:00
e17779f193 [Dependency](fe)Upgrade dependency version (#22496)
Upgrade guava to 32.1.2-jre
Set ck dependency scope to provided
Upgrade okio to 3.4.0
Upgrade snake yaml to 1.33
Upgrade aws-java-sdk to 1.12.519
Upgrade hadoop to 3.3.6
2023-08-11 10:54:37 +08:00
9e45b49f16 [regression] disable p0 case window_function 0810 (#22822) 2023-08-11 10:32:13 +08:00
13cc7a31ab [fix](bug) Fix page handle safe exit #22849 2023-08-11 09:55:19 +08:00
be5d11e0ec [fix](inverted index) check all pushdown for match predicate (#22836) 2023-08-11 09:02:15 +08:00
0aa00026bb [fix](autoinc) ignore column property isAutoInc() for create table as select ... statement(#22827) 2023-08-10 23:25:54 +08:00
9dc0f80386 [log](tablet clone) add decommission replica log (#22799) 2023-08-10 21:41:45 +08:00
a99211d818 [test](ctas) add some ut for testing varchar length in ctas (#22817)
1. If derived from a origin column, eg: `create table tbl1 as select col1 from tbl2`, the length will be same os the origin column.
2. If derived from a function, eg: `create table tbl1 as select func(col1) from tbl2`, the length will be 65533.
3. If derived from a constant value, eg: `create table tbl1 as select "abc" from tbl2`, the length will be 65533.
2023-08-10 20:48:12 +08:00
71807ceb5f [Enhancement](tvf) Table value function support reading local file (#17404)
I tested the local tvf with tpch queries. First, generate `lineitem` datasets with 6001215 rows, and load it into `lineitem` table by:
```
insert into lineitem select c11, c1, c4, c2, c3, c5, c6, c7, c8, c9, c10, c12, c13, c14, c15, c16 
from local(
        "file_path" = "tools/tpch-tools/bin/tpch-data/lineitem.tbl.1", 
        "backend_id" = "10003", 
        "format" = "csv", 
        "column_separator" = "|"
);
```
Then, run `q1` and `q16` tpch queries, the query result is correct.

It can also analyze the BE's log directly like:

```
mysql> select * from local(
        "file_path" = "log/be.out",
        "backend_id" = "10006",
        "format" = "csv")
       where c1 like "%start_time%" limit 10;
+--------------------------------------------------------+
| c1                                                     |
+--------------------------------------------------------+
| start time: 2023年 08月 07日 星期一 23:20:32 CST       |
| start time: 2023年 08月 07日 星期一 23:32:10 CST       |
| start time: 2023年 08月 08日 星期二 00:20:50 CST       |
| start time: 2023年 08月 08日 星期二 00:29:15 CST       |
+--------------------------------------------------------+
```
2023-08-10 20:07:42 +08:00
879024a3a2 disable costmodelV2 (#22830) 2023-08-10 19:22:24 +08:00
0230e3100b [case](fix) test_big_kv_map, add sync after streamload (#22802)
test_big_kv_map
2023-08-10 19:02:21 +08:00
8e5b4005dc [enhancement](data type) add use_mysql_bigint_for_largeint config Tell Doris to use bigint when returning largeint type to mysql jdbc (#22835) 2023-08-10 18:53:31 +08:00
4360b8520e [fix](executor)Fix query hang when query bitmap Orth intersect #22828 2023-08-10 18:50:33 +08:00
10d4124c82 [Chore](cmake) Refactor thirdparty.cmake by add_thirdparty function (#22777)
* Refactor thirdparty.cmake by add_thirdparty function

Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>

* Refactor thirdparty.cmake

- Rename add_library NOADD to NOTADD
- Fix tcmalloc not add to COMMON_THIRDPARTY
- Fix libintl in OS_MACOSX

Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>

---------

Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2023-08-10 18:39:09 +08:00
a1223218f3 [pipeline](exec) Support shared scan in jdbc and odbc scan node (#22826)
Support shared scan in jdbc and odbc scan node to improve exec performance
2023-08-10 18:34:45 +08:00
4a6f63e210 [fix](case) add order by in test_javaudaf_return_map (#22824)
add order by in test_javaudaf_return_map
2023-08-10 18:24:41 +08:00
be1e0dcd27 [new-feature](complex-type) support read nested parquet and orc file with complex type (#22793) 2023-08-10 18:23:07 +08:00
94c9dce308 [fix](iceberg) fix iceberg's filter expr to filter file (#22740)
Fix iceberg's filter expr to filter file, and add counts the number of partitions read
2023-08-10 18:20:57 +08:00
Pxl
56392e21ae [Bug](decimalv3) fix decimalv3 keyrange set wrong number #22818 2023-08-10 18:15:40 +08:00
d8d3e72534 [enhance](S3FIleWriter) Add md5 check for small file not suitable for multi part upload (#22296) 2023-08-10 18:11:07 +08:00
221e860cb7 [Feature](Routine Load)Support Partial Update (#22785) 2023-08-10 17:41:53 +08:00
df26fb2de4 [fix][alter table property] fix alter table property failed (#22791) 2023-08-10 17:12:42 +08:00
58e7952eea [refactor](load) use memtable writer in memtable memory limiter (#22780) 2023-08-10 17:08:47 +08:00
1c22742a14 [fix](compaction) remove check rowset overlapping in base compaction (#22599) 2023-08-10 16:36:33 +08:00
fd0c161081 [enhance](ColdHeatSeparation) forbid change storage policy to another one with different storage resource (#22519) 2023-08-10 16:32:09 +08:00
50fbe31f93 [fix](tablet report) fix not add replicas when a backend re join the cluster after changing its ip or port (#22700) 2023-08-10 15:29:28 +08:00
de5603da6b [regresstion][external]fix jdbc cases fail external 0809 (#22761)
fix jdbc cases fail external 0809
2023-08-10 15:23:30 +08:00