Commit Graph

11297 Commits

Author SHA1 Message Date
fe18cfa2fb [improvement](pg jdbc)Support for automatically obtaining the precision of the postgresql timestamp type (#20909) 2023-06-16 23:41:09 +08:00
367f64e7bd [improvement](jdbc) support insert autoinc and default value column to mysql (#20765)
In JdbcMysqlClient, I've added methods to retrieve auto-increment and default value columns from MySQL. These columns are then mapped into Doris metadata to make them visible to users.

When handling the InsertStmt into an execution plan, Doris used to automatically fill in NULL or default values for columns not specified in the InsertStmt. However, in the JDBC catalog, we don't need Doris to handle these unspecified columns, so I've made changes to skip them directly.

For the insert prepared statement required for writing, our previous behavior was to obtain all columns for placeholders. So, the change I made is to pass in the columns processed by the execution plan during the sink task generation stage for dynamic generation.
2023-06-16 23:38:11 +08:00
e834637a5b [improvement](ck jdbc) Support for automatically getting the precision of clickhouse's datetime64 type (#20887) 2023-06-16 23:37:30 +08:00
bf197ee8d2 [opt](nereids) adjust cost model for BroadCastJoin and PartitionJoin (#20713)
we add penalty for broadcast join (bc for brief in the following).
the intuition of penalty is as follow:
1. if the build side is very small (< 1M), we prefer bc, and set `penalty=1`, which means no penalty
2. if build side is more than 1M, we consider the ratio of the probe row count to the build row count. the less the ratio is, the higher penalty is.

this pr has positive impact on tpch queries. Only q3 is changed. in out test (tpch 1T, 3BE) q3 improved from 5.1sec to 2.5 sec.
this pr has positive impact on tpcds queries. test on tpcds sf100 (3BE), cold run improve from 163 sec to 156 sec, hot run improves from 155 sec to 149 sec
2023-06-16 22:49:04 +08:00
bb4f10b457 [fix](Nereids) lost having when analyze sort-having-agg (#20914) 2023-06-16 22:13:28 +08:00
7ee744ff5a [opt](Nereids) add more unexpected expression check (#20901)
1. check sub-query after rewrite, should not meet any sub-query there
2. check below expression on specific plan
  - Aggreagate
    - TableGeneratingFunction
  - Filter
    - AggregateFunction
    - GroupingScalarFunction
    - TableGeneratingFunction
    - WindowExpression
  - Generate
    - AggregateFunction
    - GroupingScalarFunction
    - WindowExpression
  - Having
    - TableGeneratingFunction
    - WindowExpression
  - Join
    - AggregateFunction
    - GroupingScalarFunction
    - TableGeneratingFunction
    - WindowExpression
  - Project
    - TableGeneratingFunction
  - Sort
    - AggregateFunction
    - GroupingScalarFunction
    - TableGeneratingFunction
    - WindowExpression
  - Window
    - GroupingScalarFunction
    - TableGeneratingFunction
2023-06-16 22:12:39 +08:00
483d96368c [doc](insert-overwrite) add related doc (#20657) 2023-06-16 21:11:04 +08:00
ab32299ba4 [feature](nereids) Support multi target rf #20714
Support multi target runtime filter, mainly for set operation, such as union/intersect/except.
2023-06-16 20:26:00 +08:00
1cc611a913 [fix](match) fix regression case test_index_match_select and test_index_match_phrase (#20860)
1. add more checks for match expression in nereids:
  - match expression only support in filter
  - match expression left child and right child must all be string type
  - left child for match expression must be sloftRef, right child for match expression must be Literal

2. to fix regression case test_index_match_select and test_index_match_phrase
2023-06-16 20:18:29 +08:00
5dc0f90c7f [opt](Nereids) revert convert IN with 2 options to OR expression rule (#20894)
revert this rule because it has negative effect on predicate push-down-to-storage-layer
2023-06-16 19:11:37 +08:00
6cde7bc8ad [feature](Nereids) just reserve logical expression in memo after do dphyp (#20843)
After DPHyp, clear all physicalExpression and other,
just keep logicalExpression as original plan as input of cascades optimize.
2023-06-16 18:12:39 +08:00
baed26acc6 [fix](merge-on-write) fix publish timeout (#20891) 2023-06-16 18:11:41 +08:00
536bf56a35 [fix](planner) strip trailing zeros when cast string literal to decimal literal (#20743) 2023-06-16 17:08:36 +08:00
2e295a1ee9 [Enhancement](http) unify http auth config (#20864) 2023-06-16 16:55:46 +08:00
22aee21208 [fix](nereids)disable tpcds query59 plan shape check #20886
ds59 plan shape is not stable.
we will fix it in future
2023-06-16 16:50:40 +08:00
e63739e729 [fix](nereids) add regression test for stats analyze and fix some bugs (#20865)
1. Add regression test case for analyze to make sure show/drop/analyze stats would work as expected
2. Remove useless code, which would block the clean for expired stats
3. Fix bug of DropStats,  before this PR drop the whole table stats would casuse a NPE exception when parsing stmt
2023-06-16 16:43:49 +08:00
0bd9aecfcc [Fix](planner&Nereids) fold constant invalid yyyy-mm-31 to the last day of month incorrectly (#20685)
currently, expression: cast('20230631' as date) will be evaluate to 2023-06-30 incorrectly, and '20230632' will be null, we fix the bug and evaluate all the invalid date to null.
2023-06-16 16:23:02 +08:00
5573858cb4 [Enhance](regression-test) use another db than test_query_db for nereids_p0 (#19467)
replace test_query_db to nereids_test_query_db in nereids_p0 directory
split test_join into five files to run faster.
2023-06-16 16:11:14 +08:00
c3b9e99350 [fix](regress-test)update config for disable_nested_complex_type (#20735) 2023-06-16 15:51:41 +08:00
97135a1cbb [Feature] (json)add json_contains function (#20824) 2023-06-16 15:10:12 +08:00
ccfd6f1d23 [fix](nereids) only setHasColocatePlanNode when currentFragment's data partition is not RANDOM (#20885) 2023-06-16 14:54:22 +08:00
f741129ffe [Fix](regression-test) fix incorrect explain requirement when globally enable nereids (#20809) 2023-06-16 14:51:22 +08:00
731ce5802e [Test][Framework] add enableCacheData option for test framework (#20874) 2023-06-16 14:12:21 +08:00
1a62c79970 [fix](replica) do not delete the only one replica (#20872) 2023-06-16 13:57:18 +08:00
f80bf3c1bc [fix](compaction) fix time series compaction policy (#20837) 2023-06-16 13:25:55 +08:00
b7a50a09fe [Opt](orc-reader) Optimize orc reader by dict filtering. (#20806)
Optimize orc reader by dict filtering.  It is similar with #17594.
Test result
**ssb-flat-100**: (3 nodes)
| Query        | before opt           | after opt  |
| ------------- |:-------------:| ---------:|
Q1.1 | 1.239 | 1.145
Q1.2 | 1.254 | 1.128
Q1.3 | 1.931 | 1.644
Q2.1 | 1.359 | 1.006
Q2.2 | 1.229 | 0.674
Q2.3 | 0.934 | 0.427
Q3.1 | 2.226 | 1.712
Q3.2 | 2.042 | 1.562
Q3.3 | 1.631 | 1.021
Q3.4 | 1.618 | 0.732
Q4.1 | 2.294 | 1.858
Q4.2 | 2.511 | 1.961
Q4.3 | 1.736 | 1.446
total | 22.004 | 16.316
2023-06-16 13:11:37 +08:00
722839e118 [Fix](multi-catalog) Fix hive transaction table regression test by adding hive-docker missing configurations. (#20832)
Fix hive transaction table regression test test_transactional_hive by adding hive-docker missing configurations of #20679. Hive need to be set these configurations to do compaction.
2023-06-16 13:08:24 +08:00
1ec7f0e50a [fix](memory) memory management thread exits gracefully
memory_maintenance_thread, memory_gc_thread, load_channel_tracker_refresh_thread, memory_tracker_profile_refresh_thread
2023-06-16 11:40:24 +08:00
05e5ebfdeb [typo][docs] Fix a parameter error (#20841) 2023-06-16 11:37:41 +08:00
9d41edd9eb [Feature](binlog) Add binlog gc && Auth master_token (#20854)
Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2023-06-16 11:25:11 +08:00
a52fc3eb33 [chore](workflow) Improve the cache hit ratio in BE UT (Clang) (#20871)
In workflow BE UT (Clang), we set up the ldb_toolchain before we build the UT. During generating the toolchain tools, the script modifies the RPATH and dynamic linker of the executables which making the mtime of compilers change in every build.

By default, Ccache computes the hash of the compilers' mtime and size to check the compilers whether they are consistent with the ones which were used to generate the caches. If the compilers change, the caches can not be hit. We should change the default behavior of Ccache by setting the configuration `compiler_check (CCACHE_COMPILERCHECK)` to increase the cache hit ratio in the workflow.

Reference: https://ccache.dev/manual/latest.html#_configuration_options
2023-06-16 11:18:21 +08:00
Pxl
db1da6b787 [Chore](pipeline) add some profile log when pipeline canceled (#20825)
add some profile log when pipeline canceled
2023-06-16 10:54:54 +08:00
7f413f13c4 [fix](nereids)use "analyze table XXX with sync" in regression cases #20856
replace analyze table XXX by analyze table XXX with sync in regression test cases
2023-06-16 10:39:37 +08:00
420603317b [improve](match) Improve performance for match query without inverted index (#20815) 2023-06-16 10:35:38 +08:00
179b933ccc [test](regression) update some case in p2 (#20683) 2023-06-16 10:09:22 +08:00
c3e6db827c [typo][docs] remove unuse config mysql_service_nio_enabled (#20862) 2023-06-16 09:58:33 +08:00
40050a46ca [fix](java-udf)Fix cannot download pentaho-aggdesigner-algorithm jar in java udf and format it (#20013) 2023-06-16 09:40:48 +08:00
9daa277b0d [typo](doc) Fix the description of the alter-table-partition document (#20800) 2023-06-15 21:27:59 +08:00
71aba93e19 [typo](doc) spark connector add new param and a way to write bitmap and hll col (#20850) 2023-06-15 21:27:37 +08:00
835a1e8af9 [typo](doc)delete document optimization (#20748) 2023-06-15 21:22:43 +08:00
6d4c16b497 [typo](doc) Fix formatting errors in CREATE-TABLE documents due to line indentation (#20839) 2023-06-15 21:22:01 +08:00
b38417358b [chore](compile) Remove fs/benchmark/fs_benchmark_tool.cpp in IO_FILES (#20845) 2023-06-15 21:16:59 +08:00
f1af09ef87 [Enhancement](merge-on-write) parallel calculate delete bitmap when tablet has multi segments (#20706) 2023-06-15 21:11:39 +08:00
e4e4f11b14 [typo](docs) Optimize the JDBC Catalog document (#20861) 2023-06-15 21:09:22 +08:00
21bb7fea07 [minor](Nereids): convert if condition into a check (#20855) 2023-06-15 20:10:56 +08:00
Pxl
4bfceb7acb [Bug](pipeline) fix pipeline task call finish_p_dependency more than once (#20851)
fix pipeline task call finish_p_dependency more than once

When pipeline task meet eos->PENDING_FINISH->CANCELED, this task will call finish_p_dependency twice.
2023-06-15 19:21:40 +08:00
11e164e7a5 [typo](doc) Added a description of inconsistent results (#20849) 2023-06-15 17:17:24 +08:00
bb5d36c5cb [Log](load) change VLOG to INFO when write replica failing (#20783) 2023-06-15 16:11:14 +08:00
37f8334811 [Bug](GA)fix the wrong job dependencies (#20846) 2023-06-15 15:57:57 +08:00
Pxl
b6835840f7 [Bug](table-function) return InvalidArgument when explode_split meet empty delimiter (#20795)
return InvalidArgument when explode_split meet empty delimiter
2023-06-15 15:17:22 +08:00