Commit Graph

7633 Commits

Author SHA1 Message Date
cadbbdd2c0 [fix](config) for compatibility issue of log dir config (#34734)
* [fix](config) for compatibility issue of log dir config

* 1
2024-05-12 09:44:50 +08:00
e23a89f0da fix compile error 2024-05-11 15:36:06 +08:00
14d0433a76 [improve](path gc) Execute path gc interval adaptive to disk size (#34538) 2024-05-11 15:04:35 +08:00
719e50f353 [fix](json function) fix failed when json_exists_path use not null input (#34289) 2024-05-11 15:04:35 +08:00
Pxl
1ff4dc8f85 [Bug](runtime-filter) fix coredump won change_null_to_true when argument column is not null… (#34602)
fix coredump won change_null_to_true when argument column is not nullable
2024-05-11 15:04:35 +08:00
659f7ef2af [refactor](Nereids)refactor runtime filter generator (#34275) (#34696)
this pr revert the modification on clucene introduced by pr #34682
2024-05-11 12:04:41 +08:00
8c237e82a3 [Bug](exec) fix intersections/differences bug (#34675) 2024-05-11 11:45:31 +08:00
58c19e33b3 [fix](round) Fix incorrect decimal scale inference in round functions (#34471)
* FIX NEEDED

* FORMAT

* FORMAT

* FIX TEST
2024-05-11 11:42:12 +08:00
0a79c547ff [Refactor](Sink) Remove is_append mode in table sink (#34684)
Remove the is_append mode from the sink component due to the following reasons:
1. The performance improvement from this mode is relatively minor, approximately 10%, as demonstrated in previous benchmarks.
2. The mode complicates maintenance. It requires a separate data writing path to avoid copying, which increases complexity and poses a risk of potential data loss.

I've already test the compability with previous version
2024-05-11 11:20:10 +08:00
7ba66c5890 [branch-2.1](routine-load) do not schedule task when there is no data (#34654) 2024-05-11 11:01:18 +08:00
dd1b54cf62 [pick](nereids)Runtime filter pushdown refactor for branch-2.1 (#34682)
* [refactor](Nereids)refactor runtime filter generator (#34275)

1. unify the process of generating rf for hash join and for nested loop join
2. fix some bugs in generating rf
3. remove some duplicated check

(cherry picked from commit 07267faac0d9c6ef3bb1fd4ee101b4c761c8a2f2)

* [refactor](nereids) do not deny a runtime filter by removing an entry in aliasMap (#34559)

in current version, there are 2 approaches to verify whether a join condition can be used to generate a runtime filter, they are
1. remove the output slot from aliasMap
2. pushDownVisitor.visit(...) return false
the 1st approach has some drawbacks, we prefer to the 2ed approach.
In this pr, all the cases are handled by the 2ed approach, and remove the related code for the 1st approach.

(cherry picked from commit a29082bf31e66efa2df193b38347e610f2bf7464)

* rebase
2024-05-11 09:44:24 +08:00
Pxl
e2ea54c0a7 [Improvement](sink) remove unused check on string's write_column_to_mysql (#34491)
remove unused check on string's write_column_to_mysql
2024-05-10 22:13:05 +08:00
aa684d85d7 [Bug](Variant) fix rapidjson::Allocator may cause mem allocate issue when build with DENABLE_CLANG_COVERAGE (#34150) 2024-05-10 22:12:00 +08:00
853dbdcb00 [Feature](PreparedStatement) implement general server side prepared (#33807) 2024-05-10 22:10:11 +08:00
5271042a7d [bugfix](gccompile) fix gcc compile error (#34546)
Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-05-10 22:06:00 +08:00
95593f8e19 [config](move-memtable) support HTTP header load_stream_per_node in stream load (#34444) 2024-05-10 22:05:35 +08:00
093fe354c8 [Improve](cache) Estimated column reader memory to control segment cache (#34526) 2024-05-10 22:05:20 +08:00
435b4343d7 f 2024-05-10 16:08:52 +08:00
7b22c94255 [bugfix](compile error) unused variables compile error (#34509)
* [bugfix](compile error) unused variables compile error

* f

---------

Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-05-10 14:46:19 +08:00
082216496e [opt](inverted index) opt for log output when matching without an index (#34024)
Generates a large volume of log output when scanning large amounts of data
2024-05-10 14:45:05 +08:00
25ae7cd65f [bug](ipv6) the ipv6 type should be uint128_t (#34121)
the ipv6 type should be uint128_t, and max value is ffff:ffff:ffff:ffff:ffff:ffff:ffff:ffff
if use int128_t type, it's will be min value.
2024-05-10 14:43:46 +08:00
e2fc231b7b [refactor](move-memtable) simplify LoadStreamStub::open (#34488) 2024-05-10 14:43:31 +08:00
32cbd4a583 [chore](status) unify error code between thrift,pb, status.h (#34397)
Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-05-10 14:41:01 +08:00
9b712b03b4 [FIX]fix is_ip_address_in_range func with const param (#34266) 2024-05-10 14:37:20 +08:00
520774a24b [fix](serde) fix ipv4/v6 serde functions for arrow, orc, parquet format (#34042)
this PR is from @sjyango work in #32326,
wants merge #32326 into master branch, but it's draft and not maintain long time. so have this new PR.
Co-authored-by: sjyango <sjyang2022@zju.edu.cn>
2024-05-10 14:37:04 +08:00
Pxl
804586b342 [Improvement](sort) insert data by batch on VSortedRunMerger::get_next (#34363)
insert data by batch on VSortedRunMerger::get_next
2024-05-10 14:36:53 +08:00
cc00666be6 [opt](inverted index) add inlist condition handling to compound (#34134)
1. Previously, the compound did not support the inlist condition, which could impact performance if an inverted index was created.
2024-05-10 14:35:47 +08:00
6c30913a31 [bugfix](memtracker) should count cancelled query in wg memory (#34443)
Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-05-10 14:35:28 +08:00
cbe8e5c010 [opt](join) For a mark join without other conjuncts, stop probing after matching one row (#34581) 2024-05-10 13:45:34 +08:00
22da62a6d5 [fix](shuffle) compensate num rows filtered in ExchangeNode (#34605) (#34623) 2024-05-10 09:27:31 +08:00
7a40f2a547 [branch-2.1](resource)fix check available fail when s3 aws_token is set and reset as, sk faild on be. (#34219) 2024-05-09 19:06:14 +08:00
06612b9be3 [fix](local exchange) fix bug of local exchange free blocks (#34531) 2024-05-08 16:26:51 +08:00
e085f75a43 [opt](file-scanner) print current path when encountering error (#34365) (#34523)
bp #34365
2024-05-08 14:49:03 +08:00
05260d6d47 [pipeline](fix) Logging blocking dependency if task is not cancelled #34502 2024-05-08 10:34:26 +08:00
1acd8e9fcb [fix](spill) incorrect result of hash join (#34450) 2024-05-08 10:06:32 +08:00
8b6d6245fa [fix](inverted index) the rowset may be deleted and become nullptr. #33878 (#34464) 2024-05-08 09:11:22 +08:00
39fdc9ba0c [refactor](executor)Rename workload schedule policy #34497 2024-05-08 08:35:20 +08:00
dec5f0ca98 Revert "[fix](profile) Fix reporting the profile while building the p… (#34498)
* Revert "[fix](profile) Fix reporting the profile while building the pipeline profile. (#34215)"

This reverts commit eb0d963389e1b7d150cbc18c927091648e0a60f7.

* Revert "[feature](profile) sort pipelineX task by total time #34053"

This reverts commit 67b394f2b0dddab3801d2faa82a91c52ef875e76.
2024-05-07 22:58:50 +08:00
ac56255f82 [opt](inverted index) the "unicode" tokenizer can be configured to disable stop words. (#34467) 2024-05-07 18:23:43 +08:00
4be589951b Revert "Revert "[fix](csv-reader) fix column split error when there is escape character (#34364)""
This reverts commit d127d67ebe989484bbdf340a4de5b79ded56eecc.
2024-05-07 18:03:56 +08:00
561c6a752d [Bug](RegressionTest) fix regresstion test failed (#34466) 2024-05-07 16:53:05 +08:00
98cd066368 [bug](not in) if not in (null) could eos early (#34411) 2024-05-07 13:55:21 +08:00
d127d67ebe Revert "[fix](csv-reader) fix column split error when there is escape character (#34364)"
This reverts commit 971e10a9db782c9986b20e1209468e4d7aeedf71.
2024-05-07 13:36:11 +08:00
9d0d7293f0 [fix](json) fix be crash while load json data (#34283) 2024-05-07 07:42:53 +08:00
971e10a9db [fix](csv-reader) fix column split error when there is escape character (#34364) 2024-05-07 07:38:35 +08:00
8fdfbcb3c4 Revert "[Opt](func) opt the percentile func performance (#34373) (#34416)"
This reverts commit 509ae425e416b4779ae94eab9c2b21f9850e03c3.
2024-05-07 07:23:48 +08:00
e19d57261c [improvement](spill) improve cancel (#34451)
* [improvement](spill) improve cancel

* fix
2024-05-07 00:07:20 +08:00
a81beb19c2 [fix](load) fix repeatedly open tablets_channel when tablets_channel already cancelled (#34442) 2024-05-06 23:15:33 +08:00
f7900b53ce [enhancement](function) floor/ceil/round/round_bankers can use column as scale argument (#34391) 2024-05-06 22:18:36 +08:00
c22f42121b [fix](compaction test) show single replica compaction status and fix test (#33076) (#34285) (#34438) 2024-05-06 21:00:34 +08:00