Commit Graph

8577 Commits

Author SHA1 Message Date
ebb21ef031 branch-2.1: [Fix](merge-on-write) Add defensive check before partial update #44687 (#45086)
pick https://github.com/apache/doris/pull/44687
2024-12-06 17:16:42 +08:00
08c8a416ea branch-2.1: [fix](runtime_profile) fix race condition in to_thrift #45047 (#45099)
Cherry-picked from #45047

Co-authored-by: Kaijie Chen <chenkaijie@selectdb.com>
2024-12-06 16:25:33 +08:00
53291bd040 branch-2.1: [fix](hudi) upgrade hudi to 0.15.0 (#44267)(#44995) (#45041)
cherry-pick pr: #44267 #44995
2024-12-05 23:18:37 -08:00
f0324e2a56 branch-2.1: [improvement](information_schema)Support show default value in information_schema. #44849 (#45080)
Cherry-picked from #44849

Co-authored-by: James <lijibing@selectdb.com>
2024-12-06 14:54:09 +08:00
94746b764d branch-2.1: [fix](memory) Fix UT ThreadMemTrackerMgrTest #44147 (#44936)
Cherry-picked from #44147

Co-authored-by: Xinyi Zou <zouxinyi@selectdb.com>
2024-12-06 10:54:12 +08:00
21e21f5e3b [opt](exec) Use PASSTHROUGH to improve the concurrency of the ADAPTIV… (#44971)
…E_PASSTHROUGH SINK. (#44925)
https://github.com/apache/doris/pull/44925
before
```
op -> local sink(1) -> local source (n)
```
now
```
op -> local passthrough(1) -> local passthrough(n) ->  local sink(n) -> local source (n)
```

profile
```
                  Pipeline  :  1(instance_num=3):
                      AGGREGATION_SINK_OPERATOR  (id=4  ,  nereids_id=255):
                          CROSS_JOIN_OPERATOR  (id=3  ,  nereids_id=245):
                              LOCAL_EXCHANGE_OPERATOR  (ADAPTIVE_PASSTHROUGH)  (id=-5):
                  Pipeline  :  2(instance_num=3):
                      LOCAL_EXCHANGE_SINK_OPERATOR  (ADAPTIVE_PASSTHROUGH)  (id=-5):
                          LOCAL_EXCHANGE_OPERATOR  (PASSTHROUGH)  (id=-6):
                  Pipeline  :  3(instance_num=1):
                      LOCAL_EXCHANGE_SINK_OPERATOR  (PASSTHROUGH)  (id=-6):
                          OLAP_SCAN_OPERATOR  (id=2.  nereids_id=234.  table  name  =  nums1(nums1)):
```
2024-12-06 09:57:31 +08:00
0c3bc1bcd0 [fix](auth)fix be enable http auth, some request link never return. (#44959) (#45063)
bp #44959

if you `enable_all_http_auth = true` in be.conf, then restart be, and
keep using `curl -u "xxxx:xxxx" http://127.0.0.1:8040/api/health` while
be is starting. You may encounter a situation where the link does not
return.
Reason:
When be is still starting, there is no information about fe master. When
you make an api request to be http port, be needs to request
authentication information from fe, which will cause it to request a
machine with empty ip and port 0. This rpc call will definitely fail
(this is not equivalent to a password error). After receiving this
failure, be does not `send_reply` to the api requester, so this api
request cannot be returned.
2024-12-06 09:39:13 +08:00
a98cb8c8bf branch-2.1: [fix](arrow-flight-sql) Fix query result is empty and not return query error message #45023 (#45053)
Cherry-picked from #45023

Co-authored-by: Xinyi Zou <zouxinyi@selectdb.com>
2024-12-06 09:27:35 +08:00
405b50b1b7 [Improvement](queue) Return value of concurrent queue should be proce… (#45032)
…… (#44986)

…ssed

Push items into concurrent queue will return false due to some
unexpected error (e.g. poor memory available).
2024-12-05 17:46:18 +08:00
c4bd0e8fa6 branch-2.1: [fix](memory) Fix compatibility with CgroupV2 #44579 (#44934)
Cherry-picked from #44579

Co-authored-by: Xinyi Zou <zouxinyi@selectdb.com>
2024-12-04 22:09:16 +08:00
dcf3eb3434 branch-2.1: [fix](agg) Fixed a core dump when using the IPv6 type in array_agg. #44877 (#44918)
Cherry-picked from #44877

Co-authored-by: Mryange <yanxuecheng@selectdb.com>
2024-12-04 22:05:00 +08:00
6714936f8b [pick](branch-2.1) pick #39962 #40304 (#44931) 2024-12-04 17:56:58 +08:00
02fdf5307c [pick](branch-2.1) pick #42059 (#44938) 2024-12-04 17:49:08 +08:00
00c7394813 branch-2.1: [fix](scanner) Delete meaningless finish dependency in schema scanner #44915 (#44963)
Cherry-picked from #44915

Co-authored-by: Gabriel <liwenqiang@selectdb.com>
2024-12-04 13:16:08 +08:00
Pxl
0b681d3d51 [Bug](runtime-filter) fix publish not inited rf when broadcast join meet wake up by downsteam #44823 (#44859)
pick from #44823
2024-12-02 21:32:38 +08:00
7278a86c20 [profile](pipeline) Add key metrics for pipeline initialization (#35073) (#44738)
pick #35073
2024-12-02 16:02:19 +08:00
5f952cf6ed branch-2.1: [fix](iceberg)Bring field_id with parquet files And fix map type's key optional #44470 (#44828)
Cherry-picked from #44470

Co-authored-by: wuwenchi <wuwenchi@selectdb.com>
2024-12-02 10:24:07 +08:00
6726c9bf2f [improvement](compaction) reduce tablet skip compaction time (#44273) (#44791)
pick master #44273

The time for tablet skip compaction is 120 seconds, which is too long.
In the scenario of high-frequency import (mow), it leads to a high
compaction score. Therefore, reducing the skip time to 10 seconds is
necessary.
2024-12-02 10:07:17 +08:00
4b15b1f263 [fix](orc) check all the cases before build_search_argument (#44615) (#44801)
cherry-pick #44615

Co-authored-by: Socrates <suyiteng@selectdb.com>
2024-11-30 09:17:56 +08:00
c3707dbdee branch-2.1: [Enhancement](function) Print exact column name when execute non_nullable failed #44679 (#44742)
Cherry-picked from #44679

Co-authored-by: zclllhhjj <zhaochangle@selectdb.com>
2024-11-29 20:52:19 +08:00
412cec5bad branch-2.1: [fix](clone) Fix wrong clone file path #44746 (#44773)
Cherry-picked from #44746

Co-authored-by: walter <maochuan@selectdb.com>
2024-11-29 14:09:16 +08:00
82c7a9d15a [Fix](Variant) create table should not automatically add variant to key (#44736)
#36609
2024-11-29 09:34:43 +08:00
Pxl
bfd51f07e6 revert #44408 on branch-2.1 (#44757) 2024-11-29 08:28:46 +08:00
Pxl
846fe83152 [Chore](runtime-filter) add rpc error msg to RuntimeFilterContext (#43517) (#44622) (#44719)
pick from #43517
2024-11-28 16:46:27 +08:00
e7520ae6cf branch-2.1: [fix](hyperscan) Fix hyper scan fall back to re2 #44547 (#44653)
Cherry-picked from #44547

Co-authored-by: zhiqiang <hezhiqiang@selectdb.com>
2024-11-28 16:00:43 +08:00
Pxl
ddaaac5b44 [Bug](runtime-filter) pick #44408 #43078 (#44686)
pick #44408 #43078
2024-11-28 12:23:38 +08:00
88cfaedb16 [opt](paimon)Optimize the storage location of the serialized paimon table for 2.1 (#44274) (#44660)
bp: #44274
2024-11-27 20:35:35 +08:00
66cb95ffed branch-2.1: [fix](inverted index) Content Check for Tokenize Function Parser #44465 (#44555)
Cherry-picked from #44465

Co-authored-by: zzzxl <yangsiyu@selectdb.com>
2024-11-27 11:53:36 +08:00
Pxl
7464f461bc [Bug](runtime-filter) avoid ignore rf multiple times (#44408) (#44625)
avoid ignore rf multiple times

When an instance is wake_up_by_downstream, it will destroy the logic of
`first closing the instance that should build hash table and then
closing other instances`, instance that should build hash table during
the process of inserting data into rf, this instance may find that rf is
ignored, causing dcheck to fail.

```cpp
 F20241121 12:01:02.245405 9832 runtime_filter.cpp:395] Check failed: !is_ignored()

3# raise at ../sysdeps/posix/raise.c:27
4# abort at ./stdlib/abort.c:81
5# 0x00005603CFDF778D in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
6# 0x00005603CFDE9DCA in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
7# google::LogMessage::SendToLog() in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
8# google::LogMessage::Flush() in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
9# google::LogMessageFatal::~LogMessageFatal() in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
10# doris::RuntimePredicateWrapper::insert_fixed_len(COW<doris::vectorized::IColumn>::immutable_ptr<doris::vectorized::IColumn> const&, unsigned long) at /home/zcp/repo_center/doris_branch-3.0/doris/be/src/exprs/runtime_filter.cpp:395
11# doris::RuntimePredicateWrapper::insert_batch(COW<doris::vectorized::IColumn>::immutable_ptr<doris::vectorized::IColumn> const&, unsigned long) at /home/zcp/repo_center/doris_branch-3.0/doris/be/src/exprs/runtime_filter.cpp:431
12# doris::IRuntimeFilter::insert_batch(COW<doris::vectorized::IColumn>::immutable_ptr<doris::vectorized::IColumn>, unsigned long) at /home/zcp/repo_center/doris_branch-3.0/doris/be/src/exprs/runtime_filter.cpp:992
13# doris::VRuntimeFilterSlots::insert(doris::vectorized::Block const*) at /home/zcp/repo_center/doris_branch-3.0/doris/be/src/exprs/runtime_filter_slots.h:146
14# doris::pipeline::HashJoinBuildSinkLocalState::close(doris::RuntimeState*, doris::Status) in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
15# doris::pipeline::DataSinkOperatorXBase::close(doris::RuntimeState*, doris::Status) in /mnt/ssd01/doris-branch40preview/NEREIDS_ASAN/be/lib/doris_be
16# doris::pipeline::PipelineTask::close(doris::Status) at /home/zcp/repo_center/doris_branch-3.0/doris/be/src/pipeline/pipeline_task.cpp:487
```

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2024-11-27 10:51:02 +08:00
43ebfa9031 [branch-2.1] pick some workload group pr (#44634)
pick #42053 #43942
2024-11-27 09:42:41 +08:00
f88a8169dc [opt](Variant) avoid unnecessary mem for variant extracted columns (#… (#44585)
cherry-pick from #43567
2024-11-26 19:30:06 +08:00
071a2106a3 branch-2.1: [opt](brpc) Make the timeout of brpc checking configurable #44481 (#44602)
Cherry-picked from #44481

Co-authored-by: Jerry Hu <hushenggang@selectdb.com>
2024-11-26 19:26:39 +08:00
9bc547578c [opt](scan) Release instances of Segment to avoid consuming a large amount of memory in ParallelScannerBuilder (#44189) (#44608) 2024-11-26 19:25:58 +08:00
bdca9cce3e [branch-2.1]add internal workload group (#42006) (#44592)
Add an internal workload group when Doris started, currently it mainly
used to manage compaction workload cpu usage.
pick #42006
2024-11-26 17:09:55 +08:00
550aa9c3ae [cherry-pick](branch-2.1) Pick "[Fix](full compaction) Full compaction should not do ordered data compaction (#44359)" (#44529) 2024-11-26 12:21:30 +08:00
cf2ee23e68 Revert "[fix] (compaction) fix compaction score in time series policy (#40242)" (#44527)
revert #40242
2024-11-26 12:18:59 +08:00
5eb433e7aa [branch-2.1] pick some pr from master (#44556)
pick #40468 #40535  #40763 #40855
2024-11-25 20:43:07 +08:00
a233343fe3 branch-2.1: [fix](jvm)fix jvm metrics memory leak. #44311 (#44399)
Cherry-picked from #44311

Co-authored-by: daidai <changyuwei@selectdb.com>
2024-11-23 03:30:28 +08:00
6678088767 [cherry-pick](branch-21) reuse the join block to reduce malloc memory (#43738) (#44349)
cherry-pick from master https://github.com/apache/doris/pull/43738/
2024-11-23 03:29:52 +08:00
92a0919e9c branch-2.1: [Bug](function)fix json_object function check null nums error as return bool #44321 (#44381)
Cherry-picked from #44321

Co-authored-by: zhangstar333 <zhangsida@selectdb.com>
2024-11-22 23:51:33 +08:00
d21940ee1c branch-2.1: [fix](move-memtable) immediately return error when close wait failed #44344 (#44387)
Cherry-picked from #44344

Co-authored-by: Kaijie Chen <chenkaijie@selectdb.com>
2024-11-22 22:56:51 +08:00
dceaf97381 branch-2.1: [enhance](orc) Optimize ORC Predicate Pushdown for OR-connected Predicate #43255 (#44438)
Cherry-picked from #43255

Co-authored-by: Socrates <suyiteng@selectdb.com>
2024-11-22 22:52:53 +08:00
702abbff0f [Opt](orc)Optimize the merge io when orc reader read multiple tiny stripes. (#42004) (#44239)
bp #42004

Co-authored-by: kaka11chen <kaka11.chen@gmail.com>
2024-11-22 11:01:41 +08:00
346b89e683 [improve](routine load) adjust default values to make routine load more convenient to use (#42491) (#44377)
pick (#42491)

For a routine load job, it will be divided into many tasks, each of
which is a transaction. Currently, the default time
consumed(max_batch_interval) is 10 seconds. The benefits of increasing
this value are:
1. Larger batch consumption can lead to better performance.
2. Reducing the number of transactions can alleviate the pressure of
compaction and the conflicts of concurrent transaction submissions.

related doc: https://github.com/apache/doris-website/pull/1236/files
2024-11-21 23:05:11 +08:00
9664b50eb6 [improve](load) do not block delta writer if memtable memory is low (#42649) (#44305)
backport #42649
2024-11-21 11:17:35 +08:00
fb163b55c2 branch-2.1: [Fix](merge-on-write) Fix MergeIndexDeleteBitmapCalculator::calculate_one() coredump #44284 (#44330)
Cherry-picked from #44284

Co-authored-by: bobhan1 <baohan@selectdb.com>
2024-11-20 21:07:43 +08:00
dc67086d97 [fix](scan) Avoid memory allocated by buffered_reader from being traced (#41921) (#44253)
Use OwnedSlice to replace `char*` in BufferedReader

## Proposed changes

pick #41921
2024-11-20 10:37:06 +08:00
610054c77b [cherry-pick](branch-21) fix exchange of tablet shuffle send block error (#44102) (#44230)
cherry-pick from master (#44102)
2024-11-19 17:31:06 +08:00
433c1bc9ff [cherry-pick](branch-21) replace the LOG(FATAL) to throw Exception in query execute layer (#38144) (#44183)
cherry-pick from master https://github.com/apache/doris/pull/38144
2024-11-19 17:28:20 +08:00
c9801f7a38 branch-2.1: [Bug](function) fix cut_ipv6 function error about modify the input column data #43921 (#44180)
Cherry-picked from #43921

Co-authored-by: zhangstar333 <zhangsida@selectdb.com>
2024-11-19 17:27:29 +08:00