Commit Graph

22051 Commits

Author SHA1 Message Date
89c4d12ddf branch-2.1: [fix](audit) fix audit loader thread hang and label already exists issue #54031 (#54169)
Cherry-picked from #54031

Co-authored-by: camby <104178625@qq.com>
2025-08-01 09:13:12 +08:00
68ea4eaf99 Revert "branch-2.1: [fix](trash) Fix shadow variable causing garbage … (#54074) 2025-07-30 14:34:24 +08:00
c6b18e7075 branch-2.1: [fix](catalog) fix refresh logic with lower case table name(#52989) (#53991)
pick part of #52989
2025-07-30 10:12:10 +08:00
da917573f4 branch-2.1: [chore](dep)bump commons-validator from 1.7 to 1.9.0 (#51753) (#53056)
#51753
2025-07-30 10:10:02 +08:00
4aebd911ab branch-2.1: [fix](case)Use relative cooldown time instead of absolutetime in storage policy test case (#51572) (#53054)
#51572

(cherry picked from commit 2462b213756f8500a1331f8f0e0c9bf9ba8ee4b2)
2025-07-27 16:00:45 +08:00
a86a8f5e57 branch-2.1: [fix](case) fix group commit case #53292 (#53431)
Cherry-picked from #53292

Co-authored-by: meiyi <myimeiyi@gmail.com>
2025-07-27 15:59:20 +08:00
372627231c branch-2.1: [fix](csv reader) fix data loss when concurrency read using multi char line delimiter (#53374) (#53635)
pick (#53374)

Multiple concurrent split file locations will be determined in plan
phase, if the split point happens to be in the middle of the multi char
line delimiter:

- The previous concurrent will read the complete row1 and read a little
more to read the line delimiter.
- The latter concurrency will start reading from half of the multi char
line delimiter, and row2 is the first line of this concurrency, but the
first line in the middle range is always discarded, so row2 will be
lost.

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-27 07:57:26 +08:00
8869783458 branch-2.1: [fix](be) core dump because of invalid bitmap data #53088 (#53935)
Cherry-picked from #53088

Co-authored-by: JinYang <130516674+gohalo@users.noreply.github.com>
2025-07-27 07:54:23 +08:00
159975f237 branch-2.1: [chore](load) optimize show create load error message #53694 (#53731)
Cherry-picked from #53694

Co-authored-by: hui lai <laihui@selectdb.com>
2025-07-26 23:56:37 +08:00
fb73066ca0 branch-2.1: [fix](case) fix regression case create_commit_mtmv_many_tasks #53247 (#53304)
Cherry-picked from #53247

Co-authored-by: zhangdong <zhangdong@selectdb.com>
2025-07-26 23:51:15 +08:00
1eadb67c35 branch-2.1: [fix](nereids)'VALUES IN' keyword should be optional in list partition definition #52986 (#53243)
Cherry-picked from #52986

Co-authored-by: starocean999 <lichi@selectdb.com>
2025-07-26 23:38:53 +08:00
79592c7f79 branch-2.1: [fix](load) stream load should not overwrite existing error status in filtered rows check #53393 (#53696)
Cherry-picked from #53393

Co-authored-by: Kaijie Chen <chenkaijie@selectdb.com>
2025-07-26 23:32:47 +08:00
a936c5132d branch-2.1: [fix](nereids) fix rule count on index #53825 (#53878)
Cherry-picked from #53825

Co-authored-by: Sun Chenyang <sunchenyang@selectdb.com>
2025-07-26 23:24:27 +08:00
fa10cd05df branch-2.1: [fix](ci) fix compile error 'Unknown host repo.maven.apache.org' #53900 (#53906)
Cherry-picked from #53900

Co-authored-by: Dongyang Li <lidongyang@selectdb.com>
2025-07-26 23:23:11 +08:00
101ea8434b [Cherry-Pick](branch-2.1) Pick "[Fix](core) Fix null ptr introduced by #42949 (#46074)" (#53865) 2025-07-26 22:48:03 +08:00
a7f3c21983 branch-2.1: [opt](nereids) opt range inference for or expression when out of order #46303 (#53706)
picked from part of #46303
2025-07-25 16:49:06 +08:00
c1fa17af38 branch-2.1: [feature](group by)Support group by with order. (#53037) (#53840)
backport: https://github.com/apache/doris/pull/53037
2025-07-25 14:37:10 +08:00
77ff75b954 branch-2.1: [opt](nereids) use one fragment to execute point query #53541 (#53574)
Cherry-picked from #53541

Co-authored-by: 924060929 <lanhuajian@selectdb.com>
2025-07-25 14:02:06 +08:00
850360c1c4 branch-2.1: [fix](sql_select_limit) sql_select_limit should not affect DML #53379 (#53764)
cherry picked from #53379
2025-07-25 09:50:55 +08:00
ddb3b46fff branch-2.1: [fix](p2) throw Exception if hdfs is not enable (#52761) (#53811)
(cherry picked from PR #52761)
2025-07-25 09:50:10 +08:00
5873aa3083 branch-2.1:[fix](auth)fix when authentication, the permissions of multiple roles should be merged #52349 (#52947)
pick #52349
2025-07-24 17:06:32 +08:00
d316dbb3bd branch-2.1: [fix](catalog) fix deadlock of catalog and database(#53626) (#53629)
bp #53626
2025-07-23 10:00:51 +08:00
Pxl
5216f3bf60 [Bug](datev2) fix wrong result of predicate about cast(cast(datetimev2 as datev2) as datetimev2) #53485 (#53514)
pick fom #53485
2025-07-22 18:11:12 +08:00
Pxl
feae3a57ef [Chore](workflow) change workflow's thirdparty prebuild to 3.0 (#53638)
change workflow's thirdparty prebuild to 3.0
2025-07-22 18:00:10 +08:00
05e0ef1053 branch-2.1: [bugfix](regression) fix regression test result errors (#53458) 2025-07-21 17:07:05 +08:00
a7fef00334 [enhance](compaction) limit time series table max version using maximum of current backend (#53244) (#53561)
pick master #53244
2025-07-21 16:52:22 +08:00
b4df998c39 branch-2.1: [fix](case) fix some group commit case (#53295) 2025-07-21 16:51:32 +08:00
428d2e8a73 branch-2.1: [fix](regress) new thread should connect to cluster.jdbcUrl in docker regression suite #53234 (#53316)
cherry pick from #53234
2025-07-21 12:10:31 +08:00
fac42370c9 [branch-21](upgrate) fix upgrate error of branch-21 (#53353) 2025-07-21 12:08:48 +08:00
e8d18788f6 branch-2.1:[fix](function)fix month=0 of from_iso8601_date function. (#53050) (#53438)
bp #53050
Related PR: #40695
Problem Summary:

pr #40695 introduced the function `from_iso8601_date`, which parses the
string to get year、mouth、 day, and sets the date value through the
`set_time_unit` function.
Since `set_time_unit` lacks some judgment on mouth, it may get an
illegal date in the end, which may cause core in debug mode.

sql : `select from_iso8601_date('2023-00-01');`
```
F20250709 09:50:14.366984 3587796 vdatetime_value.h:1222] Check failed: date_v2_value_.month_ != 0
*** Check failure stack trace: ***
    @     0x559bd7050d96  google::LogMessage::SendToLog()
    @     0x559bd704d7e0  google::LogMessage::Flush()
    @     0x559bd70515d9  google::LogMessageFatal::~LogMessageFatal()
    @     0x559bc725a570  doris::DateV2Value<>::set_time_unit<>()
    @     0x559bc7257380  doris::vectorized::FromIso8601DateV2::execute()
    @     0x559bc7255a88  doris::vectorized::FunctionOtherTypesToDateType<>::execute_impl()
    @     0x559bc09e0781  doris::vectorized::DefaultExecutable::execute_impl()
    @     0x559bc423aa20  doris::vectorized::PreparedFunctionImpl::_execute_skipped_constant_deal()
    @     0x559bc4234938  doris::vectorized::PreparedFunctionImpl::execute_without_low_cardinality_columns()
    @     0x559bc4233f42  doris::vectorized::PreparedFunctionImpl::default_implementation_for_nulls()
    @     0x559bc423a773  doris::vectorized::PreparedFunctionImpl::_execute_skipped_constant_deal()
    @     0x559bc4234938  doris::vectorized::PreparedFunctionImpl::execute_without_low_cardinality_columns()
    @     0x559bc4234a57  doris::vectorized::PreparedFunctionImpl::execute()
```

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-18 14:43:11 +08:00
ac36df29d6 branch-2.1: [fix](p0)Fix prepare stmt case failure. (#53437)
backport: https://github.com/apache/doris/pull/53435
2025-07-17 17:25:15 +08:00
0f1f0cd4ca branch-2.1: [bugfix](load) Fixed import failure caused by the $. symbol (#53337)
Problem Summary:

Of course, master is normal.

1. Routine load task
```
CREATE ROUTINE LOAD kafka_load_task ON test_123
WITH APPEND
COLUMNS(ot,time=from_unixtime(`ot`),id,name,content),
PRECEDING FILTER ((`ot` > 0) AND (`id` != ''))
PROPERTIES
(
"max_error_number" = "0",
"max_filter_ratio" = "1.0",
"max_batch_interval" = "10",
"max_batch_rows" = "300000",
"max_batch_size" = "209715200",
"format" = "json",
"jsonpaths" = "[
        \"$.time\",
        \"$.id\",
        \"$.name\",
        \"$.\"
]",
"strip_outer_array" = "false",
"num_as_string" = "false",
"fuzzy_parse" = "false",
"strict_mode" = "false",
"timezone" = "Asia/Shanghai",
"exec_mem_limit" = "2147483648"
)
FROM KAFKA
(
"kafka_broker_list" = "127.0.0.1:9092",
"kafka_topic" = "test-topic",
"property.kafka_default_offsets" = "OFFSET_BEGINNING",
"property.group.id" = "test-groupid-1"
);
```
`I want to use the $. symbol to put the entire JSON string from Kafka
into the context field. In the version before the fix, an error would
occur: exception=[E-1721] Size of filter doesn't match size of column:
size=0, filter.size=1. The JSON data in Kafka has other fields besides
time, id, and name, but my requirement is only to get time, id, and
name.`

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-17 14:03:58 +08:00
4d59af5aa3 branch-2.1: [fix](job) fix show routine load job result incorrect after alter job property #53038 (#53094)
Cherry-picked from #53038

Co-authored-by: hui lai <laihui@selectdb.com>
2025-07-17 12:16:17 +08:00
982bc81744 branch-2.1: [fix](regression) Spark reads Doris data error #48150 (#53233)
Cherry-picked from #48150

Co-authored-by: caoliang-web <71004656+caoliang-web@users.noreply.github.com>
2025-07-17 12:08:35 +08:00
eb6a5f884e branch-2.1: [fix](Nereids) not process must shuffle when regulate cannot be banned agg #52827 (#53265)
picked from #52827
2025-07-17 11:34:05 +08:00
70b7a0d795 branch-2.1: [fix](regression)Fix test_analyze_mv case. (#53361)
backport: https://github.com/apache/doris/pull/53352
2025-07-17 10:17:48 +08:00
cc387f362f branch-2.1: [opt](identifer) let unicode format as a superset of latin format #48078 (#53264)
cherry-picked from #48078
2025-07-16 19:58:58 +08:00
83cdd44d6b branch-2.1: [fix](catalog) synchronize reset methods in catalog classes and remove Lombok annotations (#53168)
pick (#51787)
2025-07-16 12:28:50 +08:00
d543713ceb [fix & opt](orc) ORC-1525: Fix bad read in RleDecoderV2::readByte and Decompress zlib by libdeflate. (#53078)
### What problem does this PR solve?

Problem Summary:

### Release note

Cherry-pick #51775 

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-16 12:28:12 +08:00
140fb4c6e6 branch-2.1: [fix](iceberg) Fix transaction issues (#52716) (#53238)
bp #52716

Co-authored-by: wuwenchi <wuwenchi@selectdb.com>
Co-authored-by: wuwenchi.wwc <wuwenchi.wwc@oceanbase.com>
2025-07-16 11:46:36 +08:00
e2fb2566dd [fix](nereids)should distinguish hash and other conjuncts for outer join in MultiJoin (#53184)
pick from master https://github.com/apache/doris/pull/50378 and
https://github.com/apache/doris/pull/53051
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-16 09:30:04 +08:00
11142eee3a branch-2.1: [fix](httpserver) Fix lsan check error when stop libevent server #53081 (#53112)
Cherry-picked from #53081

Co-authored-by: Xin Liao <liaoxin@selectdb.com>
2025-07-16 09:28:11 +08:00
1e19a3d852 branch-2.1: [chore](information schema) rename routine_load_job system table to routine_load_jobs (#49515) (#53246)
pick (#49515)

### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-16 09:26:00 +08:00
4b261d43b6 [Enhancement](multi-catalog) Add PredicateFilterTime, DictFilterRewriteTime, LazyReadFilteredRows profile metrics in parquet orc profiles. (#52815)
### What problem does this PR solve?

Problem Summary:

### Release note

Cherry-pick #51248

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-16 09:07:38 +08:00
412e5746a7 [fix](test) Fix the test case results (#53158)
### What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

### Release note

None

### Check List (For Author)

- Test <!-- At least one of them must be included. -->
    - [ ] Regression test
    - [ ] Unit Test
    - [ ] Manual test (add detailed scripts or steps below)
    - [ ] No need to test or manual test. Explain why:
- [ ] This is a refactor/code format and no logic has been changed.
        - [ ] Previous test can cover this change.
        - [ ] No code files have been changed.
        - [ ] Other reason <!-- Add your reason?  -->

- Behavior changed:
    - [ ] No.
    - [ ] Yes. <!-- Explain the behavior change -->

- Does this need documentation?
    - [ ] No.
- [ ] Yes. <!-- Add document PR link here. eg:
https://github.com/apache/doris-website/pull/1214 -->

### Check List (For Reviewer who merge this PR)

- [ ] Confirm the release note
- [ ] Confirm test cases
- [ ] Confirm document
- [ ] Add branch pick label <!-- Add branch pick label that this PR
should merge into -->
2025-07-16 09:02:35 +08:00
f1b7acd098 branch-2.1: [fix](tvf) support compressed json file for tvf and refactor code (#51983) (#53063)
bp: #51983
2025-07-11 06:36:46 +08:00
2f7c15f38e branch-2.1: [Bug](function) fix bitmap_from_base64 function cause heap-buffer-overflow error #53018 (#53044)
Cherry-picked from #53018

Co-authored-by: zhangstar333 <zhangsida@selectdb.com>
2025-07-11 06:35:40 +08:00
03994b8d93 branch-2.1: [fix](iceberg)Fix the thread pool issue used for commit. #51508 (#51528)
Cherry-picked from #51508

---------

Co-authored-by: wuwenchi <wuwenchi@selectdb.com>
Co-authored-by: Mingyu Chen (Rayner) <yunyou@selectdb.com>
2025-07-11 06:31:40 +08:00
6e24d2c66e [chore](lsan) lsan add dl_find_object_update (#53049) 2025-07-10 16:01:14 +08:00
6e12dc4b06 [opt](auditlog) Use varchar(1024) for column frontend_ip of audit log table #52762 (#52985)
pick #52762
2025-07-10 08:47:30 +08:00