Commit Graph

8048 Commits

Author SHA1 Message Date
41fa7bc9fd [bugfix](paimon)Fixed the reading of timestamp with time zone type data for 2.1 (#37716) (#38592)
bp: #37716
2024-08-01 10:23:06 +08:00
184b8cbbe4 [pick](json)fix jsonb deseriaze (#38630)
## Proposed changes
backport: https://github.com/apache/doris/pull/37251
Issue Number: close #xxx

<!--Describe your changes.-->
2024-08-01 10:18:27 +08:00
66ebf709ba [Fix](inverted index) fix fast execute for not_in expr #37745 (#38594)
cherry pick from #37745
2024-07-31 19:58:12 +08:00
7730aa2170 [Fix](inverted index) fix wrong no need read data when same column in inverted index and like function #36687 (#38581)
cherry pick from #36687
2024-07-31 19:41:39 +08:00
a75511ae08 [Feature](inverted index) add no need read data optimize config (#38584)
pick from #36686
2024-07-31 19:39:17 +08:00
232ee74566 [Fix](inverted index) fix memory leak for index compaction (#38586)
Pick from (#36209)
2024-07-31 19:19:38 +08:00
aed0cc8ba0 [Fix](inverted index) remove duplicate stats of inverted_index_query_cache_miss #36707 (#38580)
cherry pick from #36707
2024-07-31 19:18:58 +08:00
7357d7bd3b [Update](inverted index) Add column name to debug point for "no need to read data" optimization #37649 (#38579)
cherry pick from #37649
2024-07-31 19:17:46 +08:00
3b234cfab6 [performance](exec) Performance problem create too many scanner task (#38460)
## Proposed changes

cherry pick the pr: #38430

<!--Describe your changes.-->
2024-07-31 14:34:01 +08:00
aa9bdd76d0 [Pick](Variant) pick some fix #38413 #38364 (#38512) 2024-07-31 11:03:31 +08:00
182bf4d323 [chore](fe) Returns dropped tables in GetMeta request (#38541)
Cherry-pick #38019
2024-07-31 10:57:00 +08:00
017dad8c54 [fix](type)support runtime predicate for time type (#38258) (#38465)
## Proposed changes
https://github.com/apache/doris/pull/38258
Issue Number: close #xxx

<!--Describe your changes.-->
2024-07-31 10:27:36 +08:00
715bcd13f1 [opt](mow) opt mow lookup with sequence column (#38287) (#38406) 2024-07-30 09:46:09 +08:00
cefee4dbc0 [Pick 2.1](clucene) update clucene version (#38496)
## Proposed changes

backport #38482
2024-07-30 09:40:04 +08:00
17d351af80 [fix](csv reader) fix csv parser incorrect if enclosing line_delimiter (#38347) (#38445)
Csv reader parse data incorrect when data enclosing line_delimiter, for
example, line_delimiter is \n and enclose is ', data as follows:
```
'aaaaaaaaaaaa
bbbb'
```
it will be parsed as two columns: `'aaaaaaaaaaaa` and `bbbb',` rather
than one column
```
'aaaaaaaaaaaa
bbbb'
```

The reason why this happened is csv reader will not reset result when
not match enclose in this `output_buf_read`, causing incorrect
truncation was made.

Co-authored-by: Xin Liao <liaoxinbit@126.com>
2024-07-29 14:55:45 +08:00
87cf2d1fb4 [fix](spill) Duplicate calls to Dependency::set_ready() in hash join(#37461) (#38399)
## Proposed changes

pick #37461
Duplicate calling the function `Dependency::set_ready()` will cause
pipeline tasks to be scheduled incorrectly.
2024-07-29 09:44:48 +08:00
e9f12fac47 [fix](load) fix no error url for stream load #38325 (#38417)
cherry pick from #38325
2024-07-28 19:06:57 +08:00
d8744cd3d0 [Opt](load) don't print stack when some errors occur for stream load #38332 (#38418)
cherry pick from #38332
2024-07-28 19:04:24 +08:00
c93f3bd24e [Improvement](bloom filter) Forbid small bloom filter (#38349) (#38392)
Bloom filter has a expected filter ratio when data is enough. This PR
forbid too small bloom filter which has a big bias for filter ratio.

pick #38349
2024-07-26 10:11:31 +08:00
0152a4e86f [config](be) add be config migration_lock_timeout_ms (#38000) (#38337)
backport #38000
2024-07-25 17:36:34 +08:00
91f0301b43 [fix](group commit) Pick some group commit pr (#38320)
Pick https://github.com/apache/doris/pull/38292,
https://github.com/apache/doris/pull/34021,
https://github.com/apache/doris/pull/38228, some modify of
https://github.com/apache/doris/pull/37260, some modify of
https://github.com/apache/doris/pull/37595
2024-07-25 17:32:44 +08:00
54787f51c7 [log](load) print reason when LoadPathMgr failed to delete file (#38235) (#38340)
backport #38235
2024-07-25 17:28:09 +08:00
4288f09e8c [fix](debug_point) Add debug point:reached_limit_early & return_empty_block (#38328)
## Proposed changes

pick #38127 and #37629
2024-07-25 14:46:40 +08:00
a8139d9f39 [Improvement](profile) Provide more info for schedule time (#38290) (#38338)
## Proposed changes

pick #38290

<!--Describe your changes.-->
2024-07-25 14:43:52 +08:00
e2bb86e7f8 [fix](inverted index) fixed in_list condition not indexed on pipelinex (#38178)
## Proposed changes

https://github.com/apache/doris/pull/36565
https://github.com/apache/doris/pull/37842
https://github.com/apache/doris/pull/37921
https://github.com/apache/doris/pull/37386

<!--Describe your changes.-->
2024-07-25 14:42:34 +08:00
a751372e76 [Feature](multi-catalog) Add memory tracker for orc reader/writer and arrow parquet writer。 (#37257)
## Proposed changes

backport #37234
2024-07-25 13:51:59 +08:00
73fc55b203 [Pick](Variant) fix some issue by RQG (#38336)
#38318 
#38291
2024-07-25 12:19:07 +08:00
70cde39fe0 [cherry-pick](branch-21) fix conv function get wrong result as parse overflow (#38001) (#38309)
## Proposed changes

cherry-pick from https://github.com/apache/doris/pull/38001

<!--Describe your changes.-->
2024-07-25 12:06:46 +08:00
e9052e2180 [cherry-pick](branch-21) fix mod function cause core dump (#37999) (#38308)
## Proposed changes
cherry-pick from master https://github.com/apache/doris/pull/37999

<!--Describe your changes.-->

## Proposed changes

Issue Number: close #xxx

<!--Describe your changes.-->
2024-07-25 12:06:21 +08:00
57864e8554 [cherry-pick](branch-21) fix collect_set function core dump without arena pool (#38234) (#38307)
## Proposed changes

cherry-pick from master #38234

<!--Describe your changes.-->
2024-07-25 12:05:52 +08:00
d19b3a5cb6 [fix](function) the bucket number arg of width_bucket should be a positive integer value (#37892) (#38295)
## Proposed changes

pick #37892
2024-07-25 11:56:47 +08:00
21b3fc3d1e [branch-2.1](function) fix coredump for MULTI_MATCH_ANY (#37959) (#38314)
pick https://github.com/apache/doris/pull/37959

[INVALID_ARGUMENT][E33] Compile regexp expression failed. got Embedded
start anchors not supported.. some expressions may be illegal
2024-07-25 11:34:22 +08:00
79a6496bb6 [branch-2.1](function) fix wrong result when convert_tz is out of bound (#37358) (#38313)
## Proposed changes

pick https://github.com/apache/doris/pull/37358

before:
```sql
mysql> select CONVERT_TZ(cast('0000-01-01 00:00:00.00001'  as DATETIMEV1), cast('Asia/Shanghai' as VARCHAR(65533)), cast('America/Los_Angeles' as VARCHAR(65533)));
+---------------------------------------------------------------------------------------------------------------------------------------------------+
| convert_tz(cast('0000-01-01 00:00:00.00001' as DATETIME), cast('Asia/Shanghai' as VARCHAR(65533)), cast('America/Los_Angeles' as VARCHAR(65533))) |
+---------------------------------------------------------------------------------------------------------------------------------------------------+
| q535-12-31 08:01:19                                                                                                                               |
+---------------------------------------------------------------------------------------------------------------------------------------------------+
1 row in set (0.12 sec)
```
now:
```sql
mysql> select CONVERT_TZ(cast('0000-01-01 00:00:00.00001'  as DATETIMEV1), cast('Asia/Shanghai' as VARCHAR(65533)), cast('America/Los_Angeles' as VARCHAR(65533)));
+---------------------------------------------------------------------------------------------------------------------------------------------------+
| convert_tz(cast('0000-01-01 00:00:00.00001' as DATETIME), cast('Asia/Shanghai' as VARCHAR(65533)), cast('America/Los_Angeles' as VARCHAR(65533))) |
+---------------------------------------------------------------------------------------------------------------------------------------------------+
| NULL                                                                                                                                              |
+---------------------------------------------------------------------------------------------------------------------------------------------------+
1 row in set (0.09 sec)
```
2024-07-25 11:32:44 +08:00
3ea26a8c95 [fix](external) record not found file number (#38253) (#38285)
bp #38253
2024-07-25 11:03:19 +08:00
9a40cd5f9f [fix](compaction) fix duplicate key in agg/mor table caused by ordered data compaction (#38224) (#38299)
pick master #38224
2024-07-24 19:03:40 +08:00
ef00dad680 [Fix](multi-catalog) Fix some undefined behaviors. (#38274)
## Proposed changes

backport #37845
2024-07-24 16:14:34 +08:00
c6cd1e2e3d [opt](inverted index) opt value extraction from column to string (#38179)
## Proposed changes

https://github.com/apache/doris/pull/37395
2024-07-24 15:38:41 +08:00
10c5c336d8 [branch-2.1](arrow-flight-sql) Add config arrow_flight_result_sink_buffer_size_rows (#38223)
pick #38221
2024-07-24 15:15:39 +08:00
bb2bc77717 [Pick 2.1](inverted index) skip index compaction for inverted index V2 (#38278)
Enabling index compaction in the inverted index V2 format currently
causes unexpected errors, especially in the case of tables with hybrid
indexes, such as BKD index and Fulltext index together.

backport #38209
2024-07-24 14:38:25 +08:00
792bd7c74a [cherry-pick](branch-2.1) Pick "[Enhancement](group commit)Optimize be select for group commit #35558" (#37830)
Pick #35558
2024-07-24 09:21:07 +08:00
Pxl
cf2120a44a [Chore](brpc) add gc for abafreelist to avoid eagain and set brpc tim… (#38177)
pick from #37888
2024-07-23 21:24:36 +08:00
193be20c86 [feature](csv)Supports reading CSV data using LF and CRLF as line separators. (#37687) (#38099)
bp #37687
2024-07-22 22:53:04 +08:00
aff3f292a7 [fix](pipeline) Fix blocked tasks if query is canceled before opening (#38200)
## Proposed changes

pick #38206

<!--Describe your changes.-->
2024-07-22 18:22:22 +08:00
73199122c2 [enhancement](compaction) Control the parallelism for urgent compacton tasks (#37782) (#38189)
## Proposed changes

For some urgent compaction tasks, their submittion should take
parallelism into account.

Currently, we apply the control policy for data loading in specific.
Other source of urgent tasks are considered as eager.
2024-07-22 17:22:53 +08:00
ce1c268743 [Fix](load) Fix uninitiated load_type (not a valid value for type) of StreamLoadContext when operate_txn_2pc #38033 (#38187)
cherry pick from #38033
2024-07-22 17:20:03 +08:00
2e79612320 [env](compile) compile failed caused by <immintrin.h> (#37068) (#38130)
pick from master #37068
2024-07-20 14:10:31 +08:00
7819c75e55 [fix](shuffle) Fix local exchange dependency blocking (#38160)
## Proposed changes

pick #38151

<!--Describe your changes.-->
2024-07-20 00:19:47 +08:00
947d7594c8 [fix](func)fix array_with_const with larger than max_array_size (#38152)
## Proposed changes
backport: https://github.com/apache/doris/pull/37495
Issue Number: close #xxx

<!--Describe your changes.-->
2024-07-20 00:19:00 +08:00
ffc0d6884d [Fix](load) Fix the channel leak when close wait has been cancelled #38031 (#38125)
cherry pick from #38031
2024-07-19 22:58:54 +08:00
d9fd419e47 [Fix](JsonReader) fix json with duplicate key entry may result out of bound exception (#38147)
#38146
2024-07-19 22:53:02 +08:00