Commit Graph

6023 Commits

Author SHA1 Message Date
8f15f9adf6 [test](case) add test case to improve code coverage (#25516)
[test](case) add test case to improve code coverage (#25516)
2023-11-01 12:51:12 +08:00
7914b7b19e [fix](seek) add SeekCount and SeekTime profile (#26009) 2023-11-01 11:29:29 +08:00
Pxl
696ecc8c83 [Chore](log) adjust error code on too many filtered rows (#26168) 2023-11-01 00:15:56 +08:00
9d83948987 [feature](pipelineX) Use dependency instead of block queue in the runtime filter (#26078) 2023-10-31 22:44:18 +08:00
8c454a3287 [bug](scanner) Fix scanner core dump (#26156) 2023-10-31 22:23:32 +08:00
aadd220eff [FIX](decimalv3) fix large int cast to decimalv3 #26159
fix large int cast to decimalv3 , before if we make data more than max int64 cast to decimalv3 will make result overflow and incorrect
such as

mysql> select CAST(12345678901234567890123456789012345678 AS DECIMALV3(38, 0));
+------------------------------------------------------------------+
| CAST(12345678901234567890123456789012345678 AS DECIMALV3(38, 0)) |
+------------------------------------------------------------------+
|                                             -4302749291975740594 |
+------------------------------------------------------------------+
2023-10-31 18:24:09 +08:00
b98744ae90 [Bug](iceberg)fix read partitioned iceberg without partition path (#25503)
Iceberg does not require partition values to exist on file paths, so we should get the partition value from `PartitionScanTask.partition`.
2023-10-31 18:09:53 +08:00
b137f03921 [feature](profile) add MaxActiveTime and add exectime in pipelineX (#26148) 2023-10-31 17:53:52 +08:00
973657d163 [fix](compile) be cannot compile on MacOS (#26155)
build on MacOS meet error: reference to 'detail' is ambiguous.
Because there is a detail namespace under std
2023-10-31 17:36:00 +08:00
b619982c3a [Enhancement](metrics) Avoid update_process_fd_num reporting no such file error indiscriminately. (#26013)
Avoid `update_process_fd_num` reporting `no such file error `indiscriminately.
2023-10-31 17:28:43 +08:00
ea1554374c [fix](multicast) fix DCHECK failure of block mem reuse for multicast (#26127)
* [fix](multicast) fix DCHECK failure of block mem reuse for multicast
2023-10-31 16:35:26 +08:00
0449a240f4 [Fix](from_unixtime) Keep consistent with MySQL & bug fix (#25966)
Bug fix: implicit convert from int32 -> int64 makes negative time stamp valid, so change signature to int64
Consistent: keep consistent with mysql.
2023-10-31 14:31:24 +08:00
7e6644b1ee [pipelineX](profile) Improve exchange sink profile (#26117) 2023-10-31 14:10:42 +08:00
3d295f0eb0 [Update](regression) add failure injection in inverted index writer (#26121)
* [Update](regression) add failure injection in inverted index writer
2023-10-31 14:09:20 +08:00
8f320944a8 [fix](move-memtable) fix DeltaWriterV2 profile use-after-free (#26110)
The sink who creates the delta writer may be closed while other sinks still using this delta writer.
The parent profile is deconstructed and when the last sink trying to update the profile, it will meet use-after-free.

To address this issue, we record the profile number in delta writer,
and the last sink who close the delta writer will create and update the profile.
2023-10-31 13:52:18 +08:00
462db3d5be [fix](compile) fix macos compile error (#25944) 2023-10-31 11:44:56 +08:00
3184060fa7 [chore](page) print the file path to the log when encountering corruption data pages (#26134) 2023-10-31 10:17:54 +08:00
745b22808f [FIX](decimalv3) fix string to decimalv3 with dot #26072 2023-10-31 09:17:55 +08:00
619f2bbbda [Fix](thrift api) column should be converted if const before serialize to arrow format (#26111)
```
select null;
```

The above sql from thrift api will cause crash in `write_column_to_arrow` since it's column const,
we should convert to full column
2023-10-30 19:56:19 +08:00
0a4863e859 fix mac compile (#25724) 2023-10-30 18:50:53 +08:00
4b5ca106ef [refine](profileV2) use task dependency in profile and print pipelinetask index (#26059) 2023-10-30 18:40:04 +08:00
6eb2cb2d48 [date](parser) Support DateV1 keyword (#25414) 2023-10-30 18:39:22 +08:00
6761dc4113 [coverage](test) improve test coverage (#26096)
improve test coverage
2023-10-30 18:01:55 +08:00
f883d1acfa [enhancement](regression) fault injection for segcompaction test (#25709)
1. generalized debug point facilities from docker suites for
   fault-injection/stubbing cases
2. add segcompaction fault-injection cases for demonstration
3. add -238 TOO_MANY_SEGMENTS fault-injection case for good
2023-10-30 17:36:17 +08:00
3a954cd1aa [fix](function)return NULL rather than 'null' if path not found (#25880)
fix json_extract not return NULL but null
2023-10-30 14:26:44 +08:00
f6a2faf967 [fix](partial update) fix a asan core, cause by release SegmentCacheHandle too early (#26042) 2023-10-30 11:16:48 +08:00
4d3dbf1b3b [fix](function) fix EXPLODE_JSON_ARRAY_STRING function (#25519) 2023-10-30 11:05:27 +08:00
Pxl
15ba886725 do not print stack when stream load catch failed status on thrift (#26062)
do not print stack when stream load catch failed status on thrift
2023-10-30 10:36:01 +08:00
e20cab64f4 [improvement](scan) avoid too many scanners for file scan node (#25727)
In previous, when using file scan node(eq, querying hive table), the max number of scanner for each scan node
will be the `doris_scanner_thread_pool_thread_num`(default is 48).
And if the query parallelism is N, the total number of scanner would be 48 * N, which is too many.

In this PR, I change the logic, the max number of scanner for each scan node
will be the `doris_scanner_thread_pool_thread_num / query parallelism`. So that the total number of scanners
will be up to `doris_scanner_thread_pool_thread_num`.

Reduce the number of scanner can significantly reduce the memory usage of query.
2023-10-29 17:41:31 +08:00
99b45e1938 [fix](Outfile) Export DateTimev2 type of doris to ORC's TimeStamp type (#25470)
Previously,doris's `DateTimev2` was exported to orc as a `String` type.
Now, export doris's `DateTimev2` to orc timestamp type.
2023-10-29 15:59:38 +08:00
6a85f46ff3 [refactor](move-memtable) rename open_stream_sink rpc to open_load_stream (#25883) 2023-10-29 10:07:14 +08:00
bd2f007d52 [fix](load) Fix not check if the tablet is nullptr (#26015) 2023-10-29 10:01:14 +08:00
47955f9109 [bug](intersect) fix intersect node nullable field should depend on _row_descriptor (#26038) 2023-10-28 22:39:36 +08:00
5e884bc065 [Fix](bitmap index) fix is_like_predicate for bitmap index evaluate (#25961) 2023-10-27 20:23:42 +08:00
606223ab62 Revert "[refactor](pipeline) simplify runtime state ctor (#25995)" (#26029)
This reverts commit a01922cdc55e2b3a63d9a9aafb38ac5ed64c6dd3.
2023-10-27 18:15:30 +08:00
340a37116f [cleanup](load) remove unused dynamic schema code (#25932) 2023-10-27 16:21:40 +08:00
a01922cdc5 [refactor](pipeline) simplify runtime state ctor (#25995) 2023-10-27 15:45:29 +08:00
a6d64c6b25 [refine](profilev2) format profilev2 (#25963) 2023-10-27 14:26:38 +08:00
4ee24d8cb1 [FIX](upgrade)fix upgrade for predict column delete collection type will make core #26006 2023-10-27 14:25:45 +08:00
46d40b1952 [refactor](executor)Remove empty group logic #26005 2023-10-27 14:24:41 +08:00
ae8e6fbe59 [Fix](inverted index) fix empty array index writer bug (#25984) 2023-10-27 11:27:44 +08:00
cedab51676 [enhancement](UT) add unit test cases about bitmap (#25867)
* [fix](bitmap) incorrect result of operator ==

* [enhancement](UT) add unit test cases about bitmap
2023-10-27 11:27:14 +08:00
bac523956f [Update](clucene) update clucene version to latest (#25981) 2023-10-27 10:42:56 +08:00
0eb86919cc [fix](block-convertor) clear filter bitmap per batch to get the correct filtered rows (#25989) 2023-10-27 10:40:07 +08:00
c86fad7cbd [Fix](orc-reader) Fix orc decimal128 scale issue. (#25977) 2023-10-26 08:50:18 -05:00
d9a3aa5c06 [pipelineX](improvement) improve local shuffle (#25964) 2023-10-26 21:42:43 +08:00
69015df767 [Fix](inverted index) reorder ConjunctionQuery deconstruct order (#25972) 2023-10-26 08:34:09 -05:00
2679fa4ea7 [improvement](tablet clone) furthur repair replicas should be check even if they are versions catchup (#25551) 2023-10-26 18:14:40 +08:00
c3527672a5 [refactor & pipelineX][pick fix] Pick fix of predicate pushdown to pipelineX (#25953)
Co-authored-by: JackDrogon <jack.xsuperman@gmail.com>
2023-10-26 18:04:43 +08:00
c1d64a7128 [Feature](datatype) Add IPv4/v6 data type for doris (#24965) 2023-10-26 17:33:28 +08:00