Commit Graph

14425 Commits

Author SHA1 Message Date
4b5ca106ef [refine](profileV2) use task dependency in profile and print pipelinetask index (#26059) 2023-10-30 18:40:04 +08:00
6eb2cb2d48 [date](parser) Support DateV1 keyword (#25414) 2023-10-30 18:39:22 +08:00
6761dc4113 [coverage](test) improve test coverage (#26096)
improve test coverage
2023-10-30 18:01:55 +08:00
f883d1acfa [enhancement](regression) fault injection for segcompaction test (#25709)
1. generalized debug point facilities from docker suites for
   fault-injection/stubbing cases
2. add segcompaction fault-injection cases for demonstration
3. add -238 TOO_MANY_SEGMENTS fault-injection case for good
2023-10-30 17:36:17 +08:00
844b7c8cba [enhancement](Nereids): check stats unreliable when deriving stats (#26103)
check stats unreliable when deriving stats
2023-10-30 17:21:42 +08:00
3a954cd1aa [fix](function)return NULL rather than 'null' if path not found (#25880)
fix json_extract not return NULL but null
2023-10-30 14:26:44 +08:00
0d956e90cf [Performance](Nereids): optimize GroupExpressionMatching (#26084) 2023-10-30 14:05:25 +08:00
eb2cbae6e3 [Fix](MySqlLoad) Fix meaningless thread creation every time checkpoint mysql load (#26031)
Add a unified thread name setting method
2023-10-30 13:55:40 +08:00
bddb6b6ddc [test](jdbc) fix unstable group_concat distinct case (#26076)
The `group_concat` with `distinct` may return unstable result,
so I remove the distinct and add `order by` to make test case stable
2023-10-30 12:46:11 +08:00
00c30f075f [fix](nereids)only push down subquery in non-window agg functions (#26034) 2023-10-30 11:32:10 +08:00
f6a2faf967 [fix](partial update) fix a asan core, cause by release SegmentCacheHandle too early (#26042) 2023-10-30 11:16:48 +08:00
4d3dbf1b3b [fix](function) fix EXPLODE_JSON_ARRAY_STRING function (#25519) 2023-10-30 11:05:27 +08:00
bb814ccff8 [fix](Nereids) bound error when in diff value of table name case config (#25957)
- when lower_case_table_names = 0, reference of table name should case sensitive
- when lower_case_table_names = 1 and 2 cte name should case insensitive
more details, please read doris doc: https://doris.apache.org/docs/advanced/variables#supported-variables
2023-10-29 22:04:42 -05:00
d47a946aab [fix](auth)Fix no priv to insert to mysql catalog (#25678)
fix user can not insert to mysql catalog even if he has load permission
2023-10-30 10:59:53 +08:00
Pxl
15ba886725 do not print stack when stream load catch failed status on thrift (#26062)
do not print stack when stream load catch failed status on thrift
2023-10-30 10:36:01 +08:00
576972ee79 [opt](tools) analyze with full in tools scripts #25873 2023-10-30 08:19:36 +08:00
28dce9f226 [opt](nereids) let DBA ignore some runtime filters (#25933)
example:
set ignore_runtime_filter_ids="3, 1";
after this setting, RF003 and RF001 will be ignored
2023-10-29 21:39:25 +08:00
e20cab64f4 [improvement](scan) avoid too many scanners for file scan node (#25727)
In previous, when using file scan node(eq, querying hive table), the max number of scanner for each scan node
will be the `doris_scanner_thread_pool_thread_num`(default is 48).
And if the query parallelism is N, the total number of scanner would be 48 * N, which is too many.

In this PR, I change the logic, the max number of scanner for each scan node
will be the `doris_scanner_thread_pool_thread_num / query parallelism`. So that the total number of scanners
will be up to `doris_scanner_thread_pool_thread_num`.

Reduce the number of scanner can significantly reduce the memory usage of query.
2023-10-29 17:41:31 +08:00
99b45e1938 [fix](Outfile) Export DateTimev2 type of doris to ORC's TimeStamp type (#25470)
Previously,doris's `DateTimev2` was exported to orc as a `String` type.
Now, export doris's `DateTimev2` to orc timestamp type.
2023-10-29 15:59:38 +08:00
1e5ed91e4d [fix](fe-meta) make FE meta be compatible with branch 2.0 (2.0.2.1 or later) (#26048)
In branch 2.0, we changed the read/write method of AnalysisManager,
and rename the image module name to AnalysisMgrV2.
So we need to make the same change in master branch, so that user
can upgrade Doris from branch-2.0 to master branch.

After this PR, user can:

- upgrade from 2.0.x(or branch-2.0) to master
2023-10-29 14:45:21 +08:00
501c6096dd Revert "[Test](multi-catalog) Add tpcds sf100 hive shape. (#25639)" (#26069)
This reverts commit 3beba1764c01b6712b108556433c96429c59cc45.
2023-10-29 12:45:32 +08:00
6a85f46ff3 [refactor](move-memtable) rename open_stream_sink rpc to open_load_stream (#25883) 2023-10-29 10:07:14 +08:00
bd2f007d52 [fix](load) Fix not check if the tablet is nullptr (#26015) 2023-10-29 10:01:14 +08:00
47955f9109 [bug](intersect) fix intersect node nullable field should depend on _row_descriptor (#26038) 2023-10-28 22:39:36 +08:00
c32cd5b25a FE ssl certificates are of various formats #26039 #26044 2023-10-28 22:37:38 +08:00
517233c7fb [fix](pipeline) remove pipelinex required on branch-2.0 (#26053) 2023-10-28 16:21:41 +08:00
bcdd500426 [pipeline](branch-2.0) pr to branch-2.0 also require to run regressio… (#25942) 2023-10-27 10:18:25 -05:00
365fdd2f4d [feature](backup) add property to remove snapshot before creating repo (#25847)
Doris is not responsible for managing snapshots, but it needs to clear all
snapshots before doing backup/restore regression testing, so a property is
added to indicate that existing snapshots need to be cleared when creating a
repo.

In addition, a regression test case for backup/restore has been added.
2023-10-27 21:03:26 +08:00
c715facafa [fix](window_function) window function first_value/last_value should be always nullable (#26014) 2023-10-27 20:53:48 +08:00
5e884bc065 [Fix](bitmap index) fix is_like_predicate for bitmap index evaluate (#25961) 2023-10-27 20:23:42 +08:00
d953e5c8f4 [feature](Nereids): Convert topn(x, 1) to max(x) (#26004) 2023-10-27 18:45:21 +08:00
3beba1764c [Test](multi-catalog) Add tpcds sf100 hive shape. (#25639)
Add tpcds sf100 hive shapes.

Disable query64 temporarily because it is not same with emr cluster after collecting metadata by analyze table xxx.
And the root cause need to analyze, will enable in future PR.
2023-10-27 18:39:29 +08:00
606223ab62 Revert "[refactor](pipeline) simplify runtime state ctor (#25995)" (#26029)
This reverts commit a01922cdc55e2b3a63d9a9aafb38ac5ed64c6dd3.
2023-10-27 18:15:30 +08:00
2464a22ac5 [fix](meta) fix follower sync image timeout after checkpoint. (#26003)
The image file of our cluster reaches 2.3G. After the checkpoint, Followers synchronize the image timeout, resulting in the continuous increase of the bdb directory.

related pr: #25768
2023-10-27 16:58:42 +08:00
165957658a [fix](Nereids) could not run multi group_concat distinct (#25851)
could not run multi group_concat distinct with more than one parameters.
This bug is not just for group_concat, but we usually use literal as
parameters in group_concat. So group_concat brought the problem to light.

In the original logic, we think only distinct aggregate function with
zero or one parameter could run in multi distinct mode. But it is wrong.
We could process all distinct aggregate function with not more than one
input slots.

Think about sql:
```sql
SELECT
  group_concat(distinct c1, ','), group_concat(distinct c2, ',')
FROM t
GROUP BY c3
```
2023-10-27 03:42:00 -05:00
11e04f76fb [Enhancement](submodule) print current submodule commit id when building (#26017) 2023-10-27 16:28:27 +08:00
340a37116f [cleanup](load) remove unused dynamic schema code (#25932) 2023-10-27 16:21:40 +08:00
a01922cdc5 [refactor](pipeline) simplify runtime state ctor (#25995) 2023-10-27 15:45:29 +08:00
cad3d8fb99 [doc](complex type) add docs for complex and nested types (#25865)
Add docs for complex and nested types.
2023-10-27 15:04:53 +08:00
d57fc263cf [fix](Nereids) support create dup table without key (#25954) 2023-10-27 01:57:39 -05:00
4aebe879a6 [feature](Nereids) support complex project in graph simplifier (#26002)
Reject the edge which has an alias when ordering edge
2023-10-27 14:38:54 +08:00
a6d64c6b25 [refine](profilev2) format profilev2 (#25963) 2023-10-27 14:26:38 +08:00
4ee24d8cb1 [FIX](upgrade)fix upgrade for predict column delete collection type will make core #26006 2023-10-27 14:25:45 +08:00
46d40b1952 [refactor](executor)Remove empty group logic #26005 2023-10-27 14:24:41 +08:00
e3dbbed47c [fix](Nereids) query failed when fold constant on be failed (#25982)
we translate expression to legacy one when do fold constant on BE.
some times, we generate invalid expression that cannot be tranlsated.
So, we should catch translate exception to avoid query failed.
2023-10-27 01:22:08 -05:00
deb1ad8264 [enhancement](Nereids) add mode no_backslash_escapes (#25822)
add mode [no_backslash_escapes](https://dev.mysql.com/doc/refman/8.0/en/sql-mode.html)
2023-10-27 00:43:32 -05:00
ae8e6fbe59 [Fix](inverted index) fix empty array index writer bug (#25984) 2023-10-27 11:27:44 +08:00
cedab51676 [enhancement](UT) add unit test cases about bitmap (#25867)
* [fix](bitmap) incorrect result of operator ==

* [enhancement](UT) add unit test cases about bitmap
2023-10-27 11:27:14 +08:00
6a62835f7e [fix](docker) Add docker ignore files and change docker default sourc… (#25696) 2023-10-27 11:23:21 +08:00
a3ac8b98b8 [fix](planner)nvl should match function signature in the same way as ifnull (#25849) 2023-10-27 11:13:30 +08:00