Commit Graph

5060 Commits

Author SHA1 Message Date
5f73668626 [log] add more error info for hdfs reader writer (#10475) 2022-06-29 12:02:27 +08:00
abd10f0f3e [feature-wip](multi-catalog) Impl FileScanNode in be (#10402)
Define a new file scanner node for hms table in be.
This file scanner node is different from broker scan node as blow:
1. Broker scan node will define src slot and dest slot, there is two memory copy in it: first is from file to src slot
    and second from src to dest slot. Otherwise FileScanNode only have one stemp memory copy just from file to dest slot.
2. Broker scan node will read all the filed in the file to src slot and FileScanNode only read the need filed.
3. Broker scan node will convert type into string type for src slot and then use cast to convert to dest slot type,
    but FileScanNode will have the final type.

Now FileScanNode is a standalone code, but we will uniform the file scan and broker scan in the feature.
2022-06-29 11:04:01 +08:00
9aa800141d [fix](ut)(nereids) the check bound function lacks recursive processing (#10357) 2022-06-29 10:40:13 +08:00
8cbdbb5658 [Enhancement] a better vec version for count_zero_num (#10472) 2022-06-29 10:26:42 +08:00
deeb3028ad [Enhancement] [Memory] [Vectorized] Stress test and optimize memory allocation (#9581)
* vec stress test, Allocator introduce chunkallocator

* fix comment
2022-06-29 02:57:51 +08:00
0a36c34326 [feature](nereids) costAndEnforcerJob interim solution (#10468)
In order to complete the ssb test, temporarily increase the implementation of costAndEnforcerJob, and create an OptimizeGroupjob for all children of the group.
2022-06-28 18:45:23 +08:00
f5936aa7ce [enhancement](Nereids): add more implmentation rules. (#10335)
Add more implmentation rules.

Current some `logical` and `physical` operator is different. I change some code to make them match.

Implementation
- Sort:only heap sort
- Agg
- OlapScan
2022-06-28 17:08:33 +08:00
7898c818e9 Revert "[improvement]Do not lazily read dict encoded columns (#10420)" (#10466)
Reason:
1. Some queries performance degradation
2. Coredump bug: #10419

This reverts commit 904e7576797c796b809823647a769bc1d4569115.
2022-06-28 15:43:48 +08:00
17eb8c00d3 [feature] add table valued function framework and numbers table valued function (#10214) 2022-06-28 14:01:57 +08:00
2f30c7cf03 [doc] Update the URL download link address in Quick Start (#10457) 2022-06-28 13:53:07 +08:00
1f2bf39140 [feature-wip](multi-catalog) get catalog name from TableName (#10435) 2022-06-28 10:42:37 +08:00
498a80547c [fix](fe-ut) fix fe ut and build.sh bug (#10432) 2022-06-27 19:01:05 +08:00
904e757679 [improvement]Do not lazily read dict encoded columns (#10420) 2022-06-26 22:08:48 +08:00
ca94867b4e [Feature-wip] add date v2 type (#9916) 2022-06-26 16:07:56 +08:00
HB
f6ef1aad5c [fix](backup) fix mkdir failed (#10422) (#10423) 2022-06-26 09:55:48 +08:00
4408231765 [fix](random-distribution) Make aggregate keys table with replace type columns and unique keys table can only have hash distribution to make data computing correctly (#10414) 2022-06-26 09:52:16 +08:00
79ad05eec6 [fix](doe) fix doe on es v8 (#10391)
doris on es8 can not work, because type change. The use of type is no longer recommended in es7,
and support for type has been removed from es8.

1. /_mapping not support include_type_name
2. /_search not support use type
2022-06-26 09:51:29 +08:00
fba0051ed3 [doc] Fix Slack invitation link on README.md not working (#10406) 2022-06-26 01:23:07 +08:00
a0e330a156 [website] add website external resource (#10416) 2022-06-26 01:22:14 +08:00
Pxl
4750e94746 set default do not build benchmark-tool && and use lld/gold (#10215) 2022-06-25 22:31:11 +08:00
69134fb3f2 [docs] Add regression test for weekday function (#10253) 2022-06-25 22:29:32 +08:00
eebfbd0c91 Revert "[fix](vectorized) Support outer join for vectorized exec engine (#10323)" (#10424)
This reverts commit 2cc670dba697a330358ae7d485d856e4b457c679.
2022-06-25 22:18:08 +08:00
7fe4b20da3 [feature-wip](multi-catalog) refactor catalog interface (#10320) 2022-06-25 21:51:54 +08:00
f12b22a51e [Bug][Vectorized] Fix cord dump of BloomFilter not support DATE type (#10417) 2022-06-25 21:29:32 +08:00
4ca257a1cd [improvement] Modify the default value of doris_scan_range_max_mb (#10232)
* [improvement] Modify the default value of doris_scan_range_max_mb

* fix regression-test
2022-06-25 19:48:49 +08:00
14a9a676e7 [BUG] fix DCHECK failed (#10396) 2022-06-25 17:08:40 +08:00
eb25df5a2c [fix] (mem tracker) Fix inaccurate mem tracker leads to load OOM (#10409)
* fix load tracker

* fix comment
2022-06-25 14:13:02 +08:00
8abd00dcd5 [feature-wip](multi-catalog) Add catalog name to information schema. (#10349)
Information schema database need to show catalog name after multi-catalog is supported.
This part is step 1, add catalog name for schemata table.
2022-06-25 11:53:04 +08:00
3757bd521a [feature](nereids)Convert the expression from nereids to stale expr. (#10343)
Add ExpressionConverter.java to convert the expression from nereids to stale expression
2022-06-25 11:16:52 +08:00
7921320124 [fix]Make sure only call once set_dict_encoding_type for each ColumnReader (#10389) 2022-06-25 04:31:19 +08:00
df908873bb [improvement]Use std::iota to set values of _block_rowids in SegmentIterator::_read_columns_by_index (#10386) 2022-06-25 04:30:23 +08:00
89860fd0e3 [opt] delete the redundant parameter of _execute_non_nullable (#10173)
1. This pr is used to delete the redundant parameter of _execute_non_nullable.
2. This modification will not affect the function "element_at".
2022-06-24 19:22:50 +08:00
476be35961 [TYPO] fix typo 'destory' -> 'destroy' (#10373) 2022-06-24 19:11:28 +08:00
e82d8433be [feature](nereids) Integrate nereids into current SQL process framework (#10304)
To integrate the nereids optimizer with new SQLParser and Planner to existing SQL process framework, I abstract a interface which named "Planner" and let the Both planner from nereids and stale optimizer implement it, to disguish it
with origin Planner, I rename the Planner to OriginalPlanner.

As we don't want to impact the existing logic too much, I defined a LogicalPlanAdapter to adapt the logicalPlan that is the output of the new paser to the existing code.

Besides, as the MySQL protocol supports sending multiple statements in one packet, so I add Nereids#SparseSQL method to handle this properly.
2022-06-24 18:26:26 +08:00
8f2b2b4457 add dynamic partition DataProperty (#10338) 2022-06-24 16:57:56 +08:00
8a49c7ef04 [chore] Rename Doris binary output format 2022-06-24 15:30:05 +08:00
9036f93df4 Revert "[improvement](function) optimize substr performance (#10169)" (#10390)
This reverts commit 2335d233f1f52eb64a380b4c9959becdf182b71b.
2022-06-24 14:38:52 +08:00
516f5b1789 [feature-wip](multi-catalog) support to switch catalog (#10381)
Add `switch catalog` stmt with privilege check
2022-06-24 10:42:13 +08:00
f15d84335c [websit][doc]Modify image path (#10361) 2022-06-24 09:12:20 +08:00
ad8da109c3 [community] update PMC & Committer list (#10360) 2022-06-24 09:11:49 +08:00
0cbf39d879 [doc] Fix typo (#10333) 2022-06-24 09:10:59 +08:00
2cc670dba6 [fix](vectorized) Support outer join for vectorized exec engine (#10323)
In a vectorized scenario, the query plan will generate a new tuple for the join node.
This tuple mainly describes the output schema of the join node.
Adding this tuple mainly solves the problem that the input schema of the join node is different from the output schema.
For example:
1. The case where the null side column caused by outer join is converted to nullable.
2. The projection of the outer tuple.
2022-06-24 08:59:30 +08:00
c288bb363a [f][docs] Restore images in new official website. (#10262) 2022-06-24 08:58:29 +08:00
1bd0d7ded5 [typo] Fix typos in comments (#10252) 2022-06-24 08:57:54 +08:00
2335d233f1 [improvement](function) optimize substr performance (#10169)
optimize substr performance about 1.5~2x speedup.
2022-06-24 08:57:31 +08:00
e3d549cdfa [fix](proc) Fix bug that TrashProcDir cannot show backend trash detail normally (#10365) 2022-06-24 08:56:32 +08:00
d5be190cf9 [fix](planner)infer predicate rule infer binary pridicate when two slot do not equal by mistake (#10356)
infer predicate rule infer binary predicate when two slot do not equal by mistake
2022-06-24 08:56:12 +08:00
f06a06d623 [chore](fe)remove java doc period end check in checkstyle (#10329)
We do not generate real java doc. All java doc comments is used to help to understand the code logic more easily.
So we need loose java doc style check. Remove period character check in summary java doc check rule.
2022-06-24 08:55:53 +08:00
b1d9b54805 BetaRowsetReader::next_block does not return 0 rows before eof (#10367) 2022-06-24 07:22:45 +08:00
2e661ac63f [improvement]Support vectorized predicates for dict columns (#10370) 2022-06-24 07:21:26 +08:00