Commit Graph

6043 Commits

Author SHA1 Message Date
1d9d99c8ec [fix](Nereids)join output order need same with child plan node output when translate (#12130)
In BE, There is an implicit convention that HashJoinNode's left child's output Slot must before right child's output slot in intermediateTuple.

However, after we do commute rule on join plan in Nereids, this convention will be broken and cause core dump in BE.

There are two way to fix this problem:
1. add a project on join after we do commute
2. reorder output of join node when we do translate

Since we cannot translate project yet because BE projection support is on going(#11842). So we use second way to fix it now. After the project translation could work correctly, we should use the first way to fix it.
2022-08-29 15:32:55 +08:00
454e21daca [Docs](array type) Update array type documentation (#12097) 2022-08-29 15:03:26 +08:00
ed131b8eb0 [Bugfix](coredump) fix coredump cause by fmt::format param malformt (#12138)
fix coredump cause by fmt::format param malformt
2022-08-29 12:45:22 +08:00
af09c1f4eb [Improvement](window funnel) restrict timestamp to datetime type in window funnel (#12123) 2022-08-29 12:14:04 +08:00
957bf98784 [docs](remote-udaf) change title (#12133) 2022-08-29 12:11:48 +08:00
5f7d6e8f2b [Refactor](predicate) Unify Conditions and ColumnPredicate (#11985) 2022-08-29 12:11:22 +08:00
3ca6f34c87 [fix](view) Fix view not showing specific lengths for varchar type (#12107) 2022-08-29 12:09:48 +08:00
Pxl
7829c21b20 [Bug](lateral-view) fix some conjunct not work on lateral view #12105 2022-08-29 12:08:20 +08:00
fb7c42a4e3 [fix](fe) Fixed alterOp from HashSet to EnumSet (#12094)
Change the HashSet to EnumSet of the AlterOp's currentOps for better performance
2022-08-29 12:07:31 +08:00
62e3bd338e [refactor](BE) return error status when vslot_ref contains invalid slot_id (#12106)
In current implementation, we detect invalid slot at execute phase. At execute phase, it is hard to get useful information for further debug. This pr moves error detection ahead to prepare phase, so that we can log related tuple descriptors.
2022-08-29 12:07:08 +08:00
db07e51cd3 [refactor](status) Refactor status handling in agent task (#11940)
Refactor TaggableLogger
Refactor status handling in agent task:
Unify log format in TaskWorkerPool
Pass Status to the top caller, and replace some OLAPInternalError with more detailed error message Status
Premature return with the opposite condition to reduce indention
2022-08-29 12:06:01 +08:00
eb3e0b2f7d [test](Nereids): add more plan equals test for Nereids (#12127)
- add more plan equals test for Nereids
- fix join equals bugs
2022-08-29 11:46:30 +08:00
ac425d4bf3 [fix](remote)Fix bug for cache reader (#12104) 2022-08-29 11:28:17 +08:00
fe9767941d [fix](array-type) adjust enable_array_type config (#12071)
Problem:
1. `enable_array_type` is masterOnly;
2. dynamic open config only affect FE MASTER
`admin set frontend config("enable_array_type"="true");`
3. query in FE FOLLOWER will fail, because of `enable_array_type` is false in FE FOLLOWER
`select * from table_with_array `

Solution:
Only check `enable_array_type` while creating new tables with array column.

Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-08-29 11:10:52 +08:00
44c4a45f72 [fix](array-type) fix the wrong data when use stream load to import '\N' (#12102)
Co-authored-by: hucheng01 <hucheng01@baidu.com>
2022-08-29 09:53:37 +08:00
7fbcf3c8ba [api-change](http) change kill query http api by using query id (#12120)
Now user can cancel query id by http by following steps:

Get query id by trace id
cancel query by query id
The modified api has not been released yet.
2022-08-29 09:51:51 +08:00
dec576a991 [feature-wip](parquet-reader) generate null values and NullMap for parquet column (#12115)
Generate null values and NullMap for the nullable column by analyzing the definition levels.
2022-08-29 09:30:32 +08:00
f26c051835 [Typo](doc) Rewrite the Quick Start document (#11956) 2022-08-29 09:25:58 +08:00
acd7ab379d [feature](Nereids)add range partition prune (#11964)
1. Rewrite Filter(Project) to Project(Filter) to make sure when do partition prune the tree looks like this: Project(Filter(OlapScan)).
2. Enable the MergeConsecutiveProject MergeConsecutiveFilter rules.
3. prune range partition just like what Legacy Planner do.
2022-08-28 23:39:09 +08:00
6e6269c682 [Improvement](load) accelerate streamload and compaction (#12119)
* [Improvement](load) accelerate streamload and compaction
2022-08-28 23:10:47 +08:00
2f88dfd7c5 [regression-test](unique) Add tpch sf0.1 test under unique table with zstd (#12122) 2022-08-27 21:39:09 +08:00
5693fdaf9c fix configs default value in docs (#12047)
fix configs default value in docs
2022-08-27 10:08:54 +08:00
bd4f994c7e update document of alter table oartition (#12077)
update document of alter table oartition
2022-08-27 10:08:13 +08:00
8c5005f247 [typo](doc)Add the backup operation, using the following instructions from S3 (#12091)
Add the backup operation, using the following instructions from S3
2022-08-27 10:07:15 +08:00
a6e2e2f3bc [feature](remote)Add cache files cleaner for remote olap files (#11959) 2022-08-26 23:59:36 +08:00
0b5bb565a7 [feature-wip](parquet-reader) parquet dictionary decoder (#11981)
Parse parquet data with dictionary encoding.

Using the PLAIN_DICTIONARY enum value is deprecated in the Parquet 2.0 specification.
Prefer using RLE_DICTIONARY in a data page and PLAIN in a dictionary page for Parquet 2.0+ files.
refer: https://github.com/apache/parquet-format/blob/master/Encodings.md
2022-08-26 19:24:37 +08:00
818afa4c66 [fix](Nereids)fix some problem in CBO. (#12083)
This PR fix three problem in Nereids.
- Add selected index, partition and tablet Info in LogicalOlapScan
- with JoinReorderContext in new LogicalJoin
- fix compute data size when no column size info in StatsCaculator
2022-08-26 18:52:59 +08:00
f3f17eb222 [Bugfix](load) fix be will coredump when parsing malformed json file using simdjson (#12062)
* [Bugfix](load) fix be will coredump when parsing malformed json file using simdjson
2022-08-26 18:01:19 +08:00
fba2658a1d [fix](array-type) fix the be core dump when use collect_list result to insert (#12045)
Co-authored-by: hucheng01 <hucheng01@baidu.com>
2022-08-26 18:00:43 +08:00
2b3a5b5fdd [fix](array-type) add ARRAY_BOOLEAN support for lots of array functions #12079
Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-08-26 18:00:29 +08:00
f6023d8a5b [sample](flink-connector) add full db sync v2 (#12090)
* add db sync v2
2022-08-26 16:13:01 +08:00
4636d6c476 [enhancement](Nereids)merge consecutive LogicalLimit plan (#12096)
This rule aims to merge consecutive limits.

LogicalLimit(limit=10, offset=4)
+---LogicalLimit(limit=3, offset=5)

transformed to

LogicalLimit(limit=3, offset=5)

where

newLimit.limit    = min(topLimit.limit, bottomLimit.limit)
newLimit.offset = bottomLimit.offset
topLimit.offset  is ignored
2022-08-26 15:53:44 +08:00
Pxl
3af0745c8f [Bug](function) fix aggFnParams set not correct (#12006) 2022-08-26 14:29:56 +08:00
7b83308ffc [fix](planner) fix rewrite subquery error (#12075) 2022-08-26 14:29:14 +08:00
22157077e9 [fix](memtracker) Optimize the return msg of process memory limit exceed #12086
Return the real process memory information when the process exceeds mem limit
Optimize the memory exceed limit log printing logic
process tracker does not participate in process memory limit.
2022-08-26 14:28:46 +08:00
9caaa4bfbd [fix](memory) fix set disable_chunk_allocator_in_vec=false performance #12092 2022-08-26 14:28:12 +08:00
89d6f1231f [fix](projection)join node should always output at least one column (#12080) 2022-08-26 12:15:51 +08:00
ccff3f5711 [bugfix](light weight schema change) support delete condition in schema change (#11869)
* [bugfix](light weight schema change) support delete condition in schema change


Co-authored-by: yiguolei <yiguolei@gmail.com>
2022-08-26 11:45:55 +08:00
e478f096ed [typo](doc) fix some typo in docs #12054 2022-08-26 11:44:22 +08:00
82ca62dfcc [fix](memory) Fix disable_mem_pools to disable cache #12087 2022-08-26 11:43:19 +08:00
0f4a1e811b [Enhancement](table_function) table function node enhancement (#12038)
* table function node enhancement

* also avoid copy for non-vec table function node

* fix table function node output slots calculation while lateral view involves subquery

Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-08-26 10:37:15 +08:00
ba11d8dc67 [feature-wip](unique-key-merge-on-write) fix bugs on tablet clone #12067 2022-08-26 10:37:00 +08:00
de148465bc [feature-wip](unique-key-merge-on-write) unique key table with MOW supports update (#11882) 2022-08-26 09:44:37 +08:00
0c16740f5c [feature-wip](parquet-reader) parquert scanner can read data (#11970)
Co-authored-by: jinzhe <jinzhe@selectdb.com>
2022-08-26 09:43:46 +08:00
721d418a2f [feature-wip](unique-key-merge-on-write) fix that version is awlays 0 when update delete bitmap (#12044) 2022-08-26 09:41:55 +08:00
0bacbb95c2 [typo](doc)website sidebar Menu order (#12073)
menu order
2022-08-26 09:41:45 +08:00
e5bfbbe761 [feature-wip](unique-key-merge-on-write) support alter table column for MoW (#12052) 2022-08-26 09:40:11 +08:00
8deead66cd [feature-wip](unique-key-merge-on-write) add pk_uk correctness check case (#12060)
Co-authored-by: yixiutt <yixiu@selectdb.com>
2022-08-26 09:38:38 +08:00
17b809210a [Bug](runtime filter) fix bug for late-arrival runtime filters (#12049) 2022-08-26 09:13:10 +08:00
98d82915d3 [fix](bdbje) add reserved disk config to avoid too many reserved bdbje files (#12043) 2022-08-26 08:12:27 +08:00