0ead048b93
[Enhancement](column) remove ColumnString terminating zero and add a data_version for pblock ( #12456 )
...
1. remove ColumnString terminating zero
2. add a data_version for pblock
3. change EncryptionMode to enum class
2022-09-14 21:25:22 +08:00
c8e9a32bb2
[Function](cbrt)Add cbrt function for doris ( #12523 )
...
Add cbrt function for doris
2022-09-12 19:58:45 +08:00
09b45f2b71
[Function](ELT)Add elt function ( #12321 )
2022-09-07 15:21:08 +08:00
7929500608
[typo](docs)The table_function calling reset() function should set _eos to false #12323
2022-09-05 08:29:19 +08:00
62561834a8
[Feature](array-type) Support is-null-predicate for array type ( #12237 )
2022-09-03 11:37:57 +08:00
44c4a45f72
[fix](array-type) fix the wrong data when use stream load to import '\N' ( #12102 )
...
Co-authored-by: hucheng01 <hucheng01@baidu.com >
2022-08-29 09:53:37 +08:00
50ef6e35be
[enhancement](RowDescriptor) enhance tuple_idx check during runtime ( #11835 )
2022-08-17 17:50:48 +08:00
3a49156e30
[performance] (vectorization)optimize In Expr ( #11826 )
...
Co-authored-by: Wang Bo <wangbo36@meituan.com >
2022-08-17 10:46:37 +08:00
f39f57636b
[feature-wip](parquet-reader) update column read model and add page index ( #11601 )
2022-08-16 15:04:07 +08:00
01383c3217
[Enhancement](stream-load-json) using simdjson to parse json ( #11665 )
...
Currently we use rapidjson to parse json document, It's fast but not fast enough compare to simdjson.And I found that the simdjson has a parsing front-end called simdjson::ondemand which will parse json when accessing fields and could strip the field token from the original document, using this feature we could reduce the cost of string copy(eg. we convert everthing to a string literal in _write_data_to_column by sprintf, I saw a hotspot from the flamegrame in this function, using simdjson::to_json_string will strip the token(a string piece) which is std::string_view and this is exactly we need).And second in _set_column_value we could iterate through the json document by for (auto field: object_val) {xxx}, this is much faster than looking up a field by it's field name like objectValue.FindMember("k1").The third optimization is the at_pointer interface simdjson provided, this could directly get the json field from original document.
2022-08-16 14:49:50 +08:00
7d97aa194b
[feature-wip](datev2) Support to use datev2 as partition column ( #11618 )
2022-08-12 11:54:01 +08:00
ed7f7dead9
[Refactor](push-down predicate) Derive push-down predicate from vconjuncts ( #11468 )
...
* [Refactor](push-down predicate) Derive push-down predicate from vconjuncts
2022-08-08 19:19:26 +08:00
95753ec868
[feature](parquet-reader) add group filter util ( #11533 )
...
* [feature-wip](parquet-reader) add group filter util
Co-authored-by: jinzhe <jinzhe@selectdb.com >
2022-08-05 14:02:48 +08:00
ecbf87d77b
[bugfix](memtracker)fix exceed memory limit log ( #11485 )
2022-08-04 10:22:20 +08:00
b260a02215
[fix](be): fix stack overflow in unhex function ( #11204 )
...
* [fix](be): fix stack overflow in unhex function
2022-07-28 14:59:54 +08:00
72d2feae99
[feature-wip] Support all date functions for datev2/datetimev2 ( #11265 )
...
* [feature-wip] (datetimev2) support convert_tz function
* [feature-wip] Support all date functions for datev2/datetimev2
2022-07-28 08:18:59 +08:00
829d534e12
[Improvement] Replace switch with constexpr to boost date functions ( #11134 )
2022-07-23 22:58:59 +08:00
babab5d535
[feature-wip] support datetimev2 ( #11085 )
2022-07-23 16:07:59 +08:00
ad31b6c902
[bugfix and improvement]fix mem tracker for load and simplify some macros ( #11125 )
2022-07-22 21:59:36 +08:00
4960043f5e
[enhancement] Refactor to improve the usability of MemTracker (step2) ( #10823 )
2022-07-21 17:11:28 +08:00
56e036e68b
[feature-wip](multi-catalog) Support runtime filter for file scan node ( #11000 )
...
* [feature-wip](multi-catalog) Support runtime filter for file scan node
Co-authored-by: morningman <morningman@apache.org >
2022-07-20 12:36:57 +08:00
f6cb7a838b
[Optimize] Improve performance like/not like filter through pushdown function to storage engine ( #10355 )
...
* support like/not like conjuncts push down to storage engine
* vectorized engine support like/not like conjuncts push down to storage engine
* support both evaluate and evaluate_vec method in like predicate
* reuse remove_pushed_conjuncts and prevent logic error during move function conjuncts
* change #ifndef to pragma once as per comments
* change enable_function_pushdown default to false
Co-authored-by: heguangnan <heguangnan@bytedance.com >
2022-07-19 08:33:04 +08:00
6736e06679
[feature](udf) Vectorization support remote udaf #10683 ( #10685 )
2022-07-18 17:15:34 +08:00
0381cdc989
[Bug] fix core for min/max runtime filter ( #10899 )
2022-07-16 22:22:51 +08:00
ad4751972c
[feature-wip] Support in predicate for datev2 type ( #10810 )
2022-07-15 14:32:40 +08:00
3b46242483
[feature-wip] Optimize Decimal type ( #10794 )
...
* [feature-wip](decimalv3) support decimalv3
* [feature-wip] Optimize Decimal type
Co-authored-by: liaoxin <liaoxinbit@126.com >
2022-07-14 10:50:50 +08:00
4e9d5a7f7a
optimize substr performance and fix ASAN global buffer overflow ( #10442 )
...
* add volnitsky substr algorithm
* replace std::search with volnitsky search algorithm in StringSearch
* optimize substring for constant_substring_fn case
use long run length search for performance
2022-07-12 08:36:21 +08:00
cc279d09a1
[BUG] Wrong result when build size is beyond IN runtime filter threshold ( #10735 )
2022-07-11 12:19:38 +08:00
1f08f2d144
[Bug][Vectorized] Support array function in where pre in volap_scan_node ( #10467 )
...
Co-authored-by: lihaopeng <lihaopeng@baidu.com >
Support array function in where pre in volap_scan_node
2022-07-09 16:22:01 +08:00
c358a43f35
[feature-wip] support parquet predicate push down ( #10512 )
2022-07-08 23:11:25 +08:00
c583d3e27c
[fix][vectorized] Fix bug of VInPredicate on date type ( #10663 )
2022-07-07 22:15:33 +08:00
89e56ea67f
[refactor] remove alpha rowset related code and vectorized row batch related code ( #10584 )
2022-07-05 20:33:34 +08:00
c9f86bc7e2
[refactor] Refactoring Status static methods to format message using fmt( #9533 )
2022-07-02 18:58:23 +08:00
5dd43fff53
[improvement] alloc aligned memory for block bloom filter to work with jemalloc ( #10446 )
2022-07-01 16:05:48 +08:00
ca94867b4e
[Feature-wip] add date v2 type ( #9916 )
2022-06-26 16:07:56 +08:00
8a49c7ef04
[chore] Rename Doris binary output format
2022-06-24 15:30:05 +08:00
9036f93df4
Revert "[improvement](function) optimize substr performance ( #10169 )" ( #10390 )
...
This reverts commit 2335d233f1f52eb64a380b4c9959becdf182b71b.
2022-06-24 14:38:52 +08:00
2335d233f1
[improvement](function) optimize substr performance ( #10169 )
...
optimize substr performance about 1.5~2x speedup.
2022-06-24 08:57:31 +08:00
f5e5880fb6
[Improvement] make expression for template argument a constexpr ( #10268 )
2022-06-21 07:42:02 +08:00
588634ddf6
[feature] support runtime filter on vectorized engine ( #10103 )
2022-06-20 09:46:38 +08:00
60147ad7a5
[Improvement] build runtime filters asynchronously ( #10186 )
2022-06-17 11:09:13 +08:00
fd0bd395ac
[Enhancement] Remove some unused include ( #10035 )
2022-06-17 10:47:25 +08:00
44e979e43b
[Vectorized][Function] add orthogonal bitmap agg functions ( #10126 )
...
* [Vectorized][Function] add orthogonal bitmap agg functions
save some file about orthogonal bitmap function
add some file to rebase
update functions file
* refactor union_count function
refactor orthogonal union count functions
* remove bool is_variadic
2022-06-17 08:48:41 +08:00
28e8effc52
[Refactor] Refactor vectorized scan node ( #9968 )
2022-06-16 11:10:56 +08:00
39a2785ce2
[enhancement] support simd instructions on arm cpus through sse2neon ( #10068 )
...
* [enhancement] support simd instructions on arm cpus through sse2neon
2022-06-14 09:17:09 +08:00
bf8b4fb2d3
[Bugfix] be crash when executing sql contains bitmap_intersect function ( #9910 )
...
* fix bitmap serialize bug
* add regression test for bitmap seralize bugfix
* add missing regression test out file
* fix reggresion test failed issue
2022-06-09 08:45:46 +08:00
fc9afda97a
[enhancement][diagnostics] Add a diagnostic: detect unused includes ( #9117 )
2022-06-08 11:52:48 +08:00
35c3e4e33c
[Bug] runtime filter is not used as expected ( #10001 )
...
* [Bug] runtime filter is not used as expected
* update
2022-06-08 11:10:39 +08:00
49d4798276
[fix](function) fix bug in time_round function ( #9712 )
2022-06-06 08:58:22 +08:00
f377c26bf7
[refactor][be] Optimize headers ( #9708 )
2022-05-30 16:12:10 +08:00