doris

Author	SHA1	Message	Date
Mr.Hu	940efc6014	[Fix]Remove duplicated destructor function in MinMaxFuncBase (#8287 )	2022-03-01 18:38:09 +08:00
zhangstar333	2b9b0fc1ec	[Fix] Function percentile input null return null (#8238 )	2022-03-01 14:42:48 +08:00
Zhengguo Yang	757e35744d	[refactor] remove unused new_in_predicate code (#8263 ) remove unused code of new_in_predicate.h/cpp	2022-03-01 11:11:42 +08:00
Mingyu Chen	e77e2b0bf0	[improvement](lateral-view) Add number rows filtered in profile (#8251 ) Add `RowsFiltered` counter in TableFunctionNode profile. So that we can know the total number of rows that TableFunctionNode processed	2022-03-01 11:04:57 +08:00
HappenLee	8642fa38b9	[Bug] Double/Float % 0 should be NULL (#8230 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-02-25 11:03:42 +08:00
Pxl	4c5d7c27df	[Bug] group_concat(value,null) not return null	2022-02-25 11:03:23 +08:00
HappenLee	a6bc9cbe53	[Function] Refactor the function code of log (#8199 ) 1. Support return null when input is invalid 2. Del the unless code in vec function Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-02-24 11:06:58 +08:00
Zhengguo Yang	c47368f80c	[fix] (udf) fix check_fn and fn_call function name not same (#8132 )	2022-02-22 09:18:07 +08:00
Mingyu Chen	16020cbdf9	[fix](lateral-view) Fix bug that explode_json_array_string return unstable result (#8152 ) Co-authored-by: morningman <chenmingyu@baidu.com>	2022-02-21 09:38:36 +08:00
zhannngchen	826738d97f	[docs]Some doc improvements and typo fix (#8153 )	2022-02-21 09:36:01 +08:00
Zhengguo Yang	50864aca7d	[refactor] fix warings when compile with clang (#8069 )	2022-02-19 11:29:02 +08:00
Zhengguo Yang	7a73645eee	[refactor] remove some unused code (#8022 )	2022-02-12 15:17:28 +08:00
Zhengguo Yang	5029ef46c9	[fix] fix ltrim result may incorrect in some case (#7963 ) fix ltrim result may incorrect in some case according to https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html Built-in Function: int __builtin_cl/tz (unsigned int x) If x is 0, the result is undefined. So we handle the case of 0 separately this function return different between gcc and clang when x is 0	2022-02-09 13:06:37 +08:00
Pxl	0553ce2944	[feature](vectorization) support function topn && remove some unused code (#7793 )	2022-02-09 13:05:31 +08:00
Zhengguo Yang	f8d086d87f	[feature](rpc) (experimental)Support implement UDF through GRPC protocol. (#7519 ) Support implement UDF through GRPC protocol. This brings several benefits: 1. The udf implementation language is not limited to c++, users can use any familiar language to implement udf 2. UDF is decoupled from Doris, udf will not cause doris coredump, udf computing resources are separated from doris, and doris services are not affected But RPC's UDF has a fixed overhead, so its performance is much slower than C++ UDF, especially when the amount of data is large. Create function like ``` CREATE FUNCTION rpc_add(INT, INT) RETURNS INT PROPERTIES ( "SYMBOL"="add_int", "OBJECT_FILE"="127.0.0.1:9999", "TYPE"="RPC" ); ``` Function service need to implement `check_fn` and `fn_call` methods Note: THIS IS AN EXPERIMENTAL FEATURE, THE INTERFACE AND DATA STRUCTURE MAY BE CHANGED IN FUTURE !!!	2022-02-08 09:25:09 +08:00
Mingyu Chen	c0e59e59aa	[fix][refactor] fix bugs and refactor some code by lint (#7871 ) 1. Fix some `passedByValue` issues. 2. Fix some `dereferenceBeforeCheck` issues. 3. Fix some `uninitMemberVar` issues. 4. Fix some iterator `eraseDereference` issues. 5. Fix compile issue introduced from #7923 #7905 #7848	2022-02-01 14:31:14 +08:00
Pxl	2003da7cf9	[fix](ut) fix abs function ut (#7938 )	2022-01-31 14:58:29 +08:00
Pxl	3ee000c13c	[chore] support build with libc++ && add some build config (#7903 ) support LIBCPP/LDD/BUILD_META_TOOL for build.sh	2022-01-30 16:47:22 +08:00
924060929	c1fef37399	[improvement](runtime-filter) Support adaptive runtime filter(#7546 ) (#7645 ) Change 1: Support an adaptive runtime filter: IN_OR_BLOOM_FILTER The processing logic is If the number of rows in the right table < runtime_filter_max_in_num, then IN predicate will work If the number of rows in the right table >= runtime_filter_max_in_num, then Bloom filter can take effect Change 2: The default runtime filter is changed to filter: IN_OR_BLOOM_FILTER	2022-01-30 16:46:52 +08:00
Amos Bird	800a36343a	[chore] Prolog of hermetic build with GCC 11 and Clang 13. (#7712 ) Prepare to generate hermetic build using GCC 11 and Clang 13. The ideal toolchain would be ldb toolchain generated by [ldb_toolchain_gen.sh](https://github.com/amosbird/ldb_toolchain_gen/releases/download/v0.3/ldb_toolchain_gen.sh) To kick off a clang build, set `DORIS_TOOLCHAIN=clang` before running any build scripts.	2022-01-21 12:12:04 +08:00
HappenLee	e1d7233e9c	[feature](vectorization) Support Vectorized Exec Engine In Doris (#7785 ) # Proposed changes Issue Number: close #6238 Co-authored-by: HappenLee <happenlee@hotmail.com> Co-authored-by: stdpain <34912776+stdpain@users.noreply.github.com> Co-authored-by: Zhengguo Yang <yangzhgg@gmail.com> Co-authored-by: wangbo <506340561@qq.com> Co-authored-by: emmymiao87 <522274284@qq.com> Co-authored-by: Pxl <952130278@qq.com> Co-authored-by: zhangstar333 <87313068+zhangstar333@users.noreply.github.com> Co-authored-by: thinker <zchw100@qq.com> Co-authored-by: Zeno Yang <1521564989@qq.com> Co-authored-by: Wang Shuo <wangshuo128@gmail.com> Co-authored-by: zhoubintao <35688959+zbtzbtzbt@users.noreply.github.com> Co-authored-by: Gabriel <gabrielleebuaa@gmail.com> Co-authored-by: xinghuayu007 <1450306854@qq.com> Co-authored-by: weizuo93 <weizuo@apache.org> Co-authored-by: yiguolei <guoleiyi@tencent.com> Co-authored-by: anneji-dev <85534151+anneji-dev@users.noreply.github.com> Co-authored-by: awakeljw <993007281@qq.com> Co-authored-by: taberylyang <95272637+taberylyang@users.noreply.github.com> Co-authored-by: Cui Kaifeng <48012748+azurenake@users.noreply.github.com> ## Problem Summary: ### 1. Some code from clickhouse ClickHouse is an excellent implementation of the vectorized execution engine database, so here we have referenced and learned a lot from its excellent implementation in terms of data structure and function implementation. We are based on ClickHouse v19.16.2.2 and would like to thank the ClickHouse community and developers. The following comment has been added to the code from Clickhouse, eg: // This file is copied from // https://github.com/ClickHouse/ClickHouse/blob/master/src/Interpreters/AggregationCommon.h // and modified by Doris ### 2. Support exec node and query: * vaggregation_node * vanalytic_eval_node * vassert_num_rows_node * vblocking_join_node * vcross_join_node * vempty_set_node * ves_http_scan_node * vexcept_node * vexchange_node * vintersect_node * vmysql_scan_node * vodbc_scan_node * volap_scan_node * vrepeat_node * vschema_scan_node * vselect_node * vset_operation_node * vsort_node * vunion_node * vhash_join_node You can run exec engine of SSB/TPCH and 70% TPCDS stand query test set. ### 3. Data Model Vec Exec Engine Support Dup/Agg/Unq table, Support Block Reader Vectorized. Segment Vec is working in process. ### 4. How to use 1. Set the environment variable `set enable_vectorized_engine = true; `(required) 2. Set the environment variable `set batch_size = 4096; ` (recommended) ### 5. Some diff from origin exec engine https://github.com/doris-vectorized/doris-vectorized/issues/294 ## Checklist(Required) 1. Does it affect the original behavior: (No) 2. Has unit tests been added: (Yes) 3. Has document been added or modified: (No) 4. Does it need to update dependencies: (No) 5. Are there any changes that cannot be rolled back: (Yes)	2022-01-18 10:07:15 +08:00
Universe	5b0f11b665	[feature](mysql-compatibility)(function) add `WEEKDAY` function (#7673 ) `WEEKDAY` in MySQL: returns an index from 0 to 6 for Monday to Sunday. `DAYOFWEEK` in MySQL: returns an index from 1 to 7 for Sunday to Saturday. Doris only have `DAYOFWEEK` function, so I add `WEEKDAY` function. Thanks for the following materials: - https://github.com/apache/incubator-doris/pull/6982/files - https://www.bilibili.com/video/BV1V44y1Y7Ro	2022-01-16 10:39:21 +08:00
Mingyu Chen	5e1caea2b1	[fix](lateral-view) Fix some bugs about lateral view (#7721 ) 1. fix core dump when using multi explode_bitmap #7716 2. fix bug that json array extract by json path is wrong #7717 3. fix bug that after lateral view, the null value become non-null value #7718 4. fix bug that lateral view may return error: couldn't resolve slot descriptor 1. #7719 5. fix error result when using lateral view with where predicate #7720	2022-01-13 15:30:38 +08:00
924060929	563545475e	[Optimize](Runtime Filter) Support merge in runtime filter(#7546 ) (#7547 ) Support merge IN predicate when exist remote target(e.g. shuffle hash join). Remote the code that IN predicate implicit conversion to Bloom filter then exist remote target. Close related #7546	2022-01-06 19:08:35 +08:00
EmmyMiao87	46ca012e2b	[fix](bloom-filter) Fix error when handle empty string in bloom filter (#7448 )	2021-12-31 16:05:33 +08:00
weizuo93	7357089e4e	[fix] change percentile_approx return from nan to null (#7512 ) Change function percentile_approx return value from nan to null (like hive.) to ensure that return value of function percentile_approxcan be parsed by JDBC successfully. Co-authored-by: weizuo <weizuo@xiaomi.com>	2021-12-30 10:24:35 +08:00
Mingyu Chen	e93360791f	Revert "[improvement](planner) make BinaryPredicate do not cast date to datetime/varchar (#7045 )" (#7517 )	2021-12-28 23:05:27 +08:00
Pxl	9fb89004aa	[revert] part of "[improvement](planner) make BinaryPredicate do not cast date to datetime/varchar (#7045 )" (#7501 )	2021-12-28 15:07:10 +08:00
Zhengguo Yang	07e2acb2f3	[feature] Suport national secret (national commercial password) algorithm SM3/SM4 (#7464 ) SM3 is password hash algorithm SM4 is a block cipher used to replace DES / AES and other international algorithms.	2021-12-28 10:39:54 +08:00
zhangstar333	0c154733e0	[feature](function) support bitmap_union/intersect have more columns parameters (#7379 ) support multi bitmap parameter for all bitmap aggregation function	2021-12-26 11:03:20 +08:00
shee	3ba6dcf236	[fix](function) fix round function for inaccuracy (#7421 )	2021-12-24 21:23:11 +08:00
Pxl	ff5a0e98b0	[improvement](planner) make BinaryPredicate do not cast date to datetime/varchar (#7045 )	2021-12-24 21:22:43 +08:00
Mingyu Chen	0499b2211b	[feat](lateral-view) Support execution of lateral view stmt (#7255 ) 1. Add table function node 2. Add 3 table functions: explode_split, explode_bitmap and explode_json_array	2021-12-16 10:46:15 +08:00
Zhengguo Yang	62d12067aa	[feature](udf) make orthogonal bitmap udaf as build in functions (#7211 ) move orthogonal bitmap udaf as build in functions add three buildin bitmap functions: - orthogonal_bitmap_intersect - orthogonal_bitmap_intersect_count - orthogonal_bitmap_union_count	2021-12-07 09:57:26 +08:00
HappenLee	d3316ff567	[performance](function) Support SIMD function in some string function (#7236 ) Support SIMD function in some string function：lrtim，rtrim，trim，reverse，hex	2021-12-06 10:24:26 +08:00
Zhengguo Yang	d8ba6e3eb6	1. Fix an error when fetch string type field may cause malform packet error. (#7262 ) This is beacuse of an const MAX_PHYSICAL_PACKET_LENGTH in fe should be 2^24 -1, but it is set as 2^24 -2 by mistake. 2. Fix bitmap_to_string may failed when the result is large than 2G	2021-12-01 10:02:34 +08:00
Hao Tan	a1bf2878c0	[feat-opt](json-function) optimize get_json_xx function (#7157 ) Avoid repeated parsing json string is the first parameter of function is constant.	2021-11-26 10:12:55 +08:00
Zhengguo Yang	c9e578032b	optimize bitmap function count, use roaring cardinality method, this will more fast than current version (#7151 )	2021-11-24 14:42:48 +08:00
Pxl	a74fdf184c	[refactor](be) refactor predicate function creator (#7054 ) Refactor predicate function creator, make MinMaxFunction/HybridSet/BloomFilter use a unified interface through template to get function.	2021-11-24 10:39:29 +08:00
Zhengguo Yang	6c6380969b	[refactor] replace boost smart ptr with stl (#6856 ) 1. replace all boost::shared_ptr to std::shared_ptr 2. replace all boost::scopted_ptr to std::unique_ptr 3. replace all boost::scoped_array to std::unique<T[]> 4. replace all boost:thread to std::thread	2021-11-17 10:18:35 +08:00
pengxiangyu	632f8fcc75	[libhdfs] Add errno for hdfs writer. when no dir, hdfs writer open failed, the dir need to be created. (#7050 ) 1. Add errno message for hdfs writer failed. 2. When call openWrite for hdfs, the dir will be created when it doesn't exist,	2021-11-11 15:21:21 +08:00
Pxl	29ca77622f	[Refactor] Refactor part of RuntimeFilter's code (#6998 ) #6997	2021-11-07 17:40:45 +08:00
Xinyi Zou	e69249c082	sub_bitmap (#6977 ) Starting from the offset position, intercept the specified limit bitmap elements and return a bitmap subset. Types of chang	2021-11-06 13:31:03 +08:00
Zhengguo Yang	760fc02bfe	Added bprc stub cache check and reset api, used to test whether the bprc stub cache is available, and reset the bprc stub cache (#6916 ) Added bprc stub cache check and reset api, used to test whether the bprc stub cache is available, and reset the bprc stub cache add a config used for auto check and reset bprc stub	2021-11-05 09:45:37 +08:00
pengxiangyu	599ecb1f30	[Function] Add bitmap function bitmap_subset_limit (#6980 ) Add bitmap function bitmap_subset_limit. This function will return subset in specified index.	2021-11-04 12:14:47 +08:00
xy720	aeec9c45e6	[Function] Add bitmap-xor-count function for doris (#6982 ) Add bitmap-xor-count function for doris relate to #6875	2021-11-02 16:37:00 +08:00
zhangstar333	1ff3d708ca	[Function] add functions of bitmap_and/or_count (#6912 ) issue #6875 add bitmap_and_count/ bitmap_or_count	2021-11-01 14:00:07 +08:00
luozenglin	c7a3116f98	[Function] add bitmap function of bitmap_has_all (#6918 ) The 'bitmap_has_all' function returns true if the first bitmap contains all the elements of the second bitmap.	2021-11-01 12:50:47 +08:00
qiye	65ded82778	[Function] add BE bitmap function bitmap_subset_in_range (#6917 ) Add bitmap function bitmap_subset_in_range. This function will return subset in specified range (not include the range_end).	2021-11-01 11:05:19 +08:00
Pxl	28030294f7	[Feature] Support bitmap_and_not & bitmap_and_not_count (#6910 ) Support bitmap_and_not & bitmap_and_not_count.	2021-11-01 10:11:54 +08:00

1 2 3 4 5

242 Commits