doris

Author	SHA1	Message	Date
Xinyi Zou	e69249c082	sub_bitmap (#6977 ) Starting from the offset position, intercept the specified limit bitmap elements and return a bitmap subset. Types of chang	2021-11-06 13:31:03 +08:00
pengxiangyu	599ecb1f30	[Function] Add bitmap function bitmap_subset_limit (#6980 ) Add bitmap function bitmap_subset_limit. This function will return subset in specified index.	2021-11-04 12:14:47 +08:00
xy720	aeec9c45e6	[Function] Add bitmap-xor-count function for doris (#6982 ) Add bitmap-xor-count function for doris relate to #6875	2021-11-02 16:37:00 +08:00
zhangstar333	1ff3d708ca	[Function] add functions of bitmap_and/or_count (#6912 ) issue #6875 add bitmap_and_count/ bitmap_or_count	2021-11-01 14:00:07 +08:00
luozenglin	c7a3116f98	[Function] add bitmap function of bitmap_has_all (#6918 ) The 'bitmap_has_all' function returns true if the first bitmap contains all the elements of the second bitmap.	2021-11-01 12:50:47 +08:00
qiye	65ded82778	[Function] add BE bitmap function bitmap_subset_in_range (#6917 ) Add bitmap function bitmap_subset_in_range. This function will return subset in specified range (not include the range_end).	2021-11-01 11:05:19 +08:00
Pxl	28030294f7	[Feature] Support bitmap_and_not & bitmap_and_not_count (#6910 ) Support bitmap_and_not & bitmap_and_not_count.	2021-11-01 10:11:54 +08:00
zhuixun	a842d41b87	[Function] add BE bitmap function bitmap_max (#6942 ) Support bitmap_max.	2021-10-30 18:16:38 +08:00
EmmyMiao87	adb9b0d9c6	[Bug] Return 0 when hex(0) (#6837 )	2021-10-15 10:18:55 +08:00
tianhui5	58440b90f0	[Bug] Left() string function behaves not identically to the mysql implementation (#6811 ) See Fix #6810	2021-10-15 10:17:21 +08:00
zhoubintao	ad949c2f65	Optimize Hex and add related Doc (#6697 ) I tested hex in a 1000w times for loop with random numbers， old hex avg time cost is 4.92 s，optimize hex avg time cost is 0.46 s which faster nearly 10x.	2021-10-13 11:36:14 +08:00
Cui Kaifeng	020282e885	[Bug] Fix aes_decrypt to handle null input correctly. (#6636 )	2021-09-14 11:19:55 +08:00
qiye	225bdb1fda	[Bug] fix `replace` function bug (#6605 ) * fix replace function bug * fix replace docs	2021-09-14 09:59:13 +08:00
Pxl	577ff01094	[Bug][Function] Fix pad function wrong result when len.val==str_char_size (#6564 ) like #6563 and #6562	2021-09-07 11:55:49 +08:00
zhangstar333	7a15e583a7	[Feature]Support functions of json_array, json_object, json_quote (#6504 )	2021-09-02 09:59:02 +08:00
Hao Tan	66a7a4b294	[Feature] Support exact percentile aggregate function (#6410 ) Support to calculate the exact percentile value array of numeric column `col` at the given percentage(s).	2021-08-18 15:56:06 +08:00
caiconghui	285d44cd48	[BUG] Fix potential overflow exception when do money format for double (#6408 ) * [BUG] Fix potential overflow bug when do money format for double Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-15 18:40:26 +08:00
caiconghui	2f5b06ae70	[Bug][Optimize] Fix race condition problem and optimize do_money_format function (#6350 ) * [Bug][Optimize] Fix race condition problem and optimize do_money_format function Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-06 16:29:34 +08:00
stdpain	4c0fdd2800	[Bug] Fix core dump in BloomFilter while build Runtime Filter right table string column contains null (#6305 ) when right table has null value in string column, runtime filter may coredump ``` select count(*) from baseall t1 join test t2 where t1.k7 = t2.k7; ```	2021-07-26 09:41:41 +08:00
xinghuayu007	13ef2c9e1d	[Function][Enhance] lower/upper case transfer function vectorized (#6253 ) Currently, the function lower()/upper() can only handle one char at a time. A vectorized function has been implemented, it makes performance 2 times faster. Here is the performance test: The length of char: 26, test 100 times vectorized-function-cost: 99491 ns normal-function-cost: 134766 ns The length of char: 260, test 100 times vectorized-function-cost: 179341 ns normal-function-cost: 344995 ns	2021-07-26 09:38:07 +08:00
HappenLee	fae3eff2e6	[Bug] Fix the bug of cast string to datetime return not null (#6228 )	2021-07-17 10:55:08 +08:00
stdpain	bf5db6eefe	[BUG][Timeout][QueryLeak] Fixed memory not released in time (#6221 ) * Revert "[Optimize] Put _Tuple_ptrs into mempool when RowBatch is initialized (#6036)" This reverts commit f254870aeb18752a786586ef5d7ccf952b97f895. * [BUG][Timeout][QueryLeak] Fixed memory not released in time, Fix Core dump in bloomfilter	2021-07-16 12:32:10 +08:00
Mingyu Chen	c2695e9716	[Bug][RoutineLoad] Can not match whole json in routine load (#6213 ) Support using json path "$" to match the whole json in routine load Co-authored-by: chenmingyu <chenmingyu@baidu.com>	2021-07-16 09:21:27 +08:00
Zhengguo Yang	ed3ff470ce	[ARRAY] Support array type load and select not include access by index (#5980 ) This is part of the array type support and has not been fully completed. The following functions are implemented 1. fe array type support and implementation of array function, support array syntax analysis and planning 2. Support import array type data through insert into 3. Support select array type data 4. Only the array type is supported on the value lie of the duplicate table this pr merge some code from #4655 #4650 #4644 #4643 #4623 #2979	2021-07-13 14:02:39 +08:00
stdpain	290a844e04	[optimize] Optimize bloomfilter performance (#6180 ) refactor runtime filter bloomfilter and eliminate some virtual function calls which obtained a performance improvement of about 5% import block bloom filter, for avx version obtained 40% performance improvement before: bloomfilter size:default, about 2000W item cost about 1s400ms after: bloomfilter size:524288, about 2000W item cost about 400ms	2021-07-10 10:12:12 +08:00
DinoZhang	c929a8935a	[Feature][Function] support bit_length function (#6140 ) support bit_length function like mysql	2021-07-08 09:40:30 +08:00
Zhengguo Yang	739c0268ff	[refactor] Remove decimal v1 related code from code base (#6079 ) remove ALL DECIMAL V1 type code ， this is a part of #6073	2021-07-07 10:26:32 +08:00
stdpain	149def9e42	[Feature] Support RuntimeFilter in Doris (BE Implement) (#6077 ) 1. support in/bloomfilter/minmax 2. support broadcast/shuffle/bucket shuffle/colocate join 3. opt memory use and cpu cache miss while build runtime filter 4. opt memory use in left semi join (works well on tpcds-95)	2021-07-04 20:59:05 +08:00
stdpain	bde60280b8	[Optimize] use string_view instead of std::string in string function (#6010 )	2021-06-16 09:40:13 +08:00
luozenglin	d33a6d1b98	[Function] Support date function: yearweek(), week(), makedate(). (#6000 )	2021-06-10 17:38:25 +08:00
stdpain	d790cc6a50	[BUG] Fixed the problem that substring function may access illegal address (#5952 )	2021-06-03 18:38:10 +08:00
Yingchun Lai	be733cfa9c	[Metrics] Add some large memtrackers' metric (#5614 ) MemTracker can provide memory consumption for us to find out which module consume more memory, but it's just a current value, this patch add metrics for some large memory consumers, then we can find out which module consume more memory in timeline, it would be useful to troubleshoot OOM problems and optimize configs.	2021-04-21 09:15:04 +08:00
Zhengguo Yang	40f53ac71f	fix bitmap unit test failed (#5610 )	2021-04-08 10:25:59 +08:00
Patrick	1e8c4584ab	[Function] Add BE udf bitmap_min (#2538 ) (#5581 ) this function will return the min result of the input bitmap .	2021-04-08 09:11:32 +08:00
stdpain	ad67dd34a0	update gcc to gcc 10 and support c++17 (#5394 ) * update gcc to gcc 10 and support c++17 update brpc to 0.9.7 update boost to 1.73 remove third-party boost 1.54 for mysql * update cmake version * ignore jdk version * remove unused patch * avoid use SYS_getrandom call	2021-03-25 09:30:38 +08:00
caiconghui	47d6b1ff0b	Fix ut failed for topn_function_test (#5449 ) Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>	2021-03-04 21:53:52 +08:00
924060929	9c8766356a	[Bug-Fix][Bitmap][Be] Resolve bitmap_not calculate wrong result(#5440 ) (#5441 ) bitmap_not calculate wrong result(#5440) Execute follow sql, and expect response '' ``` select bitmap_to_string(bitmap_not(bitmap_from_string('1'), bitmap_from_string('2,1'))); ``` Co-authored-by: lanhuajian <lanhuajian@sankuai.com>	2021-03-04 15:46:42 +08:00
stdpain	7eae3e280a	[optimization] use inline optimize ExprContext::get_value (#5385 )	2021-02-16 22:35:14 +08:00
HappenLee	a1808c1a71	[Function] Add BE udf bitmap_not (#5346 ) (#5357 ) this function will return the not result of inputs two bitmap.	2021-02-07 22:39:17 +08:00
Zhengguo Yang	93a4c7efc1	[LOG] Standardize the use of VLOG in code (#5264 ) At present, the application of vlog in the code is quite confusing. It is inherited from impala VLOG_XX format, and there is also VLOG(number) format. VLOG(number) format does not have a unified specification, so this pr standardizes the use of VLOG	2021-01-21 12:09:09 +08:00
HappenLee	f2cf8d2c5e	[Bug-Fix] Fix the bug of `PERCENTILE_APPROX` return error result `nan` and add `PERCENTILE_APPROX` UT (#5172 )	2021-01-03 15:45:22 +08:00
Yingchun Lai	11c0aafa5c	[UT] Speed up BE unit test (#5131 ) There are some long loops and sleeps in unit tests, it will cost a very long time to run all unit tests, especially run in TSAN mode. This patch speed up unit tests by shortening long loops and sleeps, on my environment all unit tests finished in 1 minite. It's useful to do basic functional unit tests. You can switch to run in this mode by adding a new environment variable 'DORIS_ALLOW_SLOW_TESTS'. For example, you can set: export DORIS_ALLOW_SLOW_TESTS=1 and also you can disable it by setting: export DORIS_ALLOW_SLOW_TESTS=0	2020-12-27 22:19:56 +08:00
Youngwb	650536d53e	[Feature] Add Topn udaf (#4803 ) For #4674 This is a udaf for approximate topn using Space-Saving algorithm. At present, we can only calculate the frequent items and their frequencies in a certain column, based on which we can implement similar topN functions supported by Kylin in the future. I have also added a test to calculate the accuracy of this algorithm. The following is a rough running result. The total amount of data is 1 million lines and follows the Zipfian distribution, where Element Cardinality represents the data cardinality, 20X, 50X.. The value representing space_expand_rate is 20,50, which is used to set the counter number in the space-saving algorithm ``` zf exponent = 0.5 Element cardinality 20X 50X 100X 1000 100% 100% 100% 10000 100% 100% 100% 100000 100% 100% 100% 500000 94% 98% 99% zf exponent = 0.6，1 Element cardinality 20X 50X 100X 1000 100% 100% 100% 10000 100% 100% 100% 100000 100% 100% 100% 500000 100% 100% 100% ```	2020-12-16 21:58:34 +08:00
sduzh	6fedf5881b	[CodeFormat] Clang-format cpp sources (#4965 ) Clang-format all c++ source files.	2020-11-28 18:36:49 +08:00
Mingyu Chen	bfdb15c730	[Bug] Fix some date functions to make their result same as MySQL (#4786 ) dayofweek, dayofmonth, dayofyear, weekofyear, timediff Also fix ut compilation problem	2020-10-27 12:52:44 +08:00
Zhengguo Yang	09f97f8a05	[Refactor] Fixes some be typo part 2 (#4747 )	2020-10-20 09:28:57 +08:00
HappenLee	c00a5cb543	[Bug] Fix the core problem of function `split_part` and add the UT of core case (#4721 ) issue:#4720	2020-10-13 10:09:39 +08:00
Zhengguo Yang	75e0ba32a1	Fixes some be typo (#4714 )	2020-10-13 09:37:15 +08:00
Zhengguo Yang	98e71a8b9f	[Bug][Function] Fix rand() function return same value (#4709 ) fix rand function return same value when no parameter	2020-10-11 15:40:38 +08:00
ccoffline	f3cdf167d1	[Feature] Add time_round builtin functions (#4640 ) #4619 Add time_round functions that provides `time_floor` & `time_ceil` at each time unit. Fix two related bugs. - #4618 - Fix `struct TimeInterval` to use `int64_t` instead of `int32_t`, in case when the second diff overflow	2020-10-09 16:05:51 +08:00

1 2 3

105 Commits