doris

Author	SHA1	Message	Date
mch_ucchi	9f7386243f	[Fix](regression-test)fix some unfixed-answer test #17408	2023-03-04 12:13:41 +08:00
Pxl	ae689c3a0b	[Bug](regression-test) remove some exception title on regression case (#17374 ) remove some exception title on regression case	2023-03-03 14:18:10 +08:00
Gabriel	979cf42d7a	[Bug](decimalv3) Use correct decimal scale for function round (#17232 ) Co-authored-by: maochongxin <maochongxin@gmail.com>	2023-03-01 12:28:41 +08:00
zhangstar333	1dd2a41e38	[vectorized](bug) fix window function can't handle first row of beyond (#17084 ) Issue Number: close #16845	2023-02-28 17:30:23 +08:00
HappenLee	3e40467ce6	[Bug](vec) Fix chinese pinyin order by (#17152 ) bug: some chinese word not sort by pinyin in GBK coding CREATE TABLE `test_convert` ( `a` varchar(100) NULL ) ENGINE=OLAP DUPLICATE KEY(`a`) DISTRIBUTED BY HASH(`a`) BUCKETS 3 PROPERTIES ( "replication_allocation" = "tag.location.default: 1" ); insert into test_convert values("b"), ("a"), ("c"), ("睿"), ("多"), ("丝"); Query OK, 6 rows affected (0.03 sec) {'label':'insert_ca73a6acc2194d5b_888218a3949355a6', 'status':'VISIBLE', 'txnId':'18068'} mysql [test]>select * from test_convert; +------+ \| a \| +------+ \| a \| \| c \| \| 丝 \| \| b \| \| 多 \| \| 睿 \| +------+ 6 rows in set (0.01 sec) mysql [test]>select * from test_convert order by convert(a using gbk); +------+ \| a \| +------+ \| a \| \| b \| \| c \| \| 多 \| \| 丝 \| \| 睿 \| +------+ 6 rows in set (0.01 sec)	2023-02-28 14:29:56 +08:00
奕冷	c0360f80bb	[enhancement](aggregate-function) enhance aggregate funtion collect and add group_array aliases (#15339 ) Enhance aggregate function `collect_set` and `collect_list` to support optional `max_size` param, which enables to limit the number of elements in result array.	2023-02-27 14:22:30 +08:00
ZhaoChangle	b5d67781a2	[Fix](function)fix datatime-diff function's overflow (#16935 )	2023-02-24 20:06:06 +08:00
TengJianPing	707160ab73	[fix](regression) fix regression test case of convert function (#17107 )	2023-02-24 18:27:08 +08:00
TengJianPing	883f575cfe	[fix](string function) fix wrong usage of iconv_open (#17048 ) * [fix](string function) fix wrong usage of iconv_open Also add test case for function convert * fix test case	2023-02-24 09:13:10 +08:00
lihangyu	526a66e9fb	[Function](array-type) support array_apply (#17020 ) Filter array to match specific binary condition ``` mysql> select array_apply([1000000, 1000001, 1000002], '=', 1000002); +-------------------------------------------------------------+ \| array_apply(ARRAY(1000000, 1000001, 1000002), '=', 1000002) \| +-------------------------------------------------------------+ \| [1000002] \| +-------------------------------------------------------------+ ```	2023-02-23 17:38:16 +08:00
ElvinWei	f32cd2c123	[fix](statistics) fix a problem with histogram statistics collection parameters (#16918 ) 1. Fixed a problem with histogram statistics collection parameters. 2. Solved the problem that it takes a long time to collect histogram statistics. TODO: Optimize histogram statistics sampling method and make the sampling parameters effective. The problem is that the histogram function works as expected in the single-node test, but doesn't work in the multi-node test. In addition, the performance of the current support sampling to collect histogram is low, resulting in a large time consumption when collecting histogram information. Fixed the parameter issue and temporarily removed support for sampling to speed up the collection of histogram statistics. Will next support sampling to collect histogram information.	2023-02-20 16:33:18 +08:00
ZhaoChangle	d6a841409f	[Enhancement](func)Introduce non_nullable extraction function. #16621 Introduced a new function non_nullable to BE, which can extract concrete data column from a nullable column. If the input argument is already not a nullable column, raise an error.	2023-02-18 20:44:07 +08:00
HappenLee	de1337511c	[Bug](Datetime) Fix date time function mem use after free (#16814 )	2023-02-16 16:15:58 +08:00
abmdocrt	41947c73eb	[Feature](array-function) Support array functions for nested type datev2 and datetimev2 (#16382 )	2023-02-08 12:51:07 +08:00
luozenglin	289a4b2ea4	[fix](func) fix truncate float type result error (#16468 ) When the argument of truncate function is float type, it can match both truncate(DECIMALV3) and truncate(DOUBLE), if the match is truncate(DECIMALV3), the precision is lost when converting float to DECIMALV3(38, 0). Here I modify it to match truncate(DOUBLE) for now, maybe we still need to solve the problem of losing precision when converting float to DECIMALV3.	2023-02-08 08:57:43 +08:00
luozenglin	09870098af	[fix](func) fix core dump when the pattern of the regexp_extract_all function does not contain subpatterns (#16408 )	2023-02-05 01:16:54 +08:00
gnehil	ca7b2e27a8	[regression-test](function) add regression test for money_format with truncate (#16052 )	2023-02-04 23:10:01 +08:00
Gabriel	918004c016	[Bug](date) Fix BE crash caused by function `datediff` (#16397 ) * [Bug](date) Fix BE crash caused by function `datediff` * update	2023-02-04 18:43:23 +08:00
ElvinWei	f443ebfd9a	[Improvement](statistics) optimise histogram keyword (#16369 )	2023-02-03 23:02:41 +08:00
xy720	6294b29f0a	[chore](regression-test) Remove array config in regression test (#16376 ) The fe config "enable_array_type" is not used, this commit removes it from regression test.	2023-02-03 14:44:03 +08:00
yongkang.zhong	941e192019	[enhancement](test) add function case date_sub(datetime,INTERVAL dayofmonth(datetime)-1 DAY) (#16306 )	2023-02-02 09:56:01 +08:00
HaveAnOrangeCat	e3c8fffd99	[function](round) fix decimal scale for scale not specified (#15541 )	2023-02-01 14:58:48 +08:00
HappenLee	95d7c2de26	[Refactor](function) Rewrite the function elt (#16287 )	2023-02-01 11:17:06 +08:00
abmdocrt	ca7eb94f23	[improvement](agg-function) Increase the limit maximum number of agg function parameters (#15924 )	2023-01-31 21:03:50 +08:00
shee	6bebf92254	[fix][FE] fix be coredump when children of FunctionCallExpr is folded (#16064 ) Co-authored-by: shizhiqiang03 <shizhiqiang03@meituan.com> fix be coredump when children of FunctionCallExpr is folded	2023-01-30 15:25:00 +08:00
abmdocrt	eb7da1c0ee	[fix](datatype) fix some bugs about data type array datetimev2 and decimalv3 (#16132 )	2023-01-29 14:26:08 +08:00
lihangyu	578a855b3e	[Bug](topn-opt) filter condition for analytic info for two phase read opt (#16173 ) two phase read optimization should not be enabled when query has analytic info	2023-01-29 12:06:18 +08:00
AKIRA	b919cbe487	[ehancement](nereids) Enhancement for limit clause (#16114 ) support limit offset without order by. the legacy planner supoort this feature in PR #15218	2023-01-28 11:04:03 +08:00
abmdocrt	9ffd109b35	[fix](datetimev2) Fix BE datetimev2 type returning wrong result (#15885 )	2023-01-20 22:25:20 +08:00
Gabriel	d062ca2944	[refactor](vectorized) remove unnecessary vectorization check (#15984 )	2023-01-17 12:21:46 +08:00
xueweizhang	63d48564ed	[fix](datetimev2) fix datetimev2 error with T (#15915 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-01-16 15:30:48 +08:00
Pxl	81bab55d43	[Bug](function) catch function calculation error on aggregate node to avoid core dump (#15903 )	2023-01-16 11:21:28 +08:00
abmdocrt	7441b4dc96	[Feature](function) Support width_bucket function (#14396 )	2023-01-12 13:59:21 +08:00
lihangyu	8c47c57264	[regression-test](array) fix abnormal test on function array_intersect (#15848 )	2023-01-12 13:57:11 +08:00
luozenglin	05f6e4c48a	[fix](predicate) fix be core dump caused by pushing down the double column predicate (#15693 )	2023-01-09 19:31:04 +08:00
ElvinWei	36590da24b	[fix](regression p0) add the alias function hist to histogram and fix p0 (#15708 ) add the alias function hist to histogram and fix p0	2023-01-08 11:31:23 +08:00
ElvinWei	76ad599fd7	[enhancement](histogram) optimise aggregate function histogram (#15317 ) This pr mainly to optimize the histogram(👉🏻 https://github.com/apache/doris/pull/14910) aggregation function. Including the following: 1. Support input parameters `sample_rate` and `max_bucket_num` 2. Add UT and regression test 3. Add documentation 4. Optimize function implementation logic Parameter description： - `sample_rate`：Optional. The proportion of sample data used to generate the histogram. The default is 0.2. - `max_bucket_num`：Optional. Limit the number of histogram buckets. The default value is 128. --- Example： ``` MySQL [test]> SELECT histogram(c_float) FROM histogram_test; +-------------------------------------------------------------------------------------------------------------------------------------+ \| histogram(`c_float`) \| +-------------------------------------------------------------------------------------------------------------------------------------+ \| {"sample_rate":0.2,"max_bucket_num":128,"bucket_num":3,"buckets":[{"lower":"0.1","upper":"0.1","count":1,"pre_sum":0,"ndv":1},...]} \| +-------------------------------------------------------------------------------------------------------------------------------------+ MySQL [test]> SELECT histogram(c_string, 0.5, 2) FROM histogram_test; +-------------------------------------------------------------------------------------------------------------------------------------+ \| histogram(`c_string`) \| +-------------------------------------------------------------------------------------------------------------------------------------+ \| {"sample_rate":0.5,"max_bucket_num":2,"bucket_num":2,"buckets":[{"lower":"str1","upper":"str7","count":4,"pre_sum":0,"ndv":3},...]} \| +-------------------------------------------------------------------------------------------------------------------------------------+ ``` Query result description： ``` { "sample_rate": 0.2, "max_bucket_num": 128, "bucket_num": 3, "buckets": [ { "lower": "0.1", "upper": "0.2", "count": 2, "pre_sum": 0, "ndv": 2 }, { "lower": "0.8", "upper": "0.9", "count": 2, "pre_sum": 2, "ndv": 2 }, { "lower": "1.0", "upper": "1.0", "count": 2, "pre_sum": 4, "ndv": 1 } ] } ``` Field description： - sample_rate：Rate of sampling - max_bucket_num：Limit the maximum number of buckets - bucket_num：The actual number of buckets - buckets：All buckets - lower：Upper bound of the bucket - upper：Lower bound of the bucket - count：The number of elements contained in the bucket - pre_sum：The total number of elements in the front bucket - ndv：The number of different values in the bucket > Total number of histogram elements = number of elements in the last bucket(count) + total number of elements in the previous bucket(pre_sum).	2023-01-07 00:50:32 +08:00
starocean999	a97f582b93	[fix](nereids) use DAYS as default unit for DATE_ADD and DATE_SUB function (#15559 )	2023-01-04 01:55:15 +08:00
Pxl	85fe9d2496	[Bug](filter) fix not in(null) return true (#15466 ) fix not in(null) return true	2023-01-03 21:14:50 +08:00
morrySnow	781fa17993	[fix](Nereids) round function return type should be double (#15502 )	2022-12-30 23:36:15 +08:00
starocean999	51b14c06d3	[enhancement](nereids) support approx_count_distinct function (#15406 )	2022-12-27 22:25:21 +08:00
morrySnow	6bec1ffc47	[feature](planner) remove restrict of offset without order by (#15218 ) Support SELECT * FROM tbl LIMIT 5, 3;	2022-12-26 09:37:41 +08:00
Gabriel	06f71f2bca	[pipeline](fix) Fix bugs to pass all regression cases (#15306 ) * [pipeline](fix) Fix bugs to pass all regression cases * update * update	2022-12-23 22:17:50 +08:00
HaveAnOrangeCat	df5969ab58	[Feature] Support function roundBankers (#15154 )	2022-12-22 22:53:09 +08:00
Gabriel	d38461616c	[Pipeline](error msg) format error message (#15247 )	2022-12-22 20:55:06 +08:00
TengJianPing	a447121fc3	[fix](scanner scheduler) fix coredump of ScannerScheduler::_scanner_scan (#15199 ) * [fix](scanner scheduler) fix coredump of ScannerScheduler::_scanner_scan * fix	2022-12-21 15:44:47 +08:00
starocean999	6712f1fc1d	[fix](Nereids) encryption function with 4 params should auto-complate last param with config (#15038 )	2022-12-20 13:55:54 +08:00
Gabriel	4dbe30d37b	[regression](vectorized) delete vectorized config in regression tests (#15126 )	2022-12-16 17:08:29 +08:00
Yulei-Yang	21c2e485ae	[improvment](function) add new function substring_index (#15024 )	2022-12-15 09:54:34 +08:00
Gabriel	1200b22fd2	[function](round) compute accurate round value by decimal (#14946 )	2022-12-13 09:53:43 +08:00

1 2 3 4

153 Commits