doris

Author	SHA1	Message	Date
ElvinWei	76ad599fd7	[enhancement](histogram) optimise aggregate function histogram (#15317 ) This pr mainly to optimize the histogram(👉🏻 https://github.com/apache/doris/pull/14910) aggregation function. Including the following: 1. Support input parameters `sample_rate` and `max_bucket_num` 2. Add UT and regression test 3. Add documentation 4. Optimize function implementation logic Parameter description： - `sample_rate`：Optional. The proportion of sample data used to generate the histogram. The default is 0.2. - `max_bucket_num`：Optional. Limit the number of histogram buckets. The default value is 128. --- Example： ``` MySQL [test]> SELECT histogram(c_float) FROM histogram_test; +-------------------------------------------------------------------------------------------------------------------------------------+ \| histogram(`c_float`) \| +-------------------------------------------------------------------------------------------------------------------------------------+ \| {"sample_rate":0.2,"max_bucket_num":128,"bucket_num":3,"buckets":[{"lower":"0.1","upper":"0.1","count":1,"pre_sum":0,"ndv":1},...]} \| +-------------------------------------------------------------------------------------------------------------------------------------+ MySQL [test]> SELECT histogram(c_string, 0.5, 2) FROM histogram_test; +-------------------------------------------------------------------------------------------------------------------------------------+ \| histogram(`c_string`) \| +-------------------------------------------------------------------------------------------------------------------------------------+ \| {"sample_rate":0.5,"max_bucket_num":2,"bucket_num":2,"buckets":[{"lower":"str1","upper":"str7","count":4,"pre_sum":0,"ndv":3},...]} \| +-------------------------------------------------------------------------------------------------------------------------------------+ ``` Query result description： ``` { "sample_rate": 0.2, "max_bucket_num": 128, "bucket_num": 3, "buckets": [ { "lower": "0.1", "upper": "0.2", "count": 2, "pre_sum": 0, "ndv": 2 }, { "lower": "0.8", "upper": "0.9", "count": 2, "pre_sum": 2, "ndv": 2 }, { "lower": "1.0", "upper": "1.0", "count": 2, "pre_sum": 4, "ndv": 1 } ] } ``` Field description： - sample_rate：Rate of sampling - max_bucket_num：Limit the maximum number of buckets - bucket_num：The actual number of buckets - buckets：All buckets - lower：Upper bound of the bucket - upper：Lower bound of the bucket - count：The number of elements contained in the bucket - pre_sum：The total number of elements in the front bucket - ndv：The number of different values in the bucket > Total number of histogram elements = number of elements in the last bucket(count) + total number of elements in the previous bucket(pre_sum).	2023-01-07 00:50:32 +08:00
yongkang.zhong	a6773417ef	[Doc] Add sidebars for split_by_string function and delete split_by_char builtins code (#15679 )	2023-01-06 21:14:26 +08:00
Tiewei Fang	df2da89b89	[feature](multi-catalog) support postgresql jdbc catalog (#15570 ) support postgresql jdbc catalog	2023-01-06 11:00:59 +08:00
Stalary	e67ea1ddb7	[fix](doc): catalog use resource doc error (#15607 )	2023-01-04 19:53:25 +08:00
jiafeng.zhang	36e43c2677	fix 1.2.1 release notes (#15590 )	2023-01-04 13:26:54 +08:00
jiafeng.zhang	e5397efb67	[docs](releasenotes)release 1.2.1 (#15583 ) * release 1.2.1	2023-01-04 10:12:46 +08:00
Zhengguo Yang	caaae28b50	[docs](export) fix export data to object storage docs (#15563 )	2023-01-03 16:45:02 +08:00
gnehil	8a5f1351e2	[typo](doc) add be max jvm heap size config description (#15561 )	2023-01-03 16:04:18 +08:00
slothever	9ab663212b	[docs](muti-catalog) update external iceberg system doc (#15556 ) Tell users how to solve the problem "fail to read schema from table xx or Storage schema reading not supported" when doris access hive metastore.	2023-01-03 13:58:16 +08:00
Mingyu Chen	5062c62ee1	[chore](script) add build-for-release.sh (#15545 )	2023-01-02 22:50:36 +08:00
zhangstar333	c47bdf6606	[vectorized](jdbc) fix external table of oracle have keyworld column (#15487 ) if column name is keyword of oracle, the query will report error	2022-12-31 12:48:26 +08:00
lsy3993	9bba2f4cde	[typo](docs) array function doc fix (#15449 )	2022-12-30 23:00:48 +08:00
jiafeng.zhang	aacd11336a	[typo](docs)update java udf demo (#15521 )	2022-12-30 21:12:34 +08:00
xu tao	8e58d92e77	[typo](docs) fix document info missing in SHOW-TABLETS.md (#15488 )	2022-12-30 18:39:21 +08:00
Hu Yanjun	084eec87ee	[docs](docs)update en docs (#15470 ) * Update basic-summary.md	2022-12-30 18:38:26 +08:00
Henry2SS	34d7eeb571	[doc](session variable) add doc content for adding variables called rewrite_or_to_in_predicate_threshold (#15513 ) Co-authored-by: wuhangze <wuhangze@jd.com>	2022-12-30 17:11:45 +08:00
Cary	08d4dcefff	[typo](doc)data partition doc including en and zh-CN #15379 Co-authored-by: Chen Jinquan 陈金泉 (690) <chenjinq@haier.com>	2022-12-30 15:38:25 +08:00
HappenLee	9a517d6a8f	[DataType](Deciamlv3) change the avg function scale of decimalv3 (#15445 )	2022-12-30 00:27:51 +08:00
xu tao	73f7ccb58f	[typo](docs) fix document display error in SHOW-ALTER.md and SHOW-PARTITION-ID.md and SHOW-PARTITIONS.md (#15453 )	2022-12-30 00:27:22 +08:00
luozenglin	e2603ca883	[fix](docs) fix some docs about stream load and select. (#15372 ) * [fix](docs) fix some docs about stream load and select. * update	2022-12-29 14:50:06 +08:00
Liqf	2ae28ea9dd	[typo](docs)fix-doc #15438	2022-12-29 14:19:24 +08:00
xu tao	4179ea31bd	[typo](docs) fix typo in SHOW-ALTER.md and SHOW-LOAD-WARNINGS.md (#15431 )	2022-12-29 14:19:05 +08:00
zy-kkk	298c0a2391	[typo](docs)fix be dynamic configuration doc #15443	2022-12-29 14:18:14 +08:00
zy-kkk	f5b4faf682	fix compile doc (#15454 )	2022-12-29 14:17:53 +08:00
AlexYue	ffef81a6ab	[feature](BE)pad missed version with empty rowset (#15030 ) If all replicas of one tablet are broken, user can use this http api to pad the missed version with empty rowset.	2022-12-29 11:20:44 +08:00
spaces-x	a22ee89431	[Enhancement](jemalloc):support heap dump by http request at runtime (#15429 )	2022-12-28 20:10:50 +08:00
Jet He	75aa00d3d0	[Feature](NGram BloomFilter Index) add new ngram bloom filter index to speed up like query (#11579 ) This PR implement the new bloom filter index: NGram bloom filter index, which was proposed in #10733. The new index can improve the like query performance greatly, from our some test case , can get order of magnitude improve. For how to use it you can check the docs in this PR, and the index based on the ```enable_function_pushdown```, you need set it to ```true```, to make the index work for like query.	2022-12-28 18:01:50 +08:00
wudi	3aae27634a	[doc](flink-connector) update flink connector faq (#15405 )	2022-12-28 16:15:49 +08:00
pengxiangyu	8342691b62	[feature](remote)Add drop storage policy (#15364 ) * add drop storage policy * add drop storage policy * add drop storage policy * add drop storage policy	2022-12-28 16:04:30 +08:00
Mingyu Chen	28bb13a026	[feature](light-schema-change) enable light schema change by default (#15344 )	2022-12-28 09:29:26 +08:00
lsy3993	aad53d37c7	[typo](docs)fix doris docs 404 link (#15400 )	2022-12-27 22:57:40 +08:00
AlexYue	b3f77a2e00	[feature](Show) add one show type cast command (#15137 )	2022-12-27 14:19:04 +08:00
wudi	6d851b1fc9	[Doc](Flink) update flink connector doc add new version #15365 Co-authored-by: wudi <>	2022-12-27 14:15:49 +08:00
lsy3993	777b0b94bb	[typo](docs) fix wrong date format (#15363 ) fix wrong date format	2022-12-27 11:45:05 +08:00
Yulei-Yang	c3d0e2931a	[typo](docs) fix version tag for docs of s3 token (#15362 )	2022-12-26 19:23:43 +08:00
Mingyu Chen	8b6e4e74e7	[improvement](jdbc) add default jdbc driver's dir (#15346 ) Add a new config "jdbc_drivers_dir" for both FE and BE. User can put jdbc drivers' jar file in this dir, and only specify file name in "driver_url" properties when creating jdbc resource. And Doris will find jar files in this dir. Also modify the logic so that when the jdbc resource is modified, the corresponding jdbc table will get the latest properties.	2022-12-26 11:51:12 +08:00
Xin Liao	bf71943605	[feature](load) stream load trim double quotes for csv (#15241 )	2022-12-26 11:45:54 +08:00
Yulei-Yang	b7768a928d	[Improvement](S3) support access s3 via temporary security credentials (#15340 )	2022-12-26 00:31:55 +08:00
jiafeng.zhang	f821dbc9f2	[doc] enable_new_load_scan_node doc (#15347 )	2022-12-25 22:51:37 +08:00
Yulei-Yang	001153ab38	[Improvement](multi-catalog) support hive external tables which store data on tencent chdfs (#15297 ) * support read hive table whichs store data on tencent chdfs in multi-catalog	2022-12-25 21:57:18 +08:00
xu tao	fe9571c2fd	[typo](docs) fix typo in get-starting.md (#15345 )	2022-12-25 21:56:44 +08:00
Yulei-Yang	0cda82ad5a	[typo](docs) fix typo in tablet-repair-and-balance (#15341 )	2022-12-25 09:48:16 +08:00
starocean999	fd764b3ccd	[fix](fe)add session variable group_concat_max_len (#15254 )	2022-12-24 20:07:14 +08:00
Mingyu Chen	907cbcde69	[doc](compile) update docker compile image version (#15300 ) Add new docker compile image tag: apache/doris:build-env-for-1.2	2022-12-24 15:28:03 +08:00
zy-kkk	cf9217c0ca	[typo](docs)fix 404 err to Monitoring and alarming doc #15324	2022-12-23 22:15:54 +08:00
Hu Yanjun	ef3da105c9	[DOCS](refactor) refine en docs (#15244 ) * Update basic-summary.md * Update README.md	2022-12-23 16:47:51 +08:00
gnehil	00fd5b1b1c	[typo](doc) update Paxos spell mistake (#15171 )	2022-12-23 16:47:12 +08:00
Tiewei Fang	764b1db097	[fix](s3 outfile) Add the`use_path_style` parameter for s3 outfile (#15288 ) Currently, `outfile` did not support `use_path_style` parameter and use `virtual-host style` by default, however some Object-storage may only support `use_path_style` access mode. This pr add the`use_path_style` parameter for s3 outfile, so that different object-storage can use different access mode.	2022-12-23 16:22:06 +08:00
Gabriel	cb295de981	[Bug](decimalv3) Fix wrong precision of DECIMALV3 (#15302 ) * [Bug](decimalv3) Fix wrong precision of DECIMALV3 * update	2022-12-23 14:11:08 +08:00
HaveAnOrangeCat	df5969ab58	[Feature] Support function roundBankers (#15154 )	2022-12-22 22:53:09 +08:00

1 2 3 4 5 ...

1742 Commits