doris

Author	SHA1	Message	Date
caiconghui	7e30b28f3a	[Optimize] Speed up converting the data of other types to string in mysql_result_writer (#6384 ) Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-24 22:30:58 +08:00
Mingyu Chen	fa382f8602	[Bug][MemLimit] Modify the memory limit of storage page cache (#6451 ) This CL mainly changes: 1. the `storage_page_cache_limit` is based on config `mem_limit` the default is 20% of `mem_limit`. 2. the `buffer_pool_limit` is based on config `mem_limit` the default is 20% of `mem_limit`. 3. the `buffer_pool_clean_pages_limit` is based on config `buffer_pool_limit` the default is 50% of `buffer_pool_limit` 4. Fix some show bugs of lru cache hit ratio and usage ratio 5. Fix a create view bug that `notEvalNondeterministicFunction` should be reset after analyze.	2021-08-19 14:16:53 +08:00
Hao Tan	66a7a4b294	[Feature] Support exact percentile aggregate function (#6410 ) Support to calculate the exact percentile value array of numeric column `col` at the given percentage(s).	2021-08-18 15:56:06 +08:00
Pxl	999eaeb276	fix `Wrong use on SCOPED_RAW_TIMER` (#6459 )	2021-08-18 09:06:18 +08:00
Zhengguo Yang	8738ce380b	Add long text type STRING, with a maximum length of 2GB. Usage is similar to varchar, and there is no guarantee for the performance of storing extremely long data (#6391 )	2021-08-18 09:05:40 +08:00
caiconghui	285d44cd48	[BUG] Fix potential overflow exception when do money format for double (#6408 ) * [BUG] Fix potential overflow bug when do money format for double Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-15 18:40:26 +08:00
HappenLee	9216735cfa	[New Featrue] Support Vectorization Execution Engine Interface For Doris (#6329 ) 1. FE vectorized plan code 2. Function register vec function 3. Diff function nullable type 4. New thirdparty code and new thrift struct	2021-08-11 14:54:06 +08:00
Lijia Liu	f772649535	[Optimize] Optimize lock when check error storage (#6321 ) 1. `StorageEngine::_delete_tablets_on_unused_root_path` will try to obtain tablet shard write lock in `TabletManager` ``` StorageEngine::_delete_tablets_on_unused_root_path TabletManager::drop_tablets_on_error_root_path obtain each tablet shard's write lock ``` 2. `TabletManager::build_all_report_tablets_info` and other methods will obtain tablet shard read lock frequently. So, `StorageEngine::_delete_tablets_on_unused_root_path` will hold `_store_lock` for a long time. This will make it difficult for other threads to get write `_store_lock`, such as `StorageEngine::get_stores_for_create_tablet` `drop_tablets_on_error_root_path` is a small probability event, `TabletManager::drop_tablets_on_error_root_path` should return when its param `tablet_info_vec` is empty	2021-08-07 21:30:49 +08:00
caiconghui	2f5b06ae70	[Bug][Optimize] Fix race condition problem and optimize do_money_format function (#6350 ) * [Bug][Optimize] Fix race condition problem and optimize do_money_format function Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-06 16:29:34 +08:00
caiconghui	d1007afe80	Use fmt and std::from_chars to make convert integer to string and convert string to integer more efficient (#6361 ) * [Optimize] optimize the speed of converting integer to string * Use fmt and std::from_chars to make convert integer to string and convert string to integer more efficient Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-04 10:55:19 +08:00
HappenLee	6597a338dc	[Feature] Support config max length of zone map index (#6293 )	2021-07-30 09:23:11 +08:00
stdpain	776df2effc	[BUG][stack-buffer-overflow] fix overflow while calculate hash code in ArrayType and fix some warning	2021-07-27 13:41:00 +08:00
Mingyu Chen	f26e3408b2	[Profile] Support show load profile for broker load job (#6214 ) 1. Add new statement: `SHOW LOAD PROFILE "xxx";` 2. Improve the read performance of orc scanner	2021-07-27 13:37:34 +08:00
stdpain	4c0fdd2800	[Bug] Fix core dump in BloomFilter while build Runtime Filter right table string column contains null (#6305 ) when right table has null value in string column, runtime filter may coredump ``` select count(*) from baseall t1 join test t2 where t1.k7 = t2.k7; ```	2021-07-26 09:41:41 +08:00
xinghuayu007	13ef2c9e1d	[Function][Enhance] lower/upper case transfer function vectorized (#6253 ) Currently, the function lower()/upper() can only handle one char at a time. A vectorized function has been implemented, it makes performance 2 times faster. Here is the performance test: The length of char: 26, test 100 times vectorized-function-cost: 99491 ns normal-function-cost: 134766 ns The length of char: 260, test 100 times vectorized-function-cost: 179341 ns normal-function-cost: 344995 ns	2021-07-26 09:38:07 +08:00
HappenLee	02a00cdf35	[Bug] Fix the bug in `from_date_format_str` function (#6273 )	2021-07-21 12:31:37 +08:00
Mingyu Chen	327e31c227	[Feature] Support setting concurrency for thread pool token (#6237 ) Now we can submit a group of tasks using thread pool token, and limit the max concurrency of this task group	2021-07-21 12:30:43 +08:00
huangmengbin	2d78c31d49	[Enhance] improve performance of init_scan_key by sharing the schema (#6099 ) Co-authored-by: huangmengbin <huangmengbin@bytedance.com>	2021-07-21 10:50:31 +08:00
HappenLee	fae3eff2e6	[Bug] Fix the bug of cast string to datetime return not null (#6228 )	2021-07-17 10:55:08 +08:00
stdpain	bf5db6eefe	[BUG][Timeout][QueryLeak] Fixed memory not released in time (#6221 ) * Revert "[Optimize] Put _Tuple_ptrs into mempool when RowBatch is initialized (#6036)" This reverts commit f254870aeb18752a786586ef5d7ccf952b97f895. * [BUG][Timeout][QueryLeak] Fixed memory not released in time, Fix Core dump in bloomfilter	2021-07-16 12:32:10 +08:00
Mingyu Chen	c2695e9716	[Bug][RoutineLoad] Can not match whole json in routine load (#6213 ) Support using json path "$" to match the whole json in routine load Co-authored-by: chenmingyu <chenmingyu@baidu.com>	2021-07-16 09:21:27 +08:00
Mingyu Chen	68f988b78a	[Optimize] Use flat_hash_set to replace unorderd_set in InPredicate (#6216 ) Co-authored-by: chenmingyu <chenmingyu@baidu.com>	2021-07-15 11:15:11 +08:00
Zhengguo Yang	ed3ff470ce	[ARRAY] Support array type load and select not include access by index (#5980 ) This is part of the array type support and has not been fully completed. The following functions are implemented 1. fe array type support and implementation of array function, support array syntax analysis and planning 2. Support import array type data through insert into 3. Support select array type data 4. Only the array type is supported on the value lie of the duplicate table this pr merge some code from #4655 #4650 #4644 #4643 #4623 #2979	2021-07-13 14:02:39 +08:00
Zhengguo Yang	dbfe8e4753	[enhancement] Optimize load CSV file memory allocate (#6174 ) Optimize load CSV file memory allocate, avoid frequent allocation, may reduce the load time by 40%-50% when large column numbers	2021-07-12 09:58:45 +08:00
stdpain	290a844e04	[optimize] Optimize bloomfilter performance (#6180 ) refactor runtime filter bloomfilter and eliminate some virtual function calls which obtained a performance improvement of about 5% import block bloom filter, for avx version obtained 40% performance improvement before: bloomfilter size:default, about 2000W item cost about 1s400ms after: bloomfilter size:524288, about 2000W item cost about 400ms	2021-07-10 10:12:12 +08:00
DinoZhang	c929a8935a	[Feature][Function] support bit_length function (#6140 ) support bit_length function like mysql	2021-07-08 09:40:30 +08:00
Zhengguo Yang	739c0268ff	[refactor] Remove decimal v1 related code from code base (#6079 ) remove ALL DECIMAL V1 type code ， this is a part of #6073	2021-07-07 10:26:32 +08:00
stdpain	149def9e42	[Feature] Support RuntimeFilter in Doris (BE Implement) (#6077 ) 1. support in/bloomfilter/minmax 2. support broadcast/shuffle/bucket shuffle/colocate join 3. opt memory use and cpu cache miss while build runtime filter 4. opt memory use in left semi join (works well on tpcds-95)	2021-07-04 20:59:05 +08:00
luozenglin	6441a4c0ca	[Metrics] Add metrics for load average. (#6069 ) Added load average metrics to be for monitoring system load. See iseue #6068 for a detailed explanation.	2021-06-30 09:28:45 +08:00
Zhengguo Yang	fe65a623c1	Fix timeout error when delete condition contains invalid datetime format (#6030 ) * add date time format check in delete statment	2021-06-29 09:47:42 +08:00
stdpain	1999a0c26b	[optimization] open gcc strict-aliasing optimization (#6034 ) * open gcc strict-aliasing optimization * use -Werror=strick-alias	2021-06-18 11:39:24 +08:00
Mingyu Chen	5cfe081b05	[Bug] Remove duplicate memtracker (#6041 ) * [Enhanece] Remove duplicate memtracker This problem will cause frequent creation of memtracker and affect query concurrency.	2021-06-18 11:28:37 +08:00
Mingyu Chen	d57c2344e1	[MemTracker] Refactored the hierarchical structure of memtracker (#5956 ) To avoid showing too many memtracker on BE web pages. The MemTracker level now has 3 levels: OVERVIEW, TASK and VERBOSE. OVERVIEW Mainly used for main memory consumption module such as Query/Load/Metadata. TASK is mainly used to record the memory overhead of a single task such as a single query, load, and compaction task. VERBOSE is used for other more detailed memtrackers.	2021-06-16 09:44:24 +08:00
stdpain	bde60280b8	[Optimize] use string_view instead of std::string in string function (#6010 )	2021-06-16 09:40:13 +08:00
Yingchun Lai	6d6c3d9703	[Enhancement] Reduce memory consumption by releasing readers earier (#5811 ) We created multiple rowset readers to read data of one tablet, after one rowset reader has reached EOF, it can be released to reduce resource (typically memory) consumption. As the same, we can release segment reader when it reach EOF.	2021-06-16 09:37:50 +08:00
luozenglin	d33a6d1b98	[Function] Support date function: yearweek(), week(), makedate(). (#6000 )	2021-06-10 17:38:25 +08:00
Lijia Liu	4d64612b96	[ARRAY]Save array's size instead of offset. (#5983 ) * Save array's size instead of offset. * Optimize variable name * Fix comment	2021-06-10 12:32:58 +08:00
stdpain	d790cc6a50	[BUG] Fixed the problem that substring function may access illegal address (#5952 )	2021-06-03 18:38:10 +08:00
Mingyu Chen	81ecf3d097	[Bug] Rebuilt version graph of a tablet when there are too many orphan vertex (#5945 ) The version information of the tablet will be stored in the memory in an adjacency graph data structure. And as the new version is written and the old version is deleted, the data structure will begin to have empty vertex with no edge associations(orphan vertex). These orphan vertexs should be removed somehow.	2021-06-03 09:59:20 +08:00
xinghuayu007	63c99eb4cb	[Cache][Enhancement] Assure sql cache only one version (#5793 ) For PR #5792. This patch add a new param `cache type` to distinguish sql cache and partition cache. When update sql cache, we make assure one sql key only has one version cache.	2021-05-28 13:45:47 +08:00
Xinyi Zou	80f0b5fd1c	[BUG] Fix calculation error when the memory parameter is a float value percentage (#5916 ) When parsing memory parameters in `ParseUtil::parse_mem_spec`, convert the percentage to `double` instead of `int`. The currently affected parameters include `mem_limit` and `storage_page_cache_limit`	2021-05-27 22:06:50 +08:00
Zhengguo Yang	ba38973209	use virtual hosted-style request to access object store (#5894 ) * use virtual hosted-style access request object store	2021-05-27 15:52:07 +08:00
HappenLee	d0462f4383	[Bug] Fix Backend UT Problem (#5784 ) (#5785 ) 1. relocation R_X86_64_32 against `__gxx_personality_v0' can not be used when making a shared object; recompile with -fPIC 2. warning: the use of `tmpnam' is dangerous, better use `mkstemp' 3. Death tests use fork(), which is unsafe particularly in a threaded context. For this test, Google Test couldn't detect the number of threads.	2021-05-17 11:51:59 +08:00
stdpain	a359b1cb8b	[UT] fix ut failed in new_metrics_test (#5817 )	2021-05-17 11:51:22 +08:00
Zhengguo Yang	01a45e8691	add read buffer when use s3 reader (#5791 )	2021-05-17 11:46:38 +08:00
luozenglin	b686205b97	[Optimize] Reduce lock conflicts in ThreadResourceMgr of be (#5772 ) Removed some useless code that caused lock conflicts in ThreadResourceMgr of be.	2021-05-12 10:59:53 +08:00
Zhengguo Yang	98e80aa65e	[refactor] Replace boost::function with std::function (#5700 ) Replace boost::function with std::function	2021-05-09 22:00:48 +08:00
Mingyu Chen	11cce06962	[Feature] Support create history dynamic partition (#5703 ) 1. Add a new dynamic partition property `create_history_partition`. If set to true, Doris will create all partitions from `start` to `end`. 2. Add a new FE config `max_dynamic_partition_num` To limit the number of partitions created when creating one table.	2021-05-08 12:05:19 +08:00
weizuo93	e519a24c9a	dynamic adjust compaction policy (#5651 ) Co-authored-by: weizuo <weizuo@xiaomi.com>	2021-04-26 12:39:13 +08:00
qiye	de87f4ae84	[Feature] Add list partition support (#5529 ) Add list partition support	2021-04-24 17:42:27 +08:00

1 2 3 4 5 ...

483 Commits