doris

Author	SHA1	Message	Date
Gabriel	1d5ba9cbcc	[Improvement](like) Change `like` function to batch call (#13314 )	2022-10-16 16:18:22 +08:00
yiguolei	a5f3880649	[improvement](memory) disable page cache and chunk allocator, optimize memory allocate size (#13285 ) disable page cache by default disable chunk allocator by default not use chunk allocator for vectorized allocator by default add a new config memory_linear_growth_threshold = 128Mb, not allocate memory by RoundUpToPowerOf2 if the allocated size is larger than this threshold. This config is added to MemPool, ChunkAllocator, PodArray, Arena.	2022-10-15 17:27:17 +08:00
Gabriel	baf2689610	[Improvement](join) compute hash values by vectorized way (#13335 )	2022-10-13 16:04:58 +08:00
Gabriel	dfe308f501	[Improvement](join) refine prefetch strategy (#13286 )	2022-10-12 19:02:06 +08:00
Xinyi Zou	89b295c6cc	[enhancement](memory) Print memory usage log when memory allocation fails (#13301 )	2022-10-12 10:08:25 +08:00
Gabriel	1ba9e4b568	[Improvement](sort) Reuse memory in sort node (#12921 )	2022-09-28 09:44:35 +08:00
Yongqiang YANG	a7d42b5d81	[fix](streamload&sink) release and allocate memory in the same tracker (#12820 ) 1. HttpServer threads allocate bytebuffer and put them into streamload pipe, but scanner thread release them with query tracker. 2. We can assume brpc allocate memory in doris thread. Above problems leads to wrong result of memtracker.	2022-09-23 17:51:44 +08:00
Xinyi Zou	b41eaa5ac0	[fix](memtracker) Introduce orphan mem tracker to verify memory tracking accuracy (#12794 ) The mem hook consumes the orphan tracker by default. If the thread does not attach other trackers, by default all consumption will be passed to the process tracker through the orphan tracker. In real time, consumption of all other trackers + orphan tracker consumption = process tracker consumption. Ideally, all threads are expected to attach to the specified tracker, so that "all memory has its own ownership", and the consumption of the orphan mem tracker is close to 0, but greater than 0.	2022-09-21 15:47:10 +08:00
Jerry Hu	8f4bb0f804	[improvement](agg) iterate aggregation data in memory written order (#12704 ) Following the iteration order of the hash table will result in out-of-order access to aggregate states, which is very inefficient. Traversing aggregate states in memory write order can significantly improve memory read efficiency. Test hash table items count: 3.35M Before this optimization: insert keys into column takes 500ms With this optimization only takes 80ms	2022-09-21 14:58:50 +08:00
Gabriel	3cfaae0031	[Improvement](sort) Use heap sort to optimize sort node (#12700 )	2022-09-21 10:01:52 +08:00
Gabriel	c05d736331	[Improvement](sort) fallback to partial sort small block if topN is small (#12604 ) * [Improvement](sort) fallback to partial sort small block if topN is small	2022-09-16 10:20:17 +08:00
Pxl	0ead048b93	[Enhancement](column) remove ColumnString terminating zero and add a data_version for pblock (#12456 ) 1. remove ColumnString terminating zero 2. add a data_version for pblock 3. change EncryptionMode to enum class	2022-09-14 21:25:22 +08:00
Xinyi Zou	f72d2559cf	[fix](compile) Fix compile error '<unknown>' may be used uninitialized in PODArray::insert_prepare #12202	2022-08-31 09:12:28 +08:00
Xinyi Zou	8370115cf6	[enhancement](memtracker) Improve performance of tracking real physical memory of PODArray #12168	2022-08-30 10:22:12 +08:00
Xinyi Zou	9caaa4bfbd	[fix](memory) fix set disable_chunk_allocator_in_vec=false performance #12092	2022-08-26 14:28:12 +08:00
Xinyi Zou	1304a17600	[fix](memtracker) Improve performance of tracking real physical memory of PodArray #12021	2022-08-24 14:24:14 +08:00
Xinyi Zou	c124470408	[enhancement](memory) Fix too much cache leads to less memory available for queries (#11751 ) Disable Chunk Allocator in Vectorized Allocator, this will reduce memory cache. For high concurrent queries, using Chunk Allocator with vectorized Allocator can reduce the impact of gperftools tcmalloc central lock. Jemalloc or google tcmalloc have core cache, Chunk Allocator may no longer be needed after replacing gperftools tcmalloc.	2022-08-16 14:35:57 +08:00
Xinyi Zou	2a1803c646	[enhancement](memtracker) Optimize query memory accuracy (#11740 ) Currently, only the virtual memory used by the query can be tracked through the tcmalloc hook. When the memory is not fully used after the application, the recorded virtual memory will be larger than the physical memory. At present, it is mainly because PODArray does not memset 0 when applying for memory, and blocks applied for through PODArray in places such as VOlapScanNode::_free_blocks are usually used for memory reuse and cannot be fully used.	2022-08-16 14:23:28 +08:00
starocean999	092a394782	[improvement](agg)limit the output of agg node (#11461 ) * [improvement](agg)limit the output of agg node	2022-08-05 07:53:55 +08:00
Jerry Hu	b74f36e009	[improvement]Use phmap for aggregation with integer keys (#11175 )	2022-07-27 13:58:20 +08:00
Xinyi Zou	4960043f5e	[enhancement] Refactor to improve the usability of MemTracker (step2) (#10823 )	2022-07-21 17:11:28 +08:00
plat1ko	989e6d1cf9	[chore]fix clang compile error (#11021 )	2022-07-20 08:28:47 +08:00
Jerry Hu	fd2c374426	[fix]Empty string key in aggregation was output as NULL (#11011 )	2022-07-19 23:25:28 +08:00
Jerry Hu	899acb6564	[improvement][agg]import sub hashmap (#10937 )	2022-07-18 18:36:45 +08:00
Xinyi Zou	d9095922d9	[Enhancement] [Memory] add strict memory usage compile option STRICT_MEMORY_USE (#10936 ) In the strict memory usage mode of STRICT_MEMORY_USE=ON, when the capacity of the vectorized Hash Table is greater than 2G, it starts to grow when 75% of the capacity is satisfied, the memory usage of the vectorized Join becomes 50% of the previous value. STRICT_MEMORY_USE=ON` expects BE to use less memory, and gives priority to ensuring stability when the cluster memory is limited.	2022-07-18 16:16:43 +08:00
Jerry Hu	d1573e1a4a	[improvement]Use phmap for aggregation with serialized key (#10821 )	2022-07-14 11:26:09 +08:00
Kang	4e9d5a7f7a	optimize substr performance and fix ASAN global buffer overflow (#10442 ) * add volnitsky substr algorithm * replace std::search with volnitsky search algorithm in StringSearch * optimize substring for constant_substring_fn case use long run length search for performance	2022-07-12 08:36:21 +08:00
Jerry Hu	e293fbd277	[improvement]pre-serialize aggregation keys (#10700 )	2022-07-09 06:21:56 +08:00
camby	ec6620ae3e	[feature-wip](array-type) add function arrays_overlap (#10233 )	2022-06-30 08:12:29 +08:00
Xinyi Zou	deeb3028ad	[Enhancement] [Memory] [Vectorized] Stress test and optimize memory allocation (#9581 ) * vec stress test, Allocator introduce chunkallocator * fix comment	2022-06-29 02:57:51 +08:00
Mingyu Chen	9036f93df4	Revert "[improvement](function) optimize substr performance (#10169 )" (#10390 ) This reverts commit 2335d233f1f52eb64a380b4c9959becdf182b71b.	2022-06-24 14:38:52 +08:00
Kang	2335d233f1	[improvement](function) optimize substr performance (#10169 ) optimize substr performance about 1.5~2x speedup.	2022-06-24 08:57:31 +08:00
chenlinzhong	5974e452bc	[enhancement] CRC32 instructions compatible arm arch (#10261 ) The performance of some CPUs that do not implement CRC instructions is particularly poor	2022-06-20 17:49:06 +08:00
starocean999	1cca319d18	[fix](vectorized) intersect operator takes too long time to execute (#10183 ) * fix itersect operator takes too long time to execute * modify code based on review comments	2022-06-17 08:43:53 +08:00
Zhengguo Yang	39a2785ce2	[enhancement] support simd instructions on arm cpus through sse2neon (#10068 ) * [enhancement] support simd instructions on arm cpus through sse2neon	2022-06-14 09:17:09 +08:00
Pxl	f2aa5f32b8	[Feature] [Vectorized] Some pre-refactorings or interface additions for schema change (#9811 ) Some pre-refactorings or interface additions for schema change	2022-06-07 15:04:57 +08:00
Twice	f49284036e	[Enhancement] Refactor functions in int_exp by templates (#9939 )	2022-06-04 11:53:31 +08:00
HappenLee	5039ec4570	[vec][opt] opt hash join build resize hash table before insert data (#9735 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-05-23 15:13:57 +08:00
Adonis Ling	ec2cd0083a	[code format]Upgrade clang-format in BE Code Formatter from 8 to 13 (#9602 )	2022-05-17 19:28:15 +08:00
Adonis Ling	718a51a388	[refactor][style] Use clang-format to sort includes (#9483 )	2022-05-10 21:25:35 +08:00
chenlinzhong	c9961c9bb9	[style] clang-format all c++ code (#9305 ) - sh build-support/clang-format.sh to clang-format all c++ code	2022-04-29 16:14:22 +08:00
jacktengg	201cd207f9	[Enhancement][Vectorized] Improve hash table build efficiency (#9250 ) 1. MAP_POPULATE is missing for mmap in Allocator, because macro OS_LINUX is not defined in allocator.h; 2. MAP_POPULATE has no effect for mremap as for mmap, zero-fill enlarged memory range explicitly to pre-fault the pages	2022-04-29 14:26:33 +08:00
Xinyi Zou	26bc462e1c	[feature-wip] (memory tracker) (step5) Fix track bthread, fix track vectorized query (#9145 ) 1. fix track bthread - Bthread, a high performance M:N thread library used by brpc. In Doris, a brpc server response runs on one bthread, possibly on multiple pthreads. Currently, MemTracker consumption relies on pthread local variables (TLS). - This caused pthread TLS MemTracker confusion when switching pthread TLS MemTracker in brpc server response. So replacing pthread TLS with bthread TLS in the brpc server response saves the MemTracker. Ref: `731730da85/docs/en/server.md (bthread-local)` 2. fix track vectorized query - Added track mmap. Currently, mmap allocates memory in many places of the vectorized execution engine. - Refactored ThreadContext to avoid dependency conflicts and make it easier to debug. - Fix some bugs.	2022-04-27 20:34:02 +08:00
zbtzbtzbt	6ed59bb98b	[refactor](code_style) remove useless inline #8933 1.Member functions defined in a class are inline by default (implicitly), and do not need to be added 2.inline is a keyword used for implementation, which has no effect when placed before the function declaration	2022-04-10 18:29:55 +08:00
dataroaring	7fb4b6a6e2	[chore](tsan) add file mremap_fallback for tsan (#8665 )	2022-04-08 09:01:53 +08:00
HappenLee	92feb9c6c8	[fix] Fix error crc32 method to cal uint128 and int128 (#8577 )	2022-03-23 10:33:32 +08:00
Mingyu Chen	a76889b319	[improvement] Avoid print large string in error log (#8436 ) 1. Avoid print large string in error log If user load a unqualified large string, the all string will be saved in error log, so the error log is too big that can not be shown be using `show load warnings on "url"`. Err: `Got packet bigger than 'max_allowed_packet' bytes` 2. Remove duplicate help doc Do not allow doc with same title, or error thrown when starting FE: `java.lang.IllegalArgumentException: Multiple entries with same key:`	2022-03-11 17:23:47 +08:00
Pxl	cd8694e532	[feature][vectorized] support replace() (#8384 )	2022-03-08 18:57:12 +08:00
zbtzbtzbt	ada39dd9ad	[improvement][vec] better memequal impl to speed up string compare (#8229 ) like #8214 faster string compare operator in vec engine.	2022-03-01 11:25:12 +08:00
HappenLee	01fb25a498	[UT] Fix the UT of column_nullable_test (#8180 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-02-23 15:37:40 +08:00

1 2

59 Commits