doris

Author	SHA1	Message	Date
Adonis Ling	16a394da0e	[chore](build) Use include-what-you-use to optimize includes (PART III) (#18958 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-24 14:51:51 +08:00
yinzhijian	cbbda7857b	[feature-wip](parquet-orc) Support orc scanner in vectorized engine (#9541 )	2022-05-26 21:39:12 +08:00
Zhengguo Yang	5a44eeaf62	[refactor] Unify all unit tests into one binary file (#8958 ) 1. solved the previous delayed unit test file size is too large (1.7G+) and the unit test link time is too long problem problems 2. Unify all unit tests into one file to significantly reduce unit test execution time to less than 3 mins 3. temporarily disable stream_load_test.cpp, metrics_action_test.cpp, load_channel_mgr_test.cpp because it will re-implement part of the code and affect other tests	2022-04-12 15:30:40 +08:00
sduzh	6fedf5881b	[CodeFormat] Clang-format cpp sources (#4965 ) Clang-format all c++ source files.	2020-11-28 18:36:49 +08:00
wangbo	8cd36f1c5d	[Spark Load] Support java version hyperloglog (#3320 ) mainly used for Spark Load process to calculate approximate deduplication value and then serialize to parquet file. Try to keep the same calculation semantic with be's C++ version	2020-06-21 09:37:05 +08:00
ZHAO Chun	8f016d3ab2	Make HLL be able to handle invalid data (#1908 ) In this change list 1. validate HLL column when loading data, if data is invalid, this row will be filtered. 2. seems as empty HLL when serializing invalid type of HLL data, with this change, all ingested data will be valid. 3. seems as empty HLL when deserializing nullptr or invalid type of HLL data. With this change, dirty data can be handled normally. 4. rename function empty_hll to hll_empty. 5. disable memtable_flush_execute_test because this will fails sometimes. When tearing down, some thread is not joined, and they will visit destroyed resource, which is invalid.	2019-09-29 10:55:23 +08:00
ZHAO Chun	93fe10a268	Reduce size of HyperLogLog struct (#1845 ) Now size of HyperLogLog struct is so large that it lead the rowset is too small when ingesting data. In this CL, registers in HyperLogLog are only created when it is needed. When ingesting data, it's normal case that there are only few values in one HyperLogLog.	2019-09-21 14:38:58 +08:00

7 Commits