doris

Files

ZenoYang 9d3f1dcf44 [improvement](vectorized) Deserialized elements of count distinct aggregation directly inserted into target hashset (#21888 )

The original logic is to first deserialize the ColumnString into a HashSet (insert the deserialized elements into the hashset), and then traverse all the HashSet elements into the target HashSet during the merge phase.
After optimization, when deserializing, elements are directly inserted into the target HashSet, thereby reducing unnecessary hashset insert overhead.

In one of our internal query tests, 30 hashsets were merged in second phase aggregation(the average cardinality is 1,400,000), and the cardinality after merging is 42,000,000. After optimization, the MergeTime dropped from 5s965ms to 3s375ms.

2023-08-02 21:19:56 +08:00

src

[improvement](vectorized) Deserialized elements of count distinct aggregation directly inserted into target hashset (#21888 )

2023-08-02 21:19:56 +08:00

test

[improvement](compaction) compaction policy and options in the properties of a table (#22461 )

2023-08-01 22:02:23 +08:00

CMakeLists.txt

[Feature](inverted index) add inverted index tool (#22207 )

2023-07-27 21:28:34 +08:00