Qi Chen
ef2fdb79bb
[Improvement](parquet-reader) Optimize and refactor parquet reader to improve performance. (#16818)
Optimize and refactor parquet reader to improve performance.
- Improve 2x performance for small dict string by aligned copying.
- Refactor code to decrease condition(if) checking.
- Don't call skip(0).
- Don't read page index if no condition.
**ssb-flat-100**: (single-machine, single-thread)
| Query | before opt | after opt |
| ------------- |:-------------:| ---------:|
| SELECT count(lo_revenue) FROM lineorder_flat | 9.23 | 9.12 |
| SELECT count(lo_linenumber) FROM lineorder_flat | 4.50 | 4.36 |
| SELECT count(c_name) FROM lineorder_flat | 18.22 | 17.88|
| **SELECT count(lo_shipmode) FROM lineorder_flat** |**10.09** | **6.15**|
2023-02-20 11:42:29 +08:00
..
2023-02-20 11:08:45 +08:00
2023-02-20 11:42:29 +08:00
2023-02-14 08:44:12 +08:00
2023-02-11 13:37:50 +08:00
2023-02-19 15:11:54 +08:00
2023-02-20 11:42:29 +08:00
2023-02-20 11:08:45 +08:00
2023-02-20 11:08:45 +08:00
2023-02-11 13:37:50 +08:00
2023-02-13 14:06:36 +08:00
2023-02-06 11:16:39 +08:00
2023-02-19 23:28:33 +08:00
2023-02-19 15:11:54 +08:00
2023-02-19 15:11:54 +08:00
2023-02-07 13:28:07 +08:00
2023-02-20 11:42:29 +08:00