Commit Graph

56 Commits

Author SHA1 Message Date
56ebbf8bc9 [chore](tools) fix load-clickbench-data script cannot be interrupted #17000 2023-02-22 19:34:40 +08:00
0b3e18d060 [chore](macOS) Support LLVM Clang 15 (#16991)
Remove the deprecated classes std::codecvt_utf8_utf16<char16_t> and std::wstring_convert.
Use libiconv to convert UTF-8 strings to UTF-16LE ones.
2023-02-22 15:04:48 +08:00
0950a08efd [chore](tools) Support starting multiple FEs on single node (#16787)
Introduce a tool to start multiple FEs on single node.

Use case:

```
$ ./multi-fe
./multi-fe start|stop|clean [OPTIONS ...]

    start -n <NUM> -l <LIBRARY_PATH> -p <BASE_PORT>

             Start the FE cluster.
      -n     The number of FEs.
      -l     The FE library path (default: doris/output/fe/lib)
      -p     The base port to generate all needed ports (default: 9030).

    stop     Stop the FE cluster.

    clean    Stop the data (rm -rf "$(pwd)"/fe*).
```
2023-02-21 10:55:36 +08:00
cf739e7496 [Enhancement](Stmt) Set insert_into timeout session variable separately (#16343) 2023-02-12 16:56:10 +08:00
1b2f882d24 [fix](terminal) remove echo database passwd (#15876)
* remove echo passwd
* add timer for data load
2023-01-16 22:00:52 +08:00
816e12db6a [Bench](mem) some benchmark over the query limit (#15408) 2022-12-28 09:29:53 +08:00
5c7964d396 [minor](tools) delete unused script (#14752) 2022-12-02 15:20:01 +08:00
5ca6596ca3 [fix](tpch-tools) disable join reorder for Q12 in TPC-H tools (#14728) 2022-12-01 20:55:17 +08:00
b12ac90d8f [tools](tpch) upgrade decimal type to decimalv3 (#14665) 2022-11-30 08:41:06 +08:00
0613ccda74 [feature](tools)profile viewer (#14429)
It is a painful work to read profile, especially there are multi-parallel instances.
This tool helps us to grasp the main information of profile in a graphical view.

The profile is represented by a tree.
Sql operation nodes contains operation type(join, scan...), its node id, its fragment id. The number on the arrow edge means how many rows output by child node. This tool will sum the output rows of the same node in multi-parallel instances, that is if there are 4 parallel instance, and each ScanNode on lineitem table output 10 rows, the label on the arrow beginning with ScanNode(lineitem) is 40.

Here is a demo for tpch Q2
tpch q2 profile viewer

Issue Number: close #xxx
2022-11-21 10:29:54 +08:00
1482ab32b6 [tools](tpch)fix invalid download url (#14329) 2022-11-19 13:29:33 +08:00
9b50888aaf [feature](Nereids) prune runtime filters which cannot reduce the tuple number of probe table (#13990)
1. add a post processor: runtime filter pruner 
Doris generates RFs (runtime filter) on Join node to reduce the probe table at scan stage. But some RFs have no effect, because its selectivity is 100%. This pr will remove them.
A RF is effective if
a. the build column value range covers part of that of probe column, OR
b. the build column ndv is less than that of probe column, OR
c. the build column's ColumnStats.selectivity < 1, OR
d. the build column is reduced by another RF, which satisfies above criterions.

2. explain graph
a. add RF info in Join and Scan node
b. add predicate count in Scan node

3. Rename session variable
rename `enable_remove_no_conjuncts_runtime_filter_policy` to `enable_runtime_filter_prune` 

4. fix min/max column stats derive bug
`select max(A) as X from T group by B`  
X.min is A.min, not A.max
2022-11-11 13:13:29 +08:00
75b6b267ea [opt](ssb) Add query hint for the SSB queries (#14089) 2022-11-09 08:37:31 +08:00
4bfa95f669 [enhancement](tools) opt tpch q21: change join order (#13699) 2022-10-27 16:55:23 +08:00
bd2b803019 [opt](tpch) opt the q9 of tpch to change instance to 8 (#13656) 2022-10-26 09:28:51 +08:00
20ade4ae96 [chore](macOS) Disable JAVA UDF temporarily (#13563)
Fail to start BE (ASAN) if it was built with JAVA UDF on macOS.
2022-10-22 01:05:45 +08:00
038ccab3c4 [benchmark](rewrite) add hints for q20 (#13561) 2022-10-21 22:35:50 +08:00
f0b608018b [config](tpch) Disable jemalloc and change the hint of tpch q22 (#13555) 2022-10-21 21:35:43 +08:00
c4b5ba2a4f [Regression](java-udf) Move source code used by Java UDF test case (#13476) 2022-10-19 21:05:06 +08:00
6ea9a65bb6 [Opt](vec) opt runtime filter for TPCH Q22 (#13339) 2022-10-17 10:30:07 +08:00
Pxl
632670a49c [Enhancement](function) refactor of date function (#13362)
refactor of date function
2022-10-16 14:31:26 +08:00
Pxl
5c68f69362 [improvement](config) set enable_local_exchange default value to true (#13292) 2022-10-12 09:07:24 +08:00
9c64fde8f5 [tools](banchmark) upgrade date type (#13197)
upgrade date type to datev2
2022-10-09 14:17:12 +08:00
3cc06820c4 [doc](performance) performance doc and script update (#12493) 2022-09-09 09:09:49 +08:00
9d165797c5 [ci](shellcheck)shellcheck include ssb-tools and tpch-tools (#12039) 2022-08-25 11:52:13 +08:00
9cceddaeb3 [tools](ssb and tpch)optimize tools (#11975) 2022-08-24 14:29:38 +08:00
e63c83e8e1 [fix](script) Support starting BE without Java environment (#11910) 2022-08-19 17:58:40 +08:00
a98d808080 [Chore](benchmark) Fix benchmark scripts, cover case that $PASSWORD not empty (#11486)
Fix benchmark scripts, cover case that $PASSWORD not empty
2022-08-19 15:40:18 +08:00
4fa53b4cdb [chore](workflow) Add shellcheck to check shell scripts (#11744) 2022-08-18 16:07:28 +08:00
52290fed90 [tools](tpch)update queries for better performance (#11523) 2022-08-05 14:04:26 +08:00
ef65fbb4f0 [tools](ssb-flat)update script for better performance (#11505) 2022-08-04 17:43:26 +08:00
cac0dfcd63 [tools] opt tpch q9 (#11092) 2022-07-22 08:38:43 +08:00
1ca00e0107 [tools] add clickbench tools (#11009)
* [tools] add clickbench tools

Co-authored-by: stephen <hello-stephen@qq.com>
2022-07-20 17:59:04 +08:00
02460ab4a9 change date to int, good for performance (#10889) 2022-07-16 22:35:43 +08:00
23dab4cbf9 [enhancement] (SSB & TPC-H) fix test scripts, cover having PASSWORD situation (#10885) 2022-07-15 20:07:47 +08:00
1dccfa3d84 [enhancement](nereids) make SSB works (#10659)
enhancement
- refactor compute output expression on root fragment in nereids planner
- refactor aggregate plan translator
- refactor aggregate disassemble rule
- slightly refactor sort plan translator
- add exchange node on the top of plan node tree if it is needed
- slightly refactor PhysicalPlanTranslator#translatePlan

fix
- slotDescriptor should not reuse between TupleDescriptors
- expression's nullable now works fine
- remove quotes when parse string literal
- set resolvedTupleExprs in SortNode to control output
- remove the extra column in sortTupleSlotExprs in SortInfo

known issues
- aggregate function must be the top expression in output expression (need project in ExecNode in BE)
- first phase aggregate could not convert to stream mode.
- OlapScanNode do not set data partition
- Sort could not process expression like 'order by a + 1' and SortInfo generated in a trick way and should be refactor when we want to support 'order by a + 1'
- column prune do not work as expected
2022-07-11 11:33:17 +08:00
61d7724ab3 [tpch] Change all replication_num to 1 (#10244) 2022-06-19 10:42:04 +08:00
ac2be958b3 [tpch tools]set exec_mem_limit=8G for tpch queries (#10119)
Co-authored-by: Jerry <root@localhost.localdomain>
2022-06-16 18:19:11 +08:00
15e1bb448f [test] tpch q3 rewrite, change join order, make lineitem on left side (#10055)
rewrite the sql in tpch test tools
2022-06-14 17:16:33 +08:00
47ca743957 [tpch] fix q17 rewrite error (#10030)
the current tpch q17 rewrite is not equivalent to original q17. it missed one condition
2022-06-10 09:17:10 +08:00
b15b1a9f07 [tools](tpc-h) Modify tpch tools (#9982)
1. Modify the create table stmt.
2. Modify part of queries.
2022-06-08 11:21:57 +08:00
7498233601 [tools][doc]add ssb flat tools, and update performance doc (#9827) 2022-06-01 08:09:55 +08:00
92c1c82422 [tools]add ssb flat test scripts (#9796)
Co-authored-by: stephen <hello-stephen@qq.com>
2022-05-28 17:28:03 +08:00
d916ba8352 [fix](tools) Access denied when creating tpch tables (#9003) 2022-04-14 11:45:53 +08:00
da87e0c4ee optimize create tpch table statments to achieve higher performance (#8683)
optimize create tpch table statments to achieve higher performance
2022-03-29 18:19:22 +08:00
b89e4c7bba [feature-wip](java-udf) support java UDF with fixed-length input and output (#8516)
This feature is propsoed in [DSIP-1](https://cwiki.apache.org/confluence/display/DORIS/DSIP-001%3A+Java+UDF). 
This PR support fixed-length input and output Java UDF. Phase I in DIP-1 is done after this PR.

To support Java UDF effeciently, I use no data copy in JNI call and all compute operations are off-heap in Java.
To achieve that, I use a UdfExecutor instead. 

For users, a UDF class must have a public evaluate method.
2022-03-23 10:32:50 +08:00
276792daeb [feature](benchmark) Add TPC-H benchmark tools (#8408) 2022-03-20 23:06:10 +08:00
6a89b893a3 [refactor](benchmark) Change SSB create table column to NOT NULL (#7898) 2022-01-28 22:26:55 +08:00
00fe9deaeb [Benchmark] Add star schema benchmark tools (#6925)
This CL mainly changes:

1. Add star schema benchmark tools in `tools/ssb-tools`, for user to easy load and test with SSB data set.
2. Disable the segment cache for some read scenario such as compaction and alter operation.(Fix #6924 )
3. Fix a bug that `max_segment_num_per_rowset` won't work(Fix #6926)
4. Enable `enable_batch_delete_by_default` by default.
2021-10-27 09:55:36 +08:00
d587440e40 [Best Practice] Add systemd service config file (#6353)
Add Systemd serivce config file, which can manage doris sevice's start and stop,
and automatically restart them when they are unscheduled failed.
2021-08-07 21:31:43 +08:00