Files
doris/tools/tpch-tools
AKIRA e32eba8fdf [refactor](stats) Persist status of analyze task to FE meta data (#20264)
1. In the past, we use a BE table named `analysis_jobs` to persist the status of analyze jobs/tasks, however there are many flaws such as, if BE crashed analyze job/task would failed however the status of analyze job/task couldn't get updated.
2. Support `DROP ANALYZE JOB [job_id]` to delete analyze job
3. Support `SHOW ANALYZE TASK STATUS [job_id] ` to  get the task status of specific job
4. Restrict the execute condition of auto analyze, only when  the  last execution of auto analyze job finished a while ago could be executed again
5. Support analyze whole DB
2023-06-02 12:33:31 +08:00
..

Usage

These scripts are used to make tpc-h test. follow the steps below:

1. build tpc-h dbgen tool.

./bin/build-tpch-dbgen.sh

2. generate tpc-h data. use -h for more infomations.

./bin/gen-tpch-data.sh -s 1

3. create tpc-h tables. modify conf/doris-cluster.conf to specify doris info, then run script below.

./bin/create-tpch-tables.sh

4. load tpc-h data. use -h for help.

./bin/load-tpch-data.sh

5. run tpc-h queries.

./bin/run-tpch-queries.sh

NOTICE: At present, Doris's query optimizer and statistical information functions are not complete, so we rewrite some queries in TPC-H to adapt to Doris' execution framework, but it does not affect the correctness of the results. The rewritten SQL is marked with "Modified" in the corresponding .sql file.

A new query optimizer will be released in subsequent releases.