doris

Author	SHA1	Message	Date
Gabriel	5c7964d396	[minor](tools) delete unused script (#14752 )	2022-12-02 15:20:01 +08:00
minghong	5ca6596ca3	[fix](tpch-tools) disable join reorder for Q12 in TPC-H tools (#14728 )	2022-12-01 20:55:17 +08:00
Gabriel	b12ac90d8f	[tools](tpch) upgrade decimal type to decimalv3 (#14665 )	2022-11-30 08:41:06 +08:00
minghong	0613ccda74	[feature](tools)profile viewer (#14429 ) It is a painful work to read profile, especially there are multi-parallel instances. This tool helps us to grasp the main information of profile in a graphical view. The profile is represented by a tree. Sql operation nodes contains operation type(join, scan...), its node id, its fragment id. The number on the arrow edge means how many rows output by child node. This tool will sum the output rows of the same node in multi-parallel instances, that is if there are 4 parallel instance, and each ScanNode on lineitem table output 10 rows, the label on the arrow beginning with ScanNode(lineitem) is 40. Here is a demo for tpch Q2 tpch q2 profile viewer Issue Number: close #xxx	2022-11-21 10:29:54 +08:00
Dongyang Li	1482ab32b6	[tools](tpch)fix invalid download url (#14329 )	2022-11-19 13:29:33 +08:00
minghong	9b50888aaf	[feature](Nereids) prune runtime filters which cannot reduce the tuple number of probe table (#13990 ) 1. add a post processor: runtime filter pruner Doris generates RFs (runtime filter) on Join node to reduce the probe table at scan stage. But some RFs have no effect, because its selectivity is 100%. This pr will remove them. A RF is effective if a. the build column value range covers part of that of probe column, OR b. the build column ndv is less than that of probe column, OR c. the build column's ColumnStats.selectivity < 1, OR d. the build column is reduced by another RF, which satisfies above criterions. 2. explain graph a. add RF info in Join and Scan node b. add predicate count in Scan node 3. Rename session variable rename `enable_remove_no_conjuncts_runtime_filter_policy` to `enable_runtime_filter_prune` 4. fix min/max column stats derive bug `select max(A) as X from T group by B` X.min is A.min, not A.max	2022-11-11 13:13:29 +08:00
HappenLee	75b6b267ea	[opt](ssb) Add query hint for the SSB queries (#14089 )	2022-11-09 08:37:31 +08:00
minghong	4bfa95f669	[enhancement](tools) opt tpch q21: change join order (#13699 )	2022-10-27 16:55:23 +08:00
HappenLee	bd2b803019	[opt](tpch) opt the q9 of tpch to change instance to 8 (#13656 )	2022-10-26 09:28:51 +08:00
Adonis Ling	20ade4ae96	[chore](macOS) Disable JAVA UDF temporarily (#13563 ) Fail to start BE (ASAN) if it was built with JAVA UDF on macOS.	2022-10-22 01:05:45 +08:00
Gabriel	038ccab3c4	[benchmark](rewrite) add hints for q20 (#13561 )	2022-10-21 22:35:50 +08:00
HappenLee	f0b608018b	[config](tpch) Disable jemalloc and change the hint of tpch q22 (#13555 )	2022-10-21 21:35:43 +08:00
Gabriel	c4b5ba2a4f	[Regression](java-udf) Move source code used by Java UDF test case (#13476 )	2022-10-19 21:05:06 +08:00
HappenLee	6ea9a65bb6	[Opt](vec) opt runtime filter for TPCH Q22 (#13339 )	2022-10-17 10:30:07 +08:00
Pxl	632670a49c	[Enhancement](function) refactor of date function (#13362 ) refactor of date function	2022-10-16 14:31:26 +08:00
Pxl	5c68f69362	[improvement](config) set enable_local_exchange default value to true (#13292 )	2022-10-12 09:07:24 +08:00
Gabriel	9c64fde8f5	[tools](banchmark) upgrade date type (#13197 ) upgrade date type to datev2	2022-10-09 14:17:12 +08:00
Dongyang Li	3cc06820c4	[doc](performance) performance doc and script update (#12493 )	2022-09-09 09:09:49 +08:00
Dongyang Li	9d165797c5	[ci](shellcheck)shellcheck include ssb-tools and tpch-tools (#12039 )	2022-08-25 11:52:13 +08:00
Dongyang Li	9cceddaeb3	[tools](ssb and tpch)optimize tools (#11975 )	2022-08-24 14:29:38 +08:00
Adonis Ling	e63c83e8e1	[fix](script) Support starting BE without Java environment (#11910 )	2022-08-19 17:58:40 +08:00
yew1eb	a98d808080	[Chore](benchmark) Fix benchmark scripts, cover case that $PASSWORD not empty (#11486 ) Fix benchmark scripts, cover case that $PASSWORD not empty	2022-08-19 15:40:18 +08:00
Adonis Ling	4fa53b4cdb	[chore](workflow) Add shellcheck to check shell scripts (#11744 )	2022-08-18 16:07:28 +08:00
Dongyang Li	52290fed90	[tools](tpch)update queries for better performance (#11523 )	2022-08-05 14:04:26 +08:00
Dongyang Li	ef65fbb4f0	[tools](ssb-flat)update script for better performance (#11505 )	2022-08-04 17:43:26 +08:00
Dongyang Li	cac0dfcd63	[tools] opt tpch q9 (#11092 )	2022-07-22 08:38:43 +08:00
Dongyang Li	1ca00e0107	[tools] add clickbench tools (#11009 ) * [tools] add clickbench tools Co-authored-by: stephen <hello-stephen@qq.com>	2022-07-20 17:59:04 +08:00
Dongyang Li	02460ab4a9	change date to int, good for performance (#10889 )	2022-07-16 22:35:43 +08:00
Henry2SS	23dab4cbf9	[enhancement] (SSB & TPC-H) fix test scripts, cover having PASSWORD situation (#10885 )	2022-07-15 20:07:47 +08:00
morrySnow	1dccfa3d84	[enhancement](nereids) make SSB works (#10659 ) enhancement - refactor compute output expression on root fragment in nereids planner - refactor aggregate plan translator - refactor aggregate disassemble rule - slightly refactor sort plan translator - add exchange node on the top of plan node tree if it is needed - slightly refactor PhysicalPlanTranslator#translatePlan fix - slotDescriptor should not reuse between TupleDescriptors - expression's nullable now works fine - remove quotes when parse string literal - set resolvedTupleExprs in SortNode to control output - remove the extra column in sortTupleSlotExprs in SortInfo known issues - aggregate function must be the top expression in output expression (need project in ExecNode in BE) - first phase aggregate could not convert to stream mode. - OlapScanNode do not set data partition - Sort could not process expression like 'order by a + 1' and SortInfo generated in a trick way and should be refactor when we want to support 'order by a + 1' - column prune do not work as expected	2022-07-11 11:33:17 +08:00
Gabriel	61d7724ab3	[tpch] Change all `replication_num` to 1 (#10244 )	2022-06-19 10:42:04 +08:00
Dongyang Li	ac2be958b3	[tpch tools]set exec_mem_limit=8G for tpch queries (#10119 ) Co-authored-by: Jerry <root@localhost.localdomain>	2022-06-16 18:19:11 +08:00
minghong	15e1bb448f	[test] tpch q3 rewrite, change join order, make lineitem on left side (#10055 ) rewrite the sql in tpch test tools	2022-06-14 17:16:33 +08:00
minghong	47ca743957	[tpch] fix q17 rewrite error (#10030 ) the current tpch q17 rewrite is not equivalent to original q17. it missed one condition	2022-06-10 09:17:10 +08:00
Mingyu Chen	b15b1a9f07	[tools](tpc-h) Modify tpch tools (#9982 ) 1. Modify the create table stmt. 2. Modify part of queries.	2022-06-08 11:21:57 +08:00
Dongyang Li	7498233601	[tools][doc]add ssb flat tools, and update performance doc (#9827 )	2022-06-01 08:09:55 +08:00
Dongyang Li	92c1c82422	[tools]add ssb flat test scripts (#9796 ) Co-authored-by: stephen <hello-stephen@qq.com>	2022-05-28 17:28:03 +08:00
ZenoYang	d916ba8352	[fix](tools) Access denied when creating tpch tables (#9003 )	2022-04-14 11:45:53 +08:00
minghong	da87e0c4ee	optimize create tpch table statments to achieve higher performance (#8683 ) optimize create tpch table statments to achieve higher performance	2022-03-29 18:19:22 +08:00
Gabriel	b89e4c7bba	[feature-wip](java-udf) support java UDF with fixed-length input and output (#8516 ) This feature is propsoed in [DSIP-1](https://cwiki.apache.org/confluence/display/DORIS/DSIP-001%3A+Java+UDF). This PR support fixed-length input and output Java UDF. Phase I in DIP-1 is done after this PR. To support Java UDF effeciently, I use no data copy in JNI call and all compute operations are off-heap in Java. To achieve that, I use a UdfExecutor instead. For users, a UDF class must have a public evaluate method.	2022-03-23 10:32:50 +08:00
Dongyang Li	276792daeb	[feature](benchmark) Add TPC-H benchmark tools (#8408 )	2022-03-20 23:06:10 +08:00
HappenLee	6a89b893a3	[refactor](benchmark) Change SSB create table column to NOT NULL (#7898 )	2022-01-28 22:26:55 +08:00
Mingyu Chen	00fe9deaeb	[Benchmark] Add star schema benchmark tools (#6925 ) This CL mainly changes: 1. Add star schema benchmark tools in `tools/ssb-tools`, for user to easy load and test with SSB data set. 2. Disable the segment cache for some read scenario such as compaction and alter operation.(Fix #6924 ) 3. Fix a bug that `max_segment_num_per_rowset` won't work(Fix #6926) 4. Enable `enable_batch_delete_by_default` by default.	2021-10-27 09:55:36 +08:00
CenterCode	d587440e40	[Best Practice] Add systemd service config file (#6353 ) Add Systemd serivce config file, which can manage doris sevice's start and stop, and automatically restart them when they are unscheduled failed.	2021-08-07 21:31:43 +08:00
caoyang10	0eb54007be	[Bug]External engines(e.g. ES) don't have segments, ignore those tables (#4671 ) It will crash when there are External engine tables in doris after executing "python show_segment_status.py" The main reason is external engine tables don't have any index and partitions. So it should be ignored.	2020-09-28 13:39:05 +08:00
Mingyu Chen	67b842ce04	[License] Organize and modify the license of the code (#4371 ) 1. Disable the MySQL client and LZO library by default when building the Doris. MySQL client library is used for MySQL external table feature. This feature will be replaced by the new ODBC external table soon. LZO library is used to compress/decompress data of some old data format of Doris, which is no longer used anymore. 2. Add missing license to some files. 3. For all non-Apache-License code, all are explained in NOTICE file and the corresponding license is declared. 4. Remove the js source code from webroot, it will be downloaded as thirdparty	2020-08-24 21:51:55 +08:00
Mingyu Chen	c6ac60bab9	[SegmentV2] Optimize the upgrade logic of SegmentV2 (#3340 ) This CL mainly made the following modifications: 1. Reorganized SegmentV2 upgrade document. 2. When the variable `use_v2_rollup` is set to true, the base rollup in v2 format is forcibly queried for verifying the data. 3. Fix a problem that there is no persistent storage format information in the schema change operation that performs v2 conversion. 4. Allow users to directly create v2 format tables.	2020-04-21 10:45:29 +08:00
LingBin	036ef5bcb9	Add a tool to show segment status (#2260 ) In order to be aware of the convert process from AlphaRowset to BetaRowset, we need a mechanism to know the process of convert.	2019-11-27 11:35:16 +08:00
kangpinghuang	1e2a4c3b9b	Fix tablet restore api in BE(#1623 ) (#1624 )	2019-08-13 09:34:24 +08:00
李超勇	1ba8a4ee4e	Transform row-oriented table to columnar-oriented table (#311 )	2018-11-16 16:03:56 +08:00

50 Commits