doris

Author	SHA1	Message	Date
Gabriel	851886cc18	[minor](datev2) remove datev2 because datev2 is used by default (#19777 )	2023-05-18 13:36:11 +08:00
zhangdong	b129c9901b	[improvement](FQDN)Change the implementation of fqdn (#19123 ) Main changes: 1. If fqdn is enabled in the configuration file, when fe starts, localAddr will obtain fqdn instead of IP, priority_ Networks will fail 2. The IP and host names of Backend and Front are combined into one field, host. When fqdn is enabled, it represents the host name, and when not enabled, it represents the IP address 3. The communication between clusters directly uses fqdn, and various Connection pool add authentication mechanisms to prevent the IP address of the domain name from changing and the connection between nodes from making errors 4. No longer requires polling to verify if the IP has changed, delete fqdnManager 5. Change the method of verifying the legitimacy of nodes between FEs from obtaining client IP to displaying the identity of the transmitting node itself in the HTTP request header or the message body of the throttle 6. When processing the heartbeat, if BE finds that the host stored by itself is inconsistent with the host stored by the master, after verifying the legitimacy of the host, it will change its own host instead of directly reporting an error 7. Simplify the generation logic of fe name Scope of influence: 1. Establishing communication connections between clusters 2. Determine whether it is the same node through attributes such as IP 3. Print Log 4. Information display 5. Address Splicing 6. k8s deployment 7. Upgrade compatibility Test plan: 1. Change the IP address of the node, while keeping the fqdn unchanged, change the IP addresses of fe and be, and verify whether the cluster can read and write data normally 2. Use the master code to generate metadata, and use the previous metadata on the current pr to verify whether it is compatible with the old version (upgrading is no longer supported if fqdn has been enabled before) 3. Deploy fe and be clusters using k8s to verify whether the cluster can read and write data normally 4. According to https://doris.apache.org/zh-CN/docs/dev/admin-manual/cluster-management/fqdn?_highlight=fqdn#%E6%97%A7%E9%9B%86%E7%BE%A4%E5%90%AF%E7%94%A8fqdn Upgrading old clusters 5. Use streamload to specify the fqdn of fe and be to import data separately 6. Use different users to start transactions and write data using insert statements	2023-05-11 00:44:48 +08:00
Gabriel	208d21b01d	[tools](tpch) use origin TPCH qurries (#19479 )	2023-05-10 14:29:45 +08:00
Dongyang Li	6c21df6324	[tools](tpch) run mode like clickbench (#19339 )	2023-05-06 23:33:26 +08:00
Pxl	ec517a53a8	[Chore](build) upgrade clang-format version to 16 && move thrift to fe-common (#19155 ) upgrade clang-format version to 16 move thrift to fe-common fix core dump on pipeline engine when operator canceled and not prepared	2023-04-28 14:14:51 +08:00
TengJianPing	34ce946f5b	[tools](profile) add script file to get all tree profiles off a query (#18587 ) Add a tool script that output query profiles of all fragment instances in tree form.	2023-04-22 22:10:57 +08:00
谢健	63994e351f	[fix](Nereids) extract complicated time string in evaluating cost model framework (#17864 ) 1. The time string in the profile can be "xx s xx ms". The framework should extract time with re package to support more complicated time string 2. Add stats for sortNode and AggNode in `withChildren`	2023-04-07 15:33:04 +08:00
WenYao	c3fe113894	rename PaloFe to DorisFE (#18167 )	2023-03-29 00:30:16 +08:00
AKIRA	34dc7e57c1	[ehancement](stats) Tune for stats framework (#18035 ) 1. Estimate timearithmeticexpr instead of setting Double.MAX Double.MIN directly 2. Enable histogram to derive stats 3. Loose the condition for histogram usage 4. Improve the accuracy for agg on TPC-H 1G greatly 5. Fix avg qerror calculation	2023-03-23 16:03:58 +08:00
AKIRA	f600f70619	[ehancement](fe) Tune for stats framework (#17860 )	2023-03-22 11:07:56 +08:00
AKIRA	bece027135	[ehancement](profile) Add HTTP interface for q-error (#17786 ) 1. Add Http interface for query q-error 2. Fix the selectivity calculation of inner join, it would always be 0 if there is only one join condition before	2023-03-16 12:19:23 +08:00
Dongyang Li	67b7128e8a	[tools](tpcds) fix bug of generating and loading data (#17835 ) --------- Co-authored-by: stephen <hello_stephen@@qq.com>	2023-03-16 11:59:39 +08:00
Gabriel	079e6a3e12	[regression-test](vectorized) remove unused vectorization flag (#17662 )	2023-03-15 17:59:22 +08:00
Dongyang Li	cd7e03575b	[tools](tpc-ds) add script tools to run tpc-ds conveniently (#17366 ) build-tpcds-tools.sh gen-tpcds-data.sh gen-tpcds-queries.sh create-tpcds-tables.sh load-tpcds-data.sh run-tpcds-queries.sh generate data and queries support specify SCALE, create table may need to be edited handly to specify BUCKETS or change int to bigint if SCALE is too big. --------- Co-authored-by: stephen <hello_stephen@@qq.com>	2023-03-03 08:24:07 +08:00
谢健	e3d7f7c8d8	[feature](Nereids) add test framework for cost model (#17071 ) add test-frame-work for cost model according paper Testing the Accuracy of Query Optimizers	2023-02-28 20:59:07 +08:00
ZhangJian He	8e179d3a54	[minor][typo] fix typo in load-clickbench-data script (#17133 )	2023-02-26 10:56:04 +08:00
Adonis Ling	e7f9819168	[chore](tools) Fix NoSuchMethodError while loading data by http requests (#17075 ) When we used the tool multi-fe to start multiple FEs cluster and loaded data by stream load way, the request failed. See the following log. The issue was caused by the netty libraries. There are multiple netty libraries in classpath and the FE used the newer version netty library which made these errors.	2023-02-25 12:28:35 +08:00
ZhangYu0123	56ebbf8bc9	[chore](tools) fix load-clickbench-data script cannot be interrupted #17000	2023-02-22 19:34:40 +08:00
Adonis Ling	0b3e18d060	[chore](macOS) Support LLVM Clang 15 (#16991 ) Remove the deprecated classes std::codecvt_utf8_utf16<char16_t> and std::wstring_convert. Use libiconv to convert UTF-8 strings to UTF-16LE ones.	2023-02-22 15:04:48 +08:00
Adonis Ling	0950a08efd	[chore](tools) Support starting multiple FEs on single node (#16787 ) Introduce a tool to start multiple FEs on single node. Use case: ``` $ ./multi-fe ./multi-fe start\|stop\|clean [OPTIONS ...] start -n <NUM> -l <LIBRARY_PATH> -p <BASE_PORT> Start the FE cluster. -n The number of FEs. -l The FE library path (default: doris/output/fe/lib) -p The base port to generate all needed ports (default: 9030). stop Stop the FE cluster. clean Stop the data (rm -rf "$(pwd)"/fe*). ```	2023-02-21 10:55:36 +08:00
奕冷	cf739e7496	[Enhancement](Stmt) Set insert_into timeout session variable separately (#16343 )	2023-02-12 16:56:10 +08:00
zbtzbtzbt	1b2f882d24	[fix](terminal) remove echo database passwd (#15876 ) * remove echo passwd * add timer for data load	2023-01-16 22:00:52 +08:00
HappenLee	816e12db6a	[Bench](mem) some benchmark over the query limit (#15408 )	2022-12-28 09:29:53 +08:00
Gabriel	5c7964d396	[minor](tools) delete unused script (#14752 )	2022-12-02 15:20:01 +08:00
minghong	5ca6596ca3	[fix](tpch-tools) disable join reorder for Q12 in TPC-H tools (#14728 )	2022-12-01 20:55:17 +08:00
Gabriel	b12ac90d8f	[tools](tpch) upgrade decimal type to decimalv3 (#14665 )	2022-11-30 08:41:06 +08:00
minghong	0613ccda74	[feature](tools)profile viewer (#14429 ) It is a painful work to read profile, especially there are multi-parallel instances. This tool helps us to grasp the main information of profile in a graphical view. The profile is represented by a tree. Sql operation nodes contains operation type(join, scan...), its node id, its fragment id. The number on the arrow edge means how many rows output by child node. This tool will sum the output rows of the same node in multi-parallel instances, that is if there are 4 parallel instance, and each ScanNode on lineitem table output 10 rows, the label on the arrow beginning with ScanNode(lineitem) is 40. Here is a demo for tpch Q2 tpch q2 profile viewer Issue Number: close #xxx	2022-11-21 10:29:54 +08:00
Dongyang Li	1482ab32b6	[tools](tpch)fix invalid download url (#14329 )	2022-11-19 13:29:33 +08:00
minghong	9b50888aaf	[feature](Nereids) prune runtime filters which cannot reduce the tuple number of probe table (#13990 ) 1. add a post processor: runtime filter pruner Doris generates RFs (runtime filter) on Join node to reduce the probe table at scan stage. But some RFs have no effect, because its selectivity is 100%. This pr will remove them. A RF is effective if a. the build column value range covers part of that of probe column, OR b. the build column ndv is less than that of probe column, OR c. the build column's ColumnStats.selectivity < 1, OR d. the build column is reduced by another RF, which satisfies above criterions. 2. explain graph a. add RF info in Join and Scan node b. add predicate count in Scan node 3. Rename session variable rename `enable_remove_no_conjuncts_runtime_filter_policy` to `enable_runtime_filter_prune` 4. fix min/max column stats derive bug `select max(A) as X from T group by B` X.min is A.min, not A.max	2022-11-11 13:13:29 +08:00
HappenLee	75b6b267ea	[opt](ssb) Add query hint for the SSB queries (#14089 )	2022-11-09 08:37:31 +08:00
minghong	4bfa95f669	[enhancement](tools) opt tpch q21: change join order (#13699 )	2022-10-27 16:55:23 +08:00
HappenLee	bd2b803019	[opt](tpch) opt the q9 of tpch to change instance to 8 (#13656 )	2022-10-26 09:28:51 +08:00
Adonis Ling	20ade4ae96	[chore](macOS) Disable JAVA UDF temporarily (#13563 ) Fail to start BE (ASAN) if it was built with JAVA UDF on macOS.	2022-10-22 01:05:45 +08:00
Gabriel	038ccab3c4	[benchmark](rewrite) add hints for q20 (#13561 )	2022-10-21 22:35:50 +08:00
HappenLee	f0b608018b	[config](tpch) Disable jemalloc and change the hint of tpch q22 (#13555 )	2022-10-21 21:35:43 +08:00
Gabriel	c4b5ba2a4f	[Regression](java-udf) Move source code used by Java UDF test case (#13476 )	2022-10-19 21:05:06 +08:00
HappenLee	6ea9a65bb6	[Opt](vec) opt runtime filter for TPCH Q22 (#13339 )	2022-10-17 10:30:07 +08:00
Pxl	632670a49c	[Enhancement](function) refactor of date function (#13362 ) refactor of date function	2022-10-16 14:31:26 +08:00
Pxl	5c68f69362	[improvement](config) set enable_local_exchange default value to true (#13292 )	2022-10-12 09:07:24 +08:00
Gabriel	9c64fde8f5	[tools](banchmark) upgrade date type (#13197 ) upgrade date type to datev2	2022-10-09 14:17:12 +08:00
Dongyang Li	3cc06820c4	[doc](performance) performance doc and script update (#12493 )	2022-09-09 09:09:49 +08:00
Dongyang Li	9d165797c5	[ci](shellcheck)shellcheck include ssb-tools and tpch-tools (#12039 )	2022-08-25 11:52:13 +08:00
Dongyang Li	9cceddaeb3	[tools](ssb and tpch)optimize tools (#11975 )	2022-08-24 14:29:38 +08:00
Adonis Ling	e63c83e8e1	[fix](script) Support starting BE without Java environment (#11910 )	2022-08-19 17:58:40 +08:00
yew1eb	a98d808080	[Chore](benchmark) Fix benchmark scripts, cover case that $PASSWORD not empty (#11486 ) Fix benchmark scripts, cover case that $PASSWORD not empty	2022-08-19 15:40:18 +08:00
Adonis Ling	4fa53b4cdb	[chore](workflow) Add shellcheck to check shell scripts (#11744 )	2022-08-18 16:07:28 +08:00
Dongyang Li	52290fed90	[tools](tpch)update queries for better performance (#11523 )	2022-08-05 14:04:26 +08:00
Dongyang Li	ef65fbb4f0	[tools](ssb-flat)update script for better performance (#11505 )	2022-08-04 17:43:26 +08:00
Dongyang Li	cac0dfcd63	[tools] opt tpch q9 (#11092 )	2022-07-22 08:38:43 +08:00
Dongyang Li	1ca00e0107	[tools] add clickbench tools (#11009 ) * [tools] add clickbench tools Co-authored-by: stephen <hello-stephen@qq.com>	2022-07-20 17:59:04 +08:00

1 2

73 Commits