Commit Graph

5948 Commits

Author SHA1 Message Date
68fae88b36 improve doc spark3.2.0 (#9445) 2022-05-08 06:39:58 +08:00
1746f61388 [refactor](test) Refactor FE unit test framework that starts a FE server. (#9388)
Currently, we use `UtFrameUtils` to start a FE server in the FE unit test. 
Each test class has to do some initialization and clean up stuff with the JUnit4
`@BeforeClass` and `@AfterClass` annotation. It's redundant and boring.
Besides, almost all the APIs in `UtFrameUtils` has a `ConnectContext` parameter, which is not easy to use.

This PR proposes to use an inherit-manner, i.e., wrap all the common logic in base class `TestWithFeService`,
leveraging the 
JUnit5 `@BeforeAll` and `@AfterAll` annotation to narrow down the setup and cleanup lifecycle to each test class instance.
At the same time, the derived concrete test class could directly use utility methods inherited from the base class,
without calling a util class and passing a `ConnectContext` argument.

`UtFrameUtils` and `DorisAssert`  are marked as deprecated. We could remove these two classes
if this refactor works well for a time.
2022-05-07 21:28:42 +08:00
fd11a6b493 [fix][feature](Function) fix return type && support hll_union_agg/group_concat agg to window function (#9119) 2022-05-07 20:44:04 +08:00
e5a88dd0a4 [fix](rewrite) The where condition cannot be pushed down because there is no derivation (#8980)
Fix a bug.
The where condition cannot be pushed down because there is no derivation

eg:
select * from tb1 left join tb2 on tb1.id = tb2.id where tb2.id = 1;

The correct case is that the condition of "=1" needs to be deduced to tb1.id,
but the current implementation does not do the deduction
2022-05-07 20:41:11 +08:00
4235db8902 [refactor] some code cleanup for min/max function. (#8874) 2022-05-07 20:39:44 +08:00
9bae0a61ed [fix]Stream load 307 temporary redirection authentication information is lost (#9363) 2022-05-07 19:22:45 +08:00
816aaa7fd1 [doc]Add create java udf document (#9430)
* Add create java udf document

* Add create java udf document
2022-05-07 19:20:56 +08:00
ca92297030 [FE Code Style] Fix wrong checkstyle action configuration (#9370) 2022-05-07 19:18:00 +08:00
b6a74cfea5 [Bug][CTAS] create table by partition list (#9412)
Co-authored-by: Rongqian Li <rongqian_li@idgcapital.com>
2022-05-07 19:17:39 +08:00
ba2cc98cc0 [doc]fix typo in en faq and format pattern (#9423) 2022-05-07 19:16:59 +08:00
a5a7d1276b [DOC]Improve fe idea dev (#9431) 2022-05-07 19:15:35 +08:00
53574ce0ea [Bug] (fix) DeltaWriter::mem_consumption() coredump (#9245) 2022-05-07 19:13:08 +08:00
882987c0e7 [doc]Add hll usage documentation (#9398)
Add hll usage documentation
2022-05-07 18:56:48 +08:00
49890ce9aa [BUG][Vectorized] fix replace_if_not_null in vectorized compaction (#9376) 2022-05-07 17:16:54 +08:00
Pxl
98bfeaf560 [Enhancement] [Vectorized] Refactor and optimize BinaryOperation (#9087) 2022-05-07 10:55:15 +08:00
2ccaa6338c [enhancement](load) optimize load string data and dict page write (#9123)
* [enhancement](load) optimize load string data and dict page write
2022-05-07 10:27:27 +08:00
6f0c8fb698 [Feature] CTAS support insert data (#9271) 2022-05-07 08:51:54 +08:00
659417c5c9 [doc]add error reason for storage medium (#9390) 2022-05-07 08:46:50 +08:00
22439cb6a6 [Improvement] [compaction]Enable vectorized compaction by default (#9383) 2022-05-07 08:46:35 +08:00
0d909b73a1 [doc] add github checks doc (#9387) 2022-05-07 08:44:23 +08:00
48966c9e7c [fix][doc]style issues (#9386) 2022-05-07 08:44:08 +08:00
dce18cb325 [doc] Add window functions sql help doc (#9393) 2022-05-07 08:43:51 +08:00
f5ef179f22 [fix][doc]Chinese and English homepage link error (#9397)
Chinese and English homepage link error
2022-05-07 08:43:34 +08:00
811f019e47 [performance][query]improve the performance of DISTINCT aggregation by using flat hash set replace unordered set (#9401)
Co-authored-by: shizhiqiang03 <shizhiqiang03@meituan.com>
2022-05-07 08:43:14 +08:00
b0afa850ca [doc]Fix some path error in regression-testing doc (#9413) 2022-05-07 08:40:23 +08:00
17b95b4366 [docs]docs ddl modify (#9415) 2022-05-07 08:39:47 +08:00
e7f12db06c [fixbug][compaction] update OLAP_ERR_CUMULATIVE_NO_SUITABLE_VERSION (#9410) 2022-05-07 08:39:20 +08:00
a9831f87f2 [refactor]refactor lazy materialized (#8834)
[refactor]refactor lazy materialized (#8834)
2022-05-06 19:16:35 +08:00
c1707ca388 [feature][datax]doriswriter support timeZone (#9327) 2022-05-06 18:39:10 +08:00
ce02c661e3 [WIP-feature](Optimizer) Nereids code base (#9392)
Nereids(new optimizer) code base

Nereids is new query planner for Doris. It include three main parts: parser, analyzer and optimizer.
The parser, generated by ANTLR4, transforms SQL into a logical plan with a tree structure. Analysis and optimization are performed on the logical plan of the tree structure. Each transformation is defined as a rule. The rule is applied to the logical plan using pattern matching. The implementation of the optimizer follows the approach in the Cascades paper.
2022-05-06 16:22:29 +08:00
edc833ab76 [Bug][stream-vec-load] Null data load do not skip the same place data (#9360)
Co-authored-by: lihaopeng <lihaopeng@baidu.com>
2022-05-06 16:07:45 +08:00
7af79e1df5 [Feature][dbt] add partition_type support (#9389) 2022-05-06 15:27:34 +08:00
86b7717fe2 [doc]Add binlog to load and import the doris table and the original table … (#9317) 2022-05-06 15:22:55 +08:00
e130d2f233 [fix][compaction] Rowset::end_version null pointer(#9379) 2022-05-06 14:40:08 +08:00
e3b90de2d5 remove file result writer from result sink (#9378) 2022-05-06 02:37:20 +08:00
a5f9031c89 [improvement](hive) Support hive with HA HDFS. Pass ha configuration through hive create table properties. (#9151)
Doris couldn't resolve the defaultFS of HDFS with HA configuration, so it could query hive table on HA HDFS.
This is because there's no way to send the HA configs to hive external table.

Describe the overview of changes.
Pass the ha configs to hive external table through create table properties.

Usage:
Example of creating hive table with ha configuration properties:

CREATE TABLE region (
r_regionkey integer NOT NULL,
r_name char(25) NOT NULL,
r_comment varchar(152)
) engine=hive properties
("database"="default",
"table"="region",
"hive.metastore.uris"="thrift://172.21.16.11:7004",
"dfs.nameservices"="hacluster",
"dfs.ha.namenodes.hacluster"="3,4",
"dfs.namenode.rpc-address.hacluster.3"="192.168.0.93:8020",
"dfs.namenode.rpc-address.hacluster.4"="172.21.16.11:8020", "dfs.client.failover.proxy.provider.hacluster"="org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider");
2022-05-05 23:43:11 +08:00
2970f9842e [doc] The description of the metadata design document is not clear (#9367) 2022-05-05 20:48:32 +08:00
e5d4cf01ed [fix](ut) fix a potential memory leak in BE ut (#9362) 2022-05-05 20:47:31 +08:00
ae0f6b17d8 [doc]fix geo funtion st_distance_sphere (#9341)
add ST_Distance_Sphere function parameter values range restrictions
2022-05-05 20:46:22 +08:00
cec09c18f9 [doc][community] add more collaborator and modify documents (#9320)
1. Add collaborator morrySnow
2. Add document: How To Be A Committer
3. Add guidance to install clang-format 11
2022-05-05 20:46:03 +08:00
e6c3557b1b [improvement](regression-test) Support suite plugin to add third-part… (#9294)
Support register suite plugin to add third-party function.

See 
1. register in: ${DORIS_HOME}/regression-test/plugins/plugin_example.groovy
2. usage: ${DORIS_HOME}/regression-test/suites/demo/test_plugin.groovy
3. doc: ${DORIS_HOME}/docs/zh-CN/developer-guide/regression-testing.md
2022-05-05 20:45:45 +08:00
6d1c300241 [improvement](odbc) support more ODBC Connection Parameters for odbc external table (#9198)
user can add more supported ODBC Connection Parameters to resource PROPERTIES or external table PROPERTIES
2022-05-05 20:45:13 +08:00
e222a50c42 [fix](backup) Remove colocate_with property when backing up a table (#9142)
We currently not support backup table with colocation property.
So that we have to remove colocate_with property from a table when backing up.
2022-05-05 20:44:27 +08:00
0604ecba17 [fixbug][metadata] catalog could not load from image (#9364)
* [fixbug][catalog] catalog could not load from image

* fix ut failed

Co-authored-by: yiguolei <yiguolei@gmail.com>
2022-05-05 09:58:01 +08:00
7d6645f8f1 [doc] update member.md (#9361) 2022-05-05 09:57:23 +08:00
512d8784d9 [improvement](load) Add http header size parameter (#9357)
Add the http header size parameter to avoid failure due to too many fields when users import using stream load.
The normal default is 8192, and 10K is given here.
2022-05-05 09:56:53 +08:00
ac5f4d6c7e [docs] Fix some typos in documentation. (#9356) 2022-05-05 09:56:02 +08:00
a33191e222 [fix](memtracker) DCHECK failed in vetorized exec engine fold constant execute (#9354) 2022-05-05 09:55:38 +08:00
832338c55e [improvement] set name for scanner threads and fix compile error in clang (#9336) 2022-05-05 09:53:43 +08:00
4ca3eb7fe2 [improvement](regression-test) support exclude suite/group/directory (#9096)
regression testing framework support skip some suite/group/directory
2022-05-05 09:50:07 +08:00