Commit Graph

1213 Commits

Author SHA1 Message Date
8a49c7ef04 [chore] Rename Doris binary output format 2022-06-24 15:30:05 +08:00
f15d84335c [websit][doc]Modify image path (#10361) 2022-06-24 09:12:20 +08:00
ad8da109c3 [community] update PMC & Committer list (#10360) 2022-06-24 09:11:49 +08:00
0cbf39d879 [doc] Fix typo (#10333) 2022-06-24 09:10:59 +08:00
c288bb363a [f][docs] Restore images in new official website. (#10262) 2022-06-24 08:58:29 +08:00
573ad57467 [doc]Added Iceberg 0.13.2 version support (#10312)
Added 0.13.2 version support
2022-06-23 09:33:26 +08:00
98b3306e05 [docs]add key words for helps (#10263)
* add key words for helps

* add key words for helps

* add key words for helps
2022-06-22 14:41:15 +08:00
b913d59560 [docs] aes docs fix (#10251)
* fix aes docs

* update keywords inside aes.md

Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-06-22 14:40:40 +08:00
e49fa6075f [doc] fix wrong number (#10305)
Co-authored-by: stephen <hello-stephen@qq.com>
2022-06-22 08:58:08 +08:00
60d43dc730 [docs]update the correct link (#10318)
Co-authored-by: smallhibiscus <844981280>
2022-06-22 08:51:01 +08:00
d18808d0eb [docs] Add some user case to user list (#10256)
* [docs] Add user case to user list
2022-06-21 09:25:57 +08:00
75719bca92 [doc](website)Remove incubator prefix and add graduate note (#10257) 2022-06-20 17:48:29 +08:00
087fc596b1 [feature] add remote storage policy config for create table properties (#10159)
Add remote storage policy config for create table properties. It will set storage policy for table and partitions in `CREATE TABLE` and `ALTER TABLE`.
This policy will be used when partition is being migrated from local to remote.
grammy:
1.
`CREATE TABLE TblPxy1
(...)
ENGINE=olap
DISTRIBUTED BY HASH (aa) BUCKETS 1
PROPERTIES(
    "remote_storage_policy" = "testPolicy3"
);`
2.
`ALTER TABLE TblPxy01 SET ("remote_storage_policy" = "testPolicy3");`
3.
`ALTER TABLE TblPxy01 MODIFY PARTITION p2 SET ("remote_storage_policy" = "testPolicy3");`
2022-06-20 12:42:23 +08:00
185de4dd43 [docs]update develop document (#10242) 2022-06-20 09:24:49 +08:00
9a1f1c3864 [improvement](variables) change session variable when set global variable (#10238)
Currently, when setting variables with `global` keywords, it will not affect the
current session variable's value. That is always make user confused.

This CL mainly changes:

1. Change session variable when set global variable
2022-06-20 09:05:50 +08:00
e09066c7ee [Improvement] delete deprefacte config in document and regression test (#10231) 2022-06-19 18:16:59 +08:00
8439adad05 [doc] update array functions docs' location (#10226)
Change docs about array functions to correct directory.
Because we already refractor the docs directory.

```
docs/en/sql-manual/sql-functions/array-functions/    ===>
docs/en/docs/sql-manual/sql-functions/array-functions

```
```
docs/zh-CN/sql-manual/sql-functions/array-functions/    ===>
docs/zh-CN/docs/sql-manual/sql-functions/array-functions/
```
2022-06-19 10:40:40 +08:00
1d3496c6ab [feature] support backup/restore connect to HDFS (#10081) 2022-06-19 10:26:20 +08:00
b7b78ae707 [style](fe)the last step of fe CheckStyle (#10134)
1. fix all checkstyle warning
2. change all checkstyle rules to error
3. remove some java doc rules
    a. RequireEmptyLineBeforeBlockTagGroup
    b. JavadocStyle
    c. JavadocParagraph
4. suppress some rules for old codes
    a. all java doc rules only affect on Nereids
    b. DeclarationOrder only affect on Nereids
    c. OverloadMethodsDeclarationOrder only affect on Nereids
    d. VariableDeclarationUsageDistance only affect on Nereids
    e. suppress OneTopLevelClass on org/apache/doris/load/loadv2/dpp/ColumnParser.java
    f. suppress OneTopLevelClass on org/apache/doris/load/loadv2/dpp/SparkRDDAggregator.java
    g. suppress LineLength on org/apache/doris/catalog/FunctionSet.java
    h. suppress LineLength on org/apache/doris/common/ErrorCode.java
2022-06-17 21:02:45 +08:00
fea815f290 [doc](website)Replace CDN files with local files (#10212)
Replace CDN files with local files
2022-06-17 20:58:56 +08:00
f35b235c3b [opt](compaction) optimize compaction in concurrent load (#10153)
add some logic to opt compaction:
1.seperate base&cumu compaction in case base compaction runs too long and
affect cumu compaction
2.fix level size in cu compaction so that file size below 64M have a right level
size, when choose rowsets to do compaction, the policy will ignore big rowset,
this will reduce about 25% cpu in high frequency concurrent load
3.remove skip window restriction so rowset can do compaction right after
generated, cause we'll not delete rowset after compaction. This will highly
reduce compaction score in concurrent log.
4.remove version consistence check in can_do_compaction, we'll choose a
consecutive rowset to do compaction, so this logic is useless

after add logic above, compaction score and cpu cost will have a substantial
optimize in concurrent load.

Co-authored-by: yixiutt <yixiu@selectdb.com>
2022-06-17 17:49:45 +08:00
5e47b03595 [feature-wip](array-type) Add array aggregation functions (#10108) 2022-06-17 11:07:49 +08:00
de86c0dd25 [doc](website)fix algolia search bug (#10196) 2022-06-17 08:51:28 +08:00
44e979e43b [Vectorized][Function] add orthogonal bitmap agg functions (#10126)
* [Vectorized][Function] add orthogonal bitmap agg functions
save some file about orthogonal bitmap function
add some file to rebase
update functions file

* refactor union_count function
refactor orthogonal union count functions

* remove bool is_variadic
2022-06-17 08:48:41 +08:00
f1c9105af1 [feature] Support hive on s3 (#10128)
Support query hive table on S3. Pass AK/SK, Region and s3 endpoint to hive table while creating the external table.

example create table sql:
```
CREATE TABLE `region_s3` (
`r_regionkey` integer NOT NULL,   
`r_name` char(25) NOT NULL,   
`r_comment` varchar(152) ) 
engine=hive 
properties 
("database"="default", 
"table"="region_s3", 
“hive.metastore.uris"="thrift://127.0.0.1:9083",
“AWS_ACCESS_KEY”=“YOUR_ACCESS_KEY",
“AWS_SECRET_KEY”=“YOUR_SECRET_KEY",
"AWS_ENDPOINT"="s3.us-east-1.amazonaws.com", 
“AWS_REGION”=“us-east-1”);
```
2022-06-16 19:15:46 +08:00
41b693e1df [test] Add window cast bitmap digital_masking function regression test. (#9924) 2022-06-16 19:14:51 +08:00
dad953bc08 [doc](website)fix SSR bug and add algolia search (#10178)
* fix ssr bug and add algolia search
2022-06-16 14:25:46 +08:00
9217223cc5 [doc] update sequence en and zh-CN doc. (#10164)
* update sequence en and zh-CN doc.
2022-06-16 09:32:52 +08:00
dff1f09406 [doc](website)update Chinese heme page text (#10168)
update Chinese home page text
2022-06-16 08:04:21 +08:00
ca88f258d9 [improvement] remove unused codes and docs for SHOW USER (#10107)
* remove unused codes and docs for `SHOW USER`
2022-06-15 21:49:08 +08:00
c4871fb306 [doc](website)remove translate warning form Chinese docs (#10157)
* modify home page text
2022-06-15 18:17:37 +08:00
4005b34a52 [doc] add tpc-h benchmark (#10150)
[doc] add tpc-h benchmark
2022-06-15 16:43:10 +08:00
606c32cc30 [doc](website)add translate warning in docs (#10152)
* fix docs bugs with sidebar can not display and some style problems
2022-06-15 14:51:53 +08:00
96b54dd1d5 [doc](website)modify home page text and navbar (#10148)
* fix docs bugs with sidebar can not display and some style problems
2022-06-15 12:21:40 +08:00
c4d0fba713 Add storage policy for remote storage migration (#9997) 2022-06-15 11:00:06 +08:00
4c24586865 [Vectorized][UDF] support java-udaf (#9930) 2022-06-15 10:53:44 +08:00
7ab64f9155 [doc][website]update home page content and add slack button (#10091)
* fix docs bugs with sidebar can not display and some style problems
2022-06-15 09:31:40 +08:00
34ea6ce850 [doc]Added be enable_stream_load_record configuration description (#10130) 2022-06-15 08:14:47 +08:00
be3aa2aa37 [enhancement](community): polish doc to reformat (#10137) 2022-06-15 08:14:13 +08:00
f7b5f36da4 [feature] Support read hive external table and outfile into HDFS that authenticated by kerberos (#9579)
At present, Doris can only access the hadoop cluster with kerberos authentication enabled by broker, but Doris BE itself 
does not supports access to a kerberos-authenticated HDFS file.

This PR hope solve the problem.

When create hive external table, users just specify following properties to access the hdfs data with kerberos authentication enabled:

```sql
CREATE EXTERNAL TABLE t_hive (
k1 int NOT NULL COMMENT "",
k2 char(10) NOT NULL COMMENT "",
k3 datetime NOT NULL COMMENT "",
k5 varchar(20) NOT NULL COMMENT "",
k6 double NOT NULL COMMENT ""
) ENGINE=HIVE
COMMENT "HIVE"
PROPERTIES (
'hive.metastore.uris' = 'thrift://192.168.0.1:9083',
'database' = 'hive_db',
'table' = 'hive_table',
'dfs.nameservices'='hacluster',
'dfs.ha.namenodes.hacluster'='n1,n2',
'dfs.namenode.rpc-address.hacluster.n1'='192.168.0.1:8020',
'dfs.namenode.rpc-address.hacluster.n2'='192.168.0.2:8020',
'dfs.client.failover.proxy.provider.hacluster'='org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider',
'dfs.namenode.kerberos.principal'='hadoop/_HOST@REALM.COM'
'hadoop.security.authentication'='kerberos',
'hadoop.kerberos.principal'='doris_test@REALM.COM',
'hadoop.kerberos.keytab'='/path/to/doris_test.keytab'
);
```

If you want  to `select into outfile` to HDFS that kerberos authentication enable, you can refer to the following SQL statement:

```sql
select * from test into outfile "hdfs://tmp/outfile1" 
format as csv
properties
(
'fs.defaultFS'='hdfs://hacluster/',
'dfs.nameservices'='hacluster',
'dfs.ha.namenodes.hacluster'='n1,n2',
'dfs.namenode.rpc-address.hacluster.n1'='192.168.0.1:8020',
'dfs.namenode.rpc-address.hacluster.n2'='192.168.0.2:8020',
'dfs.client.failover.proxy.provider.hacluster'='org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider',
'dfs.namenode.kerberos.principal'='hadoop/_HOST@REALM.COM'
'hadoop.security.authentication'='kerberos',
'hadoop.kerberos.principal'='doris_test@REALM.COM',
'hadoop.kerberos.keytab'='/path/to/doris_test.keytab'
);
```
2022-06-14 20:07:03 +08:00
eb4d0f508a [doc] Add docs for SHOW TABLETS (#10105)
* add docs for SHOW TABLETS

* update

* add more examples for SHOW TABLETS

Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-06-14 15:29:46 +08:00
dc4761593b [docs] Add common error messages to doris backup (#10048) 2022-06-14 09:20:04 +08:00
d58e00c49c [fix](brpc) Embed serialized request into the attachment and transmit it through http brpc (#9803)
When the length of `Tuple/Block data` is greater than 2G, serialize the protoBuf request and embed the
`Tuple/Block data` into the controller attachment and transmit it through http brpc.

This is to avoid errors when the length of the protoBuf request exceeds 2G:
`Bad request, error_text=[E1003]Fail to compress request`.

In #7164, `Tuple/Block data` was put into attachment and sent via default `baidu_std brpc`,
but when the attachment exceeds 2G, it will be truncated. There is no 2G limit for sending via `http brpc`.

Also, in #7921, consider putting `Tuple/Block data` into attachment transport by default, as this theoretically
reduces one serialization and improves performance. However, the test found that the performance did not improve,
but the memory peak increased due to the addition of a memory copy.
2022-06-13 20:41:48 +08:00
6abc3da467 [doc] add description ofd[DEFAULT CURRENT_TIMESTAMP] (#10053) 2022-06-13 09:10:46 +08:00
27b3c76ce0 [fix]Document image path error (#10050)
Document image path error
2022-06-10 14:34:26 +08:00
4a474420c8 [feature](function) Add ntile function (#9867)
Add ntile function.
For non-vectorized-engine, I just implemented like Impala, rewrite ntile to row_number and count.
But for vectorized-engine, I implemented WindowFunctionNTile.
2022-06-10 10:32:40 +08:00
98c214e43a [doc] add create table statement for create-materialized-view doc (#10029) 2022-06-10 09:16:46 +08:00
1a31af0d85 [doc] Optimize cluster upgrade and binlog load document modification, restore adds common errors (#10027)
1.restore adds common errors
2.Optimize cluster upgrade documentation
3.Modify binlog documentation
2022-06-10 09:16:22 +08:00
a45bf53af3 [website][doc]fix docs bugs with sidebar can not display and some style problems (#10039)
* fix docs bugs with sidebar can not display and some style problems
2022-06-10 08:49:37 +08:00
6fab1cbf3c [feature-wip](array-type) Add array functions size and cardinality (#9921)
Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>
2022-06-09 15:03:03 +08:00