Commit Graph

3057 Commits

Author SHA1 Message Date
4870fd47fc [Docs] Fix README.md (#6084) 2021-06-24 09:44:14 +08:00
2998373354 [Bug] Fix bug that select into outfile in parquet format may cause NPE (#6054)
1. check the parquet schema property on FE side.
2. auto generate parquet schema if not specified.
2021-06-23 11:33:47 +08:00
c8899ee5bd [Build][ARM] Fix some compilation problems on ARM64 (#6076)
1. Disable libhdfs3 on ARM, because it doesn't support ARM now.
2. Add compilation doc for ARM64
2021-06-23 09:38:16 +08:00
72d1a3b39c fix spring boot web maximum upload file limit config (#6070)
Co-authored-by: zouxinyi <zouxinyi@baidu.com>
2021-06-22 10:46:26 +08:00
b9ad34736d [Feature] Support recording custom number of backup and restore task information (#5947)
* Record all backup jobs and support where clause
2021-06-22 09:19:54 +08:00
abcd56c6c8 [Enhance] Support show unrecoverable tablets (#6045)
* [Enhance] Support show unrecoverable tablets

The unrecoverable tablets are tablets which non of their replicas are healthy.
We should be able to find out these tablets then manual intervention.

And these tablets should not be added to the tablet scheduler.
2021-06-22 09:19:12 +08:00
68bab73c35 [Bug] Fix select random storage path maybe same at a long time (#6062)
random_shuflle will generate same random sequence when call multiple times,
although we use twice random, but when there is no change in the size relationship
between the adjacent numbers, the result of the second shuffle will not change either
2021-06-20 16:16:32 +08:00
882ebd3d7d [Bug] Fix show data bug (#6060) 2021-06-20 16:15:54 +08:00
5b2d07ca2f [Bug] Fix disk TotalUsedPct display error (#6059)
Fix TotalUsedPct display error
2021-06-20 16:15:39 +08:00
5dabf0bef5 [Alter] validate data file after alter operation success (#6022)
Co-authored-by: wangxixu <wangxixu@xiaomi.com>
2021-06-20 16:15:14 +08:00
1d796d9aa4 [Bug] Fix bug that routine load job may cause dead lock (#6058)
To make source the routine load job's lock must be released after txn aborted
2021-06-20 16:14:47 +08:00
fe0912f6e5 [SQL] Compatible with mysql nulls order by (#6043) 2021-06-20 16:12:52 +08:00
bf2423c91a [httpv2] Spring boot http upload file maximum limit parameterization (#6013)
spring.servlet.multipart.max-file-size and spring.servlet.multipart.max-request-size  Configurable
2021-06-20 16:10:54 +08:00
9bc2df43a7 [Bug][Export] Fix bug of one more record showed in the “show export limit n" (#6012) 2021-06-20 16:10:21 +08:00
4fe8bdfe1d [Doc] Update install-deploy.md (#5968)
improve doc to avoid error of installing be in hadoop cluster
2021-06-20 16:09:13 +08:00
bff6ede94e add data size field for partition cache (#6026)
Co-authored-by: wangxixu <wangxixu@xiaomi.com>
2021-06-18 11:40:00 +08:00
1999a0c26b [optimization] open gcc strict-aliasing optimization (#6034)
* open gcc strict-aliasing optimization

* use -Werror=strick-alias
2021-06-18 11:39:24 +08:00
48bd680068 update download info for boost and datatables (#6008) 2021-06-18 11:38:41 +08:00
5cfe081b05 [Bug] Remove duplicate memtracker (#6041)
* [Enhanece] Remove duplicate memtracker

This problem will cause frequent creation of memtracker and affect query concurrency.
2021-06-18 11:28:37 +08:00
ff47dc750d [Bug] Fix problem for thread safety issues and setting the status of non-existent replica does not prompt any error message (#6019)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-18 10:50:47 +08:00
0ddd5da926 [DOC]Organize FE configuration file description (#5975)
* Organize FE configuration file description

Organize FE configuration file description

* Delete redundant numbers

Delete redundant numbers

* Add two configuration parameters of spring boot upload file

Add two configuration parameters of spring boot upload file

* Add configuration instructions

Add configuration instructions

* Fix typos

Fix typos

* Add English documentation of BE configuration

Add English documentation of BE configuration

* Modify style

Modify style

* Modify punctuation

Modify punctuation

* Correct the errors in the text

Correct the errors in the text

* Modify some ads and content issues

Modify some ads and content issues
2021-06-18 09:22:29 +08:00
99d8110972 [Bug-fix] Fix wrong data distribution judgment (#6029)
* [Bug-fix] Fix wrong data distribution judgment

The Fragment where OlapScanNode is located has three data distribution possibilities.
1. UNPARTITIONED: The scan range of OlapScanNode contains only one instance(BE)
2. RANDOM: Involving multi-partitioned tables in OlapScanNode.
3. HASH_PARTITIONED: The involving table is in the colocate group.

For a multi-partition table, although the data in each individual partition is distributed according to the bucketing column,
the same bucketing column between different partitions is not necessarily in the same be.
So the data distribution is RANDOM.

If Doris wrongly plan RANDOM as HASH_PARTITIONED, it will lead to the wrong colocate agg node.
The result of query is incorrect.
2021-06-18 09:21:46 +08:00
9f52f4f9e5 fix stream load error msg missing (#6050)
Co-authored-by: weizuo <weizuo@xiaomi.com>
2021-06-18 09:21:12 +08:00
d7e62e361f [Bug] Fix that build thirdparty of parallel-hashmap-1.33 failed on ubuntu18.04 (#6033)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-17 14:45:34 +08:00
d57c2344e1 [MemTracker] Refactored the hierarchical structure of memtracker (#5956)
To avoid showing too many memtracker on BE web pages.
The MemTracker level now has 3 levels: OVERVIEW, TASK and VERBOSE.

OVERVIEW Mainly used for main memory consumption module such as Query/Load/Metadata.
TASK is mainly used to record the memory overhead of a single task such as a single query, load, and compaction task.
VERBOSE is used for other more detailed memtrackers.
2021-06-16 09:44:24 +08:00
0145bdb1f0 [Doc] Fix a typo (#6025)
Fix a typo in udaf-orthogonal-bitmap-manual.md
2021-06-16 09:41:32 +08:00
d2c1cddd55 [Bug-fix] Avoid using 'QueryDetail' in planning stage (#6018)
QueryDetail is used to statistic the current query details.
This property will only be set when the query starts to execute.
So in the query planning stage, using this attribute in the first query will cause 'NullPointerException'.
After that, this attribute retains the value of the previous query
until it is updated by the subsequent process.
Because code of 'colocateagg' uses this attribute incorrectly in its planning,
it causes 'NullPointerException' when clients like pymysql connect to doris and send the first query.
Fixed #6017
2021-06-16 09:40:53 +08:00
800c2c41bd [Docs] update data-model-rollup.md create table ddl (#6014)
update data-model-rollup.md create table ddl
2021-06-16 09:40:38 +08:00
bde60280b8 [Optimize] use string_view instead of std::string in string function (#6010) 2021-06-16 09:40:13 +08:00
daf8ce29ca [Bug] Fix bucket shuffle bug when left table is without any data (#5965) 2021-06-16 09:39:31 +08:00
8b4721c941 [Bug] Fix kafka consumer reuse bug (#6007)
When judging whether consumer can be reused, it is necessary to judge whether the parameter content is equal.
2021-06-16 09:39:05 +08:00
6d6c3d9703 [Enhancement] Reduce memory consumption by releasing readers earier (#5811)
We created multiple rowset readers to read data of one tablet,
after one rowset reader has reached EOF, it can be released to
reduce resource (typically memory) consumption.
As the same, we can release segment reader when it reach EOF.
2021-06-16 09:37:50 +08:00
d0b60541af [Bug] fix use uncorrect table name in expand star (#6003)
SelectStmt use new TableName(null, tableRef.getAlias()) to expand star expression. tableRef.getAlias() is full name include database name and table name. 
Using it as table name will generate wrong sql in CreateViewStmt. 
This patch fix this problem and use correct database name and table name in expand star method.
2021-06-15 14:18:00 +08:00
54c7d177f8 [Log] Fix a log issue in BDBJournalCursor (#6006) 2021-06-10 17:39:25 +08:00
d33a6d1b98 [Function] Support date function: yearweek(), week(), makedate(). (#6000) 2021-06-10 17:38:25 +08:00
80220af271 [Enhancement] Use Parallel Hash Map Replace Unordered Map In Dict Encodeing Map And Hyper Set (#5990)
Use Parallel Hash Map Replace Unordered Map In Dict Encodeing Map And Hyper Set To Improve Ferformance
2021-06-10 17:38:08 +08:00
206a711f9b [Bug] SimplifyInvalidDateBinaryPredicatesDateRule may cause invalid query plan (#5987)
1. "where 1k > to_date(now())" will return EMPTYSET in query plan.
2. DateLiteral should accept date string like "2021-6-1".
2021-06-10 17:37:26 +08:00
97ed59780d [Bug] Outer join dispose constexpr error in inlineview (#5986) 2021-06-10 17:36:29 +08:00
e7a7b8d2d1 [Bug] Fix bug that start time is null when SQL is forward to master (#5966) 2021-06-10 17:34:59 +08:00
6106cc7d96 [Doris On ES][Bug-Fix] split es publish_address if it has host (#5955) 2021-06-10 17:34:44 +08:00
e245aee33e [Feature] Select outfile support parquet format (#5938)
`Select outfile into` currently only supports to export data with CSV format.
This patch extends the feature to supports parquet format.

Usage:
LocaFile:
```
SELECT citycode FROM table1 INTO OUTFILE "file:///root/doris/" FORMAT AS PARQUET PROPERTIES 
("schema"="required,int32,siteid;", "parquet.compression"="snappy");
```

BrokerFile:
```
SELECT siteid FROM table1 INTO OUTFILE "hdfs://host/test_sql_prc_2019_02_19/" FORMAT AS PARQUET
PROPERTIES ( 
"broker.name" = "hdfs_broker",
"broker.hadoop.security.authentication" = "kerberos",
"broker.kerberos_principal" = "test",
"broker.kerberos_keytab_content" = "base64" ,
"schema"="required,int32,siteid;"
);
```

Field `schema` is required, which defines the schema of a parquet file.
Prefix `parquet.` is the parquet file properties, like compression, version, enable_dictionary.
2021-06-10 17:34:01 +08:00
4d64612b96 [ARRAY]Save array's size instead of offset. (#5983)
* Save array's size instead of offset.

* Optimize variable name

* Fix comment
2021-06-10 12:32:58 +08:00
ad365b3b64 [Bug] Fix bug that cannot cancel alter table operation when table is unstable (#5998)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-09 15:00:17 +08:00
d9c128b744 [BrokerLoad] Support read properties for broker load when read data (#5845)
* [BrokerLoad] support read properties for broker load when read data

Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-09 14:59:55 +08:00
c95bebfa5c [Bug] Ignore drop table log when table has been dropped (#5973)
Although the table lock can control the simultaneous modification of the table by different threads.
But it cannot control the drop operation of the table by other threads.
For example, when drop table and table update occur at the same time.

get table object by thread 1
drop table by thread 2 with table lock
update table object by thread 1
The above process is possible.
At this time, step 3 actually operates a table that no longer exists, which will eventually cause NullPointerException.

In fact, the modified table log after the drop table can be ignored. The reason is that it is meaningless to modify information on a table that no longer exists.

Fixed #5687
2021-06-09 13:00:16 +08:00
2cc5f776ce Corrected a typo in the Chinese document (#5978) 2021-06-08 16:44:25 +08:00
af34f877e9 Update zh-CN ALTER TABLE.md (#5982)
Add distribution column cannot be modified
2021-06-08 16:44:13 +08:00
8ea0ae4792 [Docker] update docker hub url to official repo (#5972) 2021-06-06 22:08:21 +08:00
60062d97da [Enhance] convert byte size into a human-readable format (#5970) 2021-06-06 22:07:58 +08:00
04cc6eaadc [Log] Fix a mistake in DorisDynamicOutputFormat.java (#5963)
Fix a mistake DorisDynamicOutputFormat.java
2021-06-06 22:06:57 +08:00