Commit Graph

3040 Commits

Author SHA1 Message Date
48bd680068 update download info for boost and datatables (#6008) 2021-06-18 11:38:41 +08:00
5cfe081b05 [Bug] Remove duplicate memtracker (#6041)
* [Enhanece] Remove duplicate memtracker

This problem will cause frequent creation of memtracker and affect query concurrency.
2021-06-18 11:28:37 +08:00
ff47dc750d [Bug] Fix problem for thread safety issues and setting the status of non-existent replica does not prompt any error message (#6019)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-18 10:50:47 +08:00
0ddd5da926 [DOC]Organize FE configuration file description (#5975)
* Organize FE configuration file description

Organize FE configuration file description

* Delete redundant numbers

Delete redundant numbers

* Add two configuration parameters of spring boot upload file

Add two configuration parameters of spring boot upload file

* Add configuration instructions

Add configuration instructions

* Fix typos

Fix typos

* Add English documentation of BE configuration

Add English documentation of BE configuration

* Modify style

Modify style

* Modify punctuation

Modify punctuation

* Correct the errors in the text

Correct the errors in the text

* Modify some ads and content issues

Modify some ads and content issues
2021-06-18 09:22:29 +08:00
99d8110972 [Bug-fix] Fix wrong data distribution judgment (#6029)
* [Bug-fix] Fix wrong data distribution judgment

The Fragment where OlapScanNode is located has three data distribution possibilities.
1. UNPARTITIONED: The scan range of OlapScanNode contains only one instance(BE)
2. RANDOM: Involving multi-partitioned tables in OlapScanNode.
3. HASH_PARTITIONED: The involving table is in the colocate group.

For a multi-partition table, although the data in each individual partition is distributed according to the bucketing column,
the same bucketing column between different partitions is not necessarily in the same be.
So the data distribution is RANDOM.

If Doris wrongly plan RANDOM as HASH_PARTITIONED, it will lead to the wrong colocate agg node.
The result of query is incorrect.
2021-06-18 09:21:46 +08:00
9f52f4f9e5 fix stream load error msg missing (#6050)
Co-authored-by: weizuo <weizuo@xiaomi.com>
2021-06-18 09:21:12 +08:00
d7e62e361f [Bug] Fix that build thirdparty of parallel-hashmap-1.33 failed on ubuntu18.04 (#6033)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-17 14:45:34 +08:00
d57c2344e1 [MemTracker] Refactored the hierarchical structure of memtracker (#5956)
To avoid showing too many memtracker on BE web pages.
The MemTracker level now has 3 levels: OVERVIEW, TASK and VERBOSE.

OVERVIEW Mainly used for main memory consumption module such as Query/Load/Metadata.
TASK is mainly used to record the memory overhead of a single task such as a single query, load, and compaction task.
VERBOSE is used for other more detailed memtrackers.
2021-06-16 09:44:24 +08:00
0145bdb1f0 [Doc] Fix a typo (#6025)
Fix a typo in udaf-orthogonal-bitmap-manual.md
2021-06-16 09:41:32 +08:00
d2c1cddd55 [Bug-fix] Avoid using 'QueryDetail' in planning stage (#6018)
QueryDetail is used to statistic the current query details.
This property will only be set when the query starts to execute.
So in the query planning stage, using this attribute in the first query will cause 'NullPointerException'.
After that, this attribute retains the value of the previous query
until it is updated by the subsequent process.
Because code of 'colocateagg' uses this attribute incorrectly in its planning,
it causes 'NullPointerException' when clients like pymysql connect to doris and send the first query.
Fixed #6017
2021-06-16 09:40:53 +08:00
800c2c41bd [Docs] update data-model-rollup.md create table ddl (#6014)
update data-model-rollup.md create table ddl
2021-06-16 09:40:38 +08:00
bde60280b8 [Optimize] use string_view instead of std::string in string function (#6010) 2021-06-16 09:40:13 +08:00
daf8ce29ca [Bug] Fix bucket shuffle bug when left table is without any data (#5965) 2021-06-16 09:39:31 +08:00
8b4721c941 [Bug] Fix kafka consumer reuse bug (#6007)
When judging whether consumer can be reused, it is necessary to judge whether the parameter content is equal.
2021-06-16 09:39:05 +08:00
6d6c3d9703 [Enhancement] Reduce memory consumption by releasing readers earier (#5811)
We created multiple rowset readers to read data of one tablet,
after one rowset reader has reached EOF, it can be released to
reduce resource (typically memory) consumption.
As the same, we can release segment reader when it reach EOF.
2021-06-16 09:37:50 +08:00
d0b60541af [Bug] fix use uncorrect table name in expand star (#6003)
SelectStmt use new TableName(null, tableRef.getAlias()) to expand star expression. tableRef.getAlias() is full name include database name and table name. 
Using it as table name will generate wrong sql in CreateViewStmt. 
This patch fix this problem and use correct database name and table name in expand star method.
2021-06-15 14:18:00 +08:00
54c7d177f8 [Log] Fix a log issue in BDBJournalCursor (#6006) 2021-06-10 17:39:25 +08:00
d33a6d1b98 [Function] Support date function: yearweek(), week(), makedate(). (#6000) 2021-06-10 17:38:25 +08:00
80220af271 [Enhancement] Use Parallel Hash Map Replace Unordered Map In Dict Encodeing Map And Hyper Set (#5990)
Use Parallel Hash Map Replace Unordered Map In Dict Encodeing Map And Hyper Set To Improve Ferformance
2021-06-10 17:38:08 +08:00
206a711f9b [Bug] SimplifyInvalidDateBinaryPredicatesDateRule may cause invalid query plan (#5987)
1. "where 1k > to_date(now())" will return EMPTYSET in query plan.
2. DateLiteral should accept date string like "2021-6-1".
2021-06-10 17:37:26 +08:00
97ed59780d [Bug] Outer join dispose constexpr error in inlineview (#5986) 2021-06-10 17:36:29 +08:00
e7a7b8d2d1 [Bug] Fix bug that start time is null when SQL is forward to master (#5966) 2021-06-10 17:34:59 +08:00
6106cc7d96 [Doris On ES][Bug-Fix] split es publish_address if it has host (#5955) 2021-06-10 17:34:44 +08:00
e245aee33e [Feature] Select outfile support parquet format (#5938)
`Select outfile into` currently only supports to export data with CSV format.
This patch extends the feature to supports parquet format.

Usage:
LocaFile:
```
SELECT citycode FROM table1 INTO OUTFILE "file:///root/doris/" FORMAT AS PARQUET PROPERTIES 
("schema"="required,int32,siteid;", "parquet.compression"="snappy");
```

BrokerFile:
```
SELECT siteid FROM table1 INTO OUTFILE "hdfs://host/test_sql_prc_2019_02_19/" FORMAT AS PARQUET
PROPERTIES ( 
"broker.name" = "hdfs_broker",
"broker.hadoop.security.authentication" = "kerberos",
"broker.kerberos_principal" = "test",
"broker.kerberos_keytab_content" = "base64" ,
"schema"="required,int32,siteid;"
);
```

Field `schema` is required, which defines the schema of a parquet file.
Prefix `parquet.` is the parquet file properties, like compression, version, enable_dictionary.
2021-06-10 17:34:01 +08:00
4d64612b96 [ARRAY]Save array's size instead of offset. (#5983)
* Save array's size instead of offset.

* Optimize variable name

* Fix comment
2021-06-10 12:32:58 +08:00
ad365b3b64 [Bug] Fix bug that cannot cancel alter table operation when table is unstable (#5998)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-09 15:00:17 +08:00
d9c128b744 [BrokerLoad] Support read properties for broker load when read data (#5845)
* [BrokerLoad] support read properties for broker load when read data

Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-06-09 14:59:55 +08:00
c95bebfa5c [Bug] Ignore drop table log when table has been dropped (#5973)
Although the table lock can control the simultaneous modification of the table by different threads.
But it cannot control the drop operation of the table by other threads.
For example, when drop table and table update occur at the same time.

get table object by thread 1
drop table by thread 2 with table lock
update table object by thread 1
The above process is possible.
At this time, step 3 actually operates a table that no longer exists, which will eventually cause NullPointerException.

In fact, the modified table log after the drop table can be ignored. The reason is that it is meaningless to modify information on a table that no longer exists.

Fixed #5687
2021-06-09 13:00:16 +08:00
2cc5f776ce Corrected a typo in the Chinese document (#5978) 2021-06-08 16:44:25 +08:00
af34f877e9 Update zh-CN ALTER TABLE.md (#5982)
Add distribution column cannot be modified
2021-06-08 16:44:13 +08:00
8ea0ae4792 [Docker] update docker hub url to official repo (#5972) 2021-06-06 22:08:21 +08:00
60062d97da [Enhance] convert byte size into a human-readable format (#5970) 2021-06-06 22:07:58 +08:00
04cc6eaadc [Log] Fix a mistake in DorisDynamicOutputFormat.java (#5963)
Fix a mistake DorisDynamicOutputFormat.java
2021-06-06 22:06:57 +08:00
4b23bca2be [Refactor] catch throwable instead of exception (#5942)
Co-authored-by: 孙忠健(20113660) <sunzj10@ke.com>
2021-06-06 22:06:02 +08:00
61af76b8fb [Log] fix log error when commit transaction in txn manager (#5937)
Co-authored-by: weizuo <weizuo@xiaomi.com>
2021-06-06 22:05:40 +08:00
f1e881e6f0 [Bug-fix] Show view stmt report error when view references invalid table (#5919) 2021-06-06 22:05:22 +08:00
a5c35eb20f [Bug] Fix the bug of null pointer exception of colocate join (#5961) 2021-06-04 10:19:58 +08:00
3ca6e386c7 [Function] Support Function current_timestamp(), curtime(), current_time() in FE. (#5946)
Support Function `current_timestamp()`, `curtime()`, `current_time()` in FE to do constant fold.
2021-06-03 18:39:19 +08:00
d790cc6a50 [BUG] Fixed the problem that substring function may access illegal address (#5952) 2021-06-03 18:38:10 +08:00
4ef1dbf394 [Bug] Fix lack of rdlock before rowset_with_max_version() in compaction log (#5953) 2021-06-03 10:01:35 +08:00
81ecf3d097 [Bug] Rebuilt version graph of a tablet when there are too many orphan vertex (#5945)
The version information of the tablet will be stored in the memory
in an adjacency graph data structure.
And as the new version is written and the old version is deleted,
the data structure will begin to have empty vertex with no edge associations(orphan vertex).

These orphan vertexs should be removed somehow.
2021-06-03 09:59:20 +08:00
9ade3bbd8a Add Doris error code table (#5935) 2021-06-02 09:59:41 +08:00
4c0a98e8bf [BE] Optimize version retrieval efficiency. (#5831)
* [FE] Optimize version retrieval efficiency in high-frequency import/compaction scenarios.

* Jump out of the loop when encountering the reverse edge.
2021-06-02 09:58:21 +08:00
8956641296 [Docs] update PMC Members & Committers (#5941)
* update PMC Members & Committers
2021-06-01 18:11:34 +08:00
8e4b601ff2 [Bug] Fit the bug of check Fragment whether is colocate / bucket shuffle join error (#5940) 2021-05-31 12:14:44 +08:00
cf2e0cf2c8 [Bug] Fix export job sometimes stuck in exporting state after timeout (#5932)
Fix #5931
The reason is that sometime the method coordinate.exec() is not call when the job is timeout,
so that the query profile in this coordinate is not be initialized,
which will cause an NPE error in the execution of ExportExportingTask.
2021-05-30 23:09:29 +08:00
ba868c610f [Optimize] Optimize some tablet scheduling logic (#5926)
1. The partitions set by the admin repair command are prioritized
   to ensure that the tablets of these partitions can be repaired as soon as possible.

2. Add an FE metric "query_begin" to monitor the number of queries submitted to the Doris.
2021-05-30 23:08:59 +08:00
0da59aab53 [Bug] Fix bug of multi load and some issue about httpv2 (#5848)
To be compatible with http v1 so that user don't need to change their code
2021-05-30 23:08:38 +08:00
63c99eb4cb [Cache][Enhancement] Assure sql cache only one version (#5793)
For PR #5792. This patch add a new param `cache type` to distinguish sql cache and partition cache.
When update sql cache,  we make assure one sql key only has one version cache.
2021-05-28 13:45:47 +08:00
c844e602a7 [BUG] Fix the bug of Desc Query cause Unknown error and some doc revise (#5921) 2021-05-28 11:21:31 +08:00