Commit Graph

2766 Commits

Author SHA1 Message Date
43dd583cfc Fix dlopen faild by upgrade cmake (#5481)
* fix dlopen faild

* remove useless code
2021-03-08 09:02:53 +08:00
db2120a7f2 [Build][BE] Fix GLIBC_COMPATIBILITY can not compile in centos6 (#5472)
Add option to disable glibc_compatibility
2021-03-07 20:47:13 +08:00
35f5cb8e0c [Bug] Fix bug that BE failed to start when validating conf from be_custom.conf (#5465) 2021-03-07 17:37:14 +08:00
8855782aab [Doc] Fix page links (#5454) 2021-03-06 16:13:56 +08:00
ae364dea3d [BUG] fix DECHECK Fail While Call UDF (#5457) 2021-03-06 16:13:30 +08:00
d6ac8f4e35 Masking glibc symbols for better portability (#4180)
* Masking glibc symbols for better portability

* Remove redundant files
2021-03-05 13:15:55 +08:00
c95f00d508 [Bug] Fix bug that the image cannot be pulled after the new fe nodes added (#5418)
* [Bug] Fix bug that the image cannot be pulled after the new fe node is added

This is because httpv2 modified the response body of the "/info" api,
causing FE to fail to obtain info from this api.
And the system did not exit correctly.

This will also cause issues in issue #5292
2021-03-05 10:50:54 +08:00
c9ddd88e14 [Rewrite]Rewrite from_unixtime to reduce calling this function (#5444)
from_unxitime is a cpu-exhausted function.
SQL: select filed from table where from_unixtime(field) > '2021-03-02',
if there are one million rows of data. Function from_unixtime will be called one million times,
which will make query very slow.

In issue #5443, we try to rewrite from_unixtime into timestamp to reduce calling this function.
This rewriting can bring 2 times query performance improvement.
2021-03-04 22:31:28 +08:00
805f98e0f9 [Bug] Set dest tuple to null when src_tuple is NULL. (#5431) 2021-03-04 22:26:05 +08:00
77485521d3 [Enhancement] move FeMetaVersion.java from fe-common to fe-core #5426 (#5427)
Currently, FeMetaVersion.java is in fe-common, users may forget to copy fe-common.jar when upgrading the service.
It's really dangerous because the data may be corrupted and can not be recovered.
2021-03-04 22:25:03 +08:00
4e1b6b3eef [ODBC] Let the type conversion of the fail in query in ODBC of MySQL table to prompt the information of the column (#5422)
Let the type conversion of the fail in query in ODBC of MySQL table to prompt the information of the column
2021-03-04 22:23:37 +08:00
8c34013cf6 [Enhance] Remove order by from insert into select (#5419)
remove unnecessary order by of insert into select stmt
2021-03-04 22:23:16 +08:00
bfce98f4b9 [Bug] The toString() method in PartitionInfo class. Property information strings are concatenated together (#5407) 2021-03-04 22:22:37 +08:00
c38a1c799f [Config] Support config validating when BE bootstrap and update BE's config by API (#5379)
Some invalid config value may cause BE work in an unexpected behavior,
this patch aim to support config validating when BE bootstrap and update BE's config by API
to reject invalid value.
This is a work to accomplish PR #4423
2021-03-04 22:21:49 +08:00
bf086408d8 [Doc] Add query cache docs (#4479) 2021-03-04 22:21:08 +08:00
47d6b1ff0b Fix ut failed for topn_function_test (#5449)
Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>
2021-03-04 21:53:52 +08:00
5a6be411da [Ut] Add drop meta unit test and remove some redundant code in Alter (#5395)
* [Ut] Add drop meta unit test and remove some redundant code in Alter

Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>
2021-03-04 21:53:31 +08:00
39136011c2 [Spark-Doris-Connector][Bug-Fix] Resolve deserialize exception when Spark Doris Connector in aync deserialize mode (#5336)
Resolve deserialize exception when Spark Doris Connector in aync deserialize mode 
Co-authored-by: lanhuajian <lanhuajian@sankuai.com>
2021-03-04 17:48:59 +08:00
9c8766356a [Bug-Fix][Bitmap][Be] Resolve bitmap_not calculate wrong result(#5440) (#5441)
bitmap_not calculate wrong result(#5440)

Execute follow sql, and expect response ''
```
select bitmap_to_string(bitmap_not(bitmap_from_string('1'), bitmap_from_string('2,1'))); 
```

Co-authored-by: lanhuajian <lanhuajian@sankuai.com>
2021-03-04 15:46:42 +08:00
7a41629fbe [Audit] Support builtin load audit function to record successful bulk load job (#5183)
* [Audit] Support builtin load audit function to record successful bulk load job

Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>
2021-03-03 17:01:02 +08:00
422456c31a Add warn log when client report be state failed and refactor some report code (#5342)
There are some redundant code for report task, disk and tablet in be, and when fe return error report message, there is no any warn log showing report failed.

Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>
2021-03-03 17:00:21 +08:00
80d237510d [Doc] Modify dead link of doc (#5411)
Fix #5405

Co-authored-by: xxiao2018 <benghua3_1@sina.com>
2021-03-03 15:20:17 +08:00
6a00c68264 Fix a typo ehco -> echo (#5433) 2021-03-03 14:46:54 +08:00
e93a6da0e5 [Doc] correct format errors in English doc (#5321)
Fix some English doc format errors
2021-02-26 11:32:14 +08:00
fdd13ea79a [Broker] Fix broker load fail using SIMPLE auth after KERBEROS auth fail (#5412)
Co-authored-by: liwei5 <liwei5@vipkid.com.cn>
2021-02-25 09:53:50 +08:00
e5cc6de294 [SQL Planner] Column Pruning under count start (#5410)
When there is count(*) function in query, we only need to scan the smallest column.
For example:
Query: select count(*) from (select k1, k2, k3 from base) tmp;
Only k1 which is the smallest column should be scanned.
The remaining columns (k2, k3) should be pruning.

This pr achieves this optimization of column pruning.
Fixed #5409
2021-02-25 09:52:36 +08:00
5781d67afe Fix file licences (#5414)
Add license to files
For Doris 0.14
2021-02-24 16:37:17 +08:00
577b62b3f9 [Internal][bug][doris-1091] Fix bug that compaction failed after deletion (#5413) 2021-02-24 13:22:55 +08:00
73b89b58f3 [Bug][Cache] Version should not be considered when find the latest partition (#5408)
When a table has multiple partitions, each partition has it's own
version, the version doesn't represent whether it's newer or not. When a
partition has a large version, it may be considered as the largest one
currently, this will cause incorrect query result.
Suppose there are 2 partitions:
PartitionName | VisibleVersion |  VisibleVersionTime
p1            |            123 | 2021-02-17 23:31:32
p2            |             23 | 2021-02-22 11:39:19
Partition p1 will be considered as the lastest partition, and there is a
cache before p2's last update time, the cache will hit and return an
error result.
2021-02-24 11:23:19 +08:00
13c7b18592 add check when set lowerBoundInclusive in operator < and <= (#5382)
Change-Id: I54c702900d68da21e3fa16e74d55d16c7365e195

Co-authored-by: qijianliang01 <qijianliang01@baidu.com>
2021-02-23 11:14:29 +08:00
bc10d44522 升级jackson版本号 (#5373)
Co-authored-by: jiangyan <jiangyan@sfmail.sf-express.com>
2021-02-23 10:43:25 +08:00
6dcc1b0a55 [Doris on ES] Fix query failed when ES field value is null (#5363)
* Update fe-idea-dev.md

use `brew install thrift@0.9` to install thrift 0.9.3.1
`brew edit thrift090 | head` shows thrift@0.9 uses thrift 0.9.3.1

* [Refactor] Remove the unnecessary if statement

Future<?> submit(Runnable task)
Submits a Runnable task for execution and returns a Future representing that task. The Future's get method will return null upon successful completion.

* Fix null type

* add comment

Co-authored-by: tanhao <tanhao.0902@bytedance.com>
2021-02-23 10:42:25 +08:00
8046172c31 Update CREATE TABLE.md (#5398)
add "```" to ## keyword section, to fit markdown syntax.
2021-02-22 16:08:18 +08:00
6ede4c6ec1 [Feature] Support backup,restore,load,export directly connect to s3 (#5399)
* [doris-1008] support backup and restore directly to cloud storage via aws s3 protocol

* Internal][S3DirectAccess] Support backup,restore,load,export directlyconnect to s3
1. Support load and export data from/to s3 directly.
2. Add a config to auto convert broker access to s3 acces when available

Change-Id: Iac96d4b3670776708bc96a119ff491db8cb4cde7

(cherry picked from commit 2f03832ca52221cc7436069b96c45c48c4bc7201)

* [Internal][S3DirectAccess] File path glob compatible with broker

Change-Id: Ie55e07a547aa22c6fa8d432ca926216c10384e68
(cherry picked from commit d4fb25544c0dc06d23e1ada571ec3f8edd4ba56f)

* [internal] [doris-1008] fix log4j class not found

Change-Id: I468176aca0d821383c74ee658d461aba9e7d5be3
(cherry picked from commit 029adaa9d6ded8503acbd6644c1519456f3db232)

* add poms

Co-authored-by: yangzhengguo01 <yangzhengguo01@baidu.com>
2021-02-22 16:07:56 +08:00
b098261253 docs(Doc): correct wrong num in create table help doc (#5365)
Co-authored-by: liuyuan <liuyuan.a@miaozhen.com>
2021-02-20 10:07:48 +08:00
aa4f46c349 [Enhancement]Use add method instead of put method to avoid query being stuck when event queue is full (#5310)
Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>
2021-02-18 10:36:55 +08:00
a7d16acaac [MySQL] Support for AuthSwitchRequest In HandShakePacket to better support MySQL 8.0 clients (#5386)
issue:5348
2021-02-16 22:47:53 +08:00
37c976b9af [Docs] Reorder docs index in sidebar (#5388)
reorder the docs sidebar
2021-02-16 22:35:35 +08:00
7eae3e280a [optimization] use inline optimize ExprContext::get_value (#5385) 2021-02-16 22:35:14 +08:00
99e1a97822 [Bug] Fix NPE when replaying modify table property (#5378) 2021-02-16 22:34:48 +08:00
204fa45a48 [FE][Bug] Fix overflow in RuntimeProfile.sortChildren. (#5377)
If the difference between the two times exceeds Integer.MAX_VALUE,
the compare's return value will overflow and the flow exception may be triggered when sorting profile.
2021-02-16 22:34:27 +08:00
bd72328177 [UT] Fix FE ut bugs (#5387)
This bug is introduced from #5356
2021-02-14 15:59:21 +08:00
b8612a4be5 [DOCS] add some missing documents (#5370) 2021-02-09 09:31:39 +08:00
51ccd44865 [Load Parallel][3/3] Support parallel delta writer (#5369)
In the previous broker load, multiple OlapTableSinks would send data to the same LoadChannel,
and because of the lock granularity problem, LoadChannel could only process these requests serially,
which made it impossible to make full use of cluster resources.

This CL modifies the related locks so that LoadChannel can process these requests in parallel.

In the test, with a size of 20G, the load speed of 334 million rows of data in 3 nodes has been
increased from 9min to 5min, and after enabling 2 concurrency, it can be increased to 3min.

Also modify the profile of load job.
2021-02-07 22:42:18 +08:00
f12f32da11 [Bug] Support if not exists in create table like stmt (#5368)
Currently we support syntax `create table xx if not exists like xxx`, but `if not exists` does not work well.
2021-02-07 22:41:55 +08:00
462efeaf39 [Performance Optimization and Refactor] (#5358) (#5364)
1. Add BlockColumnPredicate support OR and AND column predicate in RowBlockV2
2. Support evaluate vectorization delete predicate in storage engine not in Reader in SegmentV2
2021-02-07 22:41:33 +08:00
2ec55bf19e [LOG] Change some log level (#5361)
Change some log level from error to warn
2021-02-07 22:41:13 +08:00
6b0521032d [Bug] Fix the problem of floating point precision when importing parquet data (#5360)
The double data "4206.9" in parquet is converted to decimal data "4206.8999" in Doris,
which is not right.
2021-02-07 22:40:51 +08:00
f77b8ce8e4 [UT] Modify to avoid some UT failure accidentally (#5359)
After an Alter job finished, the job's state is FINISHED, but table's state
may not be NORMAL for a while.
We need to make sure that table's state become NORMAL to continue next UT.
2021-02-07 22:39:45 +08:00
a1808c1a71 [Function] Add BE udf bitmap_not (#5346) (#5357)
this function will return the not result of inputs two bitmap.
2021-02-07 22:39:17 +08:00