Commit Graph

1440 Commits

Author SHA1 Message Date
55ca52a42d [Bug] Fix bug that Drop olap table may introduce some problems when table's state is not normal (#5712)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-05-12 10:38:23 +08:00
c6bbc68e16 [Bug] Fix Backend edit log read bug (#5775)
This bug is introduced from #5722

Also fix a bug of creating dynamic history partitions, introduced from #5703
2021-05-10 10:27:27 +08:00
11cce06962 [Feature] Support create history dynamic partition (#5703)
1. Add a new dynamic partition property `create_history_partition`.
    If set to true, Doris will create all partitions from `start` to `end`.

2. Add a new FE config `max_dynamic_partition_num`
    To limit the number of partitions created when creating one table.
2021-05-08 12:05:19 +08:00
3fdfe0ba6f [Bug-fix] Export specified column (#5759)
The code logic error causes the user to specify the export column, which may not be effective.
The PR fix this problem.
2021-05-08 10:56:45 +08:00
efd51b47e5 [Bug] Fix some little bugs in FE (#5758)
1. Fix NPE in ReplicasProcNode when backend does not exist
2. Forbid the create table like statement to specify the view.
3. Check self ip when starting FE to see if it use the origin ip.
4. Modify the error msg of tablet sink to show more detail errors.
2021-05-08 10:56:10 +08:00
ab2f825a88 [Bug-fix] Pause routine load when data size exceeds quota (#5749)
In the previous code, the routine load task did not catch the exception of opening the transaction.
As a result, although the task cannot be executed,
no exceptions can be seen during show routine load, only the routine load job is stuck.

The PR catch the QuotaExceedException when opening a transaction.
If the routine load task cannot be executed due to the exhaustion of the quota,
the routine load will be paused and an error message will be presented to the user.

Similarly, other load method will also catch similar exceptions and cancel job.
2021-05-07 11:19:36 +08:00
8850cfe2ad [Compaction] Modify compaction logic (#5737)
1. Add /api/compaction/run_status to show the running compaction tasks.
2. Support do base and cumulative compaction for one tablet at same time.
3. Modify some log level.
4. Add a feedback document.
2021-05-07 11:18:47 +08:00
9f706848b9 [Bug] Fix somg bugs about Spark Load (#5701)
The distinct count result of bitmap/hll column may be incorrect in the spark load mode.

Fix some bugs in spark load to solve the above problem.
1. FE is big end but BE is little end. BitmapValues should be transfered to little end in FE's serialization
2. BitmapUnionAggregator/HllUnionAggregator ignore `null` value
3. Make sure encodeVarint64 in FE is consistent with BE

Co-authored-by: weixiang <weixiang06@meituan.com>
2021-05-07 11:18:23 +08:00
04d12fbb00 [Bug] Optimize querying from PostgreSQL and MySQL compatibility (#5741)
1. Use the param `UseDeclareFetch` to fix the memory consumption of PostgreSQL Driver
2. Fix the bug of mysql client 5.1 query failed Doris
2021-05-05 10:19:50 +08:00
eb1fdd019d [Bug-fix] Fix the replay error in stream load manager (#5722)
The previous replay logic does not record the size of the map, which eventually resulted in EOF when reading the log.
This pr replaces the replay logic directly with json.

At the same time, the replay logic of image is supplemented.
The pr ensure that the attributes 'lastStreamLoadTime' of backend can be correctly recorded in the image.
2021-05-05 10:18:51 +08:00
5bfae7674c Fix cancel load path variable LABEL_KEY (#5724) (#5725)
Co-authored-by: 迟成 <chicheng@meituan.com>
2021-04-30 10:14:51 +08:00
6ad1bf7d7e [Bug] Fix dead lock in olap scan node and refactor some code in FE profile (#5713)
* [Bug] Fix dead lock in olap scan node and refactor some code in FE profile

* Add some comment
2021-04-30 10:12:18 +08:00
f570e965f2 [Bug] The operator on single Sql has affected the session varaiables (#5620) 2021-04-30 10:09:41 +08:00
3f8834ac0e fix double slashes when join paths (#5734) 2021-04-30 09:55:52 +08:00
faf5adc4ee [UT] Fix ut failed for CreateTableTest (#5721)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-04-29 09:21:02 +08:00
9001fd28f4 support show stream load sql (#5488)
Co-authored-by: weizuo <weizuo@xiaomi.com>
2021-04-29 09:20:35 +08:00
c7af83b7c7 Support specify certain columns in export stmt (#5689)
Data export function, add certain columns that can be exported to the table.
Export stmt properties ("columns" = "k1, k2, k3");
2021-04-27 18:35:58 +08:00
827f5c0867 [Bug-fix] Ignore modified table log when table has been dropped (#5688)
Although the table lock can control the simultaneous modification of the table by different threads.
But it cannot control the drop operation of the table by other threads.
For example, when drop table and table update occur at the same time.

1. get table object by thread 1
2. drop table by thread 2 with table lock
3. update table object by thread 1

The above process is possible.
At this time, step 3 actually operates a table that no longer exists, which will eventually cause the wrong metadata to be recorded.

Fixed #5687
2021-04-27 09:40:17 +08:00
1783dbf340 [UT] Fix list partition UT failure (#5705) 2021-04-26 09:31:35 +08:00
de87f4ae84 [Feature] Add list partition support (#5529)
Add list partition support
2021-04-24 17:42:27 +08:00
b93e841688 [Optimize] Remove expired txns in batch to avoid holding lock for too long (#5675)
This CL mainly changes:

1.  Add a config to control the expire time of load job

    Add a new FE config "streaming_label_keep_max_second" to control
    the expire time of some high frequency load job such as INSERT and STREAM LOAD.

2. Remove expired txn in batch to avoid holding transaction lock for a long time
2021-04-23 09:47:30 +08:00
b12399657b [Bug] Fix StackOverFlow bug after rewriting the column descs of load stmt (#5656)
1. Fix a Self-referencing bug.
2. Also fix a display bug of SHOW BROKER.
2021-04-23 09:45:39 +08:00
8332581df8 [Optimize] Filter partitions by where header when generate stream load plan (#5667) 2021-04-21 16:56:17 +08:00
a4f8194111 [Audit][Stream Load] Support audit function for stream load (#5452)
Record finished stream load job (both successful job and failed job) into audit log
so that we can see when the stream load job was executed and check the details of stream load jobs.
2021-04-21 16:36:12 +08:00
b121ad6b95 [Refactor] Remove jprotobuf and use grpc client to connect brpc service (#5650) 2021-04-21 10:25:58 +08:00
bf4159c74d Remove unused keyword (#5676) 2021-04-20 09:11:45 +08:00
4313639157 [Metric] Add metrics for routine load (#5641)
* [Metric] Add metrics for routine load

Add following metrics for routine load:
doris_fe_job{job="load", type="ROUTINE_LOAD", state="NEED_SCHEDULE"} 0
doris_fe_job{job="load", type="ROUTINE_LOAD", state="RUNNING"} 1
doris_fe_job{job="load", type="ROUTINE_LOAD", state="PAUSED"} 0
doris_fe_job{job="load", type="ROUTINE_LOAD", state="STOPPED"} 0
doris_fe_job{job="load", type="ROUTINE_LOAD", state="CANCELLED"} 0

* change UTC
2021-04-19 09:26:58 +08:00
6be03f339c [Bug] Fix bug that tablets are not dropped when replacing tables (#5627)
When replacing table with swap = false, the origin table's tablets
should be removed from tablet inverted index.

Co-authored-by: xxiao2018 <benghua3_1@sina.com>
2021-04-19 09:26:19 +08:00
b6c0767754 [Bug] Fix alter table failed when none of new load jobs succeed on alter replica (#5617)
* [Bug] Fix alter table failed when none of new load jobs succeed on altering replica

Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-04-15 15:55:57 +08:00
9403157da4 [DOC] Add docs of warning of Docker env 1.3 of JDK 11 and some doc of FE config. (#5628) 2021-04-14 00:10:30 +08:00
75db273b93 [Doris On ES][WIP] Support external ES table with SSL secured and configurable node sniffing (#5325)
Support external ES  table with `SSL` secured and configurable node sniffing
2021-04-12 11:23:49 +08:00
a25e3afa5b [Colocate plan][Step1] Colocate join covers more situations (#5521)
The old colocate join can only cover the case where the child is hash or scan.
In fact, as long as the child's data distribution meets the requirements,
no matter what the plan node on the child node is, a colocate join can be performed.
2021-04-11 22:02:03 +08:00
9c7d8d2e98 [Bug] Fix bug that isPreAggregation is incorrectly set (#5608)
1. The MaterializedViewSelector should be reset for each scan node
2. On the BE side, columns with delete conditions must be added to the return column.
2021-04-09 14:13:06 +08:00
514d245a1f make like predicate operator public (#5552)
Co-authored-by: wangxixu <wangxixu@xiaomi.com>
2021-04-08 09:14:15 +08:00
3e34fe2529 [FE] [BUG] GroupingFunctionCallExpr: realChildren should be copied too. (#5584) 2021-04-08 09:11:11 +08:00
621c89f7b9 [Enhance] Filter all non-journal db when getting journal db names (#5605) 2021-04-08 09:09:56 +08:00
18c2553ef8 [FE][Bug] Update Spark version to fix a security issue (#5593)
Fix CVE-2020-9480: Apache Spark RCE vulnerability in
auth-enabled standalone master
https://spark.apache.org/security.html#CVE-2020-9480
2021-04-06 11:02:04 +08:00
5012fdc049 [FE][Fix]Update commons-collections to fix a security issue (#5595)
Fix CVE-2017-15708
https://www.cvedetails.com/cve/CVE-2017-15708/
2021-04-06 11:00:19 +08:00
68303ea7f3 [FE][Bug]Update log4j-web to fix a security issue (#5594)
Fix CVE-2017-5645

In Apache Log4j 2.x before 2.8.2, when using the
TCP socket server or UDP socket server to receive
serialized log events from another application, a
specially crafted binary payload can be sent that,
when deserialized, can execute arbitrary code.

https://www.cvedetails.com/cve/CVE-2017-5645/
2021-04-06 10:59:40 +08:00
05487e38ae [Bug] upgrade log4j version from 2.12.1 to 2.14.0 to fix performance issue in JDK11 using ZGC (#5591)
Co-authored-by: caiconghui <caiconghui@xiaomi.com>
2021-04-06 10:59:08 +08:00
dce66c7329 [bug-fix] Fix cluster property in hadoop/broker load (#5587)
The cluster property has been removed from load stmt which is incorrect. It caused user could not create hadoop/broker load.
Fix #5586
2021-04-02 11:54:46 +08:00
0490b156c8 [Bug] Fix bug that call frontend service failed when rpc_timeout is equal with publish_timeout (#5564)
* [Bug] Fix bug that call frontend service failed when rpc_timeout is equal with publish_timeout

*This PR is to fix bug that call frontend service failed when rpc_timeout is equal with publish_timeout and fix some small issues with log

Co-authored-by: caiconghui [蔡聪辉] <caiconghui@xiaomi.com>
2021-03-30 19:50:42 +08:00
ce015b364a [Enhance] Support to show more details for 'SHOW TABLES' (#5520)
Support to show more details for 'SHOW TABLES', it's useful for thirdparty monitoring systems.
2021-03-27 18:58:14 +08:00
ad67dd34a0 update gcc to gcc 10 and support c++17 (#5394)
* update gcc to gcc 10 and support c++17
    update brpc to 0.9.7
    update boost to 1.73
    remove third-party boost 1.54 for mysql

* update cmake version

* ignore jdk version

* remove unused patch

* avoid use SYS_getrandom call
2021-03-25 09:30:38 +08:00
fa9e5c18dc [FE] Improve StreamLoadPlanner error log. (#5549) 2021-03-24 09:33:08 +08:00
cef3cbc53a [Bug] Fix bug that the last column may be null when using multibytes separator (#5534) 2021-03-23 09:35:30 +08:00
ae3e9df8c7 (#5532) RollupJobV2 support identify cluster when replay (#5533) 2021-03-23 09:34:51 +08:00
3a96b6dfbd [Bug] Fix the bug of rowset file being deleted by mistake (#5541)
* [Bug] Fix the bug of rowset file being deleted by mistake
2021-03-23 09:33:36 +08:00
4085b194d6 [Bug] Fix multi broker load bug (#5543)
The storage type of broker desc should be set due to the type of load job
2021-03-22 09:53:17 +08:00
c97ddd7f35 [Enhance] Show brokers' hostname (#5522)
Show brokers' hostname in table to keep the same with show proc for backends and frontends.
2021-03-21 20:46:25 +08:00