Commit Graph

8276 Commits

Author SHA1 Message Date
d42409cc35 Fix short key not fill up all space (#1183) 2019-05-22 11:32:46 +08:00
5e245e031b Change source type to BATCH_LOAD_JOB (#1182) 2019-05-22 11:07:38 +08:00
592c2c24d9 Fix revoke files bug (#1181) 2019-05-22 11:06:31 +08:00
c5bf1a8da1 Fix prefix index comparison (#1180)
1. Upon prefix index comparison, it should only compare the fixed length of prefix index
2019-05-21 20:17:24 +08:00
b98b07efe2 Add persistence of load job v2 (#1176) 2019-05-21 19:30:26 +08:00
b132f4ac0c Add a configuration to force seek for block (#1179) 2019-05-21 14:53:37 +08:00
722a9e71c7 Optimize json functions (#1177)
1. get_json_xxx() now support using quoto to escape dot
2. Implement json_path_prepare() function to preprocess json_path

Performance of get_json_string() on 1000000 rows reduces from 2.27s to 0.27s
2019-05-21 09:13:12 +08:00
ff2746157e Remove log info from decimalv2_value to avoid performance degradation (#1175) 2019-05-20 14:26:14 +08:00
6528689205 Add samples about mini_load & insert (#997) 2019-05-20 12:26:49 +08:00
32c4a67f05 Fix thirdpary build error on Ubuntu 18.04 OS (#1174) 2019-05-20 12:24:35 +08:00
7f8a1bcdb6 Threadpool should be shutdown before join() (#1171) 2019-05-17 19:10:22 +08:00
9bf19f13bb Fix borker load submit job fail with null (#1172) 2019-05-17 19:09:44 +08:00
18e06d2e67 Fix position seek bug for varchar short key (#1167) 2019-05-16 17:29:43 +08:00
b2e63910a6 Fix bug that routine load task may be blocked due to premature deconstruction (#1166)
Data consumer group should wait all data consumers finished before return.
2019-05-16 16:15:00 +08:00
f985ea99fc Add support column reference in LOAD statement (#1162) 2019-05-15 20:26:10 +08:00
b24fab48cd Add some logs for compaction process (#1163) 2019-05-15 18:47:08 +08:00
afff3b3692 Modify dockerfile (#1164)
New docker image include new thirdparties
2019-05-15 18:35:47 +08:00
758adce761 Change the compaction thread number based on disks number (#1161)
Add 2 BE configs: base_compaction_num_threads_per_disk and cumulative_compaction_num_threads_per_disk to control the number of threads per disks.
2019-05-15 14:17:11 +08:00
47fb206cdf Skip tablet under compaction when choose candidate (#1160) 2019-05-15 13:05:32 +08:00
ad88741b69 Fix bug that bad tablet blocking compaction of other tablets (#1158)
A bad tablet is always be chosen to do compaction, and failed again
and again, which may block compaction of other tablets.
Add a BE config 'min_compaction_failure_interval_ms' to avoid choosing
bad tablet again at a certain interval, so that other tablets have
chance to do the compaction.

Also fix a bug that using avg() function on varchar column return
unexpected exception.
2019-05-15 12:44:38 +08:00
398055ef3e Add logic of cancel job (#1154) 2019-05-14 17:26:45 +08:00
910f16af81 Fix bug that using wrong capacity in trash sweep policy (#1156)
We should use total disk usage capacity instread of data used capacity,
otherwise, the config 'disk_capacity_insufficient_percentage' will not work.
2019-05-14 11:30:17 +08:00
76a8093c70 Add documentation for doris on es (#1151) 2019-05-13 21:58:05 +08:00
31f55532f7 Fix bug that cast decimal or bool to other type (#1152) 2019-05-13 17:41:10 +08:00
559db490e0 Fix bug that rename table to an existing rollup index name should not be allowed (#1150)
Also fix another bug that backup/restore job should add to task map after writing edit log.
ISSUE: #522
2019-05-13 17:10:51 +08:00
ffe3eaa1a7 Implement adddate, days_add and from_unixtime function in FE (#1149) 2019-05-13 16:59:52 +08:00
e7e92f0a89 Add build script for arrow (#1144) 2019-05-13 13:40:24 +08:00
02f36c23ed Set tablet as bad when loading index failed (#1146)
Bad tablet will be reported to FE and be handled

And add a config auto_recover_index_loading_failure to control the index loading failure processing
2019-05-13 10:22:04 +08:00
6117227754 FE support assigning constant conjunct and calculating expression (#1126) 2019-05-11 23:10:58 +08:00
15c9be4dfe Fix bug that balance task always choose high usage path (#1143) 2019-05-11 22:07:17 +08:00
ae18cebe0b Improve colocate table balance logic for backend added (#1139)
1. Improve colocate table balance logic for backend added
2. Add more comment
3. Break loop early
2019-05-11 21:49:51 +08:00
debb58c278 Add SHOW FUNCTION and update docs for UDF (#1140) 2019-05-11 21:46:37 +08:00
f499759f15 Revise the exception message (#1141) 2019-05-10 19:52:11 +08:00
4039985729 Fix some bugs about decommission (#1138)
1. Print the last few tablets of decommission backend in fe.log for debug.
2. OlapTableSink should get replica on alive Backends, not only available Backends.
3. When decommission multi Backends, we should drop the redundant replicas before creating a new one.
4. Replicas on decommissioning Backends should be not added to catalog again.
5. Decommissioning Backends should not be chosen as destination of tablet repairing.
2019-05-10 17:41:48 +08:00
79ab7f4413 Change label of broker load txn (#1134)
* Change label of broker load txn

1. put broker load label into txn label
2. fix the bug of `label is already used`
3. fix partition error of new broker load

* Fix count error in mini load and broker load

There are three params (num_rows_load_total, num_rows_load_filtered, num_rows_load_unselected) which are used to count dpp.norm.ALL and dpp.abnorm.ALL.
num_rows_load_total is the number rows of source file.
num_rows_load_unselected is the not satisfied (where conjuncts) rows of num_rows_load_total
num_rows_load_filtered is the rows (quality not good enough) of (num_rows_load_total-num_rows_load_unselected)
2019-05-10 16:53:46 +08:00
fdc0c40549 Fix malformed at AM bug (#1136) 2019-05-10 16:23:14 +08:00
e5a5201626 Update routine-load-manual.md (#1133)
edit  some descriptions about “max_error_number”
2019-05-10 14:38:28 +08:00
4aa41a4e3b Update admin_stmt.md (#1131) 2019-05-10 11:49:29 +08:00
6635c36cc5 Fix decommission BE bug (#1122)
The pre-check of replica num should be check database of current cluster
2019-05-09 18:53:10 +08:00
1eeb5ea891 Add str_to_date function in fe (#1118) 2019-05-09 17:20:44 +08:00
a8bf922542 Update README.md (#1119)
fix some words
2019-05-09 14:51:34 +08:00
93d2dd5f82 Fix bug of routine load job (#1116)
Fix null pointer exception when sending routine load task
2019-05-09 12:56:53 +08:00
77a1b31baa Add show load of loadv2 (#1113)
This change include the show load of loadv2 and some bug fix of loadv2.
Firstly, the show load will perform both load and loadv2 info. According to loadv2, the ETL progress of loadv2 is N/A during the period of loading.
Secondly, the loadv2 will be created when version of property is v2.
This is a temporary property which will not influence the old broker load.
After the loadv2 is finished, the default load will be changed to loadv2.
Finally, there are some bug in LoadingTaskPlanner fixed by this change.
2019-05-09 10:27:30 +08:00
7699c76df2 Fix Nullpointer exception encountered in transaction process (#1112)
* Fix Nullpointer exception encountered in transaction process
* Do not choose unavailable BE when repair tablet
2019-05-08 20:30:34 +08:00
965ecedd5d Fix compile bug (#1106) 2019-05-07 17:36:34 +08:00
a08170fd50 Enhance the usabilities (#1100)
* Enhence the usabilities

1. Add metrics to monitor transactions and steaming load process in BE.
2. Modify BE config 'result_buffer_cancelled_interval_time' to 300s.
3. Modify FE config 'enable_metric_calculator' to true.
4. Add more log for tracing broker load process.
5. Modify the query report process, to cancel query immediately if some instance failed.

* Fix bugs
1. Avoid NullPointer when enabling colocation join with broker load
2. Return immediately when pull load task coordinator execution failed
2019-05-07 15:55:04 +08:00
0c62cb888f Support negative keyword in Broker Load (#1101) 2019-05-06 22:15:27 +08:00
11be24df40 Add new scheduler of load in fe (#1076)
* Add new scheduler of load in fe

1. New scheduler only support the broker load now.
2. The stage of load consist of PENDING -> LOADING -> FINISHED
3. The LoadScheduler will divide job into a pending task. There are preparations that need to be done on pending task.
4. OnPendingTaskFinished will be invoked after pending task. It is used to submit the loading task which is created based on attachment of pending task.
5. OnLoadingTaskFinished will be invoked after loding task. It is used to record the commit info and commit txn when all of task has been finished.
.

* Combine pendingTask and loadingTask into loadTask

1. The load task callback include two methods: onTaskFinished, onTaskFailed

* Add txn callback of load job

1. isCommittting is true when beforeCommitted in txn
2. job could not be cancelled when isCommitting is true
3. job will be finished after txn is visible
4. old job will be cleaned when (CurrentTS - FinishedTs) / 1000 > Config.label_keep_seconds
5. LoadTimeoutChecker is performed to cancel timeout job
2019-05-06 13:49:06 +08:00
ba78adae94 Fix bugs when using function in both stream load request and routine load job (#1091) 2019-05-05 20:51:30 +08:00
53ae591183 Fix pending_delta has no pending_segment_group (#1089) 2019-05-05 19:53:45 +08:00