Commit Graph

18263 Commits

Author SHA1 Message Date
c86af45f4f fix 2020-05-10 19:16:15 +08:00
56db6e7a35 [Config] allow user to config BRPC socket_max_unwritten_bytes (#3488)
Add new BE config `brpc_socket_max_unwritten_bytes`
2020-05-10 17:56:14 +08:00
edb3ad696d [Deps] Remove redundant com.baidu:jprotobuf (#3322)
* exclude jprotobuf from jprotobuf-rpc-core
* add commons-io used in fe.
2020-05-10 17:10:46 +08:00
488aa22938 [Doc] Update plugin document (#3447) (#3505) 2020-05-09 19:19:38 +08:00
e69e199df1 fix concurrent_modify_bug 2020-05-09 11:46:04 +08:00
b62b310864 [Bug] Fix BE crash when input to hll_merge is null (#3521) 2020-05-09 11:01:48 +08:00
e2c3c84e8d [ut] disable backgrounp scan context gc to speed up unit test (#3524)
Each test case in ExternalScanContextMgrTest may cost 1 minitue
which is too long, we'd better disable backgrounp scan context
gc to speed up unit test.
2020-05-09 09:01:05 +08:00
a656a7ddd4 Support append_trailing_char_if_absent function (#3439) 2020-05-09 08:59:34 +08:00
ad42a1d494 fix 2020-05-08 14:16:56 +08:00
2586f09548 [Bug] Fix bug that SHOW DELETE not return Delete job info (#3515)
The callback added to the CallbackFactory should not be removed until the
transaction is aborted or visible. Otherwise, some callback method may failed
to be called.
2020-05-08 13:04:20 +08:00
8015d8ca6b [Script] Make build script portable on Mac OS (#3493)
1. options of command `mkdir` and `cp` must be put before arguments on Mac OS.
2. directory name `build` is conflict with file `BUILD` on Mac OS.
2020-05-08 12:55:34 +08:00
2f7d2c7e1a [BUG] Fix a bug that ignore_broken_disk may not work (#3486)
When BE sets `ignore_broken_disk` to true, it's expected that non-exist path in storage_root_path won't prevent BE from launching, but in 0.12 BE fails to launch in such scenario.

```
W0506 14:46:11.039953 17040 options.cpp:64] path can not be canonicalized. may be not exist. path=/data11/olap
W0506 14:46:11.040014 17040 options.cpp:141] failed to parse store path /data11/olap, res=-203
```
The reason is that #2861 adds a path existence check in `parse_root_path` which precedes the usage of `ignore_broken_disk` in the main method.
2020-05-08 12:53:44 +08:00
94b3a2bd50 [Bug] Fix string functions not support multibyte string (#3345)
Let string functions support utf8 encoding
2020-05-08 12:52:46 +08:00
f90da72078 [Planner]Enhance AssertNumRowsNode (#3485)
Enhance AssertNumRowsNode to support equal, less than, greater than,... assert conditions
2020-05-08 12:49:48 +08:00
45814c85ac [BugFix] Fix the bug that FE web can't show each fragment execute time percent (#3497)
like this:
          (Active: 14.133ms, non-child: 93.20%)
2020-05-08 12:48:05 +08:00
f591976976 [Doc] Fix the incorrect docs (#3501) 2020-05-08 12:47:00 +08:00
595f14498d fix 2020-05-08 12:10:34 +08:00
1b14dd4426 Refactor some redunant code && Replace some UT by UtFrameUtils
This CL have no logic change, just do some code refacotr and UT Change.
2020-05-08 12:07:33 +08:00
084515317f [Bug] Fix constant In Predicate result error (#3511)
`select 1 not in (2, NULL, 1);` should return `0`
2020-05-08 11:30:11 +08:00
bc42870936 Update UT compile path (#3498) (#3499)
Change UT path from be/build to be/ut_build.
In this way, normal build won't overwrite ut build directory.
2020-05-08 10:18:34 +08:00
1893cd7b5b Update travis.yml (#3512) 2020-05-07 23:10:36 +08:00
c85d847b1e [CompileBug] fix a compile error (#3502)
NodeChannel::mark_close() missing `return`
2020-05-07 23:01:46 +08:00
d68c1eb9b2 fix compile 2020-05-07 22:13:32 +08:00
d60bb81cb0 [SQL Function] Calculate 'case when expr' when possible (#3396)
Calculate 'case when expr' when possible
2020-05-07 22:04:09 +08:00
c74253c0dc for qa test6 2020-05-07 17:52:42 +08:00
94539e7120 Non blocking OlapTableSink (#3143)
ImplementaItion Notes
NodeChannel
_cur_batch -> _pending_batches: when _cur_batch is filled up, move it to _pending_batches.
add_row() just produce batches.
try_send_and_fetch_status() tries to consume one pending batch. If has in flight packet, skip send in this round.
So we can add one sender thread to be in charge of all node channels try_send.

IndexChannel
init(), open() stay the same.
Use for_each_node_channel() to expose the detailed changes of NodeChannel.(It's more easy to read & modify)
Sender thread
See func OlapTableSink::_send_batch_process()

Why use polling?
If we use wait/notify, it will notify when generate a new batch. We can't skip sending this batch, coz it won't notify the same batch again. So wait/notify can't avoid blocking simply.
So I choose polling.
It's wasting to continuously try_send(), but it's difficult to set the suitable polling interval. Thus, I add std::this_thread::yield() to give up the time slice, give priority to other process/threads (if there are other process/threads waiting in the queue).
2020-05-07 10:43:41 +08:00
9e8a060e5b Replace std::tr1::unordered_map with std::unordered_map (#3478) 2020-05-07 10:38:27 +08:00
36f2863574 fix mismatched tags (#3489)
RandomAccessFileOptions, WritableFileOptions, RandomRWFileOptions
defined as a struct but previously declared as a class; this is valid,
but will result in compile warning or error under clang compiler
2020-05-07 09:37:26 +08:00
ca36dc697f [Bug] Fix bug that push down logic error on semi join (#3481)
For SQL like:
```
select * from
join1 left semi join join2
on join1.id = join2.id and join2.id > 1;
```

the predicate `join2.id > 1` can not be pushed down to table join1.
2020-05-07 09:30:30 +08:00
7399997433 [Memory Engine] Add hash index implementation (#3462) 2020-05-06 23:37:25 +08:00
d64704599d [CodeRefactor] vector reserve before push_back (#3463)
1. reserve `SegmentWriter::_column_writers` before writing it
2. remove some condition branchs in SegmentWriter::init
3. fix hard-coded library names in build-thirdpary.sh
2020-05-06 17:16:37 +08:00
5e63629b8b [Decommission] Support NOT dropping BE after decommission (#3461)
Add a new config `drop_backend_after_decommission` in FE. if this config
is false, the BE will not be dropped after finishing decommission operation.

This new config is try to solve the problem described in ISSUE: #3460 .

TODO:
This method will generate a lot of data migration, so it is only a temporary solution.
After that, we should try to solve the problem of data balancing within the BE.

This CL also add the documents of FE and BE configuration.
These documents are incomplete and can be added later.
2020-05-06 17:14:24 +08:00
101628c813 [Bug] Fix bug of predicate pushdown logic (#3475)
When there is subquery in where clause, the query will be rewritten to join operation.
And some auxiliary binary predicates will be generated. These binary predicates
will not go through the ExprRewriteRule, so they are not normalized as
"column to the left and constant to the right" format.

We need to take this case into account so that the `canPushDownPredicate()` judgement
will not throw exception.
2020-05-06 15:15:37 +08:00
dafb356b42 [Bugfix] Fix navbar not showing on mobile clients(#3419) & image relative path problem (#3427) 2020-05-06 11:57:03 +08:00
caa7a07c70 [Query Plan]Support simple transitivity on join predicate pushdown (#3453)
Current implement is very simply and conservative, because our query planner is error-prone.

After we implement the new query planner, we could do this work by `Predicate Equivalence Class` and `PredicatePushDown` rule like presto.
2020-05-04 15:32:19 +08:00
c6822f513a [Code refactor] Remove unnecessary if condition (#3459)
if _opts.conditions is nullptr, the set cids would be empty.
2020-05-04 15:26:52 +08:00
101c7c161d [Bug] Fix bug that double unregister the resource pool in runtime state (#3458)
The resource pool in runtime state will be automatically unregistered
when deconstructing the RuntimeState. So no need to unregister it when
closing the plan fragment executor.
2020-05-04 14:48:57 +08:00
a5922051c9 [Fix] Fix bug that rowset meta is deleted after compaction (#3451)
* [Fix] Fix bug that rowset meta is deleted after compaction

After compaction, the tablet rowset meta will be modified by
adding to new output rowsets and deleting the old input rowsets.
The output version may equals to the input version.

So we should delete the "input" version from _rs_version_map
before adding the "output" version to _rs_version_map. Otherwise,
the new "output" version will be lost in _rs_version_map.
2020-05-04 09:45:25 +08:00
b58b1b3953 [metrics] Make DorisMetrics to be a real singleton (#3417) 2020-05-04 09:20:53 +08:00
d948af6a2f Fix build failure after binutils-dev 2.34 (#3449)
Doris uses some binutils private API, and binutils-dev 2.34 remove them.
This commit makes the code compatible with new versions.
2020-05-03 17:26:22 +08:00
da4d2d2699 [UT] Fix UT bug (#3456)
SSD cool downtime shouldn't be fix time in UT;
2020-05-03 16:24:08 +08:00
a1500eb544 Update doris-on-es.md (#3446) 2020-05-03 12:48:48 +08:00
2cb4027164 Update doris-on-es.md (#3441) 2020-05-03 12:48:19 +08:00
4737aff8fc [Memory Engine] Make Tablet extensible (#3431)
Adding a new storage engine, we need to make an extensible tablet interface, so olap/StorageEngine can support and manage new tablet types.

To start, this commit creates a class BaseTablet and make Tablet and new MemTablet inherit 
this base class, some common fields & methods are moved to BaseTablet class, which fields 
and methods belong to base/old class is not finalized yet, it will change as the project evolves.

Fix #3384
2020-05-01 21:21:09 +08:00
8e21827505 [Bug][AuditPlugin] Fix bug that length of query stmt in audit should less than limit. (#3444)
Use `String.getBytes().length` instead of `String.length()` to get the real byte length of a string.
Fix: #3443 

Also fix the typo of `max_filter_ratio` in audit log plugin.
2020-05-01 21:17:10 +08:00
54da5a491c Fix delete statement doc display not correctly (#3445) 2020-05-01 19:20:00 +08:00
7ef1e2ce5b [Bug] Fix bug that load data to wrong temp partitions (#3422)
When loading data without specifying partition, the data should only be loaded to
formal partitions, not including temp partitions;
2020-04-30 15:11:28 +08:00
d0fe7e4d94 [Profile] Make running profile clearer and more intuitive to improve usability (#3405)
This CL mainly made the following modifications:
    1. Delete Invalid MemoryUsed Counter and Add PeakMemUsage in each exec node and datastreamsender
    2. Add intent in child execnode profile,make it is easily to know the relationship between execnode
    3. Del _is_result_order we not support any more in olap_scan_node.h and olap_scan_node.cpp
    4. Add scan_disk method to olap_scanner to fix the counter _num_disks_accessed_counter
    5. Now we do not use buffer pool to read and write disk, so annotation eadio counter and 
    6. Delete the MemUsed counter in exec node.
2020-04-30 14:57:21 +08:00
c9ec4e8a73 [UT] Fix AlterTest UT failed (#3437) 2020-04-30 14:40:33 +08:00
a6c0d376dd [Thirdparty] Update ORC lib download address (#3440) 2020-04-30 14:16:44 +08:00