doris

Author	SHA1	Message	Date
EmmyMiao87	25e475898e	[Bug] Fix the error result when assert num rows node is used (#3436 ) The child.open() function is not called before this commit. If the assert num rows node has child which process data in open function, the assert num rows node will fetch no data from child. So the result will be empty(incorrect). This error only appear in inner subquery which has a aggregation function. For example: `select * from table where k1=(select k1 from (select avg(k1) from table) a);` The first level of subquery returns a non-scalar value, so the assert num rows node is needed. The second level of subquery has a aggregation function, so the child of assert node is aggregate node. However, if the open stage of the aggregate node is not called, the get next state of aggregate node will return empty set. So the result is wrong. Fixed #3435.	2020-04-30 14:15:50 +08:00
Mingyu Chen	74b987f053	[Bug] Fix bug that storage engine bg threads should start after env is ready	2020-04-29 11:21:19 +08:00
WingC	0430714ca9	Remove redundant call function _wait_in_flight_packet() (#3399 ) The function `_wait_in_flight_packet` has been called in `_send_cur_batch`. No need to call twice.	2020-04-27 20:45:25 +08:00
Mingyu Chen	9a934ec9f6	[Load] Add more info in SHOW LOAD result (#3391 ) Fix #3390 This CL add more info in `JobDetails` column of `SHOW LOAD` result for Broker Load Job. For example: ``` { "Unfinished backends": { "9c3441027ff948a0-8287923329a2b6a7": [10002] }, "All backends": { "9c3441027ff948a0-8287923329a2b6a7": [10002, 10004, 10006] }, "ScannedRows": 2390016, "TaskNumber": 1, "FileNumber": 1, "FileSize": 1073741824 } ``` 2 newly added keys: `Unfinished backends` indicates the BE which task on them are not finished. `All backends` indicates the BE which this job has tasks on it. One more thing, I pass the Backend Id along with the heartbeat msg from FE to BE, so that BE can know the Id of themselves.	2020-04-26 21:30:23 +08:00
Yingchun Lai	72f3082358	[Metrics] Add some metrics for container size in BE (#3246 ) We can observe the workload of BE, and also it's a way to check whether there is any problem in BE, like some container increase too large and lead to OOM. This patch add the following metrics: ``` Name Description rowset_count_generated_and_in_use The total count of rowset id generated and in use since BE last start unused_rowsets_count The total count of unused rowset waiting to be GC broker_count The total count of brokers in management data_stream_receiver_count The total count of data stream receivers in management fragment_endpoint_count The total count of fragment endpoints of data stream in management, should always equal to data_stream_receiver_count active_scan_context_count The total count of active scan contexts plan_fragment_count The total count of plan fragments in executing load_channel_count The total count of load channels in management result_buffer_block_count The total count of result buffer blocks for queries, each block has a limited queue size (default 1024) result_block_queue_count The total count of queues for fragments, each queue has a limited size (default 20, by config::max_memory_sink_batch_count) routine_load_task_count The total count of routine load tasks in executing small_file_cache_count The total count of cached small files' digest info stream_load_pipe_count The total count of stream load pipes, each pipe has a limited buffer size (default 1M) tablet_writer_count The total count of tablet writers brpc_endpoint_stub_count The total count of brpc endpoints ```	2020-04-25 16:13:39 +08:00
Yingchun Lai	37fccd53c4	[Tablet] A small refactor on class Tablet (#3339 ) There is no functional changes in this patch. Key refactor points are: - Remove meaningless return value of functions in class Tablet, and also some related functions in other classes - Allow RowsetGraph::capture_consistent_versions to pass a nullptr to the output parameter - Use CHECK instead of LOG(FATAL) to simplify code	2020-04-24 22:22:26 +08:00
HappenLee	4eb27bc7e3	[Profile] Make running profile clearer and more intuitive to improve usability (#3365 ) (#3383 ) This CL mainly made the following modifications: 1. Delete Invalid method in Running Profile Class. 2. Move Memlimit Counter from blockmgr to fragment and add PeakMemUsage Counter 3. Fix the bug of buffer pool memlimit counter 4. Call compute_time_in_profile() before pretty_print() to show the _local_time_percent without child running profile 5. Add TransferThread ThreadToken count in AveThreadToken Counter	2020-04-24 21:38:55 +08:00
yangzhg	a58bc1957e	Fix expect may produce incorrect values (#3381 )	2020-04-23 09:35:41 +08:00
HangyuanLiu	ad6698cd31	[Performance] Use Google/CCTZ to replace boost at timezone function (#3300 ) NOTICE: the thirdparty dependency need to upgrade to add libcctz.	2020-04-23 09:26:04 +08:00
Yingchun Lai	4a7a88ede1	[LSAN] Fix some memory leak detected by LSAN (#3326 )	2020-04-22 22:59:44 +08:00
Yingchun Lai	22e90f7260	[SegmentV2] Fix bloom filter bits buffer not initialize as 0 (#3372 )	2020-04-22 19:50:05 +08:00
jmk1011	5c53e0fee7	[UnitTest] Modify test to be compatible with coverage tool (#3366 ) C ++ R syntax is not compatible with coverage tools, so modify the syntax for test case.	2020-04-21 21:23:17 +08:00
Yunfeng,Wu	b60aabda11	[Doris On ES] Pushdown some castexpr predicate to ES (#3351 ) Process castexpr, such as: k (float) > 2.0, k(int) > 3.2, Doris On Es should ignore this doris native cast transformation for every row's col value, we push down this `cast semantic` to Elasticsearch. I believe in this `predicate` situation, would decrease the mount of data for transmission。 k1 is float: ```` k1 >= 5 ```` push-down filter: ``` {"range":{"k1":{"gte":"5.000000"}}} ``` k2 is int : ``` k2 > 3.2 ``` push-down filter: ``` {"range":{"k2":{"gte":"3.2"}}} ```	2020-04-21 08:34:20 +08:00
caiconghui	67b0da5652	Fix rowset_meta race condition for commit_txn in TxnManager (#3330 )	2020-04-18 18:38:48 +08:00
Yunfeng,Wu	0624f6b9eb	[Doris On ES]Add simple explain for EsTable (#3341 ) related issue: #3306 Note: this PR just remove the es_scan_node_test.cpp which is useless For the moment, just add a simple explain syntax for EsTable without translating the native predicates to ES queryDSL which is better to finished with moving the predicate translating from Doris BE to Doris FE, the whole work is still WIP.	2020-04-18 10:04:03 +08:00
caiconghui	224f5d8bad	[SegmentV1] Enable to read and write boolean type data (#3324 ) This PR is to enable to read and write boolean type data for segment v1	2020-04-16 23:39:08 +08:00
HuangWei	91438fcb40	[rowset id] Reduce memory of UniqueRowsetIdGenerator (#3316 )	2020-04-14 22:27:49 +08:00
令狐少侠	688927918c	[Doris on ES] Fix bug: when Doris and ES type not match (#3315 )	2020-04-14 20:15:13 +08:00
HuangWei	807499427c	unregister fragment mem tracker in close() (#3286 ) ref https://github.com/apache/incubator-doris/issues/3273 P.S. `614a76beea/be/src/runtime/plan_fragment_executor.cpp (L559-L562)` I think this piece of code is useless. This `_mem_tracker` in `PlanFragmentExecutor` is set as fragment_mem_tracker of `RuntimeState`. direct use We use it in these code, when rowbatch reset, mem tracker's consumption will be released. `7eab12a40e/be/src/exec/olap_rewrite_node.cpp (L57-L58)` `839ec45197/be/src/exec/olap_scan_node.cpp (L1217-L1218)` other usage e.g. `6c33f80544/be/src/exec/olap_scanner.cpp (L245)` won't consume the fragment mem tracker. We don't need to worry about the fragment mem tracker consumption is not zero when we want to destroy it. Or we can add a consumption check before we close the mem tracker?	2020-04-13 23:15:56 +08:00
Yunfeng,Wu	a467c6f81f	[ES Connector] Add field context for string field keyword type (#3305 ) This PR is just a transitional way，but it is better to move the predicates transformation from Doris BE to Doris BE, in this way, Doris BE is responsible for fetching data from ES. Add a `enable_keyword_sniff ` configuration item in creating External Elasticsearch Table ，it default to true , would to sniff the `keyword` type on the `text analyzed` Field and return the `json_path` which substitute the origin col name. ``` CREATE EXTERNAL TABLE `test` ( `k1` varchar(20) COMMENT "", `create_time` datetime COMMENT "" ) ENGINE=ELASTICSEARCH PROPERTIES ( "hosts" = "http://10.74.167.16:8200", "user" = "root", "password" = "root", "index" = "test", "type" = "doc", "enable_keyword_sniff" = "true" ); ``` note: `enable_keyword_sniff` default to "true" run this SQL： ``` select * from test where k1 = "wu yun feng" ``` Output predicate DSL： ``` {"term":{"k1.keyword":"wu yun feng"}} ``` and in this PR, I remove the elasticsearch version detected logic for now this is useless, maybe future is needed.	2020-04-13 23:07:33 +08:00
Lijia Liu	be090f5929	Use read lock when iterate tablet_map in TabletManager::start_trash_sweep (#3294 )	2020-04-13 11:18:33 +08:00
lichaoyong	3086790e06	Fix bug when use ZoneMap/BloomFiter on column with REPLACE/REPLACE_IF_NOT_NULL (#3288 ) Now, column with REPLACE/REPLACE_IF_NOT_NULL can be filtered by ZoneMap/BloomFilter when the rowset is base(version starts with zero). Always we think is an optimization. But when some case, it will occurs bug. create table test( k1 int, v1 int replace, v2 int sum ); If I have two records on different two versions 1 2 2 on version [0-10] 1 3 1 on version 11 If I perform a query select * from test where k1 = 1 and v1 = 3; The result will be 1 3 1, this is not right because of the first record is filtered. The right answer is 1 3 3, the v2 should be summed. Remove this optimization is necessity to make the result is right.	2020-04-10 10:22:21 +08:00
Yingchun Lai	f39c8b156d	[refactor] A small refactor on class DataDir (#3276 ) main refactor points are: - Use a single get_absolute_tablet_path function instead of 3 independent functions - Remove meaningless return value of register_tablet and deregister_tablet - Some typo and format	2020-04-10 00:32:22 +08:00
caiconghui	a5703ef114	[Performance] Support sharding txn_map_lock into more small map locks to make good performance for txn manage task (#3222 ) This PR is to enhance the performance for txn manage task, when there are so many txn in BE, the only one txn_map_lock and additional _txn_locks may cause poor performance, and now we remove the additional _txn_locks and split the txn_map_lock into many small locks.	2020-04-09 22:35:15 +08:00
HuangWei	e32ed28bf4	[Storage] Use getmntent_r() for thread-safe (#3284 )	2020-04-09 14:19:09 +08:00
Yunfeng,Wu	614a76beea	[Doris on ES] Support compound_and predicate push down to Elasticsearch (#3277 ) Relate Issue: https://github.com/apache/incubator-doris/issues/3248 SQL: ``` select * from test where (k2 = 6 and k3 = 1) or (k2 = 2 and k3 =3 and k4 = 'beijing'); ``` Output filter: ``` ((#k2:[6 TO 6] #k3:[1 TO 1]) (#(#k2:[2 TO 2] #k3:[3 TO 3]) #k4:beijing))~1 ``` SQL: ``` select * from test where (k2 = 6 or k3 = 7) or (k2 = 2 and k3 =3 and (k4 = 'beijing' or k4 = 'zhaochun')); ``` Output filter: ``` (k2:[6 TO 6] k3:[7 TO 7] (#(#k2:[2 TO 2] #k3:[3 TO 3]) #((k4:beijing k4:zhaochun)~1)))~1 ``` SQL: ``` select * from test where (k2 = 6 or k3 = 7) or (k2 = 2 and abs(k3) =3 and (k4 = 'beijing' or k4 = 'zhaochun')); ``` Output filter (`abs` can not be pushed down to es, so doris on es would not process this scenario ): ``` match_all ```	2020-04-08 21:09:39 +08:00
Dayue Gao	3557b12de5	[Bug] Avoid compacting recengly added rowset (#3271 ) This CL fixes #3270 by skipping recently added version when performing cumulative compaction. A new config named "cumulative_compaction_skip_window_seconds" is added to adjust the time window.	2020-04-08 18:58:12 +08:00
Yingchun Lai	8fc284d593	[config] Support to modify configs when BE is running without restarting (#3264 ) In the past, when we want to modify some BE configs, we have to modify be.conf and then restart BE. This patch provides a way to modify configs in the type of 'threshold', 'interval', 'enable flag' when BE is running without restarting it. You can update a single config once by BE's http API: `be_host:be_http_port/api/update_config?config_name=new_value`	2020-04-08 11:17:47 +08:00
Dayue Gao	d110629a5f	Optimize performance of TxnManager::build_expire_txn_map (#3269 ) It's not possible to insert duplicated transaction ids for a specific tablet, therefore we could use map<TabletInfo, vector<int64_t>> instead of map<TabletInfo, set<int64_t>> for expire_txn_map.	2020-04-07 23:54:05 +08:00
HuangWei	162b1c5d8b	[Storage] Open data dirs parallelly (#3260 )	2020-04-07 20:59:56 +08:00
Mingyu Chen	1ef4cb2d24	[Bug] Base compaction failed because of overlapping of input rowsets (#3262 ) When calculating the cumulative point at first time, we should stop increasing the cumulative point when we meet a rowset with overlap flag as OVERLAPPING, even if it has only one segments.	2020-04-07 11:26:57 +08:00
HuangWei	2ed184e06a	Add config: tablet writer open rpc timeout (#3258 )	2020-04-03 16:43:56 +08:00
lichaoyong	d2307c719c	Fix be unit test error (#3259 )	2020-04-03 15:02:49 +08:00
lichaoyong	a86161f6ce	[Bug]Fix compile error (#3257 )	2020-04-03 13:38:44 +08:00
lichaoyong	881661ac10	Fix spell error (#3255 )	2020-04-03 10:43:09 +08:00
yangzhg	63cee94c5c	Fix output results may incorrect when using intersect and except statements (#3228 ) output results may incorrect when using intersect and except statements	2020-04-01 20:58:43 +08:00
lichaoyong	6a9a62901f	Fix bug of memory limit when group by varchar columns. (#3242 ) select date_format(k10, '%Y%m%d') as myk10 from baseall group by myk10; The date_format function in query above will be stored in MemPool during the query execution. If the query handles millions of rows, it will consume much memory. Should clear the MemPool at interval.	2020-04-01 18:48:18 +08:00
Dayue Gao	8a2eb8fbcf	[Bug][segment_v2] Fix a bug that NullBitmapBuilder is not reset when data page doesn't have null (#3240 ) This CL fixes a bug that could cause wrong answer for beta rowset with nullable column. The root cause is that NullBitmapBuilder is not reset when the current page doesn't contain NULL, which leads to wrong null map to be written for the next page. Added a test case to reproduce the problem.	2020-04-01 18:39:04 +08:00
HuangWei	5f9359d618	Use SleepFor() instead of usleep() (#3211 )	2020-03-29 14:18:19 +08:00
Yingchun Lai	e4682398bd	[web] Dump configs on BE's website '/varz' (#3220 ) Dump configs on BE's website '/varz' Change NAVIGATION_BAR_PREFIX from 'Impala' to 'Doris' Format the related files by clang-format	2020-03-28 16:26:38 +08:00
HuangWei	0462607d8d	StorageEngine: unused_rowsets use unordered_multimap (#3207 )	2020-03-27 14:30:31 +08:00
Yingchun Lai	cc31bf9cf9	[rowset id] A little improvement of rowset id generator (#3203 ) The main optimization points: 1. Use std::unordered_set instead of std::set, and use RowsetId.hi as RowsetId's hash value. 2. Minimize the scope of SpinLock in UniqueRowsetIdGenerator. Profile comparation: * Run UniqueRowsetIdGeneratorTest.GenerateIdBenchmark 10 times old version \| new version 6s962ms \| 3s647ms 6s139ms \| 3s393ms 6s234ms \| 3s686ms 6s060ms \| 3s447ms 5s966ms \| 4s127ms 5s786ms \| 3s994ms 5s778ms \| 4s072ms 6s193ms \| 4s082ms 6s159ms \| 3s560ms 5s591ms \| 3s654ms	2020-03-26 20:24:26 +08:00
HangyuanLiu	a07fedd832	Fix unix_timestamp core where time less 1970 (#3198 )	2020-03-25 23:16:58 +08:00
Seaven	8426669472	[Plugin] Add BE plugin framework (#2348 ) (#2618 ) Support BE plugin framework, include: * update Plugin Manager, support Plugin find method * support Builtin-Plugin register method * plugin install/uninstall process * PluginLoader: * dynamic install and check Plugin .so file * dynamic uninstall and check Plugin status * PluginZip: * support plugin remote/local .zip file download and extract TODO: * We should support a PluginContext to transmit necessary system variable when the plugin's init/close method invoke * Add the entry which is BE dynamic Plugin install/uninstall process, include: * The FE send install/uninstall Plugin statement (RPC way) * The FE meta update request with Plugin list information * The FE operation request(update/query) with Plugin (maybe don't need) * Add the plugin status upload way * Load already install Plugin when BE start	2020-03-25 21:55:44 +08:00
Mingyu Chen	8aa8b8c96d	[Code Refactor] Using block manager to unify the data file access. (#3189 ) Earlier we introduced `BlockManager` to separate data access logic from underlying file read and write logic. This CL further unifies all `SegmentV2` data access to the `BlockManager`, removes the previous `FileManager` class, and move the file cache to the `FileBlockManager`. There are no logical changes to this CL. After this CL, all user table data is read through the `WritableBlock` and `ReadableBlock` returned by the `BlockManager`, and no file operations are performed directly.	2020-03-25 20:39:07 +08:00
lichaoyong	e20d905d70	Remove unused KUDU codes (#3175 ) KUDU table is no longer supported long time ago. Remove code related to it.	2020-03-24 13:54:05 +08:00
HangyuanLiu	d4c1938b5c	Open datetime min value limit (#3158 ) the min_value in olap/type.h of datetime is 0000-01-01 00:00:00, so we don't need restrict datetime min in tablet_sink	2020-03-24 10:52:57 +08:00
EmmyMiao87	dff3c0d57e	Revert "Remove deep copy when doing hash table EvalRow (#3171 )" (#3173 )	2020-03-23 15:29:46 +08:00
wyb	dd8d748c55	Remove deep copy when doing hash table EvalRow (#3171 ) remove varchar column deep copy in partitioned hash table EvalRow function	2020-03-21 09:52:49 +08:00
yangzhg	d29ed84b6a	[Bug] Fix bug that right semi/anti join is not right (#3167 ) This bug is introduced by PR: #3148. right semi/anti join can not use `insert_unique` in build phase of join.	2020-03-20 20:58:55 +08:00

1 2 3 4 5 ...

871 Commits