doris

Author	SHA1	Message	Date
Mingyu Chen	c440aa07d1	Revert "[Refactor] Refactor DeleteHandler and Cond module (#4925 )" (#5028 ) This reverts commit 9c9992e0aa28ee85364eebf86a6675f1073e08fb. Co-authored-by: morningman <chenmingyu@baidu.com>	2020-12-05 21:39:49 +08:00
Yingchun Lai	9c9992e0aa	[Refactor] Refactor DeleteHandler and Cond module (#4925 ) This patch mainly do the following refactors: - Use int64_t instead of int32_t for 'version' in DeleteHandler - Move some comments from .cpp to .h file, add some new comments in .h files, and also remove some meaningless comments - Use switch...case... instead of multiple if..else.. for DeleteConditionHandler::is_condition_value_valid - Use range loop to simplify code - Reduce some compare operations in Cond::del_eval - Improve some branch predictions in Reader - Fix and improve some unit tests	2020-12-04 12:13:30 +08:00
weizuo93	ec7e1c6b1b	[Refactor] Execute 'pick rowsets' before applying for permits for a compaction task (#4891 ) The current compaction mechanism is that there is a producer thread that has been producing compaction tasks, and the selected tablet must apply for `permits`. When a tablet could hold `permits`, compaction task for this tablet will be submitted to thread pool. We take compaction score as `permits` which is used for limiting memory consumption. However, `pick_rowset_to_compaction()` will be executed before the file merge in compaction thread, and the number of segment files that actually perform the merge operation is smaller than compaction score. In addition, it is also possible that compaction task exits directly because the tablet doesn't meet the requirements of compaction. This patch optimizes and refactors the code of compaction, so that we can execute 'pick rowsets' before applying for permits for a compaction task, calculate the number of segment files that actually participate in the merge operation, and take this number as `permits`.	2020-11-30 11:41:14 +08:00
sduzh	6fedf5881b	[CodeFormat] Clang-format cpp sources (#4965 ) Clang-format all c++ source files.	2020-11-28 18:36:49 +08:00
sduzh	10e1e29711	Remove header file common/names.h (#4945 )	2020-11-26 17:00:48 +08:00
Yingchun Lai	b7b1d5eb38	[Refactor] Short circuit return to avoid meaningless loop (#4933 )	2020-11-24 13:46:50 +08:00
weizuo93	6247408689	[Compact]Take tablet scan frequency into consider when selecting tablet for compaction (#4837 ) A large number of small segment files will lead to low efficiency for scan operations. Multiple small files can be merged into a large file by compaction operation. So we could take the tablet scan frequency into consideration when selecting an tablet for compaction and preferentially do compaction for those tablets which are scanned frequently during a latest period of time at the present. Using the compaction strategy of Kudu for reference, scan frequency can be calculated for tablet during a latest period of time and be taken into consideration when calculating compaction score.	2020-11-18 21:51:12 +08:00
Mingyu Chen	f239f44b37	[Compaction][Bug-Fix] Fix bug that meta lock need to be held when calculating compaction score (#4829 ) * [Compaction][Buf] Fix bug that meta lock need to be held when calucating compaction score * fix Co-authored-by: morningman <chenmingyu@baidu.com>	2020-11-05 20:29:01 +08:00
Zhengguo Yang	09f97f8a05	[Refactor] Fixes some be typo part 2 (#4747 )	2020-10-20 09:28:57 +08:00
HuangWei	e31b4a4561	[Bug] fix illegal defer in Tablet::rowset_with_max_version() (#4737 )	2020-10-17 13:44:15 +08:00
weizuo93	eba595583e	[Optimize] Optimize the execution model of compaction to limit memory consumption (#4670 ) Currently, there are M threads to do base compaction and N threads to do cumulative compaction for each disk. Too many compaction tasks may run out of memory, so the max concurrency of running compaction tasks is limited by semaphore. If the running threads cost too much memory, we can't defense it. In addition, reducing concurrency to avoid OOM will lead to some compaction tasks can't be executed in time and we may encounter more heavy compaction. Therefore, concurrency limitation is not enough. The strategy proposed in #3624 may be effective to solve the OOM. A CompactionPermitLimiter is used for compaction limitation, and use single-producer/multi-consumer model. Producer will try to generate compaction tasks and acquire `permits` for each task. The compaction task which can hold `permits` will be executed in thread pool and each finished task will release its `permits`. `permits` should be applied for before a compaction task can execute. When the sum of `permits` held by executing compaction tasks reaches a threshold, subsequent compaction task will be no longer allowed, until some `permits` are released. Tablet compaction score is used as `permits` of compaction task here. To some extent, memory consumption can be limited by setting appropriate `permits` threshold.	2020-10-11 11:39:25 +08:00
ZhangYu0123	4f7cfee908	[compaction][config] Change default config policy to size_based (#4599 ) (1) change default compaction config policy to size_based (2) change missed version check policy when delete stale rowsets	2020-09-16 15:04:06 +08:00
ZhangYu0123	d29bf30f74	[BUG] Fix stale path delete checking logic when current main path is missing. (#4549 ) Fix stale path delete checking logic. When current main path is version missing, then delete checking logic is always core dumped. So we fix the checking logic to tolerate current main version missing.	2020-09-08 18:52:53 +08:00
ZhangYu0123	c29d41f675	[BUG] Fix recover persistent stale rowsets bug from multi-single version rowsets in stale rowsets (#4513 ) (1) fix recover persistent stale rowsets bug from multi-single version rowset in stale rowsets (2) delete_expired_inc_rowsets check consistent version convert to [0, max_version]	2020-09-03 16:59:18 +08:00
Yingchun Lai	498b06fbe2	[Metrics] Support tablet level metrics (#4428 ) Sometimes we want to detect the hotspot of a cluster, for example, hot scanned tablet, hot wrote tablet, but we have no insight about tablets in the cluster. This patch introduce tablet level metrics to help to achieve this object, now support 4 metrics on tablets: `query_scan_bytes `, `query_scan_rows `, `flush_bytes `, `flush_count `. However, one BE may holds hundreds of thousands of tablets, so I add a parameter for the metrics HTTP request, and not return tablet level metrics by default.	2020-09-02 10:39:41 +08:00
ZhangYu0123	1d93ba027a	[Compaction] Compaction show policy type and disk format (#4466 ) Add more information in compaction show api 1、add cumulative policy type 2、format rowset total disk size	2020-08-30 21:09:47 +08:00
ZhangYu0123	123237afb7	[Compaction] Persistence stale rowsets meta (#4454 ) Persistence stale rowsets meta. When BE reboots, stale rowsets meta can resume and the stale version can also be readable before stale gc time. ISSUE: #4453	2020-08-30 21:05:48 +08:00
ZhangYu0123	a7422ee142	[UT][Bug-Fix] Resolve UT memory leak problem (#4406 ) Fix ut memory leak on Fix #4164	2020-08-21 10:41:54 +08:00
Mingyu Chen	4c571cb6f5	Revert "[Metrics] Support tablet level metrics (#4327 )" (#4397 ) This reverts commit 56260a65c87830ffe34109195ee4d6f1d543e630. Co-authored-by: morningman <chenmingyu@baidu.com>	2020-08-19 22:37:52 +08:00
ZhangYu0123	dc3ed1c525	[Compaction]Compaction rules optimization (#4212 ) Compaction rules optimization, the detail problem description and design to see #4164. This pr commits 2 functions: (1) add the cumulative policy configable, and implement original policy. (2) implement universal policy, the optimization version in #4164.	2020-08-19 09:34:13 +08:00
Yingchun Lai	56260a65c8	[Metrics] Support tablet level metrics (#4327 ) Sometimes we want to detect the hotspot of a cluster, for example, hot scanned tablet, hot wrote tablet, but we have no insight about tablets in the cluster. This patch introduce tablet level metrics to help to achieve this object, now support 4 metrics on tablets: `query_scan_bytes `, `query_scan_rows `, `flush_bytes `, `flush_count `. However, one BE may holds hundreds of thousands of tablets, so I add a parameter for the metrics HTTP request, and not return tablet level metrics by default.	2020-08-18 16:56:12 +08:00
Mingyu Chen	3359467b9a	[Tablet][Recovery] Support using empty tablet to repair the damaged or missing tablet (#4255 ) In some very special circumstances, such as code bugs, or human misoperation, etc., all replicas of some tablets may be lost. In this case, the data has been substantially lost. However, in some scenarios, the business still hopes to ensure that the query will not report errors even if there is data loss, and reduce the perception of the user layer. At this point, we can use the blank Tablet to fill the missing replica to ensure that the query can be executed normally. Add a new FE config `recover_with_empty_tablet`. default is false. true means to use empty tablet to fill the missing one. Also fix a bug in Fix #4274	2020-08-18 06:13:53 +00:00
ZhangYu0123	3372958a4c	[BUG] Fix calculation of cumulative point (#4259 ) Fix calculation of cumulative point. The problem is calculation of cumulative point is wrong when be restarts and there is delete rowset. also see #4258	2020-08-06 23:13:43 +08:00
ZhangYu0123	16c89c7d56	[BUG]Fix remove expired stale rowset path order error (#4214 ) Delete stale rowset path order error. This bug leads to stale rowsets version inconsistents. #4213	2020-08-01 17:44:39 +08:00
ZhangYu0123	03cf9b2a24	[Compaction] Add delayed deletion of rowsets function, fix -230 error. (#4039 ) Related issue #4017, main changes as follows: 1. Add expired_snapshot_rs_version_map，_expired_snapshot_rs_metas， 2. Add VersionedRowsetTracker record compacted path version 3. Record path version when rowsets compact 4. In gc process, add expired snapshot rowsets to unused set to remove.	2020-07-19 22:03:59 +08:00
Mingyu Chen	15d9e10a8b	[Bug] Fix bug that tablet meta lock twice (#4112 ) * [Bug] Fix bug that tablet meta lock twice The tablet meta load may already be hold before calling generate_tablet_meta_copy(), so we need provide a unlocked version of generate_tablet_meta_copy() * fix typo Co-authored-by: chenmingyu <chenmingyu@baidu.com>	2020-07-19 21:27:24 +08:00
caiconghui	2e460f581c	[Bug] Support get all rowset meta info in memory from tablet meta url (#4061 ) This PR is to fix bug that we cannot get the newest tablet meta info from tablet meta url.	2020-07-13 20:53:51 +08:00
Yingchun Lai	a16236f22f	[refactor] Remove useless return value of class RowsetGraph (#3977 )	2020-07-03 09:59:51 +08:00
lichaoyong	93a0b47d22	Revert "[Memory Engine] MemTablet creation and compatibility handling in BE (#3762 )" (#3931 ) This reverts commit ca96ea30560c9e9837c28cfd2cdd8ed24196f787.	2020-06-24 10:13:45 +08:00
Binglin Chang	ca96ea3056	[Memory Engine] MemTablet creation and compatibility handling in BE (#3762 )	2020-06-18 09:56:07 +08:00
Yingchun Lai	43d25afa2c	[compaction] Update cumulative point calculate algorithm (#3690 ) Current cumulative point calculate algorithm may skip singleton rowset when the rowset has only one segment and with NONOVERLAPPING flag. When a tablet is new created and cumulate many singleton rowsets, cumulative point will be calculated as the max version + 1, and then cumulative compaction couldn't pick any rowsets and compaction failed, and will lead the next base compaction on this tablet with all rowsets, which can also cause memory consume problem, suppose there are thousands of rowsets. All singleton rowsets must be newly wrote by delta writer and hasn't do any compaction, we should place cumulative point before any of these rowsets.	2020-05-30 10:34:53 +08:00
Binglin Chang	7524c5ef63	[Memory Engine] Add MemSubTablet, MemTablet, WriteTx, PartialRowBatch (#3637 )	2020-05-30 10:33:10 +08:00
Mingyu Chen	a5922051c9	[Fix] Fix bug that rowset meta is deleted after compaction (#3451 ) * [Fix] Fix bug that rowset meta is deleted after compaction After compaction, the tablet rowset meta will be modified by adding to new output rowsets and deleting the old input rowsets. The output version may equals to the input version. So we should delete the "input" version from _rs_version_map before adding the "output" version to _rs_version_map. Otherwise, the new "output" version will be lost in _rs_version_map.	2020-05-04 09:45:25 +08:00
Binglin Chang	4737aff8fc	[Memory Engine] Make Tablet extensible (#3431 ) Adding a new storage engine, we need to make an extensible tablet interface, so olap/StorageEngine can support and manage new tablet types. To start, this commit creates a class BaseTablet and make Tablet and new MemTablet inherit this base class, some common fields & methods are moved to BaseTablet class, which fields and methods belong to base/old class is not finalized yet, it will change as the project evolves. Fix #3384	2020-05-01 21:21:09 +08:00
Yingchun Lai	37fccd53c4	[Tablet] A small refactor on class Tablet (#3339 ) There is no functional changes in this patch. Key refactor points are: - Remove meaningless return value of functions in class Tablet, and also some related functions in other classes - Allow RowsetGraph::capture_consistent_versions to pass a nullptr to the output parameter - Use CHECK instead of LOG(FATAL) to simplify code	2020-04-24 22:22:26 +08:00
Dayue Gao	3557b12de5	[Bug] Avoid compacting recengly added rowset (#3271 ) This CL fixes #3270 by skipping recently added version when performing cumulative compaction. A new config named "cumulative_compaction_skip_window_seconds" is added to adjust the time window.	2020-04-08 18:58:12 +08:00
Mingyu Chen	1ef4cb2d24	[Bug] Base compaction failed because of overlapping of input rowsets (#3262 ) When calculating the cumulative point at first time, we should stop increasing the cumulative point when we meet a rowset with overlap flag as OVERLAPPING, even if it has only one segments.	2020-04-07 11:26:57 +08:00
Yingchun Lai	c08d6e4708	[tablet meta] Do some refactor on TabletMeta (#3136 ) remove some functions' return value which always return OLAP_SUCCESS optimize some loops	2020-03-20 15:03:22 +08:00
Mingyu Chen	42931d22cb	[Bug] tablet meta is not updated correctly after compaction (#3098 ) This CL try to fix a potential bug describe in ISSUE: #3097. But I'm not sure this is the root cause. Also remove lots of verbose log, and fix a memory leak.	2020-03-14 23:39:11 +08:00
kangkaisen	625411bd28	Doris support in memory olap table (#2847 )	2020-02-18 10:45:54 +08:00
LingBin	feef077520	Some refactors on `TabletManager` (#2918 ) 1. Add some comments to make the code easier to understand; 2. Make the metric `create_tablet_requests_failed` to be accurate; 3. Some internal methods use naked pointers directly instead of `shared_ptr`; 4. The `using` in `.h` files are contagious when included by other files, so we should only use it in `.cpp` files; 5. Some formatting changes: such as wrapping lines that are too long 6. Parameters that need to be modified, use pointers instead of references No functional changes in this patch.	2020-02-17 14:50:29 +08:00
LingBin	14c772013b	Fix removing tablet bug from partition_map in TabletManager (#2842 ) When using an iterator of _tablet_map.tablet_arr(`std::list`) to remove a tablet, we should first remove tablet from _partition_map to avoid the iterator becoming invalid.	2020-02-06 09:57:12 +08:00
LingBin	7c4149cf27	Improve comparison and printing of Version (#2796 ) * Improve comparison and printing of Version There are two members in `Version`:` first` and `second`. There are many places where we need to print one `Version` object and compare two `Version` objects, but in the current code, these two members are accessed directly, which makes the code very tedious. This patch mainly do: 1. Adds overloaded methods for `operator<<()` for `Version`, so we can directly print a Version object; 2. Adds the `cantains()` method to determine whether it is an containment relationship; 3. Uses `operator==()` to determine if two `Version` objects are equal. Because there are too many places need to be modified, there are still some naked codes left, which will be modified later. This patch also removes some necessary header file references. No functional changes in this patch.	2020-01-19 18:04:28 +08:00
Dayue Gao	4e2f01a9fa	[Compaction] Fix a bug that CumulativeCompaction compares time of different precision (#2693 ) time(NULL) returns second-resolution timestamp, however all compaction related time in Tablet are in millis-resolution. Therefore should use UnixMillis() instead.	2020-01-07 21:31:36 +08:00
lichaoyong	4c5b0b6dc9	Remove VersionHash used to comparison in BE (#2622 )	2019-12-31 19:38:45 +08:00
Mingyu Chen	1421a9be41	[Compaction] Support compact only one rowset (#2558 ) Support compaction operation to compact only one rowset. After the modification, the last rowset of the tablet will also be compacted. At the same time, we added a `segments_overlap_pb` field to the rowset meta. Used to describe whether the segment data in the rowset overlaps. This field is set by `rowset_writer`. Initially UNKNOWN for compatibility with existing data. In addition, the version hash of the rowset generated after compaction is directly set to the version hash of last rowset participating in compaction, to ensure that the tablet's version hash remains unchanged after compaction.	2019-12-27 10:08:41 +08:00
Mingyu Chen	222f8390c7	[Compaction] Fix the bug that cumulative point grows unreasonably (#2490 ) When there are to many segment in one rowset, which is larger than BE config 'max_cumulative_compaction_num_singleton_deltas', the cumulative compaction will not work and just increase the cumulative point, because there is only once rowset being selected. So when selecting rowset for cumulative compaction, we should meet 2 requirments before finishing the selection logic: 1. compaction score is larger than 'max_cumulative_compaction_num_singleton_deltas' 2. at least 2 rowsets are selected.	2019-12-18 12:59:17 +08:00
Mingyu Chen	e1ba0efbc7	Optimize compaction strategy of tablet on BE (#2473 ) The current compaction selection strategy and cumulative point update logic will cause the cumulative compaction to not work, and all compaction tasks will be completed only by the base compaction. This can cause a large number of data versions to pile up. In the current cumulative point update logic, when a cumulative cannot select enough number of rowsets, it will directly increase the cumulative point. Therefore, when the data version generates the same speed as the cumulative compaction polling, it will cause the cumulative point to continuously increase without triggering the cumulative compaction. The new strategy mainly modifies the update logic of cumulative point to ensure that the above problems do not occur. At the same time, the new strategy also takes into account the problem that compaction cannot be performed if cumulative points stagnate for a long time. Cumulative points will be forced to increase through threshold settings to ensure that compaction has a chance to execute. Also add a new HTTP API to view the compaction status of specified tablet. See `compaction-action.md` for details.	2019-12-17 10:30:43 +08:00
Lijia Liu	4d958ec7a1	Fix BE do_tablet_meta_checkpoint retain _meta_lock for a long time (#2430 ) Add a flag in RowsetMeta to record whether it has been deleted from rowset meta. Before this PR, 37156 rowsets only cost 1642 s. With this PR, 37319 rowsets just cost 1 s.	2019-12-12 23:21:43 +08:00
Mingyu Chen	c39d35df4c	Add tablet compaction score metrics (#2427 ) [Metric] Add tablet compaction score metrics Backend: Add metric "tablet_max_compaction_score" to monitor the current max compaction score of tablets on this Backend. This metric will be updated each time the compaction thread picking tablets to compact. Frontend: Add metric "tablet_max_compaction_score" for each Backend. These metrics will be updated when backends report tablet. And also add a calculated metric "max_tablet_compaction_core" to monitor the max compaction core of tablets on all Backends.	2019-12-12 17:46:59 +08:00

1 2

70 Commits