doris

Author	SHA1	Message	Date
Mingyu Chen	a5922051c9	[Fix] Fix bug that rowset meta is deleted after compaction (#3451 ) * [Fix] Fix bug that rowset meta is deleted after compaction After compaction, the tablet rowset meta will be modified by adding to new output rowsets and deleting the old input rowsets. The output version may equals to the input version. So we should delete the "input" version from _rs_version_map before adding the "output" version to _rs_version_map. Otherwise, the new "output" version will be lost in _rs_version_map.	2020-05-04 09:45:25 +08:00
Binglin Chang	4737aff8fc	[Memory Engine] Make Tablet extensible (#3431 ) Adding a new storage engine, we need to make an extensible tablet interface, so olap/StorageEngine can support and manage new tablet types. To start, this commit creates a class BaseTablet and make Tablet and new MemTablet inherit this base class, some common fields & methods are moved to BaseTablet class, which fields and methods belong to base/old class is not finalized yet, it will change as the project evolves. Fix #3384	2020-05-01 21:21:09 +08:00
Yingchun Lai	37fccd53c4	[Tablet] A small refactor on class Tablet (#3339 ) There is no functional changes in this patch. Key refactor points are: - Remove meaningless return value of functions in class Tablet, and also some related functions in other classes - Allow RowsetGraph::capture_consistent_versions to pass a nullptr to the output parameter - Use CHECK instead of LOG(FATAL) to simplify code	2020-04-24 22:22:26 +08:00
Dayue Gao	3557b12de5	[Bug] Avoid compacting recengly added rowset (#3271 ) This CL fixes #3270 by skipping recently added version when performing cumulative compaction. A new config named "cumulative_compaction_skip_window_seconds" is added to adjust the time window.	2020-04-08 18:58:12 +08:00
Mingyu Chen	1ef4cb2d24	[Bug] Base compaction failed because of overlapping of input rowsets (#3262 ) When calculating the cumulative point at first time, we should stop increasing the cumulative point when we meet a rowset with overlap flag as OVERLAPPING, even if it has only one segments.	2020-04-07 11:26:57 +08:00
Yingchun Lai	c08d6e4708	[tablet meta] Do some refactor on TabletMeta (#3136 ) remove some functions' return value which always return OLAP_SUCCESS optimize some loops	2020-03-20 15:03:22 +08:00
Mingyu Chen	42931d22cb	[Bug] tablet meta is not updated correctly after compaction (#3098 ) This CL try to fix a potential bug describe in ISSUE: #3097. But I'm not sure this is the root cause. Also remove lots of verbose log, and fix a memory leak.	2020-03-14 23:39:11 +08:00
kangkaisen	625411bd28	Doris support in memory olap table (#2847 )	2020-02-18 10:45:54 +08:00
LingBin	feef077520	Some refactors on `TabletManager` (#2918 ) 1. Add some comments to make the code easier to understand; 2. Make the metric `create_tablet_requests_failed` to be accurate; 3. Some internal methods use naked pointers directly instead of `shared_ptr`; 4. The `using` in `.h` files are contagious when included by other files, so we should only use it in `.cpp` files; 5. Some formatting changes: such as wrapping lines that are too long 6. Parameters that need to be modified, use pointers instead of references No functional changes in this patch.	2020-02-17 14:50:29 +08:00
LingBin	14c772013b	Fix removing tablet bug from partition_map in TabletManager (#2842 ) When using an iterator of _tablet_map.tablet_arr(`std::list`) to remove a tablet, we should first remove tablet from _partition_map to avoid the iterator becoming invalid.	2020-02-06 09:57:12 +08:00
LingBin	7c4149cf27	Improve comparison and printing of Version (#2796 ) * Improve comparison and printing of Version There are two members in `Version`:` first` and `second`. There are many places where we need to print one `Version` object and compare two `Version` objects, but in the current code, these two members are accessed directly, which makes the code very tedious. This patch mainly do: 1. Adds overloaded methods for `operator<<()` for `Version`, so we can directly print a Version object; 2. Adds the `cantains()` method to determine whether it is an containment relationship; 3. Uses `operator==()` to determine if two `Version` objects are equal. Because there are too many places need to be modified, there are still some naked codes left, which will be modified later. This patch also removes some necessary header file references. No functional changes in this patch.	2020-01-19 18:04:28 +08:00
Dayue Gao	4e2f01a9fa	[Compaction] Fix a bug that CumulativeCompaction compares time of different precision (#2693 ) time(NULL) returns second-resolution timestamp, however all compaction related time in Tablet are in millis-resolution. Therefore should use UnixMillis() instead.	2020-01-07 21:31:36 +08:00
lichaoyong	4c5b0b6dc9	Remove VersionHash used to comparison in BE (#2622 )	2019-12-31 19:38:45 +08:00
Mingyu Chen	1421a9be41	[Compaction] Support compact only one rowset (#2558 ) Support compaction operation to compact only one rowset. After the modification, the last rowset of the tablet will also be compacted. At the same time, we added a `segments_overlap_pb` field to the rowset meta. Used to describe whether the segment data in the rowset overlaps. This field is set by `rowset_writer`. Initially UNKNOWN for compatibility with existing data. In addition, the version hash of the rowset generated after compaction is directly set to the version hash of last rowset participating in compaction, to ensure that the tablet's version hash remains unchanged after compaction.	2019-12-27 10:08:41 +08:00
Mingyu Chen	222f8390c7	[Compaction] Fix the bug that cumulative point grows unreasonably (#2490 ) When there are to many segment in one rowset, which is larger than BE config 'max_cumulative_compaction_num_singleton_deltas', the cumulative compaction will not work and just increase the cumulative point, because there is only once rowset being selected. So when selecting rowset for cumulative compaction, we should meet 2 requirments before finishing the selection logic: 1. compaction score is larger than 'max_cumulative_compaction_num_singleton_deltas' 2. at least 2 rowsets are selected.	2019-12-18 12:59:17 +08:00
Mingyu Chen	e1ba0efbc7	Optimize compaction strategy of tablet on BE (#2473 ) The current compaction selection strategy and cumulative point update logic will cause the cumulative compaction to not work, and all compaction tasks will be completed only by the base compaction. This can cause a large number of data versions to pile up. In the current cumulative point update logic, when a cumulative cannot select enough number of rowsets, it will directly increase the cumulative point. Therefore, when the data version generates the same speed as the cumulative compaction polling, it will cause the cumulative point to continuously increase without triggering the cumulative compaction. The new strategy mainly modifies the update logic of cumulative point to ensure that the above problems do not occur. At the same time, the new strategy also takes into account the problem that compaction cannot be performed if cumulative points stagnate for a long time. Cumulative points will be forced to increase through threshold settings to ensure that compaction has a chance to execute. Also add a new HTTP API to view the compaction status of specified tablet. See `compaction-action.md` for details.	2019-12-17 10:30:43 +08:00
Lijia Liu	4d958ec7a1	Fix BE do_tablet_meta_checkpoint retain _meta_lock for a long time (#2430 ) Add a flag in RowsetMeta to record whether it has been deleted from rowset meta. Before this PR, 37156 rowsets only cost 1642 s. With this PR, 37319 rowsets just cost 1 s.	2019-12-12 23:21:43 +08:00
Mingyu Chen	c39d35df4c	Add tablet compaction score metrics (#2427 ) [Metric] Add tablet compaction score metrics Backend: Add metric "tablet_max_compaction_score" to monitor the current max compaction score of tablets on this Backend. This metric will be updated each time the compaction thread picking tablets to compact. Frontend: Add metric "tablet_max_compaction_score" for each Backend. These metrics will be updated when backends report tablet. And also add a calculated metric "max_tablet_compaction_core" to monitor the max compaction core of tablets on all Backends.	2019-12-12 17:46:59 +08:00
lichaoyong	fbee3c7722	Remove VersionHash used to comparison in BE (#2358 )	2019-12-04 20:09:03 +08:00
kangpinghuang	6b4ef34162	fix AlphaRowsetTest by remove StorageEngine #2078 (#2091 )	2019-10-30 19:39:41 +08:00
Dayue Gao	8aa2cbe12d	Load Rowset only once in a thread-safe manner (#2022 ) [Storage] This PR implements thread-safe `Rowset::load()` for both AlphaRowset and BetaRowset. The main changes are 1. Introduce `DorisCallOnce<ReturnType>` to be the replacement for `DorisInitOnce` . It works for both Status and OLAPStatus. 2. `segment_v2::ColumnReader::init()` is now implemented by DorisCallOnce. 3. `segment_v2::Segment` is now created by a factory open() method. This guarantees all Segment instances are in opened state. 4. `segment_v2::Segment::_load_index()` is now implemented by DorisCallOnce. 5. Implement thread-safe load() for AlphaRowset and BetaRowset	2019-10-21 16:05:12 +08:00
lichaoyong	9fb9dbefca	Get rid of compaction on rowset when making snapshot (#1977 ) When making snapshot for incremental clone, missed singleton versions may have been compacted. So get_rowset_by_version() will not acquire any rowset. This rowset should be acquired from _inc_rs_version_map.	2019-10-14 22:07:31 +08:00
yiguolei	a6a9b0021f	Check tablet state before update it (#1974 ) After alter BE try to set tablet state to running but the tablet maybe dropped	2019-10-14 16:04:47 +08:00
yiguolei	8232261df1	Lost rowset during tablet revise tablet meta (#1967 )	2019-10-12 23:30:11 +08:00
yiguolei	7370b44ab2	Tablet report does not set version miss (#1961 )	2019-10-12 14:36:49 +08:00
yiguolei	e4f3e8fda7	Remove redundant method in rowset meta manager (#1949 )	2019-10-10 19:29:59 +08:00
yiguolei	b72a4a4bc6	Add tablet meta checkpoint mechanism (#1936 )	2019-10-10 09:39:02 +08:00
lichaoyong	09482c9f52	Take segments in singleton rowset into consideration upon cumulative compaction (#1866 ) In previous compaction, only rowsets will be taken into consideration. Doing streaming load, the singleton rowset may is made up of many overlapping segments. Scanning these overlapping segments will result in read amplification. To address this problem, overlapping segments should be taken into consideration when doing cumulative compaction to reduce read amplification.	2019-09-25 15:27:44 +08:00
Mingyu Chen	9aa2045987	Refactor alter job (#1695 )	2019-09-12 16:31:29 +08:00
yiguolei	ca23b7a511	Should create init rowset for alter task v2 (#1767 )	2019-09-09 13:59:19 +08:00
yiguolei	981e0feb99	Check rowset is useful atomicly (#1750 ) * Check rowset is useful atomicly * Only release rowset id when it is added to unused rowset * remove release rowset id when save rowset meta	2019-09-06 17:21:42 +08:00
Dayue Gao	85940a292b	RowsetFactory as a single entry for Rowset creation (#1748 )	2019-09-05 18:10:18 +08:00
Dayue Gao	f76dad289e	Basic implementation for BetaRowsetReader (#1718 )	2019-09-03 13:52:16 +08:00
yiguolei	6f4feca3dc	Add rowset id generator to FE and BE (#1678 )	2019-09-02 18:51:31 +08:00
yiguolei	cc7a2a3eb8	Check all tablet using partition tablet map during publish version (#1619 )	2019-08-14 18:24:52 +08:00
lichaoyong	dcb75729db	Change cumulative compaction for decoupling storage from compution (#1576 ) 1. Calculate cumulative point when loading tablet first time. 2. Simplify pick rowsets logic upon delete predicate. 3. Saving meta and modify rowsets only once after cumulative compaction.	2019-08-13 18:25:56 +08:00
yiguolei	41cbedf57d	Manage tablet by partition id (#1591 )	2019-08-07 20:54:50 +08:00
lichaoyong	0d48a3961c	Refactor Storage Engine (#1478 ) NOTE: This patch would modify all Backend's data. And this will cause a very long time to restart be. So if you want to interferer your product environment, you should upgrade backend one by one. 1. Refactoring be is to clarify the structure the codes. 2. Use unique id to indicate a rowset. Nameing rowset with tablet_id and version will lead to many conflicts among compaction, clone, restore. 3. Extract an rowset interface to encapsulate rowsets with different format.	2019-07-15 21:18:22 +08:00

38 Commits