doris

Author	SHA1	Message	Date
Xin Liao	fa586c00a9	[fix](merge-on-write) fix that missed rows don't match merged rows (#18128 ) Due to concurrent load, there may be duplication in the delete bitmap of historical data and incremental calculations, resulting in duplicate calculations of missed rows.	2023-03-27 23:00:54 +08:00
Pxl	d8f0ca7108	[Chore](schema change) remove some unused code in schema change (#17459 ) remove some unused code in schema change. remove some row-based config and code.	2023-03-07 09:18:34 +08:00
Xin Liao	b0d67c0358	[fix](merge-on-write) fix cu compaction correctness check (#17347 ) During concurrent import, the same row location may be marked delete multiple times by different versions of rowset. Duplicate row location need to be removed.	2023-03-06 21:31:48 +08:00
zhannngchen	3636d0a561	[feature](merge-on-write) add DCHECK in compaction to detect data inconsistency (#16564 ) MoW will mark all duplicate primary key as deleted, so we can add a DCHECK while compaction, if MoW's delete bitmap works incorrectly, we're able to detect this kind of issue ASAP. In Debug version, DCHECK will make BE crush, in release version, compaction will fail and finally load will fail due to -235	2023-02-22 14:59:18 +08:00
plat1ko	52f9e03eea	[fix](cooldown) Use `pending_remote_rowsets` to avoid deleting rowset files being uploaded (#16803 )	2023-02-21 21:58:20 +08:00
Xin Liao	c98a0bf803	[Enchancement](merge-on-write) check the correctness of rowid conversion after compaction (#16689 ) MoW updates the delete bitmap of the imported data during the compaction by rowid conversion. The correctness of rowid conversion is very important to the result of delete bitmap. So I add a rowid conversion result check.	2023-02-20 16:27:18 +08:00
Xin Liao	2a9e748073	[enhancement](merge-on-write) do compaction with merge on read (#16799 ) To avoid data irrecoverable due to delete bitmap calculation error，do compaction with merge on read. Through this way ，even if the delete bitmap calculation is wrong, the data can be recovered by full compaction.	2023-02-16 19:20:15 +08:00
plat1ko	f1b9185830	[feature](cooldown) Implement cold data compaction (#16681 )	2023-02-14 15:21:54 +08:00
AlexYue	1f631c388d	[enhance](cooldown)accelerate cooldown task produce efficiency (#16089 )	2023-02-10 16:58:27 +08:00
Xin Liao	2bee26b05a	[fix](merge-on-write) fix that the query result has duplicate keys (#16336 ) * [fix](merge-on-write) fix that the query result has duplicate keys * add ut	2023-02-06 17:09:53 +08:00
lihangyu	1d8265c5a3	[refactor](row-store) make row store column a hidden column in meta (#16251 ) This could simplfy storage engine logic and make code more readable, and we could analyze the hidden `__DORIS_ROW_STORE_COL__` length etc..	2023-02-02 20:56:13 +08:00
lihangyu	116e17428b	[Enhancement](point query optimize) improve performace of point query on primary keys (#15491 ) 1. support row format using codec of jsonb 2. short path optimize for point query 3. support prepared statement for point query 4. support mysql binary format	2023-01-20 13:33:01 +08:00
yixiutt	d8990522fb	[conf](compaction) enable vertical_compaction ordered_data_compaction (#14945 )	2023-01-13 23:12:42 +08:00
plat1ko	ab186a60ce	[enhancement](compaction) Optimize judging delete rowset and picking candidate rowsets for compaction #15631 Tablet::version_for_delete_predicate should travel all rowset metas in tablet meta which complex is O(N), however we can directly judge whether this rowset is a delete rowset by RowsetMeta::has_delete_predicate which complex is O(1). As we won't call Tablet::version_for_delete_predicate when pick input rowsets for compaction, we can reduce the critical area of Tablet::_meta_lock.	2023-01-10 08:32:15 +08:00
yixiutt	365c3eec16	[enhancement](compaction) vertical compaction support unique-key mow (#15353 )	2023-01-02 22:53:04 +08:00
plat1ko	ad68764977	[enhancement](tablet) Unify redundant `create_rowset_writer` methods (#15519 ) * Remove redundant create_rowset_writer methods * Set resource id when setting FS in rowset meta * fix * fix ut	2022-12-30 22:57:12 +08:00
yiguolei	83a99a0f8b	[refactor](non-vec) Remove non vec code from be (#15278 ) * [refactor](removecode) remove some non-vectorization Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-12-22 23:28:30 +08:00
plat1ko	f3aea7f0f0	[Enhancement](status) Unify error code and enable customed err msg for BE internal errors (#14744 )	2022-12-11 23:33:18 +08:00
yixiutt	204ab4c951	[enhancement](compaction) add some trigger and delete useless log (#14796 ) 1.add a vertical compaction segment file size config, make it more flexible to set segment file size 2.add a config to close skip tablet compaction. If current skip logic has some bug so we can still use old logic 3.delete some useless log	2022-12-07 18:53:55 +08:00
yixiutt	3dde97bff1	(compaction) opt compaction task producer and quick compaction (#13495 ) (#14535 ) 1.remove quick_compaction's rowset pick policy, call cu compaction when trigger quick compaction 2. skip tablet's compaction task when compaction score is too small Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-12-02 10:07:44 +08:00
yixiutt	94a6ffb906	[feature](compaction) support vertical_compaction & ordered_data_compaction (#14524 )	2022-12-01 22:15:41 +08:00
Xinyi Zou	0b945fe361	[enhancement](memtracker) Refactor mem tracker hierarchy (#13585 ) mem tracker can be logically divided into 4 layers: 1)process 2)type 3)query/load/compation task etc. 4)exec node etc. type includes enum Type { GLOBAL = 0, // Life cycle is the same as the process, e.g. Cache and default Orphan QUERY = 1, // Count the memory consumption of all Query tasks. LOAD = 2, // Count the memory consumption of all Load tasks. COMPACTION = 3, // Count the memory consumption of all Base and Cumulative tasks. SCHEMA_CHANGE = 4, // Count the memory consumption of all SchemaChange tasks. CLONE = 5, // Count the memory consumption of all EngineCloneTask. Note: Memory that does not contain make/release snapshots. BATCHLOAD = 6, // Count the memory consumption of all EngineBatchLoadTask. CONSISTENCY = 7 // Count the memory consumption of all EngineChecksumTask. } Object pointers are no longer saved between each layer, and the values of process and each type are periodically aggregated. other fix: In [fix](memtracker) Fix transmit_tracker null pointer because phamp is not thread safe #13528, I tried to separate the memory that was manually abandoned in the query from the orphan mem tracker. But in the actual test, the accuracy of this part of the memory cannot be guaranteed, so put it back to the orphan mem tracker again.	2022-11-08 09:52:33 +08:00
zhengyu	554f566217	[enhancement](compaction) introduce segment compaction (#12609 ) (#12866 ) ## Design ### Trigger Every time when a rowset writer produces more than N (e.g. 10) segments, we trigger segment compaction. Note that only one segment compaction job for a single rowset at a time to ensure no recursing/queuing nightmare. ### Target Selection We collect segments during every trigger. We skip big segments whose row num > M (e.g. 10000) coz we get little benefits from compacting them comparing our effort. Hence, we only pick the 'Longest Consecutive Small" segment group to do actual compaction. ### Compaction Process A new thread pool is introduced to help do the job. We submit the above-mentioned 'Longest Consecutive Small" segment group to the pool. Then the worker thread does the followings: - build a MergeIterator from the target segments - create a new segment writer - for each block readed from MergeIterator, the Writer append it ### SegID handling SegID must remain consecutive after segment compaction. If a rowset has small segments named seg_0, seg_1, seg_2, seg_3 and a big segment seg_4: - we create a segment named "seg_0-3" to save compacted data for seg_0, seg_1, seg_2 and seg_3 - delete seg_0, seg_1, seg_2 and seg_3 - rename seg_0-3 to seg_0 - rename seg_4 to seg_1 It is worth noticing that we should wait inflight segment compaction tasks to finish before building rowset meta and committing this txn.	2022-11-04 14:12:51 +08:00
Mingyu Chen	942611c185	Revert "[enhancement](compaction) opt compaction task producer and quick compaction (#13495 )" (#13833 ) This reverts commit 4f2ea0776ca3fe5315ab5ef7e00eefabfb5771a0.	2022-11-01 14:22:12 +08:00
yixiutt	4f2ea0776c	[enhancement](compaction) opt compaction task producer and quick compaction (#13495 ) 1.remove quick_compaction's rowset pick policy, call cu compaction when trigger quick compaction 2. skip tablet's compaction task when compaction score is too small Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-10-31 12:24:05 +08:00
Yongqiang YANG	8b14c4aa98	[fix](compaction) don't log cumu policy name for quick compaction (#13101 )	2022-10-01 21:40:42 +08:00
Xinyi Zou	c55d08fa2f	[fix](memtracker) Refactor load channel mem tracker to improve accuracy (#12791 ) The mem hook record tracker cannot guarantee that the final consumption is 0, nor can it guarantee that the memory alloc and free are recorded in a one-to-one correspondence. In the life cycle of a memtable from insert to flush, the memory free of hook is more than that of alloc, resulting in tracker consumption less than 0. In order to avoid the cumulative error of the upper load channel tracker, the memtable tracker consumption is reset to zero on destructor.	2022-09-21 20:16:19 +08:00
Pxl	2306e46658	[Enhancement](compaction) reduce VMergeIterator copy block (#12316 ) This pr change make VMergeIterator support return row reference to instead copy a full block.	2022-09-13 16:19:34 +08:00
Pxl	a8c8ebf5cf	[Enhancement](compaction) empty string optimize for binary dict code (#12259 ) improve write empty string perfomance.	2022-09-02 14:25:19 +08:00
yixiutt	60a2fa7dea	[Improvement](compaction) copy row in batch in VCollectIterator&VGenericIterator (#12214 ) In VCollectIterator&VGenericIterator, use insert_range_from to copy rows in a block which is continuous to save cpu cost. If rows in rowset and segment are non overlapping, this whill improve 30% throughput of compaction.If rows are completely overlapping such as load two same files, the throughput goes nearly same as before. Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-09-01 10:20:17 +08:00
Xinyi Zou	1304a17600	[fix](memtracker) Improve performance of tracking real physical memory of PodArray #12021	2022-08-24 14:24:14 +08:00
yixiutt	60fddd56e7	[feature-wip](unique-key-merge-on-write) opt lock and only save valid delete_bitmap (#11953 ) 1. use rlock in most logic instead of wrlock 2. filter stale rowset's delete bitmap in save meta 3. add a delete_bitmap lock to handle compaction and publish_txn confict Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-08-23 14:43:40 +08:00
Xinyi Zou	b300b4faa0	[enhancement](memtracker) Optimize readability of mem exceed limit error message #11877	2022-08-18 14:39:41 +08:00
pengxiangyu	b44c47fc10	[fix] (remote storage) fix bug for storage policy (#11597 )	2022-08-09 09:05:48 +08:00
yiguolei	321107cb40	[refactor](schema change) Using tablet schema shared ptr instead of raw ptr (#11475 ) * Using tabletschema shared ptr instead of raw ptrs Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-08-05 11:04:38 +08:00
Lightman	b35daf0a04	[improvement](light-schema-change) Support tablet schema cache (#11131 )	2022-08-01 12:18:00 +08:00
Xinyi Zou	73d8f5901d	fix mem tracker limiter (#11376 )	2022-08-01 09:44:04 +08:00
Xin Liao	2783267599	[feature-wip](unique-key-merge-on-write) update bitmap after compaction, DSIP-018 (#11289 )	2022-07-30 19:50:47 +08:00
Xinyi Zou	b6bdb3bdbc	[fix] (mem tracker) Fix MemTracker accuracy (#11190 )	2022-07-27 18:59:24 +08:00
Xin Liao	d4fb27125a	[feature-wip](unique-key-merge-on-write) row id conversion for compaction (#11149 )	2022-07-27 16:32:13 +08:00
Xinyi Zou	4960043f5e	[enhancement] Refactor to improve the usability of MemTracker (step2) (#10823 )	2022-07-21 17:11:28 +08:00
plat1ko	523d395527	[refactor] Remove alpha rowset meta (#10933 ) * remove alpha_rowset_meta * remove alpha rowset related codes in compaction * remove alpha rowset related codes in RowsetMeta * fix be ut because some ut use alpha rowsetmeta	2022-07-18 08:45:46 +08:00
Lightman	486cf0ebd4	[Feature] Lightweight schema change of add/drop column (#10136 ) * [Schema Change] support fast add/drop column (#49) * [feature](schema-change) support fast schema change. coauthor: yixiutt * [schema change] Using columns desc from fe to read data. coauthor: Lchangliang * [feature](schema change) schema change optimize for add/drop columns. 1.add uniqueId field for class column. 2.schema change for add/drop columns directly update schema meta Co-authored-by: yixiutt <yixiu@selectdb.com> Co-authored-by: SWJTU-ZhangLei <1091517373@qq.com> [Feature](schema change) fix write and add regression test (#69) Co-authored-by: yixiutt <yixiu@selectdb.com> [schema change] be ssupport that delete use newest schema add delete regression test fix regression case (#107) tmp [feature](schema change) light schema change exclude rollup and agg/uniq/dup key type. [feature](schema change) fe olapTable maxUniqueId write in disk. [feature](schema change) add rpc iface for sc add column. [feature](schema change) add columnsDesc to TPushReq for ligtht sc. resolve the deadlock when schema change (#124) fix columns from fe don't has bitmap_index flag (#134) add update/delete case construct MATERIALIZED schema from origin schema when insert fix not vectorized compaction coredump use segment cache choose newest schema by schema version when compaction (#182) [bugfix](schema change) fix ligth schema change problem. [feature](schema change) light schema change add alter job. (#1) fix be ut [bug] (schema change) unique drop key column should not light schema change [feature](schema change) add schema change regression-test. fix regression test [bugfix](schema change) fix multi alter clauses for light schema change. (#2) [bugfix](schema change) fix multi clauses calculate column unique id (#3) modify PushTask process (#217) [Bugfix](schema change) fix jobId replay cause bdbje exception. [bug](schema change) fix max col unique id repeatitive. (#232) [optimize](schema change) modify pendingMaxColUniqueId generate rule. fix compaction error * fix be ut * fix snapshot load core fix unique_id error (#278) [refact](fe) remove redundant code for light schema change. (#4) [refact](fe) remove redundant code for light schema change. (#4) format fe core format be core fix be ut modify fe meta version fix rebase error flush schema into rowset_meta in old table [refactor](schema change) refact fe light schema change. (#5) delete the change of schemahash and support get max version schema * modify for review * fix be ut * fix schema change test	2022-07-12 19:41:06 +08:00
plat1ko	331fa50501	[feature](cold-data) move cold data to object storage without losing any feature(BE) (#10280 ) This PR supports rowset level data upload on the BE side, so that there can be both cold data and hot data in a tablet, and there is no necessary to prohibit loading new data to cooled tablets. Each rowset is bound to a `FileSystem`, so that the storage layer can read and write rowsets without perceiving the underlying filesystem. The abstracted `RemoteFileSystem` can try local caching strategies with different granularity, instead of caching segment files as before. To avoid conflicts with the code in be/src/io, we temporarily put the file system related code in the be/src/io/fs directory. In the future, `FileReader`s and `FileWriter`s should be unified.	2022-07-08 12:18:39 +08:00
chenlinzhong	4dfebb9852	[Feature] compaction quickly for small data import (#9804 ) * compaction quickly for small data import #9791 1.merge small versions of rowset as soon as possible to increase the import frequency of small version data 2.small version means that the number of rows is less than config::small_compaction_rowset_rows default 1000	2022-06-15 21:48:34 +08:00
Gabriel	79620f6fa2	[Improvement] change the condition of vectorized compaction (#9950 )	2022-06-04 12:29:23 +08:00
Lijia Liu	47dfdd8e09	[fix](storage) Disable compaction before schema change is actually executed(#9032 ) (#9065 ) As in issue, the combination and schema change at the same time may lead to version intersection. Describe the overview of changes. 1. Do not do compaction before schema change is actually executed. 2. Set tablet as bad when it has version intersection. 3. Do not do schema change when it can not find appropriate versions to delete in new tablet. 4. Do not change rowsets after compaction if the rowsets of the tablet has changed.	2022-06-01 23:29:18 +08:00
Xinyi Zou	ca05d1ee01	[fix](memory tracker) Fix lru cache, compaction tracker, add USE_MEM_TRACKER compile (#9661 ) 1. Fix Lru Cache MemTracker consumption value is negative. 2. Fix compaction Cache MemTracker has no track. 3. Add USE_MEM_TRACKER compile option. 4. Make sure the malloc/free hook is not stopped at any time.	2022-05-25 08:56:17 +08:00
yiguolei	2c79d223e4	[refactor][rowset]move rowset writer to a single place (#9368 )	2022-05-19 23:57:02 +08:00
Shuangchi He	73c4ec7167	Fix some typos in be/. (#9681 )	2022-05-19 20:55:39 +08:00

1 2

99 Commits