doris

Author	SHA1	Message	Date
pengxiangyu	20ef8a6e21	[feature-wip](remote storage)(step1) use a struct instead of string for parameter path, add basic remote method (#7098 ) For the first, we need to make a parameter to discribe the data is local or remote. At then, we need to support some basic function to support the operation for remote storage.	2021-12-22 22:58:23 +08:00
EmmyMiao87	d9c927fdc6	[improvement](log)(schema change) Add a clear memory description in the log (#7378 ) If the memory exceeds the limit when be generates a materialized view or schema change, a more detailed log about limit and configuration will be prompted..	2021-12-14 15:56:50 +08:00
HappenLee	8b557c0e70	[Refactor] Refact code of sequence column (#7007 )	2021-11-15 11:10:45 +08:00
HappenLee	1f196442f7	[Bug] Fix the nullptr of core in schema change (#7003 ) schema change fail as memory allocation fail on row block sorting, however, it should do internal sorting first before schema change fail as memory allocation fail on row block sorting in case there are enough memory after internal sorting.	2021-11-05 09:44:08 +08:00
GoGoWen	2d10300547	[Bug] Fix schema change fail as memory allocation on row block sorting (#6932 ) schema change fail as memory allocation fail on row block sorting. however, it should do internal sorting first before schema change fail as memory allocation fail on row block sorting in case there are enough memory after internal sorting.	2021-11-02 16:33:38 +08:00
jiafeng.zhang	b0926a317e	Modify Chinese comment (#6951 ) Modify Chinese comment	2021-10-28 13:56:59 +08:00
Mingyu Chen	fee8e6afc5	[Bug] Fix some bugs (#6665 ) 1.Fix a potential BE coredump of sending batch when loading data. (Fix [Bug] BE crash when loading data #6656) 2.Fix a potential BE coredump when doing schema change. (Fix [Bug] BE crash when doing alter task #6657) 3.Optimize the metric of base_compaction_request_failed. 4.Add Order column in show tablet result. (Fix [Feature] Add order column in SHOW TABLET stmt result #6658) 5.Fix bug that tablet repair slot not being released. (Fix [Bug] Tablet scheduler stop working #6659) 6.Fix bug that REPLICA_MISSING error can not be handled. (Fix [Bug] REPLICA_MISSING error can not be handled. #6660) 7.Modify column name of SHOW PROC "/cluster_balance/cluster_load_stat" 8.Optimize the result of SHOW PROC "/statistic" to show COLOCATE_MISMATCH tablets (Fix [Feature] the health status of colocate table's tablet is not shown in show proc statistic #6663) 9.Fix bug that show load where state='pending' can not be executed. (Fix [Bug] show load where state='pending' can not be executed. #6664)	2021-09-17 10:11:37 +08:00
Zhengguo Yang	8738ce380b	Add long text type STRING, with a maximum length of 2GB. Usage is similar to varchar, and there is no guarantee for the performance of storing extremely long data (#6391 )	2021-08-18 09:05:40 +08:00
Mingyu Chen	c6aa37f5ef	[Alter] Support doing compaction for tablets under alter operation (#6365 ) The problem I want to solve is described in #6355. This CL mainly changes: 1. Support compacting tablets under alter operations On BE side, the compaction logic will select tablets which state is "TABLET_NOTREADY" to do cumulative compaction. 2. Remove "alter_task" field in tablet's meta on BE side. "alter_task" field is never used long time ago 3. Support doing delete operation when table is doing alter operation. Previously, when a table is doing alter operation, execution of delete will return error: Table's state is not NORMAL. But now, delete can be executed successfully only if the condition column is not under schema change. And delete condition will be applied to all materialized indexes.	2021-08-07 21:32:26 +08:00
Mingyu Chen	4a5f0f859d	[Bug] Add readlock when calling get_rowset_by_version() (#6120 )	2021-07-01 09:19:10 +08:00
xinghuayu007	5dabf0bef5	[Alter] validate data file after alter operation success (#6022 ) Co-authored-by: wangxixu <wangxixu@xiaomi.com>	2021-06-20 16:15:14 +08:00
Mingyu Chen	d57c2344e1	[MemTracker] Refactored the hierarchical structure of memtracker (#5956 ) To avoid showing too many memtracker on BE web pages. The MemTracker level now has 3 levels: OVERVIEW, TASK and VERBOSE. OVERVIEW Mainly used for main memory consumption module such as Query/Load/Metadata. TASK is mainly used to record the memory overhead of a single task such as a single query, load, and compaction task. VERBOSE is used for other more detailed memtrackers.	2021-06-16 09:44:24 +08:00
HappenLee	1a81b9e160	[MemTracker] Some enchance of MemTracker (#5783 ) 1 Make some MemTracker have reasonable parent MemTracker not the root tracker 2 Make each MemTracker can be easily to trace. 3 Add show level of MemTracker to reduce the MemTracker show in the web page to have a way to control show how many tracker in web page.	2021-05-19 09:27:50 +08:00
stdpain	5fed34fcfe	[optimize] provide a better defer operator (#5706 )	2021-05-12 10:37:23 +08:00
Yingchun Lai	be733cfa9c	[Metrics] Add some large memtrackers' metric (#5614 ) MemTracker can provide memory consumption for us to find out which module consume more memory, but it's just a current value, this patch add metrics for some large memory consumers, then we can find out which module consume more memory in timeline, it would be useful to troubleshoot OOM problems and optimize configs.	2021-04-21 09:15:04 +08:00
HappenLee	b423274f17	[Enhance] Make MemTracker more accurate (#5515 ) (#5516 ) * [Enhance] Make MemTracker more accurate (#5515) This PR main about: 1. Improve the readability of MemTrackers' name 2. Add the MemTracker of: * Load * Compaction * SchemaChange * StoragePageCache * TabletManager 3. Change SchemaChange to a Singleon * revise some code for Code Review * change the name of mem_tracker * keep reader_context have the same lifetime of rowset_reader in schema change. * change vlog notice to log(warning) in schema change	2021-04-08 09:14:55 +08:00
Mingyu Chen	087fd8159b	[Bug] Fix bug that the stale rowset file will not be deleted (#5527 ) 1. If cumulative compaction compact only one rowset, the old rowset will not be put into `stale_rowset_meta_map` 2. Show rowset id in `/api/compaction/show` Co-authored-by: xxiao2018 <benghua3_1@sina.com>	2021-03-17 22:31:05 +08:00
stdpain	a1bce25677	[BUG] Fix Memory Leak in SchemaChange And Fix some DCHECK error (#5491 )	2021-03-17 09:27:05 +08:00
HappenLee	462efeaf39	[Performance Optimization and Refactor] (#5358 ) (#5364 ) 1. Add BlockColumnPredicate support OR and AND column predicate in RowBlockV2 2. Support evaluate vectorization delete predicate in storage engine not in Reader in SegmentV2	2021-02-07 22:41:33 +08:00
Zhengguo Yang	93a4c7efc1	[LOG] Standardize the use of VLOG in code (#5264 ) At present, the application of vlog in the code is quite confusing. It is inherited from impala VLOG_XX format, and there is also VLOG(number) format. VLOG(number) format does not have a unified specification, so this pr standardizes the use of VLOG	2021-01-21 12:09:09 +08:00
Yingchun Lai	49f7eb69bf	[Refactor] Refactor DeleteHandler and Cond module (2nd) (#5030 ) * [Refactor] Refactor DeleteHandler and Cond module (#4925) This patch mainly do the following refactors: - Use int64_t instead of int32_t for 'version' in DeleteHandler - Move some comments from .cpp to .h file, add some new comments in .h files, and also remove some meaningless comments - Use switch...case... instead of multiple if..else.. for DeleteConditionHandler::is_condition_value_valid - Use range loop to simplify code - Reduce some compare operations in Cond::del_eval - Improve some branch predictions in Reader - Fix and improve some unit tests	2020-12-08 10:01:18 +08:00
sduzh	6fedf5881b	[CodeFormat] Clang-format cpp sources (#4965 ) Clang-format all c++ source files.	2020-11-28 18:36:49 +08:00
HappenLee	55ce88da34	[Schema change] Support More column type in schema change (#4938 ) 1. Support modify column type CHAR to TINYINT/SMALLINT/INT/BIGINT/LARGEINT/FLOAT/DOUBLE/DATE and TINYINT/SMALLINT/INT/BIGINT/LARGEINT/FLOAT/DOUBLE convert to a wider range of numeric types (#4937) 2. Use template to refactor code of types.h and schema_change.cpp to delete redundant code.	2020-11-28 09:52:28 +08:00
sduzh	10e1e29711	Remove header file common/names.h (#4945 )	2020-11-26 17:00:48 +08:00
Zhengguo Yang	09f97f8a05	[Refactor] Fixes some be typo part 2 (#4747 )	2020-10-20 09:28:57 +08:00
Yingchun Lai	2a637f848d	[Refactor] Remove meaningless return value of RowBlock::init (#4627 ) Simplify some code, mainly remove meaningless return value of RowBlock::init.	2020-09-20 20:57:00 +08:00
EmmyMiao87	e8e5f350fe	[BUG] ReAgg when adding agg mv on dup base table (#4587 ) When the keystype of mv and base table is difference, Doris should execute sorting schema change instead of linked schema change. If doesn't, the data size of mv actually is same as base table. This will cause mv to have no pre-aggregation effect at all. The query will not choose mv. This commit fixed this problem. Fixed #4586	2020-09-13 19:17:35 +08:00
Mingyu Chen	a864db03fe	[Bug] Fix bug of load error hub and schema change (#4486 ) 1. When WITH_MYSQL is off, load error hub does not suport MySQL load error hub, we should check its return value. 2. misjudge the return value of `change_row_block` in schema_change.cpp	2020-08-31 23:21:50 +08:00
HangyuanLiu	ad738fa198	Add OLAP_ERR_DATE_QUALITY_ERR error status to display schema change failure (#4388 ) In the process of historical data transformation of materialized views, it may occur that the transformation fails due to data quality. Add an error status code ：OLAP_ERR_DATE_QUALITY_ERR to determine if a data problem is causing the failure #3344	2020-08-27 17:52:53 +08:00
Mingyu Chen	976820ba20	[SegmentV2] Change the default storage format to SegmentV2 (#4387 ) Since the Segment V2 has been released for a long time, we should make it as default storage format for newly created table. This CL mainly changes: 1. For all newly created tables, their default storage format is Segment V2. 2. For all already exist tablets, their storage format remain unchanged. 3. Fix bugs described in Fix #4384 and Fix #4385	2020-08-24 21:51:17 +08:00
HuangWei	10f822eb43	[MemTracker] make all MemTrackers shared (#4135 ) We make all MemTrackers shared, in order to show MemTracker real-time consumptions on the web. As follows: 1. nearly all MemTracker raw ptr -> shared_ptr 2. Use CreateTracker() to create new MemTracker(in order to add itself to its parent) 3. RowBatch & MemPool still use raw ptrs of MemTracker, it's easy to ensure RowBatch & MemPool destructor exec before MemTracker's destructor. So we don't change these code. 4. MemTracker can use RuntimeProfile's counter to calc consumption. So RuntimeProfile's counter need to be shared too. We add a shared counter pool to store the shared counter, don't change other counters of RuntimeProfile. Note that, this PR doesn't change the MemTracker tree structure. So there still have some orphan trackers, e.g. RowBlockV2's MemTracker. If you find some shared MemTrackers are little memory consumption & too time-consuming, you could make them be the orphan, then it's fine to use the raw ptr.	2020-07-31 21:57:21 +08:00
Zhao Chun	08403eed22	[Bug]#4181 Let linked-schema change work for BETA tablet (#4182 ) Let linked-schema change work for BETA tablet	2020-07-30 09:51:18 +08:00
HangyuanLiu	5032b7fe7a	Support materialized view schema change in bitmap hll and count field [#3739 ] (#3873 ) + Building the materialized view function for schema_change here based on defineExpr. + This is a trick because the current storage layer does not support expression evaluation. + count distinct materialized view will set mv_expr with to_bitmap or hll_hash. + count materialized view will set mv_expr with count. + Support to regenerate historical data when a new materialized view is created in BE。 + Support to_bitmap function + Support hll_hash function + Support count(field) function For #3344	2020-07-16 10:45:15 +08:00
WingC	d7893f0fa7	[Bug]Fix some schema change not work right (#4009 ) [Bug]Fix some schema change not work right This CL mainly fix some schema change to varchar type not work right because forget to logic check && Add ConvertTypeResolver to add supported convert type in order to avoid forget logic check	2020-07-11 10:18:29 +08:00
lichaoyong	413d6d2f22	[Bug] Fix core when modifing char to varchar and loading boolean with replace aggregation (#4042 ) 1. Doris support modify char to varchar. There is a bug when use two-level pointer when converting the date. 2. Boolean can be used as metric value with REPLACE and REPLACE_IF_NOT_NULL aggregation function. The aggregation function should be added into aggregation map.	2020-07-08 11:12:42 +08:00
yangzhg	123e1394b1	[Delete] Allow delete duplicated non-key column using delete from (#3424 )	2020-05-15 09:26:36 +08:00
Yingchun Lai	37fccd53c4	[Tablet] A small refactor on class Tablet (#3339 ) There is no functional changes in this patch. Key refactor points are: - Remove meaningless return value of functions in class Tablet, and also some related functions in other classes - Allow RowsetGraph::capture_consistent_versions to pass a nullptr to the output parameter - Use CHECK instead of LOG(FATAL) to simplify code	2020-04-24 22:22:26 +08:00
kangkaisen	625411bd28	Doris support in memory olap table (#2847 )	2020-02-18 10:45:54 +08:00
worker24h	1f844946e9	Fixbug: Invalid memory address in doris::memory_copy (#2919 ) (#2923 ) When I change schema from char(20) to varchar(20), be will cause coredump.	2020-02-17 18:48:38 +08:00
kangpinghuang	7fe6431ac7	Fix delete handler init when schema change (#2767 ) delete handler init failed because there are missed version. Schema change should return failure when get version failed.	2020-01-15 15:42:56 +08:00
yangzhg	c098178f7a	[Index] Implements create drop show index syntax for bitmap index [#2487 ] (#2573 ) ### create table with index ``` CREATE TABLE table1 ( siteid INT DEFAULT '10', citycode SMALLINT, username VARCHAR(32) DEFAULT '', pv BIGINT SUM DEFAULT '0', INDEX index_name [USING BITMAP] (siteid, citycode) COMMENT 'balabala' ) AGGREGATE KEY(siteid, citycode, username) DISTRIBUTED BY HASH(siteid) BUCKETS 10 PROPERTIES("replication_num" = "1"); ``` ### create index ``` CREATE INDEX index_name ON table1 (siteid, citycod) [USING BITMAP] COMMENT 'balabala'; or ALTER TABLE table1 ADD INDEX index_name [USING BITMAP] (siteid, citycod) COMMENT 'balabala'; ``` ### drop index ``` DROP INDEX index_name ON table1; or ALTER TABLE table1 DROP INDEX index_name ``` ### show index ``` SHOW INDEX[ES] FROM table1 ``` output ``` +---------+-------------+-----------------+------------+---------+ \| Table \| Index_name \| Column_name \| Index_type \| Comment \| +---------+-------------+-----------------+------------+---------+ \| table1 \| index_name \| siteid,citycode \| BITMAMP \| balabala\| +---------+-------------+-----------------+------------+---------+ ```	2020-01-03 17:41:26 +08:00
kangpinghuang	368bbfd426	Fix linked schema change bug #2610 (#2613 )	2019-12-30 15:48:52 +08:00
Mingyu Chen	1421a9be41	[Compaction] Support compact only one rowset (#2558 ) Support compaction operation to compact only one rowset. After the modification, the last rowset of the tablet will also be compacted. At the same time, we added a `segments_overlap_pb` field to the rowset meta. Used to describe whether the segment data in the rowset overlaps. This field is set by `rowset_writer`. Initially UNKNOWN for compatibility with existing data. In addition, the version hash of the rowset generated after compaction is directly set to the version hash of last rowset participating in compaction, to ensure that the tablet's version hash remains unchanged after compaction.	2019-12-27 10:08:41 +08:00
WingC	f7032b07f3	Support more schema change from VARCHAR type (#2501 )	2019-12-26 22:38:53 +08:00
kangpinghuang	63ea05f9c7	Add convert tablet rowset type (#2294 ) to solve the issue #2246. scheme is as following: add a optional preferred_rowset_type in TabletMeta for V2 format rollup index tablet add a boolean session variable use_v2_rollup, if set true, the query will v2 storage format rollup index to process the query. test queries will be sent to online service to verify the correctness of segment-v2 by send the the same queries to fe with use_v2_rollup set or not to check whether the returned results are the same.	2019-12-18 18:49:47 +08:00
WingC	c81b1db406	Support convert VARCHAR type to DATE type (#2489 )	2019-12-18 12:58:47 +08:00
WingC	89003b774b	Support Convert Varchar to INT (#2481 )	2019-12-17 22:02:28 +08:00
kangpinghuang	c07f37d78c	[Segment V2] Add a control framework between FE and BE through heartbeat #2247 (#2364 ) The control framework is implemented through heartbeat message. Use uint64_t as flags to control different functions. Now add a flag to set the default rowset type to beta.	2019-12-12 12:18:32 +08:00
WingC	af3d901a06	Convert INT type to DATE type (#2393 )	2019-12-07 21:56:52 +08:00
WingC	102a845131	Support convert date to datetime through alter table (#2385 )	2019-12-05 07:37:45 +08:00

1 2 3

104 Commits