doris

Author	SHA1	Message	Date
WingC	3cff89df7f	[Dynamic Partition] Support for automatically drop partitions (#3081 )	2020-03-25 10:24:46 +08:00
Dayue Gao	e794bb69b7	[BUG] Make default result ordering of SHOW PARTITIONS statement be consist with 0.11 (#3184 )	2020-03-24 17:14:27 +08:00
lichaoyong	e20d905d70	Remove unused KUDU codes (#3175 ) KUDU table is no longer supported long time ago. Remove code related to it.	2020-03-24 13:54:05 +08:00
yangzhg	3b32938140	[Doc] Create CONTRIBUTING.md (#3180 )	2020-03-24 13:42:21 +08:00
HangyuanLiu	d4c1938b5c	Open datetime min value limit (#3158 ) the min_value in olap/type.h of datetime is 0000-01-01 00:00:00, so we don't need restrict datetime min in tablet_sink	2020-03-24 10:52:57 +08:00
EmmyMiao87	dff3c0d57e	Revert "Remove deep copy when doing hash table EvalRow (#3171 )" (#3173 )	2020-03-23 15:29:46 +08:00
Mingyu Chen	d837231fca	[RoutineLoad] Fix bug that job will be paused when table is altering (#3169 ) Also add some debug log to observe the cost time of the process of routine load task	2020-03-23 11:05:00 +08:00
Mingyu Chen	473a67a5b8	[Syntax] Remove all EmptyStmt from the end of multi-statements list (#3140 ) to resolve the ISSUE: #3139 When user execute query by some client library such as python MysqlDb, if user execute like: "select * from tbl1;" (with a comma at the end of statement) The sql parser will produce 2 statements: `SelectStmt` and `EmptyStmt`. Here we discard the `EmptyStmt` to make it act like one single statement. This is for some compatibility. Because in python MysqlDb, if the first `SelectStmt` results in some warnings, it will try to execute a `SHOW WARNINGS` statement right after the SelectStmt, but before the execution of `EmptyStmt`. So there will be an exception: `(2014, "Commands out of sync; you can't run this command now")` I though it is a flaw of python MysqlDb. However, in order to maintain the consistency of user use, here we remove all EmptyStmt at the end to prevent errors.(Leave at least one statement) But if user execute statements like: `"select * from tbl1;;select 2"` If first `select * from tbl1` has warnings, python MysqlDb will still throw exception.	2020-03-23 09:39:22 +08:00
wyb	dd8d748c55	Remove deep copy when doing hash table EvalRow (#3171 ) remove varchar column deep copy in partitioned hash table EvalRow function	2020-03-21 09:52:49 +08:00
yangzhg	d29ed84b6a	[Bug] Fix bug that right semi/anti join is not right (#3167 ) This bug is introduced by PR: #3148. right semi/anti join can not use `insert_unique` in build phase of join.	2020-03-20 20:58:55 +08:00
lichaoyong	47a3d5000b	[UnitTest] Fix unit test bug in BetaRowset and PageCacheTest (#3157 ) 1. BlockManager has been added into StorageEngine. So StorageEngine should be initialized when starting BetaRowset unit test. 2. Cache should not use the same buf to store value, otherwise the address will be freed twice and crash.	2020-03-20 20:37:50 +08:00
kangpinghuang	6beadfda71	[Bug] Fix delete predicate bug for segment v2 (#3164 ) This bug is because the min and max wrapper field is not initialized when there is no predicate of that column.	2020-03-20 20:35:55 +08:00
HappenLee	2dc995df7b	[CodeStyle] Rename new_partition_aggregation_node and new_partitioned_hash_table (#3166 )	2020-03-20 19:59:01 +08:00
HappenLee	5a8fcd263f	[CodeStyle] Delete obsolete code of partition_aggregation_node and partitioned_hash_table (#3162 )	2020-03-20 16:25:29 +08:00
Yingchun Lai	c08d6e4708	[tablet meta] Do some refactor on TabletMeta (#3136 ) remove some functions' return value which always return OLAP_SUCCESS optimize some loops	2020-03-20 15:03:22 +08:00
lichaoyong	2d3dbc2c42	Revert "[CodeStyle] Del obsolete code of partition_aggregation_node (#3154 )" (#3160 ) This reverts commit dae013d797c1c2c9e54246d5ace4bdd90b297d43.	2020-03-20 14:47:25 +08:00
lichaoyong	5f004cb009	Revert "[CodeStyle] Remove unused PartitionedHashTable (#3156 )" (#3159 ) This reverts commit d3fd44f0a2fe076d2c62851babc162fcebe4d63b.	2020-03-20 14:42:40 +08:00
lichaoyong	d3fd44f0a2	[CodeStyle] Remove unused PartitionedHashTable (#3156 )	2020-03-20 12:19:08 +08:00
HappenLee	dae013d797	[CodeStyle] Del obsolete code of partition_aggregation_node (#3154 )	2020-03-20 11:33:55 +08:00
yangzhg	f0db9272dd	[Performance] Improve performence of hash join in some case (#3148 ) improve performent of hash join when build table has to many duplicated rows, this will cause hash table collisions and slow down the probe performence. In this pr when join type is semi join or anti join, we will build a hash table without duplicated rows. benchmark: dataset: tpcds dataset `store_sales` and `catalog_sales` ``` mysql> select count() from catalog_sales; +----------+ \| count() \| +----------+ \| 14401261 \| +----------+ 1 row in set (0.44 sec) mysql> select count(distinct cs_bill_cdemo_sk) from catalog_sales; +------------------------------------+ \| count(DISTINCT `cs_bill_cdemo_sk`) \| +------------------------------------+ \| 1085080 \| +------------------------------------+ 1 row in set (2.46 sec) mysql> select count() from store_sales; +----------+ \| count() \| +----------+ \| 28800991 \| +----------+ 1 row in set (0.84 sec) mysql> select count(distinct ss_addr_sk) from store_sales; +------------------------------+ \| count(DISTINCT `ss_addr_sk`) \| +------------------------------+ \| 249978 \| +------------------------------+ 1 row in set (2.57 sec) ``` test querys: query1: `select count() from (select store_sales.ss_addr_sk from store_sales left semi join catalog_sales on catalog_sales.cs_bill_cdemo_sk = store_sales.ss_addr_sk) a;` query2: `select count() from (select catalog_sales.cs_bill_cdemo_sk from catalog_sales left semi join store_sales on catalog_sales.cs_bill_cdemo_sk = store_sales.ss_addr_sk) a;` benchmark result: \|\|query1\|query2\| \|:--:\|:--:\|:--:\| \|before\|14.76 sec\|3 min 16.52 sec\| \|after\|12.64 sec\|10.34 sec\|	2020-03-20 10:31:14 +08:00
yangzhg	12d1b072ef	[Bug] Fix bug that of union statement (#3137 ) fix a bug of const union query like `select null union select null`, this because the type of SlotDescriptor when clause is `select null` is null ,this will cause BE core dump, and FE find wrong cast function.	2020-03-20 09:51:38 +08:00
gengjun-git	c88e8ab1ab	Add some system variables (#3144 ) Add event_scheduler and storage_engine system variables to compatible with some mysql client connect, say DataGrip of JetBrains.	2020-03-20 09:28:34 +08:00
Mingyu Chen	d90c892bd8	[Bug] Get NPE when executing show alter table statement (#3146 )	2020-03-20 09:20:21 +08:00
lichaoyong	b286f4271b	Remove unused PreAggregtionNode (#3151 )	2020-03-20 09:19:47 +08:00
Dayue Gao	4b3367636d	[Bug] Fix NPE when access follower Fe's web console (#3149 )	2020-03-19 20:34:34 +08:00
Mingyu Chen	0f14408f13	[Temp Partition] Support loading data into temp partitions (#3120 ) Related issue: #2663, #2828. This CL support loading data into specified temporary partitions. ``` INSERT INTO tbl TEMPORARY PARTITIONS(tp1, tp2, ..) ....; curl .... -H "temporary_partition: tp1, tp, .. " .... LOAD LABEL db1.label1 ( DATA INFILE("xxxx") INTO TABLE `tbl2` TEMPORARY PARTITION(tp1, tp2, ...) ... ``` NOTICE: this CL change the FE meta version to 77. There 3 major changes in this CL ## Syntax reorganization Reorganized the syntax related to the `specify-partitions`. Removed some redundant syntax definitions, and unified the syntax related to the `specify-partitions` under one syntax entry. ## Meta refactor In order to be able to support specifying temporary partitions, I made some changes to the way the partition information in the table is stored. Partition information is now organized as follows: The following two maps are reserved in OlapTable for storing formal partitions: ``` idToPartition nameToPartition ``` Use the `TempPartitions` class for storing temporary partitions. All the partition attributes of the formal partition and the temporary partition, such as the range, the number of replicas, and the storage medium, are all stored in the `partitionInfo` of the OlapTable. In `partitionInfo`, we use two maps to store the range of formal partition and temporary partition: ``` idToRange idToTempRange ``` Use separate map is because the partition ranges of the formal partition and the temporary partition may overlap. Separate map can more easily check the partition range. All partition attributes except the partition range are stored using the same map, and the partition id is used as the map key. ## Method to get partition A table may contain both formal and temporary partitions. There are several methods to get the partition of a table. Typically divided into two categories: 1. Get partition by id 2. Get partition by name According to different requirements, the caller may want to obtain a formal partition or a temporary partition. These methods are described below in order to obtain the partition by using the correct method. 1. Get by name This type of request usually comes from a user with partition names. Such as `select * from tbl partition(p1);`. This type of request has clear information to indicate whether to obtain a formal or temporary partition. Therefore, we need to get the partition through this method: `getPartition(String partitionName, boolean isTemp)` To avoid modifying too much code, we leave the `getPartition(String partitionName)`, which is same as: `getPartition(partitionName, false)` 2. Get by id This type of request usually means that the previous step has obtained certain partition ids in some way, so we only need to get the corresponding partition through this method: `getPartition(long partitionId)`. This method will try to get both formal partitions and temporary partitions. 3. Get all partition instances Depending on the requirements, the caller may want to obtain all formal partitions, all temporary partitions, or all partitions. Therefore we provide 3 methods, the caller chooses according to needs. `getPartitions()` `getTempPartitions()` `getAllPartitions()`	2020-03-19 15:07:01 +08:00
kangkaisen	9059afcc80	Delete, update and simplify some FE code (#3125 )	2020-03-19 12:27:08 +08:00
WingC	41815ef176	[Alter]Clear expire alterJobV2 (#3130 ) Too much AlterJobsV2 may consume too much memory, which may cause FullGC. Clear some data for finished or cancelled alterJobs and remove them when expired.	2020-03-18 20:27:10 +08:00
lichaoyong	178bdcb16a	Use DCHECK_GT instead when checking _tablet_map_lock_shard_size (#3138 )	2020-03-18 19:41:04 +08:00
Mingyu Chen	08e4035a41	1 (#3134 )	2020-03-17 20:11:41 +08:00
HangyuanLiu	d01b58bff6	Support 64 bit timestamp in from_unixtime (#3069 ) Support 64 bit timestamp in from_unixtime	2020-03-17 17:30:42 +08:00
Yingchun Lai	33319d659b	[Thrift] Fix a bug when ThriftClientImpl close() error (#3128 ) In ThriftClientImpl close(), the under layer TTransport may throw an exception, this pathch catch the exception to avoid crash.	2020-03-17 15:53:44 +08:00
yangzhg	0959abc1dc	[ExceptNode] Implement except node (#3056 ) implement except node, support statement like: ``` select a from t1 except select b from t2 ```	2020-03-17 10:54:40 +08:00
kangpinghuang	f6374fa9a5	Use default_rowset_type to replace compaction_rowset_type (#3101 ) * use default_rowset_type to replace compaction_rowset_type * segment v2 usage document	2020-03-16 22:23:48 +08:00
HuangWei	a80e9bf229	Fix broker scan node mem limit check (#3123 )	2020-03-16 20:36:46 +08:00
caiconghui	cb87a54c2b	[Syntax] Support schema keyword to be compatible with the mysql syntax (#3115 ) create schema db1; drop schema db1;	2020-03-16 17:17:49 +08:00
Mingyu Chen	ee06ce31ba	[Bug] Fix bug that the file_block_mgr object was incorrectly destructed (#3122 ) During the use of the `block`, some methods in the block manager will be referenced. So `file_block_mgr` should be a resident and globally unique object. I put it in `StorageEngine`. TODO: the `BlockManager`, `Env` need to be reorganized.	2020-03-16 17:07:27 +08:00
Mingyu Chen	14c088161c	[New Stmt] Support setting replica status manually (#1522 ) Sometimes a replica is broken on BE, but FE does not notice that. In this case, we have to manually delete that replica on BE. If there are hundreds of replicas need to be handled, this is a disaster. So I add a new stmt: ADMIN SET REPLICA STATUS which support setting tablet on specified BE as BAD or OK.	2020-03-16 13:42:30 +08:00
Yingchun Lai	64a06ea9d4	[UT] Fix some BE unit tests (#3110 ) And also support graceful exit for StorageEngine to avoid hang too long time in unit test.	2020-03-16 13:31:44 +08:00
LingBin	f4b028915b	Do not build llvm thirdparty (#3116 ) LLVM related codes have already be removed in master branch, so there is no need to build llvm tool(which need a long time to compile it). Currently, some old release of Doris may still need it, so for now, we just comment it, instead of remove it.	2020-03-15 18:34:52 +08:00
Mingyu Chen	e01850e6ec	[Alter] Alter job got stuck because of table is untable (#3106 ) This CL solve the issue #3105 I add a new temporary table state WAITING_STABLE. When an alter job is ready to start, it checks whether the table is stable. If it is not stable, the table state is set to WAITING_STABLE. In this state, the tablet repair logic will continue to repair the tablet until the table becomes stable. After that, the table state will be reset to SCHEMA_CHANGE/ROLLUP and alter operations will begin. This is just a temporary state, it does not need to be persistent, and only the master FE can see this state.	2020-03-14 23:48:36 +08:00
Mingyu Chen	42931d22cb	[Bug] tablet meta is not updated correctly after compaction (#3098 ) This CL try to fix a potential bug describe in ISSUE: #3097. But I'm not sure this is the root cause. Also remove lots of verbose log, and fix a memory leak.	2020-03-14 23:39:11 +08:00
wyb	01a4ab01c4	[Bug] Fix mapping columns not exist in the table schema (#3113 )	2020-03-14 22:45:39 +08:00
Youngwb	14757f61a0	[Backup] Fix table could not load data after restore (#3087 ) Backup job in BE only backup index which is visible, but the backup meta in FE contains the shadow index, after restore from this snapshot, the shadow index is visible to load process, and the tablets is not exist in BE, so load process would be cancelled. we could fix this bug by remove the useless shadow index at backup process.	2020-03-13 22:37:11 +08:00
Mingyu Chen	4c98596283	[MysqlProtocol] Support MySQL multiple statements protocol (#3050 ) 2 Changes in this CL: ## Support multiple statements in one request like: ``` select 10; select 20; select 30; ``` ISSUE: #3049 For simple testing this CL, you can using mysql-client shell command tools: ``` mysql> delimiter // mysql> select 1; select 2; // +------+ \| 1 \| +------+ \| 1 \| +------+ 1 row in set (0.01 sec) +------+ \| 2 \| +------+ \| 2 \| +------+ 1 row in set (0.02 sec) Query OK, 0 rows affected (0.02 sec) ``` I add a new class called `OriginStatement.java`, to save the origin statement in string format with an index. This class is mainly for the following cases: 1. User send a multi-statement to the non-master FE: `DDL1; DDL2; DDL3` 2. Currently we cannot separate the original string of a single statement from multiple statements. So we have to forward the entire statement to the Master FE. So I add an index in the forward request. `DDL1`'s index is 0, `DDL2`'s index is 1,... 3. When the Master FE handle the forwarded request, it will parse the entire statement, got 3 DDL statements, and using the `index` to get the specified the statement. ## Optimized the display of syntax errors I have also optimized the display of syntax errors so that longer syntax errors can be fully displayed.	2020-03-13 22:21:40 +08:00
Mingyu Chen	9832024995	[Insert] Fix bug that insert meet unexpected "label already exists" exception (#3103 ) This CL will abort the transaction of an insert operation when encountering exception thrown in analysis phase. ISSUE: #3102	2020-03-13 20:51:44 +08:00
Seaven	5f18e99cdb	[Doc] Update add fe node description (#3100 )	2020-03-13 18:05:09 +08:00
kangkaisen	aa540966c6	Output null for hll and bitmap column when select * (#2991 )	2020-03-13 11:59:30 +08:00
kangkaisen	d8c756260b	Rewrite count distinct to bitmap and hll (#3096 )	2020-03-13 11:44:40 +08:00
WingC	c5660fcb9d	[UT]Fix unit test for cgroup_util (#3094 ) Co-authored-by: wangcong18 <wangcong18@xiaomi.com>	2020-03-12 22:59:40 +08:00

1 2 3 4 5 ...

1645 Commits