doris

Author	SHA1	Message	Date
caoyang10	03d9f6d8b4	[Feature] support hour time unit with dynamic parition (#4514 ) Many tables are so large that need seperate partitions with "HOUR" time unit. But now dynamic partition doesn't support "HOUR" time unit and it was marked as "TODO". So I support the feature and it works.	2020-09-06 20:25:27 +08:00
ZhangYu0123	13e2cf172f	[Log] Add log for trace broker (#4505 ) Add tracing broker log. When fe get filestatus for distributing load task to broker, the broker maybe get empty files and not give correct error code. Add this log to easy track which broker process filestatus operation and we can get the error log.	2020-09-06 20:25:08 +08:00
Mingyu Chen	a390c9ee9f	[Bug] Fix bug that table ids is not set right for hadoop load job (#4535 ) We store all table ids involved in the Load Job in TransactionState. However, for Hadoop Load job, table ids are set incorrectly. This caused the WAITING_TXN phase to not correctly wait for the completion of the previous load transaction when doing the alter table, which caused some data version loss problems.	2020-09-04 17:39:37 +08:00
xy720	369e86bbe2	[Spark Load] [Bug] Load job's state will stay in ETL state all the time after FE restart (#4528 )	2020-09-04 15:56:54 +08:00
Youngwb	068707484d	Support sequence column for UNIQUE_KEYS Table (#4256 ) * add sequence col Co-authored-by: yangwenbo6 <yangwenbo3@jd.com>	2020-09-04 10:10:17 +08:00
Mingyu Chen	72f04ebdb8	[Bug] Fix bug that query meta fields has been sent twice (#4529 ) Fix bug that query meta fields has been sent twice. This bug is introduced by #4330 , and related PR is #4450	2020-09-04 09:31:14 +08:00
HappenLee	a64c3a7acd	[ODBC SCAN NODE] 3/4 Add ODBC_TABLE and ODBC_SCAN NODE in FE. (#4430 ) we can create odbc_table use SQL like ``` CREATE EXTERNAL TABLE `baseall_oracle` ( `k1` decimal(9, 3) NOT NULL COMMENT "", `k2` char(10) NOT NULL COMMENT "", `k3` datetime NOT NULL COMMENT "", `k5` varchar(20) NOT NULL COMMENT "", `k6` double NOT NULL COMMENT "" ) ENGINE=ODBC PROPERTIES ( "host" = "192.168.0.1", "port" = "8086", "user" = "happenlee", "password" = "doris", "database" = "doris", "table" = "baseall", "driver" = "Oracle 19 ODBC driver", "type" = "oracle" ); ``` Now we only support Oracle and MySQL Database and this feature default turned off by conf enable_odbc_table.	2020-09-04 09:30:01 +08:00
Mingyu Chen	5166a6c6bc	[Bug] function str_to_date()'s behavior on BE and FE is inconsistent (#4495 ) Main CL: 1. Copy the code from BE to implement the `str_to_date()` function in FE. 2. `str_to_date("2020-08-08", "%Y-%m-%d %H:%i:%s")` will return `2020-08-08 00:00:00` instead of `2020-08-08`.	2020-09-03 17:16:19 +08:00
Zhengguo Yang	d0d394ad7e	[SQL][Bug]fix semi/anti join error when table has delete sign column (#4498 ) It is possible to report "Illegal column/field reference'table2.DORIS_DELETE_SIGN' of semi-/anti-join" when executing a semi/anti join statement on a table with hidden columns. This is because the filter conditions of semi/anti join cannot added in the where statement. Now we add delete flag related where predicate in OlapScanNode level.	2020-09-03 17:15:15 +08:00
Zhengguo Yang	ac3bbdd3ab	[BatchDelete] Add a configuration indicating whether to enable the batch delete function (#4493 )	2020-09-03 16:56:37 +08:00
HaiBo Li	c01954719f	[Feature][Cache] Sql cache and partition cache #2581 (#4330 ) 1. Analyze what mode of cache can be used by query 2. Query cache before executing query in StmtExecutor 3. Two cache mode, sqlcache and partitioncache, are implemented	2020-09-02 19:18:42 +08:00
HappenLee	e6367e9ba8	[Bug] Support disable colocate join where join clause has join hint (#4497 ) Fix issue:#4496	2020-09-02 10:54:45 +08:00
xy720	f5ee854b6f	[Spark load][Bug] Fix column terminator for spark load (#4491 ) Support specifying column separator without back slash.	2020-09-02 10:54:03 +08:00
xinghuayu007	1a22f3b2ac	[SQL][Function] Validate the param of rand function in compile step (#4439 ) The param of rand() function should be literal, but current compiler ignore to validate the literal param of rand function, it is validated in execution step. This PR make it validated in compile step, and make it more earlier to find the usage error of rand() function.	2020-09-02 10:50:52 +08:00
Lijia Liu	f3a9f3f87c	Do not add exchange when table's distributioin satisfy the distribution requirements (#4482 ) In DistributedPlanner, do not add the unnecessary Exchanges. For case 1, we only need to judge that the table's distribute hash keys is a subset of the aggregate keys. For case 2, we should judge two conditions: - partition keys are also hash keys. - the table's distribute hash keys is a subset of the aggregate keys.	2020-09-01 11:34:53 +08:00
Mingyu Chen	d49566130b	[Bug] Fix bug of select @@sql_mode (#4484 ) Fix bug that `select @@sql_mode` throw error: Invalid number format.	2020-09-01 11:31:35 +08:00
xy720	7b67da30d2	[Spark Load] Redirect the spark launcher's log to a separated log file (#4470 )	2020-08-30 21:10:04 +08:00
Zhengguo Yang	3b7614e174	[Refactor] Use camelCase in thrift generated java sources (#4443 ) Use camelCase in thrift generated java sources to make us fe's code style is more unified	2020-08-28 13:28:11 +08:00
wyb	ec64789e89	[Bug][Colocation Join] Fix colocation balance endless loop bug (#4471 ) 1. Only one available backend. 2. All backends are checked but this round is not changed. For example, all backends are on the same host.	2020-08-28 09:27:57 +08:00
wyb	82940a4905	[Spark Load] Fix spark load bugs (#4464 ) 1. fix write dpp result when dpp throw exception 2. boolean value：true, false(IgnoreCase), 0, 1 3. wrong dest column for source data check 4. support * in source file path 5. if job state is cancelled or finished, submitPushTasks would throw all partitions have no load data exception, because tableToLoadPartitions was already cleaned up #3433	2020-08-27 23:40:33 +08:00
Youngwb	976e3bb219	[Bug][Compile] Add missing imports (#4468 ) Co-authored-by: yangwenbo6 <yangwenbo3@jd.com>	2020-08-27 18:14:11 +08:00
HangyuanLiu	ad738fa198	Add OLAP_ERR_DATE_QUALITY_ERR error status to display schema change failure (#4388 ) In the process of historical data transformation of materialized views, it may occur that the transformation fails due to data quality. Add an error status code ：OLAP_ERR_DATE_QUALITY_ERR to determine if a data problem is causing the failure #3344	2020-08-27 17:52:53 +08:00
gengjun-git	fe0c21bf93	[Bug] Fix mysql return bug (#4450 ) Send fields after first row arrived so that error packet can be send to client when exception thrown from coord.getNext(). Golang and Python can not identify error if fields packet arrived before error packet.	2020-08-27 12:17:24 +08:00
xueyan.li	3c784b9c90	[SQL] support StringLiteral try to cast BigInt (#4445 )	2020-08-27 12:15:28 +08:00
xy720	8c38c79104	[SparkLoad]Use the yarn command to get status and kill the application (#4383 ) This cl will use yarn command as follows to kill or get status of application running on YARN. ``` yarn --config confdir application <-kill \| -status> <Application ID> ```	2020-08-27 12:08:55 +08:00
Mingyu Chen	f218327dd9	[Mysql Compatibility] Support convert() and signed/unsigned interger cast (#4364 ) 1. Support convert(expr, target_type) function, which is same as CastExpr 2. Support cast (expr as signed/unsigned int) This is just for compatibility, the signed/unsigned specification is meaningless.	2020-08-27 12:07:58 +08:00
EmmyMiao87	78e1615db9	Show column display name on `Show Proc` stmt (#4446 ) The mv column with bitmap_union function is named `mv_bitmap_union_k1` inside of Doris. But this column name should not be shown to user in `Show Proc` stmt. Instead, using define expr is easier to understand. Change-Id: Id07274fef9b3a97c97f1635dd3d6cf7b09561c1e	2020-08-26 10:52:56 +08:00
EmmyMiao87	09129b5ddd	[MV] Keep the scale and precision of type when creating mv (#4436 ) The DECIMAL, CHAR, VARCHAR have their own scale and precision in column. The mv column should keep those scale and precision. Fixed #4433 Change-Id: Ie288738a4356e60d11ea472dd274e54bc7ae6990	2020-08-26 10:51:12 +08:00
EmmyMiao87	b4d8b3d9ba	Forbidden the illegal column types on BITMAP_UNION OR HLL_UNION mv (#4432 ) 1. The base column of bitmap_union could must be integer. The largeint is not supported too. 2. The base column of hll_union could not be decimal. Check error msg of const expr in Union Node If user wants to insert a negative number into bitmap mv, Doris will thrown exception 'invalid input'. The const value in Union Node is checked in this commit.	2020-08-26 10:49:32 +08:00
Stalary	ca5e224594	[Bug] Fix the bug that replication_num in show create table is incorrect (#4393 )	2020-08-26 10:43:59 +08:00
Mingyu Chen	763a42c9af	[MySQL Compatibility 2/4][Bug] Fix bug and improve compatibility with mysql protocol (#4362 ) 1. select database() will only return database name, without cluster name. 2. select user() will return the IP which user connected in.	2020-08-26 10:40:42 +08:00
Mingyu Chen	0040153c51	[MySQL Compatibility 1/4][Bug] Fix bug that set sql_mode with concat() function failed (#4359 ) Support `set sql_mode = concat(@@sql_mode, "STRICT_TRANS_TABLES");`	2020-08-26 10:28:25 +08:00
HangyuanLiu	b1c7841c20	[SQL] Fix TupleIsNull miss in SelectStmt resultExpr (#4279 )	2020-08-26 10:27:50 +08:00
Lijia Liu	d5a0a738f4	[SQL] Rewrite count(distinct if(bool, bitmap, null)) to bitmap_union_count (#4201 ) Add IF(BOOL, BITMAP, BITMAP) function.	2020-08-26 10:26:40 +08:00
wyb	691227922e	[SQL Plan]Fix explicit broadcast join bug (#4424 ) Use broadcast join when users specify explicitly [BROADCAST] in queries.	2020-08-25 22:06:45 +08:00
Mingyu Chen	67b842ce04	[License] Organize and modify the license of the code (#4371 ) 1. Disable the MySQL client and LZO library by default when building the Doris. MySQL client library is used for MySQL external table feature. This feature will be replaced by the new ODBC external table soon. LZO library is used to compress/decompress data of some old data format of Doris, which is no longer used anymore. 2. Add missing license to some files. 3. For all non-Apache-License code, all are explained in NOTICE file and the corresponding license is declared. 4. Remove the js source code from webroot, it will be downloaded as thirdparty	2020-08-24 21:51:55 +08:00
Mingyu Chen	976820ba20	[SegmentV2] Change the default storage format to SegmentV2 (#4387 ) Since the Segment V2 has been released for a long time, we should make it as default storage format for newly created table. This CL mainly changes: 1. For all newly created tables, their default storage format is Segment V2. 2. For all already exist tablets, their storage format remain unchanged. 3. Fix bugs described in Fix #4384 and Fix #4385	2020-08-24 21:51:17 +08:00
Zhengguo Yang	af2b749a87	make some readFields Deprecated (#4399 ) We have changed most of our serialization methods to json. In order to be compatible with previous data, these classes still retain the readFields method. Some prs that involve modifying metadata often modify the readFields method. To avoid this, we should Mark these methods as Deprecated #4398	2020-08-21 22:58:08 +08:00
Zhengguo Yang	d61c10b761	[Delete] Support batch delete [part 1] (#4310 ) * Implements the grammar of the batch delete #4051 * Process create, alter table when table has delete sign column * Support the syntax for enabling the delete column * Automatically filtered deleted data in the select statement. * Automatically add delete sign when create rollup table TODO: * Optimize the reading and compaction logic on the be side, so that the data marked as deleted will be completely deleted during base compaction	2020-08-21 22:57:16 +08:00
EmmyMiao87	76a04de6c4	[MV] Input correct keys type of index meta when `Add Partition` (#4408 ) Define Expr will not serialized in Column `toThrift`. 1. When adding partition, different indexes should use their own keys type instead of using the keys type of base table uniformly. ` 2. There are two kinds of define expr in Column , one is analyzed, and the other is not analyzed. Currently, analyzed define expr is only used when creating materialized views, so the define expr in RollupJob must be analyzed. In other cases, such as define expr in `MaterializedIndexMeta`, it may not be analyzed after being relayed. When executing the load, the analyzed define expr (such as to_bitmap(cast(k1, varchar))) will not be analyzed again. Only a cast function will be added to the inner layer(such as to_bitmap(cast(cast(k1 ,int), varchar))) which is analyzed too. The define expr that has not been analyzed (such as cast(k1, varchar)) will be analyzed when executing the load.	2020-08-21 10:42:41 +08:00
EmmyMiao87	09b1965499	[MV] Fix errors when alter materialized view which based on dup table (#4375 ) 1. Input the correct keys type when mv is updated. The keys type of mv should be used in schema change job rather then keys type of base table. Otherwise, the be will core and thrown exception "Create replicas failed". 2. Forbidden add non-key column on agg mv directly when base table is duplicate model If a dup table has a agg mv, user will not add a non-key column on mv. The non-key column can only be added to dup index.	2020-08-21 10:36:03 +08:00
EmmyMiao87	6bb111b42c	Modify mv rewrite rule on 'Count distinct' (#4382 ) The rewrite rule named `CountToSum` does not distinguish between `Count` and `Count distinct` which causes `Count distinct` is rewritten as `Sum` incorrectly. So this commit modified matching rule. When the function is `Count distinct`, the rewrite rule will not take effect. Fixed #4381	2020-08-20 09:30:35 +08:00
xinghuayu007	bfb39a2826	[SQL][Function] Add replace() function (#4347 ) replace is an user defined function, which is to replace all old substrings with a new substring in a string, as follow: mysql> select replace("http://www.baidu.com:9090", "9090", ""); +------------------------------------------------------+ \| replace('http://www.baidu.com:9090', '9090', '') \| +------------------------------------------------------+ \| http://www.baidu.com: \| +------------------------------------------------------+	2020-08-20 09:28:53 +08:00
Mingyu Chen	38a2a7a269	[Bug] Fix bug that modification of global variable can not be persisted. (#4324 ) When setting global variables, such as `set global default_rowset_type=beta`, the operation is not correctly persisted. This CL change the fe meta version to 90. --------------- The main reason for this problem is that for the modification of global variable, we directly use Java's reflection mechanism to modify static member variables in `GlobalVariable` class. But in the persistence method of the `set` operation, we only persist the value stored in the `globalSessionVariable` variable, and this variable does not contain Global Variable. So I added a new OperationType: `OP_GLOBAL_VARIABLE_V2`, and added a `GlobalVarPersistInfo` class to record all changes.	2020-08-18 16:54:35 +08:00
Mingyu Chen	3359467b9a	[Tablet][Recovery] Support using empty tablet to repair the damaged or missing tablet (#4255 ) In some very special circumstances, such as code bugs, or human misoperation, etc., all replicas of some tablets may be lost. In this case, the data has been substantially lost. However, in some scenarios, the business still hopes to ensure that the query will not report errors even if there is data loss, and reduce the perception of the user layer. At this point, we can use the blank Tablet to fill the missing replica to ensure that the query can be executed normally. Add a new FE config `recover_with_empty_tablet`. default is false. true means to use empty tablet to fill the missing one. Also fix a bug in Fix #4274	2020-08-18 06:13:53 +00:00
caoyang10	53d00d92cc	[Doris On ES][Bug-Fix] ES queries always route at same 3 BE nodes (#4351 ) (#4352 ) resolve the problem of querying ES table always route at same 3 BE nodes because of random strategy	2020-08-18 10:36:18 +08:00
xueyan.li	e69496feaf	[MysqlCompatibility] Support collate field option in expr (#4365 ) Support SQL like: ``` select collation_name, character_set_name, is_default collate utf8_general_ci = 'Yes' as is_default from information_schema.collations ```	2020-08-17 22:52:57 +08:00
EmmyMiao87	38921d4343	[MV]Forbidden aggregated partition key column on mv (#4343 ) The partition column of table also must be the key in materialized view. If not, when user wants to add partition of table, the be will core. The materialized view could not create partition correctly when partition column has been aggregated.	2020-08-15 11:38:50 +08:00
HangyuanLiu	4fa35c9f39	[Bug][RoutineLoad] Fix routine load timezone property invalid (#4339 )	2020-08-13 23:40:54 +08:00
xueyan.li	ac9c7741e9	[SQL]Support datagrip show database information (#4332 ) Support show schema()	2020-08-13 23:39:05 +08:00

1 2 3 4 5 ...

1140 Commits