doris

Author	SHA1	Message	Date
xinghuayu007	1a22f3b2ac	[SQL][Function] Validate the param of rand function in compile step (#4439 ) The param of rand() function should be literal, but current compiler ignore to validate the literal param of rand function, it is validated in execution step. This PR make it validated in compile step, and make it more earlier to find the usage error of rand() function.	2020-09-02 10:50:52 +08:00
Lijia Liu	f3a9f3f87c	Do not add exchange when table's distributioin satisfy the distribution requirements (#4482 ) In DistributedPlanner, do not add the unnecessary Exchanges. For case 1, we only need to judge that the table's distribute hash keys is a subset of the aggregate keys. For case 2, we should judge two conditions: - partition keys are also hash keys. - the table's distribute hash keys is a subset of the aggregate keys.	2020-09-01 11:34:53 +08:00
Mingyu Chen	d49566130b	[Bug] Fix bug of select @@sql_mode (#4484 ) Fix bug that `select @@sql_mode` throw error: Invalid number format.	2020-09-01 11:31:35 +08:00
xy720	7b67da30d2	[Spark Load] Redirect the spark launcher's log to a separated log file (#4470 )	2020-08-30 21:10:04 +08:00
Zhengguo Yang	3b7614e174	[Refactor] Use camelCase in thrift generated java sources (#4443 ) Use camelCase in thrift generated java sources to make us fe's code style is more unified	2020-08-28 13:28:11 +08:00
wyb	ec64789e89	[Bug][Colocation Join] Fix colocation balance endless loop bug (#4471 ) 1. Only one available backend. 2. All backends are checked but this round is not changed. For example, all backends are on the same host.	2020-08-28 09:27:57 +08:00
wyb	82940a4905	[Spark Load] Fix spark load bugs (#4464 ) 1. fix write dpp result when dpp throw exception 2. boolean value：true, false(IgnoreCase), 0, 1 3. wrong dest column for source data check 4. support * in source file path 5. if job state is cancelled or finished, submitPushTasks would throw all partitions have no load data exception, because tableToLoadPartitions was already cleaned up #3433	2020-08-27 23:40:33 +08:00
Youngwb	976e3bb219	[Bug][Compile] Add missing imports (#4468 ) Co-authored-by: yangwenbo6 <yangwenbo3@jd.com>	2020-08-27 18:14:11 +08:00
HangyuanLiu	ad738fa198	Add OLAP_ERR_DATE_QUALITY_ERR error status to display schema change failure (#4388 ) In the process of historical data transformation of materialized views, it may occur that the transformation fails due to data quality. Add an error status code ：OLAP_ERR_DATE_QUALITY_ERR to determine if a data problem is causing the failure #3344	2020-08-27 17:52:53 +08:00
gengjun-git	fe0c21bf93	[Bug] Fix mysql return bug (#4450 ) Send fields after first row arrived so that error packet can be send to client when exception thrown from coord.getNext(). Golang and Python can not identify error if fields packet arrived before error packet.	2020-08-27 12:17:24 +08:00
xueyan.li	3c784b9c90	[SQL] support StringLiteral try to cast BigInt (#4445 )	2020-08-27 12:15:28 +08:00
xy720	8c38c79104	[SparkLoad]Use the yarn command to get status and kill the application (#4383 ) This cl will use yarn command as follows to kill or get status of application running on YARN. ``` yarn --config confdir application <-kill \| -status> <Application ID> ```	2020-08-27 12:08:55 +08:00
Mingyu Chen	f218327dd9	[Mysql Compatibility] Support convert() and signed/unsigned interger cast (#4364 ) 1. Support convert(expr, target_type) function, which is same as CastExpr 2. Support cast (expr as signed/unsigned int) This is just for compatibility, the signed/unsigned specification is meaningless.	2020-08-27 12:07:58 +08:00
EmmyMiao87	78e1615db9	Show column display name on `Show Proc` stmt (#4446 ) The mv column with bitmap_union function is named `mv_bitmap_union_k1` inside of Doris. But this column name should not be shown to user in `Show Proc` stmt. Instead, using define expr is easier to understand. Change-Id: Id07274fef9b3a97c97f1635dd3d6cf7b09561c1e	2020-08-26 10:52:56 +08:00
EmmyMiao87	09129b5ddd	[MV] Keep the scale and precision of type when creating mv (#4436 ) The DECIMAL, CHAR, VARCHAR have their own scale and precision in column. The mv column should keep those scale and precision. Fixed #4433 Change-Id: Ie288738a4356e60d11ea472dd274e54bc7ae6990	2020-08-26 10:51:12 +08:00
EmmyMiao87	b4d8b3d9ba	Forbidden the illegal column types on BITMAP_UNION OR HLL_UNION mv (#4432 ) 1. The base column of bitmap_union could must be integer. The largeint is not supported too. 2. The base column of hll_union could not be decimal. Check error msg of const expr in Union Node If user wants to insert a negative number into bitmap mv, Doris will thrown exception 'invalid input'. The const value in Union Node is checked in this commit.	2020-08-26 10:49:32 +08:00
Stalary	ca5e224594	[Bug] Fix the bug that replication_num in show create table is incorrect (#4393 )	2020-08-26 10:43:59 +08:00
Mingyu Chen	763a42c9af	[MySQL Compatibility 2/4][Bug] Fix bug and improve compatibility with mysql protocol (#4362 ) 1. select database() will only return database name, without cluster name. 2. select user() will return the IP which user connected in.	2020-08-26 10:40:42 +08:00
Mingyu Chen	0040153c51	[MySQL Compatibility 1/4][Bug] Fix bug that set sql_mode with concat() function failed (#4359 ) Support `set sql_mode = concat(@@sql_mode, "STRICT_TRANS_TABLES");`	2020-08-26 10:28:25 +08:00
HangyuanLiu	b1c7841c20	[SQL] Fix TupleIsNull miss in SelectStmt resultExpr (#4279 )	2020-08-26 10:27:50 +08:00
Lijia Liu	d5a0a738f4	[SQL] Rewrite count(distinct if(bool, bitmap, null)) to bitmap_union_count (#4201 ) Add IF(BOOL, BITMAP, BITMAP) function.	2020-08-26 10:26:40 +08:00
wyb	691227922e	[SQL Plan]Fix explicit broadcast join bug (#4424 ) Use broadcast join when users specify explicitly [BROADCAST] in queries.	2020-08-25 22:06:45 +08:00
Mingyu Chen	67b842ce04	[License] Organize and modify the license of the code (#4371 ) 1. Disable the MySQL client and LZO library by default when building the Doris. MySQL client library is used for MySQL external table feature. This feature will be replaced by the new ODBC external table soon. LZO library is used to compress/decompress data of some old data format of Doris, which is no longer used anymore. 2. Add missing license to some files. 3. For all non-Apache-License code, all are explained in NOTICE file and the corresponding license is declared. 4. Remove the js source code from webroot, it will be downloaded as thirdparty	2020-08-24 21:51:55 +08:00
Mingyu Chen	976820ba20	[SegmentV2] Change the default storage format to SegmentV2 (#4387 ) Since the Segment V2 has been released for a long time, we should make it as default storage format for newly created table. This CL mainly changes: 1. For all newly created tables, their default storage format is Segment V2. 2. For all already exist tablets, their storage format remain unchanged. 3. Fix bugs described in Fix #4384 and Fix #4385	2020-08-24 21:51:17 +08:00
Zhengguo Yang	af2b749a87	make some readFields Deprecated (#4399 ) We have changed most of our serialization methods to json. In order to be compatible with previous data, these classes still retain the readFields method. Some prs that involve modifying metadata often modify the readFields method. To avoid this, we should Mark these methods as Deprecated #4398	2020-08-21 22:58:08 +08:00
Zhengguo Yang	d61c10b761	[Delete] Support batch delete [part 1] (#4310 ) * Implements the grammar of the batch delete #4051 * Process create, alter table when table has delete sign column * Support the syntax for enabling the delete column * Automatically filtered deleted data in the select statement. * Automatically add delete sign when create rollup table TODO: * Optimize the reading and compaction logic on the be side, so that the data marked as deleted will be completely deleted during base compaction	2020-08-21 22:57:16 +08:00
EmmyMiao87	76a04de6c4	[MV] Input correct keys type of index meta when `Add Partition` (#4408 ) Define Expr will not serialized in Column `toThrift`. 1. When adding partition, different indexes should use their own keys type instead of using the keys type of base table uniformly. ` 2. There are two kinds of define expr in Column , one is analyzed, and the other is not analyzed. Currently, analyzed define expr is only used when creating materialized views, so the define expr in RollupJob must be analyzed. In other cases, such as define expr in `MaterializedIndexMeta`, it may not be analyzed after being relayed. When executing the load, the analyzed define expr (such as to_bitmap(cast(k1, varchar))) will not be analyzed again. Only a cast function will be added to the inner layer(such as to_bitmap(cast(cast(k1 ,int), varchar))) which is analyzed too. The define expr that has not been analyzed (such as cast(k1, varchar)) will be analyzed when executing the load.	2020-08-21 10:42:41 +08:00
EmmyMiao87	09b1965499	[MV] Fix errors when alter materialized view which based on dup table (#4375 ) 1. Input the correct keys type when mv is updated. The keys type of mv should be used in schema change job rather then keys type of base table. Otherwise, the be will core and thrown exception "Create replicas failed". 2. Forbidden add non-key column on agg mv directly when base table is duplicate model If a dup table has a agg mv, user will not add a non-key column on mv. The non-key column can only be added to dup index.	2020-08-21 10:36:03 +08:00
EmmyMiao87	6bb111b42c	Modify mv rewrite rule on 'Count distinct' (#4382 ) The rewrite rule named `CountToSum` does not distinguish between `Count` and `Count distinct` which causes `Count distinct` is rewritten as `Sum` incorrectly. So this commit modified matching rule. When the function is `Count distinct`, the rewrite rule will not take effect. Fixed #4381	2020-08-20 09:30:35 +08:00
xinghuayu007	bfb39a2826	[SQL][Function] Add replace() function (#4347 ) replace is an user defined function, which is to replace all old substrings with a new substring in a string, as follow: mysql> select replace("http://www.baidu.com:9090", "9090", ""); +------------------------------------------------------+ \| replace('http://www.baidu.com:9090', '9090', '') \| +------------------------------------------------------+ \| http://www.baidu.com: \| +------------------------------------------------------+	2020-08-20 09:28:53 +08:00
Mingyu Chen	38a2a7a269	[Bug] Fix bug that modification of global variable can not be persisted. (#4324 ) When setting global variables, such as `set global default_rowset_type=beta`, the operation is not correctly persisted. This CL change the fe meta version to 90. --------------- The main reason for this problem is that for the modification of global variable, we directly use Java's reflection mechanism to modify static member variables in `GlobalVariable` class. But in the persistence method of the `set` operation, we only persist the value stored in the `globalSessionVariable` variable, and this variable does not contain Global Variable. So I added a new OperationType: `OP_GLOBAL_VARIABLE_V2`, and added a `GlobalVarPersistInfo` class to record all changes.	2020-08-18 16:54:35 +08:00
Mingyu Chen	3359467b9a	[Tablet][Recovery] Support using empty tablet to repair the damaged or missing tablet (#4255 ) In some very special circumstances, such as code bugs, or human misoperation, etc., all replicas of some tablets may be lost. In this case, the data has been substantially lost. However, in some scenarios, the business still hopes to ensure that the query will not report errors even if there is data loss, and reduce the perception of the user layer. At this point, we can use the blank Tablet to fill the missing replica to ensure that the query can be executed normally. Add a new FE config `recover_with_empty_tablet`. default is false. true means to use empty tablet to fill the missing one. Also fix a bug in Fix #4274	2020-08-18 06:13:53 +00:00
caoyang10	53d00d92cc	[Doris On ES][Bug-Fix] ES queries always route at same 3 BE nodes (#4351 ) (#4352 ) resolve the problem of querying ES table always route at same 3 BE nodes because of random strategy	2020-08-18 10:36:18 +08:00
xueyan.li	e69496feaf	[MysqlCompatibility] Support collate field option in expr (#4365 ) Support SQL like: ``` select collation_name, character_set_name, is_default collate utf8_general_ci = 'Yes' as is_default from information_schema.collations ```	2020-08-17 22:52:57 +08:00
EmmyMiao87	38921d4343	[MV]Forbidden aggregated partition key column on mv (#4343 ) The partition column of table also must be the key in materialized view. If not, when user wants to add partition of table, the be will core. The materialized view could not create partition correctly when partition column has been aggregated.	2020-08-15 11:38:50 +08:00
HangyuanLiu	4fa35c9f39	[Bug][RoutineLoad] Fix routine load timezone property invalid (#4339 )	2020-08-13 23:40:54 +08:00
xueyan.li	ac9c7741e9	[SQL]Support datagrip show database information (#4332 ) Support show schema()	2020-08-13 23:39:05 +08:00
wangbo	790779fb6f	[SparkLoad]remove unncessary convert from dataframe to rdd (#4304 )	2020-08-13 23:37:38 +08:00
gengjun-git	48d89e06c3	[Bug fix]fix query id assign bug (#4291 )	2020-08-12 22:42:36 +08:00
EmmyMiao87	98fe80dd5a	[MV]Forbidden no grouping mv on aggregation table (#4317 ) If user wants to create a no grouping mv on aggregation table, the doris will thrown exception. The correct approach is that explicit declare the grouping column. For example: Agg table: k1, k2, sum(k3) Create materialized view stmt: select k1, k2 from agg_table group by k1, k2. Fixed #4316	2020-08-12 20:57:25 +08:00
HappenLee	3354645c77	[BugFix][ColocateJoin] Fix bug of issue 4305 (#4306 ) This PR use fragmentIdToSeqToAddressMap replace seqtoAddresss, Beacause SeqBucket to Address should bind to fragment	2020-08-12 12:11:47 +08:00
hexiang55	48f3ba35ec	[Doris On ES][Bug-Fix] Resolve NullPointerException when multi fields with `text` type (#4300 )	2020-08-11 12:09:17 +08:00
HangyuanLiu	493c88c1d6	[BUG] Fix NPE when distinct in predicate push down (#4294 ) Describe the bug Predicate push down where sub query has distinct may throw NPE To Reproduce Steps to reproduce the behavior: 1. create table like ``` +--------------+--------------+------+-------+---------+---------+ \| Field \| Type \| Null \| Key \| Default \| Extra \| +--------------+--------------+------+-------+---------+---------+ \| event_day \| DATETIME \| No \| true \| NULL \| \| \| title \| VARCHAR(600) \| No \| true \| NULL \| \| \| report_value \| VARCHAR(50) \| No \| false \| NULL \| REPLACE \| +--------------+--------------+------+-------+---------+---------+ ``` 2. exec query ``` ```SELECT * FROM ( SELECT DISTINCT event_day, title FROM click_show_window ) a WHERE a.title IS NOT NULL ``` 4. See error ``` ERROR 1064 (HY000): errCode = 2, detailMessage = Unexpected exception: null ``` This is because DISTINCT generate grouping exprs in agginfo, but this clause does not have a group by clause	2020-08-11 11:07:51 +08:00
Lijia Liu	a480dec7a4	Do not wrap NULL type tuple (#4245 ) Do not wrap NULL type expr to IF(TupleIsNull(tids), NULL, expr)	2020-08-11 09:38:42 +08:00
HangyuanLiu	6abb374d0c	Fix duplicate table export fail (#4293 )	2020-08-11 09:37:43 +08:00
HaiBo Li	4ad943e45d	[Feature][Cache] Cache proxy and coordinator #2581 (#4248 ) * [Feature][Cache] Cache proxy and coordinator #2581 1. Cache's abstract proxy class and BE's Cache implementation 2. Cache coordinator implemented by consistent hashing * Adjusted the formatting code, naming and variables according to the comments	2020-08-10 16:40:25 +08:00
xinghuayu007	411ced5715	Secure singleton mode (#4257 ) Co-authored-by: wangxixu <wangxixu@xiaomi.com>	2020-08-10 11:26:56 +08:00
kangkaisen	f516172f23	Fix window function with limit zero bug 2 (#4235 )	2020-08-10 10:29:05 +08:00
HappenLee	47fff6841b	[Bug][ColocateJoin] Fix bug of #4287 and #4285 of Colocatejoin (#4289 ) 1.Table join itself should have same single partition to valid colocate join. 2.Check eqjoinConjuncts column order to valid colocate join.	2020-08-09 20:48:36 +08:00
gengjun-git	a54b0eab0c	[Bug]fix cancel query bug (#4275 ) ConnectContext.kill() use executor to cancel query, but executor has never been set.	2020-08-08 20:29:32 +08:00

1 2 3 4 5 ...

1127 Commits