Commit Graph

1152 Commits

Author SHA1 Message Date
356bfcf065 fix (#4577)
Fix batch delete not work when using broker load, becuase of merger type not initialized
2020-09-11 09:45:22 +08:00
81784d6471 Revert "Add a session variable to show or hide hidden columns (#4510)" (#4576)
This reverts commit fe0260e54f8dfa37260423cffcf42096de19ed1f.
2020-09-10 15:18:36 +08:00
fe0260e54f Add a session variable to show or hide hidden columns (#4510)
* add session variable to show hidden columns
2020-09-10 13:07:43 +08:00
c4feb82229 [Bug] Fix null pointer when replay RollupJob (#4572)
When fe replay rollup job(v1) with deleted table, it will throw null pointer exception and exit.
This commit ignore this error and print a warning log to avoid fe exit.

Fixed #4571

Change-Id: I302b554a94d42aee645db6b224cd989e00cd3ca6
2020-09-10 11:55:14 +08:00
7d3fae1257 Remove primary key type (#4555)
disable unsupported primay key table, and optimize unique table toSql(),
make show create table output can exec directly
2020-09-10 09:57:06 +08:00
97dee1f7cf [Bug] Fix enable feature error (#4558)
Fix enable feature may failed, because of base index pass to alter table.
2020-09-10 09:55:19 +08:00
e1225067b8 [UT] Fix ut test (#4545)
Fix SparkLoadPendingTask unit test.
2020-09-08 09:25:57 +08:00
e55327bbc7 [Bug] Fix bug that task_worker_pool not work (#4543)
The number of thread initialized in task worker pool is not right.
This bug is introduced from #4440
2020-09-08 09:25:36 +08:00
8be04fac64 [Variable] Add a new global variable "performance_schema" to support Mysql JDBC 8.0.16 or later.#4537# (#4542) 2020-09-08 09:25:20 +08:00
aae942b982 [Spark Load][Bug] Keep the column splitting in spark load consistent with broker load / mini load (#4532) 2020-09-06 20:33:26 +08:00
8f10317e0d [Spark load][Bug] fix that cancelling a spark load in the PENDING phase will not succeed (#4536) 2020-09-06 20:32:47 +08:00
bcb443fb63 [Feature]Support SELECT Optimizer Hints SET_VAR (#4504) 2020-09-06 20:27:53 +08:00
03d9f6d8b4 [Feature] support hour time unit with dynamic parition (#4514)
Many tables are so large that need seperate partitions with "HOUR" time unit.
But now dynamic partition doesn't support "HOUR" time unit and it was marked as "TODO".
So I support the feature and it works.
2020-09-06 20:25:27 +08:00
13e2cf172f [Log] Add log for trace broker (#4505)
Add tracing broker log. When fe get filestatus for distributing load task to broker,
the broker maybe get empty files and not give correct error code.
Add this log to easy track which broker process filestatus operation and we can get the error log.
2020-09-06 20:25:08 +08:00
a390c9ee9f [Bug] Fix bug that table ids is not set right for hadoop load job (#4535)
We store all table ids involved in the Load Job in TransactionState.
However, for Hadoop Load job, table ids are set incorrectly.

This caused the WAITING_TXN phase to not correctly wait for the completion
of the previous load transaction when doing the alter table,
which caused some data version loss problems.
2020-09-04 17:39:37 +08:00
369e86bbe2 [Spark Load] [Bug] Load job's state will stay in ETL state all the time after FE restart (#4528) 2020-09-04 15:56:54 +08:00
068707484d Support sequence column for UNIQUE_KEYS Table (#4256)
* add sequence  col

Co-authored-by: yangwenbo6 <yangwenbo3@jd.com>
2020-09-04 10:10:17 +08:00
72f04ebdb8 [Bug] Fix bug that query meta fields has been sent twice (#4529)
Fix bug that query meta fields has been sent twice.
This bug is introduced by #4330 , and related PR is #4450
2020-09-04 09:31:14 +08:00
a64c3a7acd [ODBC SCAN NODE] 3/4 Add ODBC_TABLE and ODBC_SCAN NODE in FE. (#4430)
we can create odbc_table use SQL like

```
CREATE EXTERNAL TABLE `baseall_oracle` (
  `k1` decimal(9, 3) NOT NULL COMMENT "",
  `k2` char(10) NOT NULL COMMENT "",
  `k3` datetime NOT NULL COMMENT "",
  `k5` varchar(20) NOT NULL COMMENT "",
  `k6` double NOT NULL COMMENT ""
) ENGINE=ODBC
PROPERTIES (
"host" = "192.168.0.1",
"port" = "8086",
"user" = "happenlee",
"password" = "doris",
"database" = "doris",
"table" = "baseall",
"driver" = "Oracle 19 ODBC driver",
"type" = "oracle"
);
```

Now we only support Oracle and MySQL Database and this feature default turned off by conf enable_odbc_table.
2020-09-04 09:30:01 +08:00
5166a6c6bc [Bug] function str_to_date()'s behavior on BE and FE is inconsistent (#4495)
Main CL:
1. Copy the code from BE to implement the `str_to_date()` function in FE. 
2. `str_to_date("2020-08-08", "%Y-%m-%d %H:%i:%s")` will return `2020-08-08 00:00:00` instead of `2020-08-08`.
2020-09-03 17:16:19 +08:00
d0d394ad7e [SQL][Bug]fix semi/anti join error when table has delete sign column (#4498)
It is possible to report "Illegal column/field reference'table2.DORIS_DELETE_SIGN' of semi-/anti-join"
when executing a semi/anti join statement on a table with hidden columns.
This is because the filter conditions of semi/anti join cannot added in the where statement.

Now we add delete flag related where predicate in OlapScanNode level.
2020-09-03 17:15:15 +08:00
ac3bbdd3ab [BatchDelete] Add a configuration indicating whether to enable the batch delete function (#4493) 2020-09-03 16:56:37 +08:00
c01954719f [Feature][Cache] Sql cache and partition cache #2581 (#4330)
1. Analyze what mode of cache can be used by query
2. Query cache before executing query in StmtExecutor
3. Two cache mode, sqlcache and partitioncache, are implemented
2020-09-02 19:18:42 +08:00
e6367e9ba8 [Bug] Support disable colocate join where join clause has join hint (#4497)
Fix issue:#4496
2020-09-02 10:54:45 +08:00
f5ee854b6f [Spark load][Bug] Fix column terminator for spark load (#4491)
Support specifying column separator without back slash.
2020-09-02 10:54:03 +08:00
1a22f3b2ac [SQL][Function] Validate the param of rand function in compile step (#4439)
The param of rand() function should be literal, but current compiler ignore to
validate the literal param of rand function, it is validated in execution step.
This PR make it validated in compile step, and make it more earlier to find the usage error of rand() function.
2020-09-02 10:50:52 +08:00
f3a9f3f87c Do not add exchange when table's distributioin satisfy the distribution requirements (#4482)
In DistributedPlanner, do not add the unnecessary Exchanges.
For case 1, we only need to judge that the table's distribute hash keys is a subset of the aggregate keys.
For case 2, we should judge two conditions:
 - partition keys are also hash keys.
 - the table's distribute hash keys is a subset of the aggregate keys.
2020-09-01 11:34:53 +08:00
d49566130b [Bug] Fix bug of select @@sql_mode (#4484)
Fix bug that `select @@sql_mode` throw error: Invalid number format.
2020-09-01 11:31:35 +08:00
7b67da30d2 [Spark Load] Redirect the spark launcher's log to a separated log file (#4470) 2020-08-30 21:10:04 +08:00
3b7614e174 [Refactor] Use camelCase in thrift generated java sources (#4443)
Use camelCase in thrift generated java sources to make us fe's code style is more unified
2020-08-28 13:28:11 +08:00
wyb
ec64789e89 [Bug][Colocation Join] Fix colocation balance endless loop bug (#4471)
1. Only one available backend.
2. All backends are checked but this round is not changed. For example, all backends are on the same host.
2020-08-28 09:27:57 +08:00
wyb
82940a4905 [Spark Load] Fix spark load bugs (#4464)
1. fix write dpp result when dpp throw exception
2. boolean value:true, false(IgnoreCase), 0, 1
3. wrong dest column for source data check
4. support * in source file path 
5. if job state is cancelled or finished, submitPushTasks would throw all partitions have no load data exception,
    because tableToLoadPartitions was already cleaned up

#3433
2020-08-27 23:40:33 +08:00
976e3bb219 [Bug][Compile] Add missing imports (#4468)
Co-authored-by: yangwenbo6 <yangwenbo3@jd.com>
2020-08-27 18:14:11 +08:00
ad738fa198 Add OLAP_ERR_DATE_QUALITY_ERR error status to display schema change failure (#4388)
In the process of historical data transformation of materialized views, it may occur that the transformation fails due to data quality.
Add an error status code :OLAP_ERR_DATE_QUALITY_ERR to determine if a data problem is causing the failure

#3344
2020-08-27 17:52:53 +08:00
fe0c21bf93 [Bug] Fix mysql return bug (#4450)
Send fields after first row arrived so that error packet can be send to client
when exception thrown from coord.getNext(). 
Golang and Python can not identify error if fields packet arrived before error packet.
2020-08-27 12:17:24 +08:00
3c784b9c90 [SQL] support StringLiteral try to cast BigInt (#4445) 2020-08-27 12:15:28 +08:00
8c38c79104 [SparkLoad]Use the yarn command to get status and kill the application (#4383)
This cl will use yarn command as follows to kill or get status of application running on YARN.

```
yarn --config confdir application <-kill | -status> <Application ID>
```
2020-08-27 12:08:55 +08:00
f218327dd9 [Mysql Compatibility] Support convert() and signed/unsigned interger cast (#4364)
1. Support convert(expr, target_type) function, which is same as CastExpr
2. Support cast (expr as signed/unsigned int)
   This is just for compatibility, the signed/unsigned specification is meaningless.
2020-08-27 12:07:58 +08:00
78e1615db9 Show column display name on Show Proc stmt (#4446)
The mv column with bitmap_union function is named `mv_bitmap_union_k1` inside of Doris.
But this column name should not be shown to user in `Show Proc` stmt.
Instead, using define expr is easier to understand.

Change-Id: Id07274fef9b3a97c97f1635dd3d6cf7b09561c1e
2020-08-26 10:52:56 +08:00
09129b5ddd [MV] Keep the scale and precision of type when creating mv (#4436)
The DECIMAL, CHAR, VARCHAR have their own scale and precision in column.
The mv column should keep those scale and precision.

Fixed #4433
Change-Id: Ie288738a4356e60d11ea472dd274e54bc7ae6990
2020-08-26 10:51:12 +08:00
b4d8b3d9ba Forbidden the illegal column types on BITMAP_UNION OR HLL_UNION mv (#4432)
1. The base column of bitmap_union could must be integer. The largeint is not supported too.
2. The base column of hll_union could not be decimal.

Check error msg of const expr in Union Node

If user wants to insert a negative number into bitmap mv, Doris will thrown exception 'invalid input'.
The const value in Union Node is checked in this commit.
2020-08-26 10:49:32 +08:00
ca5e224594 [Bug] Fix the bug that replication_num in show create table is incorrect (#4393) 2020-08-26 10:43:59 +08:00
763a42c9af [MySQL Compatibility 2/4][Bug] Fix bug and improve compatibility with mysql protocol (#4362)
1. select database() will only return database name, without cluster name.
2. select user() will return the IP which user connected in.
2020-08-26 10:40:42 +08:00
0040153c51 [MySQL Compatibility 1/4][Bug] Fix bug that set sql_mode with concat() function failed (#4359)
Support `set sql_mode = concat(@@sql_mode, "STRICT_TRANS_TABLES");`
2020-08-26 10:28:25 +08:00
b1c7841c20 [SQL] Fix TupleIsNull miss in SelectStmt resultExpr (#4279) 2020-08-26 10:27:50 +08:00
d5a0a738f4 [SQL] Rewrite count(distinct if(bool, bitmap, null)) to bitmap_union_count (#4201)
Add IF(BOOL, BITMAP, BITMAP) function.
2020-08-26 10:26:40 +08:00
wyb
691227922e [SQL Plan]Fix explicit broadcast join bug (#4424)
Use broadcast join when users specify explicitly [BROADCAST] in queries.
2020-08-25 22:06:45 +08:00
67b842ce04 [License] Organize and modify the license of the code (#4371)
1. Disable the MySQL client and LZO library by default when building the Doris.

    MySQL client library is used for MySQL external table feature.
    This feature will be replaced by the new ODBC external table soon.

    LZO library is used to compress/decompress data of some old data format of Doris,
    which is no longer used anymore.

2. Add missing license to some files.

3. For all non-Apache-License code, all are explained in NOTICE file and the corresponding license is declared.

4. Remove the js source code from webroot, it will be downloaded as thirdparty
2020-08-24 21:51:55 +08:00
976820ba20 [SegmentV2] Change the default storage format to SegmentV2 (#4387)
Since the Segment V2 has been released for a long time, we should make it as default storage format for newly created table.

This CL mainly changes:
1. For all newly created tables, their default storage format is Segment V2.
2. For all already exist tablets, their storage format remain unchanged.
3. Fix  bugs described in Fix #4384 and Fix #4385
2020-08-24 21:51:17 +08:00
af2b749a87 make some readFields Deprecated (#4399)
We have changed most of our serialization methods to json. In order to be compatible with previous data, these classes still retain the readFields method. Some prs that involve modifying metadata often modify the readFields method. To avoid this, we should Mark these methods as Deprecated #4398
2020-08-21 22:58:08 +08:00