Commit Graph

6608 Commits

Author SHA1 Message Date
04cc6eaadc [Log] Fix a mistake in DorisDynamicOutputFormat.java (#5963)
Fix a mistake DorisDynamicOutputFormat.java
2021-06-06 22:06:57 +08:00
4b23bca2be [Refactor] catch throwable instead of exception (#5942)
Co-authored-by: 孙忠健(20113660) <sunzj10@ke.com>
2021-06-06 22:06:02 +08:00
61af76b8fb [Log] fix log error when commit transaction in txn manager (#5937)
Co-authored-by: weizuo <weizuo@xiaomi.com>
2021-06-06 22:05:40 +08:00
f1e881e6f0 [Bug-fix] Show view stmt report error when view references invalid table (#5919) 2021-06-06 22:05:22 +08:00
a5c35eb20f [Bug] Fix the bug of null pointer exception of colocate join (#5961) 2021-06-04 10:19:58 +08:00
3ca6e386c7 [Function] Support Function current_timestamp(), curtime(), current_time() in FE. (#5946)
Support Function `current_timestamp()`, `curtime()`, `current_time()` in FE to do constant fold.
2021-06-03 18:39:19 +08:00
d790cc6a50 [BUG] Fixed the problem that substring function may access illegal address (#5952) 2021-06-03 18:38:10 +08:00
4ef1dbf394 [Bug] Fix lack of rdlock before rowset_with_max_version() in compaction log (#5953) 2021-06-03 10:01:35 +08:00
81ecf3d097 [Bug] Rebuilt version graph of a tablet when there are too many orphan vertex (#5945)
The version information of the tablet will be stored in the memory
in an adjacency graph data structure.
And as the new version is written and the old version is deleted,
the data structure will begin to have empty vertex with no edge associations(orphan vertex).

These orphan vertexs should be removed somehow.
2021-06-03 09:59:20 +08:00
9ade3bbd8a Add Doris error code table (#5935) 2021-06-02 09:59:41 +08:00
4c0a98e8bf [BE] Optimize version retrieval efficiency. (#5831)
* [FE] Optimize version retrieval efficiency in high-frequency import/compaction scenarios.

* Jump out of the loop when encountering the reverse edge.
2021-06-02 09:58:21 +08:00
8956641296 [Docs] update PMC Members & Committers (#5941)
* update PMC Members & Committers
2021-06-01 18:11:34 +08:00
8e4b601ff2 [Bug] Fit the bug of check Fragment whether is colocate / bucket shuffle join error (#5940) 2021-05-31 12:14:44 +08:00
cf2e0cf2c8 [Bug] Fix export job sometimes stuck in exporting state after timeout (#5932)
Fix #5931
The reason is that sometime the method coordinate.exec() is not call when the job is timeout,
so that the query profile in this coordinate is not be initialized,
which will cause an NPE error in the execution of ExportExportingTask.
2021-05-30 23:09:29 +08:00
ba868c610f [Optimize] Optimize some tablet scheduling logic (#5926)
1. The partitions set by the admin repair command are prioritized
   to ensure that the tablets of these partitions can be repaired as soon as possible.

2. Add an FE metric "query_begin" to monitor the number of queries submitted to the Doris.
2021-05-30 23:08:59 +08:00
0da59aab53 [Bug] Fix bug of multi load and some issue about httpv2 (#5848)
To be compatible with http v1 so that user don't need to change their code
2021-05-30 23:08:38 +08:00
63c99eb4cb [Cache][Enhancement] Assure sql cache only one version (#5793)
For PR #5792. This patch add a new param `cache type` to distinguish sql cache and partition cache.
When update sql cache,  we make assure one sql key only has one version cache.
2021-05-28 13:45:47 +08:00
c844e602a7 [BUG] Fix the bug of Desc Query cause Unknown error and some doc revise (#5921) 2021-05-28 11:21:31 +08:00
a29dd42b47 [BUG][Document] Fix the bug that failed to build the help module (#5917)
There are multiple entries with same key in help documents, which will cause build help module failed.
2021-05-27 22:07:15 +08:00
80f0b5fd1c [BUG] Fix calculation error when the memory parameter is a float value percentage (#5916)
When parsing memory parameters in `ParseUtil::parse_mem_spec`, convert the percentage to `double` instead of `int`.

The currently affected parameters include `mem_limit` and `storage_page_cache_limit`
2021-05-27 22:06:50 +08:00
f4ebac0210 [BUG] BE core when FE get_stream_load_record (#5913) 2021-05-27 22:06:26 +08:00
aa17d40865 [Bug-fix] Update correct data partition of fragment which contains Repeat Node (#5910)
The Repeat Node will change the data partition of fragment
  when the origin data partition of fragment is HashPartition.
The Repeat Node will generate some new rows.
The distribution of these new rows is completely inconsistent with the original data distribution,
  their distribution is RANDOM.

If the data distribution is not corrected,
  an error will occur when the agg node determines whether to perform colocate.
Wrong data distribution will cause the agg node to think that agg can be colocated,
  leading to wrong results.
For example, the following query can not be colocated although the distributed column of table is k1:
```
SELECT k1, k2, SUM( k3 )
FROM table
GROUP BY GROUPING SETS ( (k1, k2), (k1), (k2), ( ) )
```
2021-05-27 22:06:10 +08:00
4343354711 [BUG] Fix in memory table may cause a lot of CPU consumption when LRU Cache evict (#5908)
According to the LRU priority, the `lru list` is split into `lru normal list` and `lru durable list`,
and the two lists are traversed in sequence during LRU evict, avoiding invalid cycles.
2021-05-27 22:05:41 +08:00
ce3ae764e5 [Bug] Bucket Shuffle Join may cause:Failed to send brpc batch, Not connected to 0.0.0.0:0 (#5901) 2021-05-27 22:05:15 +08:00
d958bbedc9 [Bug] Fix the problem that the result of query from the view is incorrect (#5860) (#5897)
Fix an issue where the priority of CompoundPredicates in created view does not match the expectation.
2021-05-27 22:04:33 +08:00
0f4a39f82d [LOG]Hiding stack info of memory exceed in the log (#5896)
If query is memory exceed, a detail info where memory exceed is required.
However it is not necessary to return the entire query stack to the end user.
The query stack only needs to be printed in the be log.
2021-05-27 22:04:17 +08:00
3cb0174aa2 [Doc] Fix demo in 'CREATE TABLE' (#5865) 2021-05-27 22:03:51 +08:00
cbe1b48b71 [Docs] Fix typo in docs zh-CN CREATE TABLE.md (#5864) 2021-05-27 22:03:26 +08:00
ba38973209 use virtual hosted-style request to access object store (#5894)
* use virtual hosted-style access request object store
2021-05-27 15:52:07 +08:00
832faa4377 [Docs] Fix wrong link (#5911) 2021-05-27 10:18:52 +08:00
a415134336 [Release] Release 0.14 and update 0.14 download url (#5906)
* update 0.14 download url
2021-05-26 13:55:40 +08:00
d6076af938 [BUG] fix BE coredump if result sink prepare failed (#5899) 2021-05-26 10:02:55 +08:00
6924637e64 [BUG] fix compression bug while compaction (#5893)
Because the maximum length of LZ4 compression is 2^32, it can cause some memory problems
2021-05-26 10:02:39 +08:00
fc1389240f [Doc]Modify the flink connector document's instructions on enabling http v2 (#5883)
[doc]Modify the flink connector document's instructions on enabling http v2
2021-05-26 10:01:31 +08:00
629e440a67 [Bug] Fix the bug of nullif function: (#5882)
1. Prevent return NULL call nullif(98, null) in FE
2. Support DecimalV2 of nullif function to get the right result
2021-05-26 10:01:17 +08:00
510606ddd4 [DynamicPartition] Support specifying hot data partition (#5877)
In some scenarios, when users use dynamic partitions, they hope to use Doris' hierarchical storage
function at the same time.
For example, for the dynamic partition rule of partitioning by day, we hope that the partitions of the last 3 days
are stored on the SSD storage medium and automatically migrated to the HDD storage medium after expiration.

This CL add a new dynamic partition property: "hot_partition_num".
This parameter is used to specify how many recent partitions need to be stored on the SSD storage medium.
2021-05-26 10:00:24 +08:00
0b12218022 [Log] Change unauthorized access log to debug level (#5873) 2021-05-26 09:59:29 +08:00
9dd54b83b8 [optimize] avoid extra memory alloc in object pool (#5871) 2021-05-26 09:58:21 +08:00
ba69f7a7c8 [Command] [SQL] Add show database/table/partition id command (#5807)
In BE, when a problem happened, in the log, we can find the database id, table id, partition id,
but no database name, table name, partition name.

In FE, there also no way to find database name/table name/partition name accourding to
database id/table id/partition id. Therefore, this patch add 3 new commands:

1. show database id;
mysql> show database 10002;
+----------------------+
| DbName               |
+----------------------+
| default_cluster:test |
+----------------------+

2. show table id;
mysql> show table 11100;
+----------------------+-----------+-------+
| DbName               | TableName | DbId  |
+----------------------+-----------+-------+
| default_cluster:test | table2    | 10002 |
+----------------------+-----------+-------+

3. show partition id;
mysql> show partition 11099;
+----------------------+-----------+---------------+-------+---------+
| DbName               | TableName | PartitionName | DbId  | TableId |
+----------------------+-----------+---------------+-------+---------+
| default_cluster:test | table2    | p201708       | 10002 | 11100   |
+----------------------+-----------+---------------+-------+---------+
2021-05-26 09:58:02 +08:00
a55b12da90 [Doc] Fix some flaws in hll-type document (#5795)
Fix some flaws in hll-type document.
2021-05-26 09:56:55 +08:00
1ec615c562 [BUG] Fixed some uninitialized variables (#5850)
Fixed some potential bugs caused by uninitialized variables
2021-05-25 10:34:35 +08:00
63662194ab [BUG] Fix Stream Load cost too much memory (#5875) 2021-05-25 10:34:10 +08:00
d0ca7b037c [Bug] NULL value in where and on clause should return EmptySetNode (#5872) 2021-05-24 12:32:59 +08:00
76eca9de56 [Bug] Kill the FE process when writing BDBJE journal failed (#5861)
1. When an oom error occurs when writing bdbje, catch the error and exit the process.
2. Increase the timeout period of bdbje replica ack and change it to a configuration.
2021-05-22 23:38:47 +08:00
659d6347a0 [BUG] fix some extra memory in bitmap operate (#5857) 2021-05-22 23:38:28 +08:00
591d391bbc [Bug] Fix bug that the buffered reader may read at wrong position. (#5847)
The buffered reader's _cur_offset should be initialized as same as the inner file reader's,
to make sure that the reader will start to read at rignt position.
2021-05-22 23:38:10 +08:00
07ad038870 [Feature][RoutineLoad] Support for consuming kafka from the point of time (#5832)
Support when creating a kafka routine load, start consumption from a specified point in time instead of a specific offset.
eg:
```
FROM KAFKA
(
    "kafka_broker_list" = "broker1:9092,broker2:9092",
    "kafka_topic" = "my_topic",
    "property.kafka_default_offsets" = "2021-10-10 11:00:00"
);

or

FROM KAFKA
(
    "kafka_broker_list" = "broker1:9092,broker2:9092",
    "kafka_topic" = "my_topic",
    "kafka_partitions" = "0,1,2",
    "kafka_offsets" = "2021-10-10 11:00:00, 2021-10-10 11:00:00, 2021-10-10 12:00:00"
);
```

This PR also reconstructed the analysis method of properties when creating or altering
routine load jobs, and unified the analysis process in the `RoutineLoadDataSourceProperties` class.
2021-05-22 23:37:53 +08:00
5a06dba4f1 [Colocate plan][Step2] Colocate aggregation covers more situations (#5555)
The old colocate aggregation can only cover the case where the child is scan.
In fact, as long as the child's data distribution meets the requirements,
no matter what the plan node on the child node is, a colocate aggregation can be performed.

This PR also fixes the correct data partition attribute of fragment.
The data partition of fragment which contains scan node is Hash Partition rather than Random.
This modification is mainly to determine the possibility of colocate
through the correct distribution of child fragments.
2021-05-22 23:36:49 +08:00
d4cc5735ac [Bug-fix][Backup] Modify the persistence logic of backup and restore (#5853)
The expose annotation is used in the persistence logic used by the old backup recovery.
This annotation by itself is meant to ignore some variables when serializing and deserializing.
However, this variable was used incorrectly and gson did not ignore the variables that should have been ignored.
This results in duplicate initialization when fe is restarted.

This pr uses the doris wrapped Gson directly, than eliminates the use of the expose annotation.
Fixed sortedTabletInfoList being repeatedly initialized resulting in incorrect numbers.

Fixed #5852
2021-05-21 12:23:07 +08:00
109b55ee5f [Shell] Add build parallel option (#5819)
Add build parallel option then we can build project with a user specified parallel not a fixed value.
2021-05-19 09:32:58 +08:00