Commit Graph

12498 Commits

Author SHA1 Message Date
df26fb2de4 [fix][alter table property] fix alter table property failed (#22791) 2023-08-10 17:12:42 +08:00
58e7952eea [refactor](load) use memtable writer in memtable memory limiter (#22780) 2023-08-10 17:08:47 +08:00
1c22742a14 [fix](compaction) remove check rowset overlapping in base compaction (#22599) 2023-08-10 16:36:33 +08:00
fd0c161081 [enhance](ColdHeatSeparation) forbid change storage policy to another one with different storage resource (#22519) 2023-08-10 16:32:09 +08:00
50fbe31f93 [fix](tablet report) fix not add replicas when a backend re join the cluster after changing its ip or port (#22700) 2023-08-10 15:29:28 +08:00
de5603da6b [regresstion][external]fix jdbc cases fail external 0809 (#22761)
fix jdbc cases fail external 0809
2023-08-10 15:23:30 +08:00
ec0cedab51 [opt](stats) Use single connect context for each olap analyze task
1. add some comment 
2. Fix potential NPE caused by deleting a running analyze job
3. Use single connect  context for each olap analyze task
2023-08-10 15:04:28 +08:00
f7d00d467a [fix](multicatlog) fix read hive/iceberg catalog on cosn & fix read data via broker (#22087)
* [fix](multicatlog) fix read hive/iceberg catalog on cosn & fix read data via broker

* Update FileSystemFactory.java
2023-08-10 14:44:53 +08:00
f2658dc7bd [Feature](multi-catalog) Truncate char or varchar columns if size is smaller than file columns or not found in the file column schema. (#22318)
Truncate char or varchar columns if size is smaller than file columns or not found in the file column schema by session var `truncate_char_or_varchar_columns`.
2023-08-10 14:37:20 +08:00
f1db6bd8c1 [feature](hive)append support for struct and map column type on textfile format of hive table (#22347)
1. append support for struct and map column type on textfile format  of hive table.
2. optimizer code that array column type.

```mysql
+------+------------------------------------+
| id   | perf                               |
+------+------------------------------------+
| 1    | {"key1":"value1", "key2":"value2"} |
| 1    | {"key1":"value1", "key2":"value2"} |
| 2    | {"name":"John", "age":"30"}        |
+------+------------------------------------+
```

```mysql
+---------+------------------+
| column1 | column2          |
+---------+------------------+
|       1 | {10, "data1", 1} |
|       2 | {20, "data2", 0} |
|       3 | {30, "data3", 1} |
+---------+------------------+
```
Summarizes support for complex types(support assign delimiter) :

1. array< primitive_type > and array< array< ... > >
2. map< primitive_type , primitive_type >
3. Struct< primitive_type , primitive_type ... >
2023-08-10 13:47:58 +08:00
57fb9799b5 [feature](agg) add aggregation function 'bitmap_agg' (#22768)
This function can be used to replace bitmap_union(to_bitmap(expr)), because bitmap_union(to_bitmap(expr)) need create many many small bitmaps firstly and then merge them into a single bitmap.
bitmap_agg will convert the column value into a bitmap directly. Its performance is better than bitmap_union(to_bitmap(expr)) . In our test , there is about 30% improvement.
2023-08-10 12:18:25 +08:00
94d563f04d [improvement](garbage sweep) garbage sweep sleep for a while to reduce io (#22762) 2023-08-10 12:11:50 +08:00
35dd787ed7 [improvement](transaction) abort txn when be lost heartbeat over 1 min (#22781) 2023-08-10 12:04:42 +08:00
432c8f1d6a [opt](stats) No more sync unknown stats since cannot serialize (#22775)
Gson can't serialize INFINITY under current configuration
2023-08-10 11:46:56 +08:00
c1bc2c289b [doc](stats) Add description for some new configure option in stats related docs (#22723) 2023-08-10 11:37:50 +08:00
f001b9d5c8 [enhance](multi-catalog) support multi name service when config hive catalog #21825
when create catalog with multi-servicename like below:
REATE CATALOG hive_prod_t1 PROPERTIES (
'type'='hms',
'hive.metastore.uris' = 'thrift://10.198.xxx:9011,thrift://11.11.xxx:9001,thrift://10.198.xxx:9011',
'hadoop.username' = 'user',
'dfs.nameservices'='ns1007,ns1017',
'dfs.ha.namenodes.ns1007'='nn1,nn2',
'dfs.namenode.rpc-address.ns1007.nn1'='10.198.xxxx:8120',
'dfs.namenode.rpc-address.ns1007.nn2'='10.198.xxx:8120',
'dfs.client.failover.proxy.provider.ns1007'='org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider',
'dfs.ha.namenodes.ns1017'='nn1,nn2',
'dfs.namenode.rpc-address.ns1017.nn1'='10.198.xxxx:8120',
'dfs.namenode.rpc-address.ns1017.nn2'='10.198.xxxx:8120',
'dfs.client.failover.proxy.provider.ns1017'='org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider'
);

the result will be: ERROR 1105 (HY000): errCode = 2, detailMessage = Missing dfs.ha.namenodes.ns1007,ns1017 property
2023-08-10 10:48:08 +08:00
eafdab0cfd [Enhancement](tvf) Add frontends_disks table-valued-function (#22568)
---------

Co-authored-by: yuxianbing <yuxianbing@yy.com>
Co-authored-by: yuxianbing <iloveqaz123>
2023-08-10 10:40:24 +08:00
b90a7748a6 [Feature](Job Schedule)implement Transient Task Register (#22665)
Implement the TransientTaskRegister to support submitting transient tasks which do not require a timer trigger.
rename some class:
TimerTaskDisruptor -> TaskDisruptor
TimerTaskEvent -> TaskEvent
TimerTaskExpirationHandler -> TaskHandler
AsyncJobManager -> TimerJobManager
MemoryTask -> TransientTask
2023-08-10 10:34:13 +08:00
8591257d74 [fix](nereids) parallel instance number is set to 1 incorrectly (#22748)
make PlanNode.getNumInstance() abstract to force every PlanNode specify how to define its numInstance.
By default, PlanNode.numInstance is 1. PlanNode except exchangeNode should not use this default value directly. PlanNode.numInstance is used for PlanNode which will change numInstance like exchange node.
2023-08-10 10:17:37 +08:00
8a5021c235 [Fix](Sql)NPE when the Delete statement does not specify a where condition (#22766)
Execute Sql

delete from test_table.
2023-08-09 11:51:46,586 WARN (mysql-nio-pool-7|540) [StmtExecutor.analyze():987] Analyze failed. stmt[25, 519f916eeb94a8b-afe8e1094fb39fc1]
java.lang.NullPointerException: null
        at org.apache.doris.rewrite.ExprRewriter.applyRuleBottomUp(ExprRewriter.java:236) ~[classes/:?]
        at org.apache.doris.rewrite.ExprRewriter.applyRule(ExprRewriter.java:226) ~[classes/:?]
        at org.apache.doris.rewrite.ExprRewriter.applyRuleRepeatedly(ExprRewriter.java:216) ~[classes/:?]
        at org.apache.doris.rewrite.ExprRewriter.rewrite(ExprRewriter.java:166) ~[classes/:?]
        at org.apache.doris.rewrite.ExprRewriter.rewrite(ExprRewriter.java:151) ~[classes/:?]
        at org.apache.doris.analysis.DeleteStmt.analyze(DeleteStmt.java:127) ~[classes/:?]
        at org.apache.doris.qe.StmtExecutor.analyze(StmtExecutor.java:983) ~[classes/:?]
        at org.apache.doris.qe.StmtExecutor.executeByLegacy(StmtExecutor.java:660) ~[classes/:?]
        at org.apache.doris.qe.StmtExecutor.execute(StmtExecutor.java:448) ~[classes/:?]
        at org.apache.doris.qe.StmtExecutor.execute(StmtExecutor.java:419) ~[classes/:?]
        at org.apache.doris.qe.ConnectProcessor.handleQuery(ConnectProcessor.java:441) ~[classes/:?]
        at org.apache.doris.qe.ConnectProcessor.dispatch(ConnectProcessor.java:589) ~[classes/:?]
        at org.apache.doris.qe.ConnectProcessor.processOnce(ConnectProcessor.java:826) ~[classes/:?]
        at org.apache.doris.mysql.ReadListener.lambda$handleEvent$0(ReadListener.java:52) ~[classes/:?]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) ~[?:?]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) ~[?:?]
        at java.lang.Thread.run(Thread.java:829) ~[?:?]
Fix Result

[HY000][1105] errCode = 2, detailMessage = Where clause is not set
Affected version

2.0-Alpha +
2023-08-10 10:15:49 +08:00
b25d52b736 [feature](cast) remove some unused in functioncast and support some function in nereids (#22729)
1 ConvertImplGenericFromString do not need a template StringColumnType
2 remove timev1 in function cast
3 support time_to_sec , sec_to_time in nereids
2023-08-10 10:10:32 +08:00
919bfd73f1 [improvement](multi-catalog)add scanner isolation class loader (#22247)
Add scanner isolation class loader to make each plugin non-conflicting.
The BE will get scanner classes by JNI call and use JniClassLoader load them.
In the last version,we always get canner classes from the system class path by default,
so it cannot isolate the classes for each scanner
2023-08-10 10:02:46 +08:00
e6b835617b [fix](regression) fix export case (#22790)
Fix a export case because the machine that performs the export is randomized when there are multiple nodes.
2023-08-10 10:00:17 +08:00
df1f67d835 [improve](insert) Support server side prepare insert stmt (#22353) 2023-08-10 09:59:17 +08:00
768088c95e [refactor](udaf) refactor call udaf function and support map type in return (#22508) 2023-08-09 22:44:07 +08:00
0d75a54d6c [fix](compaction) fix null pointer if single replica compaction gets rowset version from peer (#22717) 2023-08-09 20:55:24 +08:00
3b7a0a4713 [fix](cache) Fix enable sql cache lead to FE Full GC or OOM #22769 2023-08-09 19:24:04 +08:00
Pxl
89dc1f73b2 [Bug](materialized-view) make mv matched when preagg have value column predicate contained in mv'where clause (#22779)
1. make mv matched when preagg have value column predicate contained in mv
'where clause
2. fix `org.apache.doris.common.AnalysisException: errCode = 2, detailMessage = BITMAP_UNION need input a bitmap column, but input INVALID_TYPE`
3. make the error message more detailed when create mv stmt parse failed
2023-08-09 19:17:55 +08:00
HB
5147c096ef [Enhancement] Add an API to query session information for all FEs (#20134)
Currently, Doris only has one interface for querying specific FE session information, and many times we need to know how many session information there are in the current cluster, so I added this API.

`
GET /rest/v1/session/all

{
"msg": "success",
"code": 0,
"data": {
"column_names": ["FE", "Id", "User", "Host", "Cluster", "Db", "Command", "Time", "State", "Info"],
"rows": [{
"FE": "10.14.170.23",
"User": "root",
"Command": "Sleep",
"State": "",
"Cluster": "default_cluster",
"Host": "10.81.85.89:31465",
"Time": "230",
"Id": "0",
"Info": "",
"Db": "db1"
},
{
"FE": "10.14.170.24",
"User": "root",
"Command": "Sleep",
"State": "",
"Cluster": "default_cluster",
"Host": "10.81.85.88:61465",
"Time": "460",
"Id": "1",
"Info": "",
"Db": "db1"
}]
},
"count": 2
}
`
2023-08-09 19:02:45 +08:00
e6a860fc9e [memo](nereids) record the chosen group expression in Group (#22661)
1. remember the chosen plan in group
2. set groupId after RecomputeLogicalPropertiesProcessor
2023-08-09 18:44:46 +08:00
2019bb3870 [fix](bitmap) fix wrong result of bitmap intersect functions (#22735)
* [fix](bitmap) fix wrong result of bitmap intersect functions

* fix test case
2023-08-09 18:31:24 +08:00
21beebde7d [fix](taskgroup) Fix task group overcommit memory GC profile (#22764) 2023-08-09 18:29:46 +08:00
2a13d15d20 [feature](Nereids): disable join order when the join number > 63 (#22708) 2023-08-09 17:09:38 +08:00
9422494064 [docs](docs)Rename Title and URL of HLL Functions (#22728) 2023-08-09 15:53:39 +08:00
58ef388c32 [docs](docs)Rename Title and URL of JSON Functions (#22732) 2023-08-09 15:53:25 +08:00
af5f3ae2a6 [docs](docs)Rename Title & URL and Change Category Name as Numeric of Math Functions (#22733) 2023-08-09 15:52:49 +08:00
2fb7aba9bc [docs](docs)Rename Title and URL of IP Functions (#22741) 2023-08-09 15:52:35 +08:00
910863b329 [docs](docs) Rename Window Functions (#22742) 2023-08-09 15:52:22 +08:00
780ba83d91 [docs](docs)Rename the Files Without Category of Sql Functions (#22746) 2023-08-09 15:51:47 +08:00
61e661d389 [docs](docs)Rename Title and URL of Table Functions (#22747) 2023-08-09 15:51:15 +08:00
c443bce141 [docs](docs)Delete Dash Between Title of Benchmark (#22751) 2023-08-09 15:51:01 +08:00
bf29110856 [docs](docs)Rename Title of FAQ-CN Version (#22752) 2023-08-09 15:50:44 +08:00
4332e15800 [docs](docs)Rename Title and URL of Hash Functions (#22726) 2023-08-09 15:50:23 +08:00
ed91ce5b1a [docs](docs)Rename Title and URL of Conditional Functions (#22725) 2023-08-09 15:49:11 +08:00
1625a7993c [docs](docs)Rename Title and URL of Bitmap Functions (#22721) 2023-08-09 15:48:16 +08:00
1cd99c27e2 [CI](docs)Add deadlink check (#22736) 2023-08-09 14:58:37 +08:00
77d3d4e324 [fix](cache) add sql cache conf cache_result_max_data_size (#22645)
Only the maximum number of rows in sql cache cache_result_max_row_count is not enough. If a row of data is too large, FE may OOM.
2023-08-09 14:46:23 +08:00
1d1077c3b6 [bugfix](fd) Recycle the segment file fds directly when delete stale rowset (#22705) 2023-08-09 14:45:56 +08:00
19a2617d70 [docs](streamload) improve some formatting (#22659) 2023-08-09 14:38:22 +08:00
690a519742 [fix](Nereids) disable or expansion when pipeline engine is disable (#22719) 2023-08-09 14:33:50 +08:00