Commit Graph

18429 Commits

Author SHA1 Message Date
29d4e8ee90 [Fix](Nereids) fix test leading change disable join reorder parameter (#23657)
Problem:
when running pipeline, we get randomly failed of test_leading
Reason:
physical distribute was generated and choosed to be the best plan because we can not get any statistic information of empty table. So we would get some unexpect result because we can not expect the order in memo
Solved:
Add statistic of columns used in test_leading, try repeatly in pipeline
2023-10-15 22:59:45 -05:00
Pxl
d00d029ffb Separate fixed key hash map context creator (#25438)
Separate fixed key hash map context creator
2023-10-16 11:20:30 +08:00
c482c22a74 [case](regresscases) add regress cases for nested type nested type with csv format (#25355)
this pr
1.  fix use podarray push_back() with back() will make heap_use_after_free when podarray is reach capacity which would may make heap free 
2. add cases for csv format for nested types. and csv file has two define which are without quote or just like json text
2023-10-16 11:13:44 +08:00
934d82816c [fix](Nereids) add int type alias 'integer' (#25376) 2023-10-15 22:12:44 -05:00
4c57c31c5c [fix](Nereids) count should not accept complex and json type (#25354) 2023-10-15 22:08:35 -05:00
dfc7d04626 [fix](functions) add quantile_state_empty function signature (#25306) 2023-10-16 11:05:48 +08:00
9649e09aaa [feature](function) support bitmap type in min/max_by agg function (#25430)
support bitmap type in min/max_by agg function
2023-10-16 11:05:32 +08:00
97c0af1a80 [fix](build) aarch64 compilation fix # (#25443)
Issue: #25442

Compilation to include execinfo when building on aarch64
2023-10-16 09:53:50 +08:00
08f305dd79 [chore](build) Fix compilation errors reported by GCC-13 (#25439)
1. Fix lots of compilation errors reported by GCC-13.
2. Fix the workflow BE UT (macOS).
2023-10-15 07:57:36 -05:00
7ea456ef91 [fix](insert) make group commit wal_manager exit elegantly (#25250) 2023-10-14 23:14:06 +08:00
471cf2c48b [improvement](auth) support show view priv (#25370)
Issue Number: close #xxx

current ,if user has select_priv or load_priv,he can show create table view_name,
but this is not safe,so add show_view_priv for show create table view_name

mysql SHOW VIEW description: https://dev.mysql.com/doc/refman/8.0/en/privileges-provided.html#priv_show-view
2023-10-14 22:37:51 +08:00
ed3e8f9355 Fix a naming conflict in ccr regression test (#25436) 2023-10-14 22:28:59 +08:00
03316e2355 [fix](fe rest api)api gets execution plan, table name case problem (#25112)
The user has configured the parameter lower_case_table_names, which ignores the case of the table name. When executed on the SQL client, the table name can be queried in both case.
But when using Connector to read doris data, the table names must be in the same case, otherwise an error will be reported.
2023-10-14 19:48:24 +08:00
e5ef0aa6d4 [refactor](mysql result format) use new serde framework to tuple convert (#25006) 2023-10-14 19:46:42 +08:00
b946521a56 [enhancement](regression-test) add single stream multi table case (#25360) 2023-10-14 10:59:50 +08:00
de03c152ce [fix](thrift)cancel thrift msg max size limit (#25194)
On Thrift 0.14.0+, need use TConfiguration to raise the max message size.
see https://github.com/apache/arrow/pull/11123/files
2023-10-13 20:21:36 +08:00
6bb0c918eb [fix](statistics) use replication_num as replica num (#25325) 2023-10-13 20:14:25 +08:00
3e83fb8729 [enhancement](compaction) record base compaction schedule time and status (#25283) 2023-10-13 19:51:55 +08:00
9f67bcf380 [chore](format) fix tablet_meta.cpp (#25410)
fix format error introduced by #25124
The clang format check had a bug before, so PR 25124 can pass the check at that time.
2023-10-13 17:58:54 +08:00
b56eecb341 update secure flag to false (#25412)
update secure flag to false
2023-10-13 17:00:58 +08:00
789210bc38 [chore](format) Refactor BaseTablet _full_name by using fmt replacing stringstream (#25400)
Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2023-10-13 03:59:03 -05:00
ac8fbdd53c [pipelineX](fix) Fix use-after-free in shuffling (#25409) 2023-10-13 16:57:34 +08:00
37dbda6209 [pipelineX](refactor) Use class template to simplify join (#25369) 2023-10-13 16:51:55 +08:00
Pxl
f4e2eb6564 remove unused code and adjust clang-tidy checks (#25405)
remove unused code and adjust clang-tidy checks
2023-10-13 16:27:37 +08:00
1a25bb65b0 [fix](case) change dynamic_partition.time_unit from day to month to avoid the error that the intert data not in partition (#25361)
[fix](case) change dynamic_partition.time_unit from day to month to avoid the error that the intert data not in partition
Co-authored-by: stephen <hello-stephen@qq.com>
2023-10-13 16:02:17 +08:00
2ec53ff60e [fix](multi-table) fix single stream multi table load can not finish (#25379) 2023-10-13 15:47:16 +08:00
283bd59eba [improvement](scanner) Remove the predicate that is always true for the segment (#25366)
By utilizing the zonemap index of the segment, we can ascertain if a predicate is always true. For example, if the segment’s maximum value is 100 and the predicate is col < 101, then this predicate is always true for this segment.
2023-10-13 15:25:38 +08:00
cee7a6889f [test](fix) case bug (#25363) 2023-10-13 15:25:15 +08:00
9cc0e9526a [enhancement](merge-on-write) consider version count on size-based cu compaction policy (#25352) 2023-10-13 14:52:21 +08:00
96f31ae9a7 [Docs](merge-on-write) Add more docs for partial update using native insert statement (#25356) 2023-10-13 14:48:51 +08:00
6298f90347 [ecosystem](doc) mysql synchronization example add mysql-conf port (#24666) 2023-10-13 01:36:26 -05:00
522faa8cd2 [fix](jni) the offset in map type is int64 (#25394)
The offset in map type column is int64, but #24810 has put as int32, causing error like:
2023-10-13 14:23:17 +08:00
fc40788018 [enhancement](merge-on-write) refine tablet meta_lock usage and add some trace log (#25124) 2023-10-13 14:22:07 +08:00
6757d2f361 Revert "[Enhancement](show-backends-disks) Add show backends disks (#24229)" (#25389)
This reverts commit 21223e65c59c23cfcb9e8ab610ea321168bcb75a.
2023-10-13 14:08:45 +08:00
6f9a084d99 [Fix](Outfile) Use data_type_serde to export data to parquet file format (#24998) 2023-10-13 13:58:34 +08:00
4f65a9c425 [fix](auth)fix not display be_port (#25197)
fix not display be_port who has ADMIN_PRIV
2023-10-13 11:56:00 +08:00
509a79988e [FIX](regresstest) fix cases for test_nested_types_insert_into_with_s3 (#25228) 2023-10-13 11:39:29 +08:00
ffacbe7d74 [feature](thrift) Add FE thrift rpc redirect master address (#25371)
Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2023-10-13 11:17:46 +08:00
aa0b74d63a [improvement](fe and broker) support specify broker to getSplits, check isSplitable, file scan for HMS Multi-catalog (#24830)
I want to use Doris Multi-catalog to accelerate HMS query. My organization has custom distributed file system, and we think wrapping the fs access difference into broker (listLocatedFiles, openReader..) would be a elegant approach.

This pr introduce HMS catalog conf `bind.broker.name`. If we set this conf, file split, query scan operation will send to broker.

usage:
create a hms catalog with broker usage
```
CREATE CATALOG hive_catalog_broker PROPERTIES (
    'type'='hms',
    'hive.metastore.uris' = 'thrift://xxx',
    'broker.name' = 'hdfs_broker'
);
```
When we try to query from this catalog, file split and query scan request will send to broker `hdfs_broker`.

More details about this pr:
1. Introduce HMS catalog proporty `bind.broker.name` to specify broker name to do remote path work. When `broker.name` is set, `enable.self.splitter` must be `true` to ensure file splitting process is executed in Fe
2. Introduce 2 more interfaces to broker service:
- `TBrokerIsSplittableResponse isSplittable(1: TBrokerIsSplittableRequest request)`, helps to invoke input format `isSplitable` interface.
- `TBrokerListResponse listLocatedFiles(1: TBrokerListPathRequest request)`, helps to do `listFiles` or `listLocatedStatus` for remote file system
3. 3 parts of whole processing will be executed in broker:
- Check whether the path with specified input format name `isSplittable`
- `listLocatedFiles` of table / partition locations.
- `OpenReader` for specified file splits.

Co-authored-by: chenlinzhong <490103404@qq.com>
2023-10-13 11:04:38 +08:00
ed67d5a2c2 [docs](developer-guide) Improve the be-vscode-gdb document (#25192)
Add miDebuggerPath into document to allow user set the gdb path.
If miDebuggerPath is not set, vscode may choose the gdb with low version.

ref: https://code.visualstudio.com/docs/cpp/launch-json-reference#_midebuggerpath
2023-10-13 11:03:46 +08:00
a30d30e7b5 [improvement](resource-tag) limit the default user's resource tag to 'default' (#25331)
In previous, if user property `'resource_tags.location'` is not set, the can use Backends with any resource tag.
It may confuse that when the DBA set part of Backends to resource group A, then the current existing user
should not be able to use this group A util it's `'resource_tags.location'` is set.

So in this PR, I change the behavior, that if user property `'resource_tags.location'` is not set, it can only use the
Backends with `default` tag.
2023-10-13 10:50:00 +08:00
11bbeb9a21 [Enhance](resource group)db support replication_allocation (#25195)
- db support replication_allocation,when create table,if not set `replication_num` or `replication_allocation `,will use it in db
- fix partition property will disappear when table partition is not null
2023-10-13 10:24:01 +08:00
Pxl
26f50f4f0f fix heap-use-after-free on map_agg (#25380)
fix heap-use-after-free on map_agg
2023-10-13 00:19:25 +08:00
1073ef22f3 [fix](insert) improve group_commit related tests (#25319) 2023-10-12 21:19:29 +08:00
21223e65c5 [Enhancement](show-backends-disks) Add show backends disks (#24229)
* Add statement to query disk information corresponding to data directory of BE node


[msyql]->'show backends disks;'
+-----------+-------------+------------------------------+---------+----------+---------------+-------------+-------------------+---------+
| BackendId | Host | RootPath | DirType | DiskState| TotalCapacity | UsedCapacity| AvailableCapacity | UsedPct |
+-----------+-------------+------------------------------+---------+----------+---------------+-------------+-------------------+---------+
| 10002 | 10.xx.xx.90 | /home/work/output/be/storage | STORAGE | ONLINE | 7.049 TB | 2.478 TB | 4.571 TB | 35.16 % |
| 10002 | 10.xx.xx.90 | /home/work/output/be | DEPLOY | ONLINE | 7.049 TB | 2.478 TB | 4.571 TB | 35.16 % |
| 10002 | 10.xx.xx.90 | /home/work/output/be/log | LOG | ONLINE | 7.049 TB | 2.478 TB | 4.571 TB | 35.16 % |
+-----------+-------------+------------------------------+---------+----------+---------------+-------------+-------------------+---------+
2023-10-12 20:24:45 +08:00
8825aa7543 [fix](regression test) use double quota for numbers #25365 2023-10-12 19:25:20 +08:00
66db3c9deb [Fix](mvn source) Fix fe compile java-cup and cup-maven-plugin not found #25348
use official address
2023-10-12 19:21:55 +08:00
0a38546596 [opt](Nereids) reject group commit insert temporarily (#25359)
group commit insert introduced by PR #22829. since nereids has not
support it, we forbid it temporarily on Nereids until impl it.
2023-10-12 06:20:59 -05:00
Pxl
1a0344df16 [Improvement](hash) refactor of hash map context (#24966)
refactor of hash map context
2023-10-12 18:10:21 +08:00
04bda138d6 [Enhance](regression)add broker load case (#25350) 2023-10-12 17:59:21 +08:00