doris

Author	SHA1	Message	Date
slothever	26e332c608	[fix](multi-catalog)add exception for unsupported hive input format (#25490 ) add exception for unsupported hive input format	2023-10-17 22:53:53 +08:00
Mingyu Chen	b76e23fb34	[improvement](meta) allow to ignore unknown image module (#25450 ) Add new FE config `ignore_unknown_metadata_module`. Default is false. If set to true, when reading metadata image file, and there are unknown modules, these modules will be ignored and skipped. This is mainly used in downgrade operation, old version can be compatible with new version Image file.	2023-10-17 22:53:31 +08:00
slothever	18c2a13e09	[fix](multi-catalog)fix maxcompute partition filter and session creation (#24911 ) add maxcompute partition support fix maxcompute partition filter modify maxcompute session create method	2023-10-17 22:36:10 +08:00
minghong	d287f53d77	[fix](nereids)in physical plan, print join class simple name not full name #25515	2023-10-17 20:25:14 +08:00
starocean999	9b1cdd3230	[fix](planner) mark join slot should always be nullable (#25433 )	2023-10-17 06:14:13 -05:00
minghong	8eff1486bd	[feature](nereids)print query id with memo and physical tree (#25501 ) print query id with memo and physical tree when dump_nereids_memo switched on. This is used for regression test.	2023-10-17 05:06:11 -05:00
morrySnow	9d6b2dceb2	[fix](Nereids) non-slot filter should not be push through aggregate (#25525 )	2023-10-17 05:02:26 -05:00
morrySnow	af8832389f	[feature](Nereids) add 4 array functions (#25488 ) - array_concat - array_pushback - array_pushfront - array_zip	2023-10-17 04:45:15 -05:00
zhangdong	f38f5f50eb	[fix](ipv6)fix can not resolve host and port (#25254 ) for ipv6,address should be [ip]:port instead of ip:port	2023-10-17 15:46:45 +08:00
zy-kkk	652d6c57c0	[fix](jdbc catalog) fix handle oracle date format (#25487 )	2023-10-17 02:10:28 -05:00
谢健	4d12d8885e	[feature](Nereids): graphSimplifier should compare edge1BeforeEdge2 and edge2BeforeEdge1 (#25416 )	2023-10-17 14:10:21 +08:00
minghong	0ee06f30b0	[feature](nereids)Ignore some node in 'explain shape plan' command (#25485 ) if set ignore_shape_nodes='PhysicalDistribute, PhysicalProject' then explain shape plan will not print project and distribute node	2023-10-17 11:57:36 +08:00
jakevin	410441b516	[enhancement](Nereids): remove LAsscom in Bushy Tree RuleSet (#25465 ) - Bushy Tree RuleSet don't need LAsscom - fix bug: rule pattern shouldn't use same name	2023-10-17 11:22:52 +08:00
zhangstar333	384fddb2ff	[test](case)add some debug log in mv case (#25458 ) * [test](case)change the insert stmt in mv case	2023-10-17 11:04:45 +08:00
Jibing-Li	1130317b91	[Improvement](statistics)Collect stats for hive partition column using metadata (#24853 ) Hive partition columns' stats could be calculated from hive metastore data. Doesn't need to execute sql to get the stats. This PR is using hive partition metadata to collect partition column stats.	2023-10-17 10:31:57 +08:00
Tiewei Fang	85b8497624	[fix](Tvf) return empty set when tvf queries an empty file or an error uri (#25280 ) ### Before: return errors when tvf queries an empty file or an error uri: 1. get parsed schema failed, empty csv file 2. Can not get first file, please check uri. ### Now: we just return empty set when tvf queries an empty file or an error uri. ```sql mysql> select * from s3( "uri" = "https://error_uri/exp_1.csv", "s3.access_key"= "xx", "s3.secret_key" = "yy", "format" = "csv") limit 10; Empty set (1.29 sec) ```	2023-10-17 09:52:53 +08:00
yujun	a194a15442	[improvement](tablet schedule) colocate balance between all groups (#23543 )	2023-10-17 09:33:52 +08:00
yujun	f9a80ecdab	[improvement](sync version) fe sync version with be (#25236 )	2023-10-16 20:34:25 +08:00
Pxl	72920fbd1d	[Improvement](materialized-view) set job failed when toAgentTaskRequest meet error (#25358 ) set job failed when toAgentTaskRequest meet error	2023-10-16 20:10:52 +08:00
zclllyybb	f9df3bae61	[Enhancement](functions) change some nullable mode and clear some smooth upgrade (#25334 )	2023-10-16 19:50:17 +08:00
starocean999	7fd876f3a2	[fix](planner)should call SlotRef'smaterializeSrcExpr() method if the slotRef is materialized (#25467 )	2023-10-16 19:42:12 +08:00
JingDas	e3d0e55794	[feature-wip] (Nereids) Support transforming trino dialect SQL to logical plan (#21855 ) Support transforming trino dialect SQL to logical plan (#21854) ## Proposed changes Issue Number: #21854 Use io.trino.sql.tree.AstVisitor as vistor, visit coorresponding trino node and transform it to doris logical plan. ## Further comments Here are some examples for function transforming as following: ascii('a') function is in doris and codepoint('a') funtion in trino, they have the same feature and have the same method signature, so we can use [TrinoFnCallTransformer](`3b37b76886/fe/fe-core/src/main/java/org/apache/doris/nereids/parser/trino/TrinoFnCallTransformer.java`) to handle them. another example for ComplexTransformer as following: date_diff('second', TIMESTAMP '2020-12-25 22:00:00', TIMESTAMP '2020-12-25 21:00:00')" fuction in trino and seconds_diff(2020-12-25 22:00:00, 2020-12-25 21:00:00)") fuction in doris. They have different method signature, we cant not handle it by TrinoFnCallTransformer simply and we should handle it by individual complex transformer [DateDiffFnCallTransformer](`3b37b76886/fe/fe-core/src/main/java/org/apache/doris/nereids/parser/trino/DateDiffFnCallTransformer.java`).	2023-10-16 05:10:55 -05:00
minghong	cf073ec8ce	[runtimefilter](nerieds)support Non equal runtime filter for nested loop join #25193	2023-10-16 17:49:47 +08:00
Pxl	d41e839ea0	[Chore](sink) add index number check for table sink (#25461 ) add index number check for table sink	2023-10-16 17:03:26 +08:00
AKIRA	9deda929b9	[refactor](stats) Use id instead name in analysis info (#25213 )	2023-10-16 03:49:53 -05:00
qiye	b2e3ecb81d	[opt](load)change `load_to_single_tablet` tablet search algorithm from random to round-robin (#25256 ) At present, `load_to_singlt_tablet` import implementation refers to simple random number remainder, which cannot achieve true averaging. This will lead to uneven disk IO and uneven use of cluster resources. To solve this problem, we are preparing to implement round-robin for each partition tablet imported each time, in order to achieve average load to each tablet. When generating the load query plan, the tablet index record currently imported is passed to BE. Add a deamon task in FE to regularly clean up the `loadTabletRecordMap`. The map will get the bucket_number of the partition and update the `load_tablet_index` when `getCurrentLoadTabletIndex`.	2023-10-16 16:43:25 +08:00
starocean999	e8431e1a97	[fix](planner)should not add TupleIsNullPredicate for inlineview plan (#25338 )	2023-10-16 15:24:13 +08:00
JingDas	8e9e1b1bfd	[fix](planner) Disable infer expr column name when query on old optimizer (#25317 ) Disable infer expr column name when query on old optimizer. This bug is be brought in #24990 if your query SQL is select id, name, sum(target) FROM db_test.table_test2 group by id, name; the result column name when query is as following: \|id\|name \|sum(cast(target as DOUBLE))\| when you create view as following: CREATE VIEW v1 as select id, name, sum(target) FROM db_test.table_test2 group by id, name; then query the view v1, the result is as following: \|id\|name \|__sum_2\|	2023-10-16 02:08:52 -05:00
morrySnow	1a27ac8d56	[opt] use correct column label when execute query in FE (#25372 ) SET @a = '4'; SELECT @a; previous: +-----+ \| '4' \| +-----+ \| 4 \| +-----+ current: +----+ \| @a \| +----+ \| 4 \| +----+	2023-10-16 02:03:33 -05:00
bobhan1	f698f205d5	[Fix](merge-on-write) throw exception when the user don't specify the insert columns in insert statement for partial update (#25437 )	2023-10-16 14:05:06 +08:00
LiBinfeng	29d4e8ee90	[Fix](Nereids) fix test leading change disable join reorder parameter (#23657 ) Problem: when running pipeline, we get randomly failed of test_leading Reason: physical distribute was generated and choosed to be the best plan because we can not get any statistic information of empty table. So we would get some unexpect result because we can not expect the order in memo Solved: Add statistic of columns used in test_leading, try repeatly in pipeline	2023-10-15 22:59:45 -05:00
morrySnow	934d82816c	[fix](Nereids) add int type alias 'integer' (#25376 )	2023-10-15 22:12:44 -05:00
morrySnow	4c57c31c5c	[fix](Nereids) count should not accept complex and json type (#25354 )	2023-10-15 22:08:35 -05:00
zhangstar333	dfc7d04626	[fix](functions) add quantile_state_empty function signature (#25306 )	2023-10-16 11:05:48 +08:00
zhangdong	471cf2c48b	[improvement](auth) support show view priv (#25370 ) Issue Number: close #xxx current ,if user has select_priv or load_priv,he can show create table view_name, but this is not safe，so add show_view_priv for show create table view_name mysql SHOW VIEW description: https://dev.mysql.com/doc/refman/8.0/en/privileges-provided.html#priv_show-view	2023-10-14 22:37:51 +08:00
jiafeng.zhang	03316e2355	[fix](fe rest api)api gets execution plan, table name case problem (#25112 ) The user has configured the parameter lower_case_table_names, which ignores the case of the table name. When executed on the SQL client, the table name can be queried in both case. But when using Connector to read doris data, the table names must be in the same case, otherwise an error will be reported.	2023-10-14 19:48:24 +08:00
zhiqiang	e5ef0aa6d4	[refactor](mysql result format) use new serde framework to tuple convert (#25006 )	2023-10-14 19:46:42 +08:00
Siyang Tang	6bb0c918eb	[fix](statistics) use replication_num as replica num (#25325 )	2023-10-13 20:14:25 +08:00
Petrichor	b56eecb341	update secure flag to false (#25412 ) update secure flag to false	2023-10-13 17:00:58 +08:00
yiguolei	6757d2f361	Revert "[Enhancement](show-backends-disks) Add show backends disks (#24229 )" (#25389 ) This reverts commit 21223e65c59c23cfcb9e8ab610ea321168bcb75a.	2023-10-13 14:08:45 +08:00
Tiewei Fang	6f9a084d99	[Fix](Outfile) Use data_type_serde to export data to `parquet` file format (#24998 )	2023-10-13 13:58:34 +08:00
zhangdong	4f65a9c425	[fix](auth)fix not display be_port (#25197 ) fix not display be_port who has ADMIN_PRIV	2023-10-13 11:56:00 +08:00
Jack Drogon	ffacbe7d74	[feature](thrift) Add FE thrift rpc redirect master address (#25371 ) Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-10-13 11:17:46 +08:00
DuRipeng	aa0b74d63a	[improvement](fe and broker) support specify broker to getSplits, check isSplitable, file scan for HMS Multi-catalog (#24830 ) I want to use Doris Multi-catalog to accelerate HMS query. My organization has custom distributed file system, and we think wrapping the fs access difference into broker (listLocatedFiles, openReader..) would be a elegant approach. This pr introduce HMS catalog conf `bind.broker.name`. If we set this conf, file split, query scan operation will send to broker. usage: create a hms catalog with broker usage ``` CREATE CATALOG hive_catalog_broker PROPERTIES ( 'type'='hms', 'hive.metastore.uris' = 'thrift://xxx', 'broker.name' = 'hdfs_broker' ); ``` When we try to query from this catalog, file split and query scan request will send to broker `hdfs_broker`. More details about this pr: 1. Introduce HMS catalog proporty `bind.broker.name` to specify broker name to do remote path work. When `broker.name` is set, `enable.self.splitter` must be `true` to ensure file splitting process is executed in Fe 2. Introduce 2 more interfaces to broker service: - `TBrokerIsSplittableResponse isSplittable(1: TBrokerIsSplittableRequest request)`, helps to invoke input format `isSplitable` interface. - `TBrokerListResponse listLocatedFiles(1: TBrokerListPathRequest request)`, helps to do `listFiles` or `listLocatedStatus` for remote file system 3. 3 parts of whole processing will be executed in broker: - Check whether the path with specified input format name `isSplittable` - `listLocatedFiles` of table / partition locations. - `OpenReader` for specified file splits. Co-authored-by: chenlinzhong <490103404@qq.com>	2023-10-13 11:04:38 +08:00
Mingyu Chen	a30d30e7b5	[improvement](resource-tag) limit the default user's resource tag to 'default' (#25331 ) In previous, if user property `'resource_tags.location'` is not set, the can use Backends with any resource tag. It may confuse that when the DBA set part of Backends to resource group A, then the current existing user should not be able to use this group A util it's `'resource_tags.location'` is set. So in this PR, I change the behavior, that if user property `'resource_tags.location'` is not set, it can only use the Backends with `default` tag.	2023-10-13 10:50:00 +08:00
zhangdong	11bbeb9a21	[Enhance](resource group)db support replication_allocation (#25195 ) - db support replication_allocation,when create table,if not set `replication_num` or `replication_allocation `,will use it in db - fix partition property will disappear when table partition is not null	2023-10-13 10:24:01 +08:00
yongjinhou	21223e65c5	[Enhancement](show-backends-disks) Add show backends disks (#24229 ) * Add statement to query disk information corresponding to data directory of BE node [msyql]->'show backends disks;' +-----------+-------------+------------------------------+---------+----------+---------------+-------------+-------------------+---------+ \| BackendId \| Host \| RootPath \| DirType \| DiskState\| TotalCapacity \| UsedCapacity\| AvailableCapacity \| UsedPct \| +-----------+-------------+------------------------------+---------+----------+---------------+-------------+-------------------+---------+ \| 10002 \| 10.xx.xx.90 \| /home/work/output/be/storage \| STORAGE \| ONLINE \| 7.049 TB \| 2.478 TB \| 4.571 TB \| 35.16 % \| \| 10002 \| 10.xx.xx.90 \| /home/work/output/be \| DEPLOY \| ONLINE \| 7.049 TB \| 2.478 TB \| 4.571 TB \| 35.16 % \| \| 10002 \| 10.xx.xx.90 \| /home/work/output/be/log \| LOG \| ONLINE \| 7.049 TB \| 2.478 TB \| 4.571 TB \| 35.16 % \| +-----------+-------------+------------------------------+---------+----------+---------------+-------------+-------------------+---------+	2023-10-12 20:24:45 +08:00
morrySnow	0a38546596	[opt](Nereids) reject group commit insert temporarily (#25359 ) group commit insert introduced by PR #22829. since nereids has not support it, we forbid it temporarily on Nereids until impl it.	2023-10-12 06:20:59 -05:00
Nitin-Kashyap	bdb64eab73	[feature](meta) queries as table valued function (#25052 ) (#25052 ) 1. Add queries view as table function. 2. Proxy result to other FEs and return merged results back to BE. Co-authored-by: yiguolei <676222867@qq.com>	2023-10-12 16:26:14 +08:00
谢健	d6ff9744c9	[feature](Nereids) covert predicate to SARGABLE (#25180 ) covert predicate to SARGABLE 1. support format like `1 - a` 2. support rearrange `year/month/week/day/minutes/seconds_sub/add` function	2023-10-12 14:46:56 +08:00

1 2 3 4 5 ...

4880 Commits