doris

Author	SHA1	Message	Date
Mingyu Chen	f5823a90ff	[fix](broker-load) fix broker load with hdfs failed to get right file type (#15138 )	2022-12-19 16:00:58 +08:00
Jibing-Li	6be5670ce9	[Feature](multi catalog)Remove enable_multi_catalog config item, open this function to public. (#15130 ) The multi-catalog feature is ready to use, remove enable_multi_catalog switch in FE config, open it to public.	2022-12-19 14:29:13 +08:00
xueweizhang	1597afcd67	[fix](mutil-catalog) fix get many same name db/table when show where (#15076 ) when show databases/tables/table status where xxx, it will change a selectStmt to select result from information_schema, it need catalog info to scan schema table, otherwise may get many database or table info from multi catalog. for example mysql> show databases where schema_name='test'; +----------+ \| Database \| +----------+ \| test \| \| test \| +----------+ MySQL [internal.test]> show tables from test where table_name='test_dc'; +----------------+ \| Tables_in_test \| +----------------+ \| test_dc \| \| test_dc \| +----------------+	2022-12-19 14:27:48 +08:00
Jibing-Li	3506b568ff	[Regression](multi catalog)P2 regression case for external hms catalog on emr. #15156	2022-12-19 09:21:48 +08:00
924060929	af4d9b636a	[refactor](Nerieds) Refactor aggregate function/plan/rules and support related cbo rules (#14827 ) # Proposed changes ## refactor - add AggregateExpression to shield the difference of AggregateFunction before disassemble and after - request `GATHER` physicalProperties for query, because query always collect result to the coordinator, use `GATHER` maybe select a better plan - refactor `NormalizeAggregate` - remove some physical fields for the `LogicalAggregate`, like `AggPhase`, `isDisassemble` - remove `AggregateDisassemble` and `DistinctAggregateDisassemble`, and use `AggregateStrategies` to generate various of PhysicalHashAggregate, like `two phases aggregate`, `three phases aggregate`, and cascades can auto select the lowest cost alternative. - move `PushAggregateToOlapScan` to `AggregateStrategies` - separate the traverse and visit method in FoldConstantRuleOnFE - if some expression not implement the visit method, the traverse method can handle and rewrite the children by default - if some expression implement the visit, the user defined traverse(invoke accept/visit method) will quickly return because the default visit method will not forward to the children, and the pre-process in traverse method will not be skipped. ## new feature - support `disable_nereids_rules` to skip some rules. example: 1. create 1 bucket table `n` ```sql CREATE TABLE `n` ( `id` bigint(20) NOT NULL ) ENGINE=OLAP DUPLICATE KEY(`id`) COMMENT 'OLAP' DISTRIBUTED BY HASH(`id`) BUCKETS 1 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "in_memory" = "false", "storage_format" = "V2", "disable_auto_compaction" = "false" ); ``` 2. insert some rows into `n` ```sql insert into n select * from numbers('number'='20000000') ``` 3. query table `n` ```sql SET enable_nereids_planner=true; SET enable_vectorized_engine=true; SET enable_fallback_to_original_planner=false; explain plan select id from n group by id; ``` the result show that we use the one stage aggregate ``` \| PhysicalHashAggregate ( aggPhase=LOCAL, aggMode=INPUT_TO_RESULT, groupByExpr=[id#0], outputExpr=[id#0], partitionExpr=Optional.empty, requestProperties=[GATHER], stats=(rows=1, width=1, penalty=2.0E7) ) \| \| +--PhysicalProject ( projects=[id#0], stats=(rows=20000000, width=1, penalty=0.0) ) \| \| +--PhysicalOlapScan ( qualified=default_cluster:test.n, output=[id#0, name#1], stats=(rows=20000000, width=1, penalty=0.0) ) \| ``` 4. disable one stage aggregate ```sql explain plan select /+SET_VAR(disable_nereids_rules=DISASSEMBLE_ONE_PHASE_AGGREGATE_WITHOUT_DISTINCT)/ id from n group by id ``` the result is two stage aggregate ``` \| PhysicalHashAggregate ( aggPhase=GLOBAL, aggMode=BUFFER_TO_RESULT, groupByExpr=[id#0], outputExpr=[id#0], partitionExpr=Optional[[id#0]], requestProperties=[GATHER], stats=(rows=1, width=1, penalty=2.0E7) ) \| \| +--PhysicalHashAggregate ( aggPhase=LOCAL, aggMode=INPUT_TO_BUFFER, groupByExpr=[id#0], outputExpr=[id#0], partitionExpr=Optional[[id#0]], requestProperties=[ANY], stats=(rows=1, width=1, penalty=2.0E7) ) \| \| +--PhysicalProject ( projects=[id#0], stats=(rows=20000000, width=1, penalty=0.0) ) \| \| +--PhysicalOlapScan ( qualified=default_cluster:test.n, output=[id#0, name#1], stats=(rows=20000000, width=1, penalty=0.0) ) \| ```	2022-12-18 21:49:29 +08:00
xueweizhang	6aba948df0	[fix](multi-catalog) hidden password for show create jdbc catalog (#15145 ) when show create catalog of jdbc, it will show 'jdbc.password' plain text. fix it like other code that hidden password.	2022-12-17 17:20:17 +08:00
starocean999	6d5251af78	[fix](subquery)fix bug of using constexpr as subquery's output (#15119 )	2022-12-16 21:58:58 +08:00
Hong Liu	33abe11dea	[regression-test](query) Add regression case of error could not be changed to nullabl when exe… (#15123 ) * Add regression case of error could not be changed to nullabl when exeing sql * add out file Co-authored-by: smallhibiscus <844981280>	2022-12-16 21:57:36 +08:00
Gabriel	4dbe30d37b	[regression](vectorized) delete vectorized config in regression tests (#15126 )	2022-12-16 17:08:29 +08:00
Mingyu Chen	0e1e5a802b	[config](load) enable new load scan node by default (#14808 ) Set FE `enable_new_load_scan_node` to true by default. So that all load tasks(broker load, stream load, routine load, insert into) will use FileScanNode instead of BrokerScanNode to read data 1. Support loading parquet file in stream load with new load scan node. 2. Fix bug that new parquet reader can not read column without logical or converted type. 3. Change jsonb parser function to "jsonb_parse_error_to_null" So that if the input string is not a valid json string, it will return null for jsonb column in load task.	2022-12-16 09:41:43 +08:00
starocean999	5e0d44ff25	[fix](nereids) fix bug of expr rewrite and column prune rule of group by exprs (#15097 )	2022-12-16 03:22:36 +08:00
mch_ucchi	8f914aa864	[feature](Nereids) support 'timestamp' type constructor (#15095 ) sql like: select timestamp '2022-01-01 01:00:00' + interval '2' hours;	2022-12-16 03:20:56 +08:00
Gabriel	5ef4c42a80	[Bug](datev2) Fix wrong result when use datev2 as partition key (#15094 )	2022-12-15 21:27:05 +08:00
Ashin Gau	6625e650c4	[fix](resource) HdfsStorage can get default.Fs from path or configuration (#15079 )	2022-12-15 16:56:32 +08:00
Tiewei Fang	c6d93f739c	[feature-wip](file reader) Merge stream_load_pipe to the new file reader (#15035 ) Currently, there are two sets of file readers in Doris, this pr rewrites the old stream_load_pipe with the new file reader.	2022-12-15 16:31:22 +08:00
Mingyu Chen	7e90fc5784	[regresion-test](config) open fe debug log (#15044 )	2022-12-15 10:06:25 +08:00
Yulei-Yang	21c2e485ae	[improvment](function) add new function substring_index (#15024 )	2022-12-15 09:54:34 +08:00
morrySnow	83e81c60a2	[fix](regression) Nereids' const by constant case is unstable (#15062 )	2022-12-14 15:03:35 +08:00
AlexYue	f17b138cbd	[BugFix](regression) don't use sf1DataPath when stream load (#15060 ) don't use sf1DataPath when stream load	2022-12-14 12:39:56 +08:00
Kikyou1997	03e69d863e	[fix](nereids) Use precison and scale of decimal type column (#15025 )	2022-12-13 23:39:04 +08:00
minghong	99c339d5cb	[refactor](regression) make nereids case in a separate dir (#14990 ) add DateV2 and DateTimeV2 for Literal.uncheckCastTo() move nereids tpch cases into suite nereids_tpch_p1 move nereids datav2 cases into suite nereids_datav2_p1	2022-12-13 16:28:41 +08:00
starocean999	c767e0bc74	[fix](nereids) create select node for standalone PhysicalFilter node (#14939 ) The PhysicalFilter can't be assigned to ExchangeNode, SortNode and UnionNode. The nereids would create a standalone SelectNode to do the filter work properly.	2022-12-13 13:29:45 +08:00
minghong	3caa9a19cc	[feature](Nereids) add binary & unary arithmetic expression (#14867 ) binary arithmetic expression: div, ^, \|, &, % unary arithmetic expression: ~, +1	2022-12-13 12:58:40 +08:00
Kikyou1997	414566b56d	[feature](nereids) Support orderby and groupby int literal as ordinal of the select list expr (#14862 )	2022-12-13 12:48:09 +08:00
Gabriel	1200b22fd2	[function](round) compute accurate round value by decimal (#14946 )	2022-12-13 09:53:43 +08:00
starocean999	281d47434a	[fix](nereids) having with no group by is not parsed correctly (#14883 ) SQL: SELECT * FROM tbl HAVING c1 > 10;	2022-12-12 22:03:37 +08:00
minghong	e57419fc9e	[feature](nereids) Date add and Date sub related functions (#14753 ) ## date_add series - DATE_ADD - DAYS_ADD - ADDDATE - TIMESTAMPADD ## date_sub series - DATE_SUB - DAYS_SUB - SUBDATE ## NOTE 1. For DAYS_XXX, time unit is omissible, by default the time unit is DAY 2. no TIMESTAMPSUB	2022-12-12 21:34:30 +08:00
starocean999	b5c0d4870d	[fix](nereids)fix bug of elt and sub_replace function (#14971 )	2022-12-12 17:37:36 +08:00
Kikyou1997	33bd9eb85e	[fix](nereids) Support syntax of nested CTE (#14962 )	2022-12-12 17:03:44 +08:00
liqing-coder	38570312dd	[feature](split_by_string)support split by string function (#13741 )	2022-12-12 15:22:30 +08:00
Yulei-Yang	33349c3419	[feature](function)Support negative index for function split_part (#13914 )	2022-12-12 09:56:09 +08:00
plat1ko	f3aea7f0f0	[Enhancement](status) Unify error code and enable customed err msg for BE internal errors (#14744 )	2022-12-11 23:33:18 +08:00
lihangyu	9eeb8b8711	[Bugfix](Jsonb) fix jsonb load in unique key model (#14958 ) * [Bugfix](Jsonb) fix jsonb load in unique key model Register JSONB to `replace_load` agg function * fix test	2022-12-09 21:10:14 +08:00
AlexYue	8c02f19302	[chore](regression) use correct bucket path in regression conf (#14960 )	2022-12-09 16:20:27 +08:00
mch_ucchi	93b281b44b	[feature](Nereids) support select except syntax (#14851 ) Support syntax: select * except(v1, v2) from t;	2022-12-09 15:54:26 +08:00
Gabriel	b213ce6ffd	[Bug](pipeline) Fix double prepare on pipeline engine (#14959 )	2022-12-09 14:35:34 +08:00
HappenLee	3209e49541	[Regression](pipeline) remove ssb test for pipeline (#14953 )	2022-12-09 09:08:09 +08:00
Ashin Gau	e8becaa562	[refactor](resource) unified resource user interface (#14842 ) At present, there are multiple user interface to access hdfs and s3. Each interface has its own configuration and is different, which causes confusion for users. Create resource already supports remote storage resources and resource permission management, but only `spark`/`odbc_catalog` are in use. Cloud storage resources need to be created and managed uniformly through create resource. This PR contains the following changes: 1. Add `s3`, `hdfs` and `hms` resource, and each resource contains it's own configuration items, and delete configuration items scattered in other classes. 2. Use `resource` to create `storage` tools, and use `storage` tools to access the remote file system.	2022-12-08 20:37:10 +08:00
luozenglin	c0b764e419	[fix](schemachange) fix the schema change that causes the be core dump. (#14804 ) * [fix](schemachange) fix the schema change that causes the be core dump. Forbid schema change to add or modify the key column of the agg model as double or float.	2022-12-08 17:36:54 +08:00
Pxl	375e0e08ca	[Bug](predicate) fix ccore dump on varchar with in list predicate (#14881 ) * fix ccore dump on varchar with in list predicate * update case * Update sqlsmith01.sql	2022-12-08 17:14:23 +08:00
Gabriel	0c817e6b3a	[Pipeline](hashjoin) Support hash join on pipeline engine (#14898 )	2022-12-08 15:43:02 +08:00
Kikyou1997	2fb896d916	[feature](nereids) Support using join syntax (#14784 )	2022-12-08 15:22:41 +08:00
AlexYue	be3f3978c8	[enhancement](test) remove sf1DataPath conf from regression-conf.groovy (#13861 )	2022-12-08 11:24:25 +08:00
Yulei-Yang	27c8147a2b	[fix](multi-catalog) use last used database for catalog when switch back (#14793 ) remember last used database of every catalog and use it when switch back	2022-12-08 10:32:30 +08:00
zhangstar333	962810b973	[Vectorized](jdbc) add check type for jdbc table (#14501 )	2022-12-08 10:27:47 +08:00
starocean999	a3095e29d5	[fix](nereids)translate is not null predicate mistake (#14866 ) the 'is not null' predicate is not translated correctly in ExpressionTranslator	2022-12-07 20:14:13 +08:00
lsy3993	a078a0d602	[test](catalog)add some emr hive case (#14848 )	2022-12-07 14:41:57 +08:00
starocean999	6b5e10c8be	[fix](agg)having clause should use alias if there is no group by clause (#14831 )	2022-12-07 14:13:17 +08:00
TengJianPing	3286fb48ab	[fix](if) fix coredump of if const (#14858 )	2022-12-07 09:43:10 +08:00
HappenLee	1304185adb	[Regression](Fix) fix the regression of pipeline and ConcurrentModificationException failed (#14849 ) * [fix](ut) try to fix ConcurrentModifycationException bug * [Regression](Fix) fix the regression of pipeline and ConcurrentModificationException failed Co-authored-by: morningman <morningman@163.com>	2022-12-06 15:34:32 +08:00

1 2 3 4 5 ...

748 Commits