doris

Author	SHA1	Message	Date
starocean999	782001c75b	[fix](planner) project should be done inside subquery (#17630 ) WITH t0 AS( SELECT report.date1 AS date2 FROM( SELECT DATE_FORMAT(date, '%Y%m%d') AS date1 FROM cir_1756_t1 ) report GROUP BY report.date1 ), t3 AS( SELECT date_format(date, '%Y%m%d') AS date3 FROM cir_1756_t2 ) SELECT row_number() OVER(ORDER BY date2) FROM( SELECT t0.date2 FROM t0 LEFT JOIN t3 ON t0.date2 = t3.date3 ) tx; The DATE_FORMAT(date, '%Y%m%d') was calculated in GROUP BY node, which is wrong. This expr should be calculated inside the subquery.	2023-03-13 11:10:27 +08:00
abmdocrt	55c42da511	[Feature](array) Support array<decimalv3> data type (#16640 )	2023-03-13 10:48:13 +08:00
HappenLee	39b5682d59	[Pipeline](shared_scan_opt) Support shared scan opt in pipeline exec engine	2023-03-13 10:33:57 +08:00
Tiewei Fang	13e05c4a5d	[Enhencement](stream load) add some regression test for json format streamload (#17520 )	2023-03-12 20:13:07 +08:00
slothever	455c800405	[feature](parquet-reader) add rle bool and delta decoder to read AWS Glue (#17112 ) Support delta encoding and rle(bool) to read Glue data add delta bit pack decoder, add delta length byte array decoder, add delta byte array decoder. add rle bool decoder. We find some data type is read with delta encoding on AWS Glue, so it should be supported. The definition of delta encoding can refer to the delta encoding in parquet.	2023-03-12 20:09:58 +08:00
Pxl	8328ab69ad	[Chore](Materialized-View) add some mv regression test case (#17345 ) 1. add some mv regression test case 2. rename materialized_view_p0 to mv_p0 (avoid create database failed because long db name)	2023-03-11 10:55:11 +08:00
camby	6dcd791b74	[feature](struct-type) support CAST AS Struct type (#17553 ) 1. add support `CAST AS Struct` from Struct type; 2. fix crash while `CAST('{}' AS Struct)`; 3. `CAST('' AS complext_type)` should return NULL instead of empty object; --------- Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2023-03-10 21:21:16 +08:00
lihangyu	a79b8ede88	[Bug](ColumnArray) Fix array column replicate `replicate_offsets` not matched (#17616 ) the input replicate_offsets should be the same size as ColumnArray's offset. ``` IColumn::Offsets replicate_offsets(get_offsets().size(), 0); // \|---------------------\|-------------------------\|-------------------------\| // [0, begin) [begin, begin + count_sz) [begin + count_sz, size()) // do not need to copy copy counts[n] times do not need to copy ``` we should	2023-03-10 11:52:22 +08:00
bobhan1	e1bf9411de	[feature](array function) add support for array_enumerate_uniq (#17541 ) add support for array_enumerate_uniq()	2023-03-10 10:20:49 +08:00
huangzhaowei	4ba93efc98	[Enhance](DOE)Support parse default es iso datetime string (#17412 ) * support parse default es iso datetime string	2023-03-10 09:59:20 +08:00
morrySnow	006f7a91ac	[fix](planner) should not turn on push agg op when olapscan has conjuncts on it (#17598 ) we should not set PushAggOp to any type, if olap scan already has conjunct on it.	2023-03-10 09:33:08 +08:00
WenYao	a745ab1703	[fix](schema scanner) fix query some schema table report invalid parameter (#17626 ) Example: SELECT ROUTINE_SCHEMA AS PROCEDURE_CAT, NULL AS PROCEDURE_SCHEM,ROUTINE_NAME AS PROCEDURE_NAME,NULL AS NUM_INPUT_PARAMS,NULL AS NUM_OUTPUT_PARAMS,NULL AS NUM_RESULT_SETS,ROUTINE_COMMENT AS REMARKS,IF(ROUTINE_TYPE = 'FUNCTION', 2,IF(ROUTINE_TYPE= 'PROCEDURE', 1, 0)) AS PROCEDURE_TYPE FROM INFORMATION_SCHEMA.ROUTINES WHERE ROUTINE_SCHEMA = DATABASE(); ERROR 1105 (HY000): errCode = 2, detailMessage = invalid parameter This wrong and some BI tools could not work correctly.	2023-03-10 08:52:09 +08:00
Jerry Hu	08f0170895	[fix](olap) The 'scan key' generated by the 'is null' expression causes incorrect query results (#17569 )	2023-03-10 08:51:06 +08:00
Xinyi Zou	f9baf9c556	[improvement](scan) Support pushdown execute expr ctx (#15917 ) In the past, only simple predicates (slot=const), and, like, or (only bitmap index) could be pushed down to the storage layer. scan process: Read part of the column first, and calculate the row ids with a simple push-down predicate. Use row ids to read the remaining columns and pass them to the scanner, and the scanner filters the remaining predicates. This pr will also push-down the remaining predicates (functions, nested predicates...) in the scanner to the storage layer for filtering. scan process: Read part of the column first, and use the push-down simple predicate to calculate the row ids, (same as above) Use row ids to read the columns needed for the remaining predicates, and use the pushed-down remaining predicates to reduce the number of row ids again. Use row ids to read the remaining columns and pass them to the scanner.	2023-03-10 08:35:32 +08:00
Xin Liao	849b5b7b8f	[fix](sequence) fix that the result is wrong when load multiple duplicate keys (#17575 )	2023-03-09 20:59:23 +08:00
morrySnow	6c894be007	[enhancement](Nereids) support decimalv3 and precision derive (#17393 )	2023-03-09 14:12:10 +08:00
zhangstar333	4ef46159ae	[vectorized](udaf) support array type for java-udaf (#17351 )	2023-03-09 11:30:07 +08:00
amory	06dee69174	[Refactor](map) remove using column array in map to reduce offset column (#17330 ) 1. remove column array in map 2. add offsets column in map Aim to reduce duplicate offset from key-array and value-array in disk	2023-03-09 11:22:26 +08:00
lihangyu	368e6a4f9c	[Bug](array filter) Fix bug due to `ColumnArray::filter_generic` invalid inplace `size_at` after `set_end_ptr` (#17554 ) We should make a new PodArray to add items instead of do it inplace	2023-03-09 10:59:29 +08:00
luozenglin	00727e8c11	[fix](in-bitmap) fix result may be wrong if the left side of the in bitmap predicate is a constant (#17570 )	2023-03-09 10:59:05 +08:00
Xinyi Zou	397cc011c4	[fix](function) fix AES/SM3/SM4 encrypt/ decrypt algorithm initialization vector bug (#17420 ) ECB algorithm, block_encryption_mode does not take effect, it only takes effect when init vector is provided. Solved: 192/256 supports calculation without init vector For other algorithms, an error should be reported when there is no init vector Initialization Vector. The default value for the block_encryption_mode system variable is aes-128-ecb, or ECB mode, which does not require an initialization vector. The alternative permitted block encryption modes CBC, CFB1, CFB8, CFB128, and OFB all require an initialization vector. Reference: https://dev.mysql.com/doc/refman/8.0/en/encryption-functions.html#function_aes-decrypt Note: This fix does not support smooth upgrades. during upgrade process, query may report error: funciton not found	2023-03-09 09:51:41 +08:00
starocean999	2b6d971c2f	[fix](nereids)fix first_value/lead/lag window function bug in nereids (#17315 ) * [fix](nereids)fix first_value/lead/lag window function bug in nereids * add more test * add order by to fix test case * fix test cases	2023-03-09 09:35:27 +08:00
minghong	4822b9811a	[feature](nereids)support bitmap runtime filter on nereids (#16927 ) * A in(B) -> bitmap_contains(bitmap_union(B), A) support bitmap runtime filter on nereids * GroupPlan -> Plan * fmt * fix target cast problem remove test code	2023-03-09 09:30:24 +08:00
qiye	f0bd002911	[fix](DOE) Fix esquery not working (#17566 ) Function esquery does not work because there is a problem parsing the first parameter type. The first parameter, which is SlotRef, will be cast to CastExpr. This will cause error while generating ES DSL. Add more types to adapt esquery function.	2023-03-08 21:51:17 +08:00
ElvinWei	bd5ed2b0c2	[enhancement](histogram) optimize the histogram bucketing strategy, etc (#17264 ) * optimize the histogram bucketing strategy, etc * fix p0 regression of histogram	2023-03-08 20:12:05 +08:00
TengJianPing	eea6d770d7	[fix](bitmap) fix wrong result of bitmap_or for null (#17456 ) Result of select bitmap_to_string(bitmap_or(to_bitmap(1), null)) should be 1 instead of null. This PR fix logic of bitmap_or and bitmap_or_count. Other count related funcitons should also be checked and fix, they will be fixed in another PR.	2023-03-08 16:29:01 +08:00
zhengshiJ	aab14922af	[Feature](Nereids) support MarkJoin (#16616 ) # Proposed changes 1.The new optimizer supports the combination of subquery and disjunction.In the way of MarkJoin, it behaves the same as the old optimizer. For design details see:https://emmymiao87.github.io/jekyll/update/2021/07/25/Mark-Join.html. 2.Implicit type conversion is performed when conjects are generated after subquery parsing 3.Convert the unnesting of scalarSubquery in filter from filter+join to join + Conjuncts.	2023-03-08 14:26:24 +08:00
Kang	626fbc34f9	[bugfix](jsonb) Fix create mv using jsonb key cause be crash (#17430 )	2023-03-08 14:18:26 +08:00
bobhan1	4ea0d6c5fa	[feature](array_function) add support for array_popfront (#17416 )	2023-03-08 13:57:38 +08:00
gitccl	b1d65f855d	[Feature](array-function) Support array_concat function (#17436 )	2023-03-08 13:57:16 +08:00
TengJianPing	778acb3c5b	[opt](string) optimize string equal comparision (#17336 ) Optimize string equal and not-equal comparison by using memequal_small_allow_overflow15.	2023-03-08 11:30:00 +08:00
Kang	4b743061b4	[feature](function) support type template in SQL function (#17344 ) A new way just like c++ template is proposed in this PR. The previous functions can be defined much simpler using template function. # map element extract template function [['element_at', '%element_extract%'], 'E', ['ARRAY<E>', 'BIGINT'], 'ALWAYS_NULLABLE', ['E']], # map element extract template function [['element_at', '%element_extract%'], 'V', ['MAP<K, V>', 'K'], 'ALWAYS_NULLABLE', ['K', 'V']], BTW, the plain type function is not affected and the legacy ARRAY_X MAP_K_V is still supported for compatability.	2023-03-08 10:51:31 +08:00
htyoung	69c62b6c6c	[Fix](vectorization) fixed that when a column's _fixed_values exceeds the max_pushdown_conditions_per_column limit, the column will not perform predicate pushdown, but if there are subsequent columns that need to be pushed down, the subsequent column pushdown will be misplaced in _scan_keys and it causes query results to be wrong (#17405 ) the max_pushdown_conditions_per_column limit, the column will not perform predicate pushdown, but if there are subsequent columns that need to be pushed down, the subsequent column pushdown will be misplaced in _scan_keys and it causes query results to be wrong Co-authored-by: tongyang.hty <hantongyang@douyu.tv>	2023-03-08 07:23:56 +08:00
zhangstar333	06468ba627	[vectorized](bug) fix array constructor function change origin column from block (#17296 )	2023-03-07 16:42:23 +08:00
minghong	fd8adb492d	[fix](nereids) fix bugs in nereids window function (#17284 ) fix two problems: 1. push agg-fun in windowExpression down to AggregateNode for example, sql: select sum(sum(a)) over (order by b) Plan: windowExpression( sum(y) over (order by b)) +--- Agg(sum(a) as y, b) 2. push other expr to upper proj for example, sql: select sum(a+1) over () Plan: windowExpression(sum(y) over ()) +--- Project(a + 1 as y,...) +--- Agg(a,...)	2023-03-07 16:35:37 +08:00
ZhangYu0123	8ccc805cd0	[Fix](Lightweight schema Change) query error caused by array default type is unsupported (#17331 ) We have supportted array type default [], but when using lightweight schema Change to add column array type, query failed as follows: Fix "array default type is unsupported" error. Fix the default value filling assignment digit problem.	2023-03-07 16:30:41 +08:00
mch_ucchi	86252e25bf	[regression-test](Nereids) add binary arithmetic regression test cases(#17363 ) add all of the valid binary arithmetic expressions test for nereids. currently, float, double, stringlike(string, char, varchar) doesn't support div, bitand, bitor, bitxor. some results with float type are incorrect because of inaccurate precision of regression-test framework.	2023-03-07 15:48:22 +08:00
Jerry Hu	caacee253d	[fix](olap)Crashing caused by IS NULL expression (#17463 ) Issue Number: close #17462	2023-03-07 15:32:52 +08:00
zhangdong	7e96b06e6c	[Enhance](auth)Users support multiple roles (#17236 ) Describe your changes. 1.support GRANT role [, role] TO user_identity 2.support REVOKE role [, role] FROM user_identity 3.’Show grants‘ Add a column to display the roles owned by users 4.‘alter user’ prohibit deleting user's role 5.Repair Logic of roleName cannot start with RoleManager.DEFAULT_ ROLE	2023-03-07 10:28:56 +08:00
AKIRA	f85f89f240	[fix](planner) Fix incosistency between groupby expression and output of aggregation node (#17438 )	2023-03-07 09:38:20 +08:00
ZhangYu0123	440cf526c8	[fix](type compatibility) fix unsigned int type compatibility problem (#17427 ) Fix unsigned int type compatibility value scope problem. When defining columns, map UNSIGNED INT to BIGINT for compatibility. The problems are as follows: It is not consistent with this doc image We support the unsigned int type to be compatible with mysql types, but the unsigned int type is created as the int at the time of definition. This will cause numerical overflow.	2023-03-07 08:55:38 +08:00
AKIRA	aedbc5fcb1	[fix](planner) Slots in the cojuncts of table function node didn't got materialized #17460	2023-03-07 08:50:33 +08:00
luozenglin	e7cba11680	[fix](array)(parquet) fix be core dump due to load from parquet file containing array types (#17298 )	2023-03-06 15:18:42 +08:00
AlexYue	ee1be6edd7	[chore](fe) enhance_mysql_data_type (#17429 )	2023-03-06 10:42:01 +08:00
WenYao	a8f20eb4ac	[Enhencement](schema_scanner) Optimize the performance of reading information schema tables (#17371 ) batch fill block batch call rpc from FE to get table desc For 34w colunms SELECT COUNT( * ) FROM information_schema.columns; time: 10.3s --> 0.4s	2023-03-06 09:53:01 +08:00
morrySnow	9aecd517b0	[test](Nereids) turn on all test in scalar function w (#17269 ) turn on all test case in scalar function W except width_bucket(fix be bug in next PR) turn off all test case for group_concat(distinct order by) fix return nullable in TimestampArithmetic	2023-03-04 08:23:50 +08:00
ZhaoChangle	e82b827bc8	[optimize](vectorization)Optimize to_string's performance. (#17076 )	2023-03-03 10:35:59 +08:00
Tiewei Fang	ba82cd10c6	[Enhencement](Jdbc catalog) Add two optional properties for jdbc catalog (#17245 ) 1. The first property is `only_specified_database`: In the past, `Jdbc Catalog` will synchronize all database from source database. Now we add a parameter called `only_specified_database` to jdbc catalog to allow only the specified database to be synchronized, eg: ```sql create resource if not exists ${resource_name} properties( "type"="jdbc", "user"="root", "password"="123456", "jdbc_url" = "jdbc:mysql://172.18.0.1:${mysql_port}/doris_test?useSSL=false", "driver_url" = "https://doris-community-test-1308700295.cos.ap-hongkong.myqcloud.com/jdbc_driver/mysql-connector-java-8.0.25.jar", "driver_class" = "com.mysql.cj.jdbc.Driver", "only_specified_database" = "true" ); ``` if `only_specified_database` is `true`, jdbc catalog will only synchronize the database which is specified in `jdbc_url`. 2. The second property is `lower_case_table_names`: This property will synchronize jdbc external data source table names in lower case. ```sql create resource if not exists ${resource_name} properties( "type"="jdbc", "user"="doris_test", "password"="123456", "jdbc_url" = "jdbc:oracle:thin:@172.18.0.1:${oracle_port}:${SID}", "driver_url" = "https://doris-community-test-1308700295.cos.ap-hongkong.myqcloud.com/jdbc_driver/ojdbc8.jar", "driver_class" = "oracle.jdbc.driver.OracleDriver", "lower_case_table_names" = "true" ); ```	2023-03-03 00:47:46 +08:00
morrySnow	3eeeff09fd	[enhancement](nereids) convert string literal to commontype in in-expr and cass-when-expr (#17200 )	2023-03-02 22:05:35 +08:00
starocean999	27352afdf6	[fix](fe)support multi distinct group_concat (#17237 ) * [fix](fe)support multi distinct group_concat * update based on comments	2023-03-02 17:53:13 +08:00

1 2 3 4 5 ...

910 Commits