doris

Author	SHA1	Message	Date
morrySnow	779a9a1fbb	[opt](planner) use string for varchar in ctas if original table is not olap (#30323 )	2024-01-29 19:03:47 +08:00
jakevin	930e3bb701	[feature](Nereids): double eager support mix function (#30468 )	2024-01-29 19:03:47 +08:00
zzzxl	ae38f28280	[feature](invert index) does not create an inverted index to support the match_phrase_prefix feature. (#30414 )	2024-01-29 19:02:46 +08:00
lihangyu	7667fe8570	[Improve)(Variant) do not allow fall back to legacy planner (#30430 )	2024-01-29 19:02:46 +08:00
zhangdong	658c869aac	[improvement](mtmv)mtmv support partition by hms table (#29989 )	2024-01-29 19:02:46 +08:00
Chester	7e19224a6c	[fix](function) fix ipv4 funcs get failed error, improve an ipv6 func and exception message (#30269 )	2024-01-28 18:25:31 +08:00
minghong	5986d5415e	[opt](Nereids) make runtime filter target support expression (#30131 ) the target expression should be: 1. only one numeric slot, or 2. cast for any data type example: select * from T1 join T2 on abs(T1.a) = T2.a RF T2.a->abs(T1.a)	2024-01-27 10:07:10 +08:00
jakevin	04237f60e0	[feature](Nereids): eager aggreagate support mix agg function (#30400 )	2024-01-27 09:11:02 +08:00
starocean999	713798d549	[feature](nereids)support mark join (#30133 ) Co-authored-by: Jerry Hu <mrhhsg@gmail.com>	2024-01-27 09:09:53 +08:00
seawinde	f25af15842	[Fix](Nereids) Fix lost predicate when query has a filter at the right input of the outer join (#30374 ) materialized view def is as following: > select l_shipdate, o_orderdate, l_partkey, l_suppkey, o_orderkey > from lineitem > left join (select * from orders where o_orderdate = '2023-12-10' ) t2 > on lineitem.l_orderkey = t2.o_orderkey; the query as following, should add filter `o_orderdate = '2023-12-10'` on mv when query rewrite by materialized view > select l_shipdate, o_orderdate, l_partkey, l_suppkey, o_orderkey > from lineitem > left join orders > on lineitem.l_orderkey = orders.o_orderkey > where o_orderdate = '2023-12-10' order by 1, 2, 3, 4, 5;	2024-01-27 09:09:02 +08:00
zhannngchen	5e4674ab66	[fix](partial update) mishandling of exceptions in the publish phase may result in data loss (#30366 )	2024-01-27 09:09:02 +08:00
lihangyu	8543167195	[Nereids](Variant) Implement variant type and support new sub column access method (#30348 ) * [Nereids](Variant) Implement variant type in Variant and support new sub column access method The query SELECT v["a"]["b"] from simple_var WHERE cast(v["a"]["b"] as int) = 1 1. During the binding stage, the expression element_at(var, "xxx") is transformed into a SlotReference with a specified path. This conversion is tracked in the StatementContext, where the parent slot is the primary key and the paths are secondary keys. This structure, known as subColumnSlotRefMap in the StatementContext, helps to eliminate duplicates of the same slot derived from identical paths. 2. A new rule, BindSlotWithPaths, is introduced in the analysis stage. This rule is responsible for converting slots with paths into their respective slot suppliers. To ensure that slots with paths are correctly associated with the appropriate LogicalOlapScan, an additional mapping, slotToRelation, is added to the StatementContext. This mapping links the top-level slot to its corresponding relation (i.e., LogicalOlapScan). Consequently, subsequent slots with paths can determine the correct LogicalOlapScan to merge with and modify accordingly.	2024-01-27 09:09:02 +08:00
lihangyu	9aaa6ba351	[Fix](Variant) fix variant lost null info after `cast_column` (#30153 ) This could result incorrect output in hirachinal cases ``` sql """insert into ${table_name} values (-3, '{"a" : 1, "b" : 1.5, "c" : [1, 2, 3]}')""" sql """insert into ${table_name} select -2, '{"a": 11245, "b" : [123, {"xx" : 1}], "c" : {"c" : 456, "d" : "null", "e" : 7.111}}' as json_str union all select -1, '{"a": 1123}' as json_str union all select *, '{"a" : 1234, "xxxx" : "kaana"}' as json_str from numbers("number" = "4096") limit 4096 ;""" mysql> select v["c"] from var_rs where k = -3 or k = -2; +----------------------+ \| element_at(`v`, 'c') \| +----------------------+ \| [1,2,3] \| \| [] \| +----------------------+ 2 rows in set (0.04 sec) ```	2024-01-27 09:08:29 +08:00
Mryange	a954bab81a	[fix](function) fix error result in time_to_sec and timediff (#30248 )	2024-01-27 09:08:29 +08:00
Gavin Chou	6f8c133a37	[chore] Remove unused test_show_create_catalog.out (#30290 )	2024-01-27 09:07:13 +08:00
qiye	2ecc6ed0d4	[opt](inverted index)Add RAM directory null cases (#30353 )	2024-01-25 21:37:33 +08:00
qiye	01c394acc2	[fix](inverted index)support merge null_bitmap during index compaction (#30326 ) `null_bitmap` file is not considered in index compaction process. This will lead wrong query result when doc is contain `NULL` values.	2024-01-25 13:24:52 +08:00
Pxl	5b462194d1	[Feature](materialized-view) support rewrite case when to if on legacy planner to make mv work (#30320 ) support rewrite case when to if on legacy planner to make mv work	2024-01-25 13:24:52 +08:00
jakevin	bee6ae73c7	[minor](Nereids): enable PushDownTopNDistinctThroughJoin (#30275 )	2024-01-25 13:24:52 +08:00
Jibing-Li	86d7a8be44	[improvement](statistics nereids)Nereids support select mv. (#30267 )	2024-01-25 13:24:09 +08:00
jakevin	83ea486b15	[fix](Nereids): Except just can merge with left deep shape (#30270 )	2024-01-25 13:24:09 +08:00
Pxl	7e60369ba2	[Feature](materialized-view) support create mv with count() (#30313 ) support create mv with count()	2024-01-25 13:24:09 +08:00
Guangdong Liu	101b2593fc	[regression test](schema change) add case for tinyint/smallint/int/bigint/float/double type in agg (#30193 )	2024-01-25 13:24:09 +08:00
Guangdong Liu	df504df475	[regression test](schema change) add case for partition (#30195 )	2024-01-25 13:24:09 +08:00
koarz	ca5a314765	[fix](function) make STRLEFT and STRRIGHT and SUBSTR function DEPEND_ON_ARGUMENT (#28352 ) make STRLEFT and STRRIGHT function DEPEND_ON_ARGUMENT	2024-01-25 13:23:59 +08:00
seawinde	2f68aac885	[Improvement](Nereids) Support to query rewrite by materialized view when join input has aggregate (#30230 ) Support to query rewrite by materialized view when join input has aggregate, the aggregate should be simple For example as following: The materialized view def is > select > l_linenumber, > count(distinct l_orderkey), > sum(case when l_orderkey in (1,2,3) then l_suppkey * l_linenumber else 0 end), > max(case when l_orderkey in (4, 5) then (l_quantity 2 + part_supp_a.qty_max) 0.88 else 100 end), > avg(case when l_partkey in (2, 3, 4) then l_discount + o_totalprice + part_supp_a.qty_sum else 50 end) > from lineitem > left join orders on l_orderkey = o_orderkey > left join > (select ps_partkey, ps_suppkey, sum(ps_availqty) qty_sum, max(ps_availqty) qty_max, > min(ps_availqty) qty_min, > avg(ps_supplycost) cost_avg > from partsupp > group by ps_partkey,ps_suppkey) part_supp_a > on l_partkey = part_supp_a.ps_partkey > and l_suppkey = part_supp_a.ps_suppkey > group by l_linenumber; when query is like following, it can be rewritten by mv above > select > l_linenumber, > sum(case when l_orderkey in (1,2,3) then l_suppkey * l_linenumber else 0 end), > avg(case when l_partkey in (2, 3, 4) then l_discount + o_totalprice + part_supp_a.qty_sum else 50 end) > from lineitem > left join orders on l_orderkey = o_orderkey > left join > (select ps_partkey, ps_suppkey, sum(ps_availqty) qty_sum, max(ps_availqty) qty_max, > min(ps_availqty) qty_min, > avg(ps_supplycost) cost_avg > from partsupp > group by ps_partkey,ps_suppkey) part_supp_a > on l_partkey = part_supp_a.ps_partkey > and l_suppkey = part_supp_a.ps_suppkey > group by l_linenumber;	2024-01-25 13:23:59 +08:00
Nitin-Kashyap	f85b04c2c6	[fix](datatype) fixed decimal type implicit cast handling in BinaryPredicate (#30181 )	2024-01-25 13:23:12 +08:00
nanfeng	c7360fd014	[feature](function) support ip function named ipv4_cidr_to_range(addr, cidr) (#29819 ) * support ip function ipv4_cidr_to_range * fix ipv4_cidr_to_range function only support ipv4 type	2024-01-24 10:02:03 +08:00
jakevin	1b9f1f6483	[feature](Planner): Push down TopNDistinct through Join (#30216 ) Push down TopNDistinct through Outer/Cross Join	2024-01-24 09:59:13 +08:00
wuwenchi	8308bc96b9	[fix](paimon)set timestamp's scale for parquet which has no logical type (#30119 )	2024-01-23 13:22:14 +08:00
Pxl	1e74ad3f3b	[Feature](materialized-view) support predicate apprear both on key and value mv column (#30215 ) support predicate apprear both on key and value mv column	2024-01-23 13:22:14 +08:00
morrySnow	ce47354d59	[fix](Nereids) result nullable of sum distinct in scalar agg is wrong (#30221 )	2024-01-23 10:09:54 +08:00
yangshijie	d5d0e5e611	[feature](function) support ip functions named to_ipv4[or_default, or_null](string) and to_ipv6[or_default, or_null](string) (#29838 )	2024-01-23 10:09:54 +08:00
starocean999	24c0900b41	[fix](planner) should return outputTupleDesc's id instead of tupleIds if outputTupleDesc is set in Plan Node (#30150 )	2024-01-23 10:09:54 +08:00
zhiqiang	e5dea910bf	[feature](bitwise function) bit_count/bit_shift_left/bit_shift_right implementation (#30046 )	2024-01-23 10:09:54 +08:00
seawinde	9a58cacf0f	[Improvement](nereids) Make sure to catch and record exception for every materialization context (#29953 ) 1. Make sure instance when change params of StructInfo,Predicates. 2. Catch and record exception for every materialization context, this make sure that if throw exception when one materialization context rewrite, it will not influence others. 3. Support to mv rewrite when hava count function when aggregate without group by	2024-01-23 10:09:54 +08:00
Chester	dfde10d4c8	[improvement](function) switch inet(6)_aton alias origin function (#30196 )	2024-01-23 10:09:54 +08:00
lihangyu	4480f751e6	[Improve](Variant) support implicit cast to numeric and string type (#30029 )	2024-01-23 10:09:54 +08:00
zzzxl	e5f1d8d7ec	[fix](phrase_prefix) fix match_phrase_prefix query incorrect result (#29946 )	2024-01-23 10:09:54 +08:00
minghong	332b9cb619	[opt](nereids) do not change RuntimeFilter Type from IN-OR_BLOOM to BLOOM on broadcast join (#30148 ) 1. do not change RuntimeFilter Type from IN-OR_BLOOM to BLOOM on broadcast join tpcds1T, q48 improved from 4.x sec to 1.x sec 2. skip some redunant runtime filter example: A join B on A.a1=B.b and A.a1 = A.a2 RF B.b->(A.a1, A.a2) however, RF(B.b->A.a2) is implied by RF(B.a->A.a1) and A.a1=A.a2 we skip RF(B.b->A.a2) Issue Number: close #xxx	2024-01-23 10:07:51 +08:00
Chester	ead3b4ac1d	[feature](function) support ip function is_ipv4_compat, is_ipv4_mapped (#29954 )	2024-01-23 10:07:51 +08:00
minghong	ddeed079d4	[opt](Nereids)make orToIn rule appliable to in-pred (#29990 ) make orToIn rule appliable to in-pred	2024-01-19 15:48:56 +08:00
yangshijie	97b2a3b993	[improvement](ip function) refactor some ip functions and remove dirty codes (#30080 )	2024-01-19 15:48:56 +08:00
谢健	e560f31692	[fix](Nereids): fix eliminate join test for pk-fk constraint (#30094 )	2024-01-19 15:48:56 +08:00
qiye	fac0580eae	[opt](docker)optimize ES docker compose (#30068 ) 1. add volume for es logs 2. optimize health check, waiting for es status to be green 3. fix es6 valume path error 4. optimize disk watermark to avoid es disk watermark error 5. fix es6 create index error 6. add custom elasticsearch.yml for es6 7. add log4j2.properties for es6, es7, es8	2024-01-19 15:48:56 +08:00
jakevin	097641b543	[fix](Nereids): fix AssertNumRows StatsCalculator (#30053 )	2024-01-19 15:48:15 +08:00
Pxl	2ccb69dbed	[Feature](materialized-view) support some case unmached to materialized-view (#30036 ) same column appears in key and value like select id,count(id) group by id; complex expr in sum select sum(if(xxx));	2024-01-18 12:03:07 +08:00
zy-kkk	0ccd706a30	[Enhancement](Jdbc Catalog) Map Jdbc Catalog JSON Type to String for Improved Performance and Compatibility (#30035 ) This PR proposes mapping external catalog JSON types to String instead of JsonB in Apache Doris. This change is motivated by the realization that JDBC retrieves JSON data as a String JSON string, regardless of its storage format (Json(String) or Json(Binary)). Mapping to String streamlines data retrieval, simplifies write-backs, and ensures compatibility with all JSON(String) and JSON(Binary) functions, despite potentially misleading displays of JSON data as Strings in Doris. This approach avoids the performance overhead and complexity of converting each row of data from JsonB to String, making the process more efficient and elegant. About Upgrade To ensure query compatibility with existing Catalogs in the upgraded version,we currently still retain the capability to query external JSON types as JSONB. However, once you upgrade to the new version and either refresh the Catalog or create a new one, all external JSON types will be treated as Strings. To ensure consistent behavior,and possible future removal of support for JSON as JSONB query code, it is highly recommended that you manually refresh your Catalog as soon as possible after upgrading to the new version.	2024-01-18 12:03:07 +08:00
wuwenchi	44ba9e102c	[feature](statistics)support statistics for iceberg/paimon/hudi table (#29868 )	2024-01-18 12:03:07 +08:00
amory	ade720470d	[Improve](config)delete confused config for nested complex type (#29988 )	2024-01-18 12:03:07 +08:00

1 2 3 4 5 ...

2450 Commits