doris

Author	SHA1	Message	Date
chenlinzhong	6736e06679	[feature](udf) Vectorization support remote udaf #10683 (#10685 )	2022-07-18 17:15:34 +08:00
Mingyu Chen	9adbd8abbd	[feature](resource-tag) support multi tag for a single Backend (#10901 )	2022-07-18 16:50:45 +08:00
吕飞	091e17ecab	fix(fe): add `,` with json_root property in stmt `show create routine load for xx_job` (#10929 ) Fix issue: https://github.com/apache/doris/issues/10928	2022-07-18 16:44:40 +08:00
jiafeng.zhang	006d7c9225	[fix]The spring boot startup banner is lost, and the maven package does not package the pictures in the resources directory (#10955 )	2022-07-18 16:00:14 +08:00
yinzhijian	dc01ea7ad9	[fix](nereids) Fix the substring compilation error caused by merge (#10965 ) Compilation error after merging due to Literal refactoring. Compilation failure: fe/fe-core/src/main/java/org/apache/doris/nereids/trees/expressions/functions/Substring.java:[40,38] org.apache.doris.nereids.trees.expressions.Literal is abstract; cannot be instantiated	2022-07-18 15:20:25 +08:00
deardeng	8c544b6e13	fix show storage policy null pointer and redundant log (#10906 ) * fix show storage policy null pointer and redundant log	2022-07-18 14:08:54 +08:00
yinzhijian	0b177669d9	[feature](nereids) support substring (#10847 ) support substring, for example: select substr(a, 2), substring(b ,3 ,4) from test1;	2022-07-18 12:38:56 +08:00
Kikyou1997	bf95440c13	[Refactor](nereids)Refactor Literal to an inheritance hierarchy (#10771 ) Use inheritance hierarchy instead of combination to make the framework more clear	2022-07-18 12:01:30 +08:00
starocean999	5c88a74792	[Enhancement] generate runtime filter only for tuples with conjunct (#8745 ) Remove useless runtime filter in some primary-foreign key join scenario in TPCH case.	2022-07-18 09:37:45 +08:00
caiconghui	2e94674cb5	[fix](alter) fix bug that fe crash because npe on rollupBatchTask (#10943 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-07-18 08:47:25 +08:00
jakevin	6b1408ce41	[fix](planner) fix create view when using union (#10849 )	2022-07-17 20:54:40 +08:00
camby	09d19e3f0f	[feature-wip](array-type) explode support more sub types (#10673 ) 1. explode support more sub types; 2. explode support nullable elements; Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-07-17 18:08:30 +08:00
Yongqiang YANG	f0babfdcf8	print image's version when it is higher than FEConstants.meta_version (#10908 )	2022-07-16 19:26:47 +08:00
caiconghui	6751e5b23c	[fix](alter)(tablet-scheduler) fix unexpected exception with compaction_too_slow message when add rollup for olap table (#10827 )	2022-07-15 19:59:00 +08:00
Gabriel	dc6fbcce14	[feature-wip] (datev2) modify datev2 format in memory (#10873 ) * [feature-wip] (datev2) modify datev2 format in memory * update	2022-07-15 19:57:38 +08:00
deardeng	401203da6a	[feature](code-data) move cold data to object storage without losing any feature(FE) (#10693 ) Co-authored-by:platonekosama@gmail.com	2022-07-15 18:00:48 +08:00
Mingyu Chen	97861f517a	Revert "[chore][nereids] Bump the version of antlr4 to 4.10.1 (#10780 )" (#10876 ) This reverts commit b4927a8f151c60357387302723fa808e523d17e3.	2022-07-15 17:05:08 +08:00
Gabriel	ad4751972c	[feature-wip] Support in predicate for datev2 type (#10810 )	2022-07-15 14:32:40 +08:00
Gabriel	cc0f334cb7	[Bug] (UT) fix parsing microseconds ut (#10842 )	2022-07-15 14:15:08 +08:00
jakevin	33e9d5b2da	[enhance](test): remove some `System.out.println` in UT. (#10859 )	2022-07-15 11:16:24 +08:00
Adonis Ling	b4927a8f15	[chore][nereids] Bump the version of antlr4 to 4.10.1 (#10780 )	2022-07-15 10:43:05 +08:00
Gabriel	505758c76b	[BUG] (decimalv3) fix FE UTs (#10834 )	2022-07-14 19:24:50 +08:00
zhannngchen	13e9cb146f	[feature-wip](unique-key-merge-on-write) Add option to enable unique-key-merge-on-write, DSIP-018[5/1] (#10814 ) * Add option in FE * add opt in be * some fix * update * fix code style * fix typo * fix typo * update * code format	2022-07-14 12:10:58 +08:00
zhangstar333	e361eb385e	[vectorized][udf] improvement java-udaf with group by clause (#10296 ) save for file about udaf add bool _destory_deserialize update some code according reviewer change destroy all data at once	2022-07-14 11:23:42 +08:00
Gabriel	3b46242483	[feature-wip] Optimize Decimal type (#10794 ) * [feature-wip](decimalv3) support decimalv3 * [feature-wip] Optimize Decimal type Co-authored-by: liaoxin <liaoxinbit@126.com>	2022-07-14 10:50:50 +08:00
Ashin Gau	077ec4b114	[bug](multi-catalog) empty hadoop configuration when reading iceberg table (#10793 )	2022-07-14 10:18:59 +08:00
924060929	e78cca1009	(Refactor)[Nereids] Combine operator and plan (#10786 ) in #9755, we split plan into plan & operator, but in subsequent development, we found the rule became complex and counter intuition: 1. we must create an operator instance, then wrap a plan by the operator type. 2. relational algebra(operator) not contains children e.g. ```java logicalProject().then(project -> { List<NamedExpression> boundSlots = bind(project.operator.getProjects(), project.children(), project); LogicalProject op = new LogicalProject(flatBoundStar(boundSlots)); // wrap a plan return new LogicalUnaryPlan(op, project.child()); }) ``` after combine operator and plan, the code become to: ```java logicalProject().then(project -> { List<NamedExpression> boundSlots = bind(project.getProjects(), project.children(), project); return new LogicalProject(flatBoundStar(boundSlots), project.child()); }) ``` Originally, we thought it would be convenient for `Memo.copyIn()` after split plan & operator, because Memo don't known how to re-new the plan(assembling child plan in the children groups) by the plan type. So plan must provide the `withChildren()` abstract method to assembling children. The less plan type, the lower code cost we have(logical/physical with leaf/unary/binary plan, about 6 plans, no concrete plan e.g. LogicalAggregatePlan). But the convenient make negative effect that difficult to understand, and people must known the concept then can develop some new rules, and rule become ugly. So we combine the plan & operator, make the rule as simple as possible, the negative effect is we must overwrite some withXxx for all concrete plan, e.g. LogicalAggregate, PhysicalHashJoin.	2022-07-13 19:05:15 +08:00
xy720	bd982ac815	[Bug] Fix array functions arguments mismatch (#10549 ) Currently, we convert array<Int> to array<BigInt> For example, the input array_sum([1, 2, 3]) can match function array_sum(Array<Int>) as well as array_sum(Array<BigInt>). But when a function has more than one argument, the function may be match incorrectly. For example, the input array_contains([1, 2, 3], 2147483648) will match the function array_contains(Array<BigInt>, BigInt), but the correct match should be array_contains(Array<Int>, Int) The correct match should be: array_contains([1, 2, 3], 1) match array_contains(Array<Int>, Int) array_contains([1, 2, 3], 2147483648) match array_contains(Array<Int>, Int) array_contains([2147483648, 2147483649, 2147483650], 2147483648) match array_contains(Array<BigInt>, BigInt) now is: array_contains([1, 2, 3], 1) match array_contains(Array<Int>, Int) array_contains([1, 2, 3], 2147483648) match array_contains(Array<BigInt>, BigInt) array_contains([2147483648, 2147483649, 2147483650], 2147483648) match array_contains(Array<BigInt>, BigInt) And this will cause some trouble. Assume that there are two functions being defined: Int array_functions(Array<Int>, Int) BigInt array_functions(Array<BigInt>, BigInt) And array_functions([1,2,3], 2147483648) will match BigInt array_functions(Array<BigInt>, BigInt), but the result type should not be BigInt, but should be Int.	2022-07-13 14:54:49 +08:00
Stalary	f9f711cd16	FIX: fix datetimev2 decimal error. (#10736 )	2022-07-13 08:32:26 +08:00
Pxl	d6210edcda	[bugfix]set IsNullPredicate to ALWAYS_NOT_NULLABLE (#10785 )	2022-07-13 08:28:00 +08:00
caiconghui	d278f400d4	[enhancement](show data skew) Support show avg_row_count for data skew of one table (#10790 )	2022-07-13 08:27:20 +08:00
Lightman	486cf0ebd4	[Feature] Lightweight schema change of add/drop column (#10136 ) * [Schema Change] support fast add/drop column (#49) * [feature](schema-change) support fast schema change. coauthor: yixiutt * [schema change] Using columns desc from fe to read data. coauthor: Lchangliang * [feature](schema change) schema change optimize for add/drop columns. 1.add uniqueId field for class column. 2.schema change for add/drop columns directly update schema meta Co-authored-by: yixiutt <yixiu@selectdb.com> Co-authored-by: SWJTU-ZhangLei <1091517373@qq.com> [Feature](schema change) fix write and add regression test (#69) Co-authored-by: yixiutt <yixiu@selectdb.com> [schema change] be ssupport that delete use newest schema add delete regression test fix regression case (#107) tmp [feature](schema change) light schema change exclude rollup and agg/uniq/dup key type. [feature](schema change) fe olapTable maxUniqueId write in disk. [feature](schema change) add rpc iface for sc add column. [feature](schema change) add columnsDesc to TPushReq for ligtht sc. resolve the deadlock when schema change (#124) fix columns from fe don't has bitmap_index flag (#134) add update/delete case construct MATERIALIZED schema from origin schema when insert fix not vectorized compaction coredump use segment cache choose newest schema by schema version when compaction (#182) [bugfix](schema change) fix ligth schema change problem. [feature](schema change) light schema change add alter job. (#1) fix be ut [bug] (schema change) unique drop key column should not light schema change [feature](schema change) add schema change regression-test. fix regression test [bugfix](schema change) fix multi alter clauses for light schema change. (#2) [bugfix](schema change) fix multi clauses calculate column unique id (#3) modify PushTask process (#217) [Bugfix](schema change) fix jobId replay cause bdbje exception. [bug](schema change) fix max col unique id repeatitive. (#232) [optimize](schema change) modify pendingMaxColUniqueId generate rule. fix compaction error * fix be ut * fix snapshot load core fix unique_id error (#278) [refact](fe) remove redundant code for light schema change. (#4) [refact](fe) remove redundant code for light schema change. (#4) format fe core format be core fix be ut modify fe meta version fix rebase error flush schema into rowset_meta in old table [refactor](schema change) refact fe light schema change. (#5) delete the change of schemahash and support get max version schema * modify for review * fix be ut * fix schema change test	2022-07-12 19:41:06 +08:00
huangzhaowei	f5036fea63	[enhancement][multi-catalog]Add strong checker for hms table (#10724 )	2022-07-11 23:48:15 +08:00
924060929	5a54d518dc	[Refactor](Nereids) remove generic type from concrete expressions (#10761 ) in the past, we use generic type for plan and expression to support pattern match framework, it can support type inference without unsafely type cast. then, we observed that expression usually traverse or rewrite by visitor pattern, so generic type is useless for expression and introduces complexity. so we remove generic type from concrete expressions.	2022-07-11 22:30:42 +08:00
Gabriel	c51badb1ae	[feature-wip](datev2) add FE functions and fix some bugs (#10767 )	2022-07-11 19:25:31 +08:00
Adonis Ling	deae728fc6	[refactor](nereids) Refine some code snippets (#10672 ) Refine some code snippets: 1. Rename: ExpressionUtils::add -> ExpressionUtils::and 2. Reduce temporary objects when combing expressions.	2022-07-11 16:31:38 +08:00
jakevin	51855633e4	[feature](Nereids): cost and enforcer job in cascades. (#10657 ) Issue Number: close #9640 Add enforcer job for cascades. Inspired by to NoisePage enforcer job, and ORCA paper During this period, we will derive physical property for plan tree, and prune the plan according to the cos.	2022-07-11 15:01:59 +08:00
Mingyu Chen	639f1cd26c	[improvement](parquet-reader) Add some profile for parquet reader (#10740 )	2022-07-11 12:19:06 +08:00
caiconghui	81101fc1c5	[enhancement](alter) Make alter job more robust by ignoring some task failure (#10719 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-07-11 12:16:48 +08:00
Gabriel	8472ea8324	Revert "[Enhancement] Add column prune support for VOlapScanNode (#10615 )" (#10734 )	2022-07-11 12:16:08 +08:00
morrySnow	1dccfa3d84	[enhancement](nereids) make SSB works (#10659 ) enhancement - refactor compute output expression on root fragment in nereids planner - refactor aggregate plan translator - refactor aggregate disassemble rule - slightly refactor sort plan translator - add exchange node on the top of plan node tree if it is needed - slightly refactor PhysicalPlanTranslator#translatePlan fix - slotDescriptor should not reuse between TupleDescriptors - expression's nullable now works fine - remove quotes when parse string literal - set resolvedTupleExprs in SortNode to control output - remove the extra column in sortTupleSlotExprs in SortInfo known issues - aggregate function must be the top expression in output expression (need project in ExecNode in BE) - first phase aggregate could not convert to stream mode. - OlapScanNode do not set data partition - Sort could not process expression like 'order by a + 1' and SortInfo generated in a trick way and should be refactor when we want to support 'order by a + 1' - column prune do not work as expected	2022-07-11 11:33:17 +08:00
Stalary	46662bfee8	[Bug] CTAS varchar length lost (#10738 )	2022-07-10 23:51:36 +08:00
jakevin	a6e4c88663	[improve](planner): split output expr to multiple line. (#10710 ) * [improve](planner): split output expr to multiple line. +---------------------------------------------------+ \| Explain String \| +---------------------------------------------------+ \| PLAN FRAGMENT 0 \| \| OUTPUT EXPRS: \| \| <slot 9> `user_id` \| \| <slot 11> `default_cluster:test`.`tbl`.`date` \| \| <slot 10> `city` \| \| <slot 12> `default_cluster:test`.`tbl`.`age` \| +---------------------------------------------------+ * *: fix UT and regression-test.	2022-07-10 11:35:48 +08:00
HappenLee	1f08f2d144	[Bug][Vectorized] Support array function in where pre in volap_scan_node (#10467 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com> Support array function in where pre in volap_scan_node	2022-07-09 16:22:01 +08:00
huangzhaowei	24d824a783	[improvement](multi-catalog) Impl parallel for file scanner to improve the scanner performance (#10620 ) Add multi-thread support in FileScanNode on be and impl the file spilt logic in fe.	2022-07-09 15:52:53 +08:00
luozenglin	d5ea677282	[feature](tracing) Support query tracing to improve doris observability by introducing OpenTelemetry. (#10533 ) The collection of query traces is implemented in fe and be, and the spans are exported to zipkin. DSIP: https://cwiki.apache.org/confluence/display/DORIS/DSIP-012%3A+Introduce+opentelemetry	2022-07-09 15:50:40 +08:00
jiafeng.zhang	3229730933	[refactor]broker rpc timeout configuration parameterization (#10692 )	2022-07-09 06:27:02 +08:00
Pxl	f58a071605	[Bug][Function] pass intermediate argument list to be (#10650 )	2022-07-08 20:50:05 +08:00
zhengshiJ	e6da00bb26	[feature](nereides) support sort translator (#10678 ) Physical sort: * 1. Build sortInfo * There are two types of slotRef: * one is generated by the previous node, collectively called old. * the other is newly generated by the sort node, collectively called new. * Filling of sortInfo related data structures, * a. ordering use newSlotRef. * b. sortTupleSlotExprs use oldSlotRef. * 2. Create sortNode * 3. Create mergeFragment TODO: 1.Currently, columns that do not exist in select but exist in order by cannot be parsed. eg: select key from table order by value; 2.For the combination of Literal and slotRefrance in select, there is a problem with parsing, eg: select key ,(10-value) from table;	2022-07-08 19:22:48 +08:00
yinzhijian	eeee036cba	[fix](optimizer) join reorder may cause column non-existence problem (#10670 ) for example: select * from t1 inner join t2 on t1.a = t2.b inner join t3 on t3.c = t2.b; If t3 is a large table, it will be placed first after the reorderTable, and the problem that t2.b does not exist will occur in reanalyzing.	2022-07-08 17:28:32 +08:00

... 67 68 69 70 71 ...

5755 Commits