doris

Author	SHA1	Message	Date
Yongqiang YANG	08ebef2992	[Enhancement] check vm.max_map_count before starting (#11052 ) When vectorized engine is enabled, doris uses much more vmas than before, and it leads to core dump due to memory allocation failure.	2022-07-21 21:16:48 +08:00
huangzhaowei	7147a7c290	[feature-wip](multi-catalog) Support s3 storage for file scan node (#10977 ) This is an example of s3 hms_catalog: ```sql CREATE CATALOG hms_catalog properties( "type" = "hms", "hive.metastore.uris"="thrift://localhost:9083", "AWS_ACCESS_KEY" = "your access key", "AWS_SECRET_KEY"="your secret key", "AWS_ENDPOINT"="s3 endpoint", "AWS_REGION"="s3-region", "fs.s3a.paging.maximum"="1000"); ``` All these params are necessary;	2022-07-21 17:38:53 +08:00
Xinyi Zou	4960043f5e	[enhancement] Refactor to improve the usability of MemTracker (step2) (#10823 )	2022-07-21 17:11:28 +08:00
carlvinhust2012	5f6f35e886	Add the supported sub-type for array (#10824 ) 1.This pr is used for adding the supported sub-type for array which has been modified in #9916 2.add regression test for the supported sub-type Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-07-21 16:29:17 +08:00
Dongyang Li	ae53a8a7e9	[regression] sf1DataPath can be url or local path (#11065 )	2022-07-21 14:35:24 +08:00
924060929	03783ce551	[fix](Nereids) fix merge conflict caused compile error (#11064 ) fix merge conflict by #10882 and #10667 remove duplicate function hashCode	2022-07-21 14:14:26 +08:00
Compilation Success	a1758bd139	[feature-wip](unique-key-merge-on-write) Add agg cache for delete bitmap DSIP-018 (#10921 ) Use global LRU for delete bitmap cache	2022-07-21 12:48:44 +08:00
shee	f8ad2613cf	[Enhancement](Nereids) add some expr rewrite rule and plan rewrite rule of rewrite its expression (#10667 ) # first: Add two expr rewrite rule: 1. remove duplicate expr a = 1 and a = 1 -> a = 1 2. extract common expr (a or b) and (a or c) -> a or (b and c) # second: Add some plan rewrite rule of rewriting expr of operator 1. NormalizeExpressionOfPlan contains normalize expr rewrite rule. Using these normalizerule rewrite LogicalFilter、LogicalAggravate，LogicalProject，LogicalJoin exprs 2. OptimizeExpressionOfPlan contains optimize expr rewrite rule. Using these optimize rule rewrite LogicalFilter、LogicalAggravate，LogicalProject，LogicalJoin exprs	2022-07-21 12:35:28 +08:00
morrySnow	072479fa21	[enhancement](Nereids)expression equals and hashCode function (#10882 ) review and add all missing equals and hashCode function to Expression and its sub class. Alias Arithmetic BoundFunction CompoundPredicate Not UnboundFunction UnboundSlot UnboundStar	2022-07-21 12:20:53 +08:00
yinzhijian	329f70dc02	[enhancement](Nereids) support case when for TPC-H (#10947 ) support case when for TPC-H for example: CASE [expression] WHEN [value] THEN [expression] ... ELSE [expression] END or CASE WHEN [predicate] THEN [expression] ... ELSE [expression] END	2022-07-21 12:02:37 +08:00
Mingyu Chen	d36b927fdb	[improvement](fe-ut) use local journal to make FE ut run fast (#11038 ) * [improvement](fe-ut) use local journal to make FE ut run fast	2022-07-21 09:12:21 +08:00
carlvinhust2012	b59ce73e1d	fix the case fail when enable Hdfs (#11051 )	2022-07-21 07:09:09 +08:00
Dongyang Li	b35d5bc15c	[regressiontest] add tpcds_sf1 test (#10852 ) (#11042 ) * [regressiontest] add tpcds_sf1 test (#10852) Co-authored-by: smallhibiscus <844981280> Co-authored-by: stephen <hello-stephen@qq.com> * ignore q30 temporarily since it encounter latin character Ô Co-authored-by: stephen <hello-stephen@qq.com>	2022-07-21 07:07:05 +08:00
Gabriel	b115b362fb	[Bug] fix bug for function `unix_timestamp` (#11041 ) * [Bug] fix bug for function `unix_timestamp`	2022-07-20 20:17:41 +08:00
jiafeng.zhang	b95dedd07b	[doc]Gis function style (#11015 )	2022-07-20 19:18:35 +08:00
HappenLee	d9b6e07e9d	[Vectorized] Support ODBC sink for vec exec engine (#11045 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-07-20 19:09:41 +08:00
Xin Liao	c037066163	[fix](cache) fix that ShardedLRUCache may coredump when destructor was called (#10995 )	2022-07-20 19:07:04 +08:00
plat1ko	2df1822269	[bugfix]fix DCHECK failure in remove_all_remote_rowsets (#10994 )	2022-07-20 19:06:21 +08:00
Jibing-Li	6aadee9a2e	[data lake]Support hdfs ha for Iceberg table. (#11002 ) * Support Iceberg on HDFS with HA mode enabled.	2022-07-20 19:03:58 +08:00
AlexYue	a607c30ad4	[docs] Fe build idea doc (#10996 ) * [doc](fe): enhance the fe-idea-dev * [doc](fe)add solution for m1 mac compile error Co-authored-by: jackwener <jakevingoo@gmail.com>	2022-07-20 19:03:29 +08:00
Hong Liu	b62e3e7aa0	[regression test]Add ssb sf1 test under unique table with zstd (#11004 ) Co-authored-by: smallhibiscus <844981280>	2022-07-20 18:59:46 +08:00
camby	0a8ae6aeec	Refractor COLLECT_LIST and COLLECT_SET register logic (#10956 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-07-20 18:02:39 +08:00
Dongyang Li	1ca00e0107	[tools] add clickbench tools (#11009 ) * [tools] add clickbench tools Co-authored-by: stephen <hello-stephen@qq.com>	2022-07-20 17:59:04 +08:00
Adonis Ling	e5663f9872	[Bug](array-type) Fix the core dump caused by unaligned __int128 (#11020 ) Fix the core dump caused by unaligned __int128 and change DEFAULT_ALIGNMENT	2022-07-20 16:37:27 +08:00
Lightman	a71822a74d	[refactor]remove col_unique_id (#11025 )	2022-07-20 16:35:14 +08:00
Mingyu Chen	7bdce8f572	[refactor](policy) refactor some policy create and check logic (#11007 ) * [refactor](policy) refactor some policy create and check logic	2022-07-20 16:20:59 +08:00
morrySnow	658a9f7531	[fix](planner)unnecessary cast will be added on children in InPredicate (#11033 )	2022-07-20 16:00:26 +08:00
minghong	6233b5200e	[refactor] (Nereids) rename GroupExpression.getParent() to getOwnerGroup() (#11027 ) GroupExpression.getParent() returns the group which contains this expr. This name is missleading especially in tree structures. So we change the name to getOwnerGroup.	2022-07-20 15:57:59 +08:00
zhannngchen	a1c1cfce47	Add some comments for the feature mow (#11028 )	2022-07-20 15:35:41 +08:00
zhannngchen	ec5471f048	[feature-wip](unique-key-merge-on-write) Implement tablet lookup interface, using rowset-tree, DSIP-018[3/5] (#10938 )	2022-07-20 14:52:14 +08:00
Shuo Wang	9b91f86c38	[Feature](Nereids) Reorder join to eliminate cross join. (#10890 ) Try to eliminate cross join via finding join conditions in filters and changing the join orders. For example: -- input: SELECT * FROM t1, t2, t3 WHERE t1.id=t3.id AND t2.id=t3.id -- output: SELECT * FROM t1 JOIN t3 ON t1.id=t3.id JOIN t2 ON t2.id=t3.id This feature is controlled by session variable enable_nereids_reorder_to_eliminate_cross_join with true by default. Simplify usage of Memo and rewrite rule application. Before this PR, if we want to apply a rewrite rule to a plan, the code is like the below: Memo memo = new Memo(); memo.initialize(root); PlannerContext plannerContext = new PlannerContext(memo, new ConnectContext()); JobContext jobContext = new JobContext(plannerContext, new PhysicalProperties(), 0); RewriteTopDownJob rewriteTopDownJob = new RewriteTopDownJob(memo.getRoot(), ImmutableList.of(new AggregateDisassemble().build()), jobContext); plannerContext.pushJob(rewriteTopDownJob); plannerContext.getJobScheduler().executeJobPool(plannerContext); Plan after = memo.copyOut(); After this PR, we could use chain style calling: new Memo(plan) .newPlannerContext(connectContext) .setDefaultJobContext() .topDownRewrite(new AggregateDisassemble()) .getMemo() .copyOut(); Rename the session variable enable_nereids to enable_nereids_planner to make it more meaningful.	2022-07-20 13:53:54 +08:00
Mingyu Chen	56e036e68b	[feature-wip](multi-catalog) Support runtime filter for file scan node (#11000 ) * [feature-wip](multi-catalog) Support runtime filter for file scan node Co-authored-by: morningman <morningman@apache.org>	2022-07-20 12:36:57 +08:00
Kikyou1997	a5a50726bf	[Ehancement](planner) Rewrite implicit cast to the predicates (#10920 ) During the analysis of BinaryPredicate, it will generate a CastExpr if the slot implicitly in the below case: SELECT * FROM t1 WHERE t1.col1 = '1'; col1 is integer column. This will prevent the binary predicate from pushing down to OlapScan which would impact the performance.	2022-07-20 12:28:29 +08:00
yixiutt	dc2b709f6f	[Bug](compaction) fix uniq key compaction bug that does not count merged rows right(#10971 ) When a rowset includes multiple segments, segments rows will be merged in generic_iterator but merged_rows is not maintained. Compaction will failed in check_correctness. Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-07-20 12:07:45 +08:00
plat1ko	989e6d1cf9	[chore]fix clang compile error (#11021 )	2022-07-20 08:28:47 +08:00
Mingyu Chen	ba9c7e50aa	[doc] missing sidebar for cloudcanal (#10998 )	2022-07-19 23:51:12 +08:00
Jerry Hu	fd2c374426	[fix]Empty string key in aggregation was output as NULL (#11011 )	2022-07-19 23:25:28 +08:00
Pxl	95366de7f6	cast array element to same type (#10980 ) Fix problem when there are element of different types in an array.	2022-07-19 21:47:10 +08:00
Xin Liao	371c7be235	[feature-wip](unique-key-merge-on-write) add segment lookup interface implementation, DSIP-018 (#10922 )	2022-07-19 21:14:32 +08:00
Dongyang Li	d7770db5e2	Revert "[regressiontest] add tpcds_sf1 test (#10852 )" (#11008 ) This reverts commit d2bee602514e8238dd8ef3d3b9b34fb6171bd26f.	2022-07-19 18:41:53 +08:00
ElvinWei	2d90f4b87c	[feature-wip](statistics) step4: collect statistics by implementing statistics tasks (#8861 ) This pull request includes some implementations of the statistics(https://github.com/apache/incubator-doris/issues/6370), it will not affect any existing code and users will not be able to create statistics job. Now only MetaStatisticsTask that directly collects statistics by reading FE meta is implemented. SQLStatisticsTask is still being implemented, it needs to query BE through FE. The following is the function implemented by this pr: 1. Support statistics collection for partitioned and non-partitioned tables. For partitioned tables, the collection of statistics for the specified partition is implemented. 2. When the task is divided, it is divided according to the partition table and the non-partition table. The most fine-grained is to the tablet level. A matetask collects as many statistics as possible. 3. Add partition statistics (Table -> Partition -> Column). For example, the size of the table, the number of rows, the size of the partition, the number of rows, the maximum and minimum values of the columns, etc. 4. Display and modify partition-level statistics. …	2022-07-19 16:22:25 +08:00
Hong Liu	ac4ce4d874	Revert "[regression] Add ssb sf1 test under unique table with zstd (#10957 )" (#10992 ) This reverts commit 216a55c12c0be5c4090523195b2aff9d96c64f65.	2022-07-19 15:44:32 +08:00
Xinyi Zou	d5fa66d9a3	[Enhancement] [Memory] Limit memory usage use process actual physical memory (#10924 )	2022-07-19 11:08:39 +08:00
yuanyuan8983	b70274e2af	[docs] Changing the symbol of dataX doriswriter table creation statement (#10632 ) * Update datax.md	2022-07-19 10:15:27 +08:00
Jet He	f6cb7a838b	[Optimize] Improve performance like/not like filter through pushdown function to storage engine (#10355 ) * support like/not like conjuncts push down to storage engine * vectorized engine support like/not like conjuncts push down to storage engine * support both evaluate and evaluate_vec method in like predicate * reuse remove_pushed_conjuncts and prevent logic error during move function conjuncts * change #ifndef to pragma once as per comments * change enable_function_pushdown default to false Co-authored-by: heguangnan <heguangnan@bytedance.com>	2022-07-19 08:33:04 +08:00
Dongyang Li	d2bee60251	[regressiontest] add tpcds_sf1 test (#10852 ) Co-authored-by: smallhibiscus <844981280> Co-authored-by: stephen <hello-stephen@qq.com>	2022-07-19 08:30:53 +08:00
Yongqiang YANG	2acd5efcd8	[improvement-log]print a log when got a lower image version (#10910 )	2022-07-19 08:29:58 +08:00
Gabriel	842ff2b1e2	[refactor] Refactor time LUT (#10982 )	2022-07-19 08:23:29 +08:00
Stalary	68b9a2936a	[improvement](doe) Step1: Fe generates the DSL and is used to explain (#9895 ) For the first step, I will only change FE and then change BE once I make sure the DSL is ok.	2022-07-18 23:20:58 +08:00
Gabriel	e769597fd2	[Improvement] (datetime) support microsecond for date literal (#10917 ) * [Improvement] (datetime) support microsecond for date literal * remove joda dependency	2022-07-18 21:39:39 +08:00

1 2 3 4 5 ...

5395 Commits