doris

Author	SHA1	Message	Date
zhou zhuohan	1c91257c01	📝 fix create table doc typo (#12269 ) fix create table doc typo	2022-09-02 09:20:46 +08:00
Xujian Duan	061b49b7bf	[doc](website) update SHOW-PROC doc (#12229 )	2022-09-01 19:50:25 +08:00
zy-kkk	58c1d6ce9d	[typo](docs)Modify the maximum handle parameter reference #12244	2022-09-01 19:50:07 +08:00
morrySnow	87086ffe31	[enhancment](Nereids)enable normalize aggregate rule (#12194 ) enable normalize aggregate rule introduced by #12013	2022-09-01 19:20:37 +08:00
Mingyu Chen	3ce305134a	[fix](scan) fix potential wrong cancel when sql has limit (#12224 )	2022-09-01 19:11:40 +08:00
starocean999	f8eb480bec	[fix](emptynode)fix empty node bug in vec engine (#12258 ) * [fix](emptynode)fix empty node bug in vec engine * update fe ut	2022-09-01 18:52:10 +08:00
Henry2SS	ad8e2f4749	[fix](rpc) fix that coordinator rpc timeout too large may make show load blocked for long time (#12152 ) Co-authored-by: wuhangze <wuhangze@jd.com>	2022-09-01 18:05:37 +08:00
morrySnow	068e60145e	[enhancement](Nereids)ban groupPlan() pattern to avoid misuse (#12250 ) `groupPlan()` pattern means to find a `GroupPlan` in memo. Since we have no `GroupPlan` in memo, it is always return nothing. When we want write a pattern to match any GROUP, we should use `group()`. But pattern `groupPlan` is very confusing, and easy misuse. So, this PR ban `groupPlan()` pattern ti avoid misuse.	2022-09-01 14:37:48 +08:00
Gabriel	3bcab8bbef	[feature](function) support now/current_timestamp functions with precision (#12219 ) * [feature](function) support now/current_timestamp functions with precision	2022-09-01 14:35:12 +08:00
pengxiangyu	c5481dfdf7	[fix](remote)Fix bug for Segment::open() in case: config::file_cache_type (#12249 ) * fix bug for Segment::open() in case: config::file_cache_type * fix bug for Segment::open() in case: config::file_cache_type	2022-09-01 14:16:41 +08:00
catpineapple	df51c78593	[fix](dbt)fix dbt run abnormal #12242	2022-09-01 12:10:48 +08:00
TengJianPing	f294d33332	[bugfix](index) index page should not be bitshuffle decoded (#12231 ) * [bugfix](index) index page should not be bitshuffle decoded * minor change	2022-09-01 11:56:44 +08:00
camby	fc05d54f0d	[fix](array-type) array_sort function with empty input #12175 Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-09-01 10:54:09 +08:00
HappenLee	8c8078ad28	[fix](projections) get error row_descriptor when have projections on ExecNode (#12232 ) When ExecNode's projections is not empty, it use output row descriptor to initialize the block before doing projection. But we should use original row descriptor. This PR fix it.	2022-09-01 10:48:10 +08:00
yixiutt	60a2fa7dea	[Improvement](compaction) copy row in batch in VCollectIterator&VGenericIterator (#12214 ) In VCollectIterator&VGenericIterator, use insert_range_from to copy rows in a block which is continuous to save cpu cost. If rows in rowset and segment are non overlapping, this whill improve 30% throughput of compaction.If rows are completely overlapping such as load two same files, the throughput goes nearly same as before. Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-09-01 10:20:17 +08:00
Gabriel	90fb3b7783	[Improvement](load) accelerate tablet sink (#12174 )	2022-09-01 10:08:09 +08:00
Jibing-Li	ec4863b63a	[feature-wip](new-scan)Add new file scan node (#12048 ) Related pr: #11582 This is the new file scan node and scanner for external hms catalog.	2022-09-01 10:01:20 +08:00
luozenglin	65051d67cf	[fix](yearweek) fixed the yearweek result error when mode is set to 1 (#12234 )	2022-09-01 09:46:38 +08:00
starocean999	d7e02a9514	[fix](join)join reorder by mistake (#12113 )	2022-09-01 09:46:01 +08:00
Yongqiang YANG	f3cb0c24ee	[enhancement](test) add restore action and s3 helper methond (#12084 ) Co-authored-by: morrySnow <morrysnow@126.com> Co-authored-by: SWJTU-ZhangLei <1091517373@qq.com>	2022-08-31 23:08:23 +08:00
morrySnow	a49bde8a71	[fix](Nereids)statistics calculator for Project and Aggregate lost some columns (#12196 ) There are some bugs in Nereids' StatsCalculator. 1. Project: return child column stats directly, so its parents cannot find column stats from project's slot. 2. Aggregate: do not return column that is Alias, its parents cannot find some column stats from Aggregate's slot. 3. All: use SlotReference as key of column to stats map. So we need change SlotReference's equals and hashCode method to just using ExprId as we discussed.	2022-08-31 20:47:22 +08:00
morrySnow	57051d3591	[fix](Nereids)cast StringType to DateType failed when bind TimestampArithmetic function (#12198 ) When bind TimestampArithmetic, we always want to cast left child to DateTimeType. But sometimes, we need to cast it to DateType, this PR fix this problem.	2022-08-31 19:52:03 +08:00
siriume	1a198b3777	[typo](docs) Fix Bitmap index and Type QUANTILE_STATE: fix description (#12217 ) Fix Bitmap index and Type QUANTILE_STATE: fix description	2022-08-31 18:10:17 +08:00
Ashin Gau	1cc9eeeb1a	[feature-wip](parquet-reader) read and generate array column (#12166 ) Read and generate parquet array column. When D=1, R=0, representing an empty array. Empty array is not a null value, so the NullMap for this row is false, the offset for this row is [offset_start, offset_end) whose `offset_start == offset_end`, and offset_end is the start offset of the next row, so there is no value in the nested primitive column. When D=0, R=0, representing a null array, and the NullMap for this row is true.	2022-08-31 17:08:12 +08:00
HappenLee	573e5476dd	[Opt](load) Speed up the vectorized load (#12146 ) * [Opt](load) Speed up the vectorized load	2022-08-31 16:23:36 +08:00
xy720	90c5180370	[Bug](array-type) Fix bug in creating view from table with array types (#12200 )	2022-08-31 14:36:31 +08:00
Yankee24	d7e032bc38	Modify the startup script and print the log without using the --daemon parameter. (#12218 )	2022-08-31 14:36:14 +08:00
camby	da4ffd3c56	[Enhancement](metric-type) more readable error message for only metric type #12162 Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-08-31 14:35:48 +08:00
zy-kkk	c2f7e95c7f	[typo](docs)Add somm json format load doc (#12213 ) * Specify JSON root document when adding JSON import, and add JSON file size limit parameter prompt	2022-08-31 14:34:55 +08:00
starocean999	3cdd19821d	[fix](sort)the slot in sort node should be nullable if it's outer joined (#12193 ) The sort node's output expr should be nullable if it is outer joined.	2022-08-31 14:34:14 +08:00
jakevin	8999ba34ae	[improve](Nereids)unify all plan toString() function (#12132 ) Add a Util function to generate uniform format plan toString for easy reading and debugging	2022-08-31 14:28:44 +08:00
camby	8b98e2021e	[enhancement](array-type) Array type do not support compare with '=','>', '<', make the error message more readable (#12181 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-08-31 12:49:10 +08:00
zxealous	254cb321b9	[optimize](remote) Optimize cache reader use a pre-created buffer when downloading the cache (#12165 ) * optimize cache reader * add description for config * optimize cache reader * optimize cache reader	2022-08-31 10:15:40 +08:00
HouRong	f0cde35ea6	[performance improvement] Spark Load, SparkDpp processRDDAggregate performance improvement (#12186 ) Co-authored-by: hourong <hourong@zhihu.com>	2022-08-31 09:14:13 +08:00
Mingyu Chen	8251e7cbfc	[refactor](column) remove confused field (#12187 )	2022-08-31 09:13:31 +08:00
Xinyi Zou	f72d2559cf	[fix](compile) Fix compile error '<unknown>' may be used uninitialized in PODArray::insert_prepare #12202	2022-08-31 09:12:28 +08:00
minghong	f949262ddf	[fix](planner) a slot id is bounded on a wrong tuple id, if cross join has a hash join as child (#12156 )	2022-08-31 09:07:55 +08:00
Mingyu Chen	22430cd7bb	[feature](stmt) add ADMIN COPY TABLET stmt for local debug (#12176 ) Add a new stmt ADMIN COPY TABLET for easy copy a tablet to local env to reproduce problem. See document for more details.	2022-08-31 09:06:49 +08:00
morrySnow	172c213fbc	[fix](Nereids)get NPE from finalize TimestampArithmeticExpr that generated by ExpressionTranslator (#12163 ) This PR: 1. refactor getDataType in TimestampArithmetic 2. set TimeUnit correctly when translate TimestampArithmetic to TimestampArithmeticExpr	2022-08-30 21:59:28 +08:00
jakevin	59e5527eb0	[feature](Nereids)enable CBO optimize stage in Nereids (#12008 ) - enable CBO stage in Nereids - use the `chooseBestPlan()` to get the best plan - add a new rule JoinCommuteProject - test the stage by JoinCommute rule	2022-08-30 21:28:17 +08:00
Shuo Zhang	dda446f490	fix typos in get-starting.md (both en and zh-CN) (#12189 )	2022-08-30 20:52:24 +08:00
Zhengguo Yang	fdd236cc9b	[fix](simdjson) disable simdjson by default, wait for fix the bug (#12179 )	2022-08-30 20:24:46 +08:00
morrySnow	de4bdc7f6f	[fix](Nereids)Sum return DoubleType when child is DecimalType by mistake (#12169 ) When Sum's child is Decimal, Return Double Type by mistake lead to result error, so we should keep the return type to decimal when the child expression's type is decimal.	2022-08-30 19:53:25 +08:00
924060929	f6a10e9ea3	[refactor](Nereids)refactor memo.copyIn (#12147 ) this pr do 2 refactor 1. remove useless parameter from `Plan#computeOutput` 2. refactor memo.copyIn It the past, `memo.copyIn` has complex logic to process init, rewrite and copyIn, It's difficult to understand and easy to meet bug and leak memory for some unreachable group/groupExpression. So I separate it into three methods: 1. `Memo.init` for init Memo by LogicalPlan 2. `Memo.doRewrite` for rewrite 3. `Memo.doCopyIn` for exploration and implementation And separate the UT into 3 files 1. `MemoInitTest` 2. `MemoRewriteTest` 3. `MemoCopyInTest` I have added a lots of UT for `Memo.rewrite`, and add some unreachable DAG check in the PlanChecker, when the plan is changed.	2022-08-30 19:36:04 +08:00
catpineapple	fc400a15db	[fix] (docs) Fix Data type: string length and parameter description (#11565 )	2022-08-30 19:16:00 +08:00
iceqing	9c7016f6e7	[docs](query plan) Add a description `explain graph` and `desc graph` and `explain verbose` (#11697 ) * docs: 增加explain graph与desc graph描述，避免文档前后描述不一致 * docs: add the usage docs of `explain verbose`	2022-08-30 19:12:23 +08:00
xy720	370b92a2ea	[Bug](compile) fix clang build for be (#12183 )	2022-08-30 17:58:54 +08:00
Kikyou1997	9a74ad1702	[feature](Nereids)add the ability of projection on each ExecNode and add column prune on OlapScan (#11842 ) We have added logical project before, but to actually finish the prune to reduce the data IO, we need to add related supports in translator and BE. This PR: - add projections on each ExecNode in BE - translate PhysicalProject into projections on PlanNode in FE - do column prune on ScanNode in FE Co-authored-by: HappenLee <happenlee@hotmail.com>	2022-08-30 16:17:10 +08:00
Yongqiang YANG	fb27e3ef31	[fix](planner) let OlapScanNode turn off preaggragation when there is a filter on DELETE_SIGN (#12118 ) We can skip aggregate on replace column, otherwise it would generate wrong result. e.g. a row in UNIQUE is deleted by delte_sign_column, then it would be returned.	2022-08-30 15:54:37 +08:00
chenlinzhong	eea756fd5b	[docs](hive)add hadoop.username config item (#12109 ) add hadoop.username config item	2022-08-30 12:34:27 +08:00

... 9 10 11 12 13 ...

6608 Commits