doris

Author	SHA1	Message	Date
yiguolei	97996c9275	[fix](Insert) fix 5 concurrent "insert...select..." OOM (#10501 ) * [hotfix](dev-1.0.1) 5 concurrent insert...select... OOM Co-authored-by: minghong <minghong.zhou@163.com> Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-07-01 15:29:26 +08:00
TengJianPing	d9f2da8cf0	[bugfix] temporarily disable RF code to avoid core dump caused by vexpr destruction (#10504 ) Runtime filter handling in volap_scann_ode may cause double free in VExprContext, temporarily disable it to avoid it.	2022-06-30 14:54:44 +08:00
Adonis Ling	e42adbb959	Fix compilation error reported by clang (#10494 )	2022-06-29 20:38:06 +08:00
yiguolei	4ec6e3ee81	[refactor] Remove debug action since it is never used. (#10484 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-06-29 20:37:51 +08:00
Pxl	6a566ccb74	[Enhancement][Vectorized] add constexpr_loop_match (#10283 )	2022-06-29 14:58:50 +08:00
huangzhaowei	abd10f0f3e	[feature-wip](multi-catalog) Impl FileScanNode in be (#10402 ) Define a new file scanner node for hms table in be. This file scanner node is different from broker scan node as blow: 1. Broker scan node will define src slot and dest slot, there is two memory copy in it: first is from file to src slot and second from src to dest slot. Otherwise FileScanNode only have one stemp memory copy just from file to dest slot. 2. Broker scan node will read all the filed in the file to src slot and FileScanNode only read the need filed. 3. Broker scan node will convert type into string type for src slot and then use cast to convert to dest slot type, but FileScanNode will have the final type. Now FileScanNode is a standalone code, but we will uniform the file scan and broker scan in the feature.	2022-06-29 11:04:01 +08:00
Tiewei Fang	17eb8c00d3	[feature] add table valued function framework and numbers table valued function (#10214 )	2022-06-28 14:01:57 +08:00
Gabriel	ca94867b4e	[Feature-wip] add date v2 type (#9916 )	2022-06-26 16:07:56 +08:00
Gabriel	eebfbd0c91	Revert "[fix](vectorized) Support outer join for vectorized exec engine (#10323 )" (#10424 ) This reverts commit 2cc670dba697a330358ae7d485d856e4b457c679.	2022-06-25 22:18:08 +08:00
Gabriel	14a9a676e7	[BUG] fix DCHECK failed (#10396 )	2022-06-25 17:08:40 +08:00
Gabriel	476be35961	[TYPO] fix typo 'destory' -> 'destroy' (#10373 )	2022-06-24 19:11:28 +08:00
HappenLee	2cc670dba6	[fix](vectorized) Support outer join for vectorized exec engine (#10323 ) In a vectorized scenario, the query plan will generate a new tuple for the join node. This tuple mainly describes the output schema of the join node. Adding this tuple mainly solves the problem that the input schema of the join node is different from the output schema. For example: 1. The case where the null side column caused by outer join is converted to nullable. 2. The projection of the outer tuple.	2022-06-24 08:59:30 +08:00
yiguolei	3370c10528	[profile] add more detail profile in segment iterator (#10352 )	2022-06-23 15:32:43 +08:00
HappenLee	fa13bef3da	[Bug][Vectorized] Fix coredump in other join conjunt is const expr (#10223 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-23 13:27:32 +08:00
Gabriel	139cd3d11a	[Improvement] remove olap filters when use in key ranges (#10278 )	2022-06-23 09:12:29 +08:00
Yongqiang YANG	274a0f2603	[fix] do not read seq column when reading a compacted rowset (#10344 ) SEQ_COL is used on tables with unique key to order data in one transaction(rowset), when there is only one rowset and the rowset is compacted, rows in the rowset is sorted and rows with same keys are resolved by compaction, so a scanner sets direct_mode to optimize read iterator to avoid sorting and aggregating, and iterators does not need SEQ_COL. However, init_return_columns adds SEQ_COL to return_columns, which is passed to SegmentIterator. Then segment Iterator would be called via get_next with a block without SEQ_COL, segment iterator creates columns included in return_columns but not in the block. SEQ_COL is nullable, segment Iterator does not handle it, so a core dump happen. Actually, in the above case, segment iterator does not need to read SEQ_COL. When SEQ_COL is really needed, iterators creates SEQ_COL column in block, so segment Iterator does not need do create SEQ_COL at all.	2022-06-23 08:44:43 +08:00
Gabriel	200557052a	[BUGFIX] wrong answer with `with as` + two phase agg (#10303 )	2022-06-22 14:39:39 +08:00
Gabriel	588634ddf6	[feature] support runtime filter on vectorized engine (#10103 )	2022-06-20 09:46:38 +08:00
Gabriel	60147ad7a5	[Improvement] build runtime filters asynchronously (#10186 )	2022-06-17 11:09:13 +08:00
Pxl	fd0bd395ac	[Enhancement] Remove some unused include (#10035 )	2022-06-17 10:47:25 +08:00
starocean999	1cca319d18	[fix](vectorized) intersect operator takes too long time to execute (#10183 ) * fix itersect operator takes too long time to execute * modify code based on review comments	2022-06-17 08:43:53 +08:00
Gabriel	6f5f447aa3	[FOLLOWUP] cherrypick after refactoring scan nodes (#10177 )	2022-06-17 08:41:47 +08:00
camby	96de99525e	[compile&build]clang compile errors fix (#10201 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-06-17 08:41:25 +08:00
HappenLee	8d98c17c4e	[Bug][Vectorized] Fix DCHECK failed in VExchangeNode close twice (#10184 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-16 23:56:49 +08:00
lihangyu	f49a4535c4	[Fix] fix vjson_scanner heap use after free when meet object or array type (#10179 ) quick merge. It is a serious bug in 1.1.	2022-06-16 16:01:18 +08:00
Gabriel	28e8effc52	[Refactor] Refactor vectorized scan node (#9968 )	2022-06-16 11:10:56 +08:00
zhangstar333	4c24586865	[Vectorized][UDF] support java-udaf (#9930 )	2022-06-15 10:53:44 +08:00
Jing Shen	4a474420c8	[feature](function) Add ntile function (#9867 ) Add ntile function. For non-vectorized-engine, I just implemented like Impala, rewrite ntile to row_number and count. But for vectorized-engine, I implemented WindowFunctionNTile.	2022-06-10 10:32:40 +08:00
yinzhijian	19bc14cf8d	[feature-wip](array-type) Add array type support for vectorized parquet-orc scanner (#9856 ) Only support one level array now. for example: - nullable(array(nullable(tinyint))) is support. - nullable(array(nullable(array(xx))) is not support.	2022-06-09 12:11:47 +08:00
HappenLee	94089b9192	[Refactor] Use file factory to replace create file reader/writer (#9505 ) 1. Simplify code logic and improve abstraction 2. Fix the mem leak of raw pointer Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-08 15:07:39 +08:00
Gabriel	35c3e4e33c	[Bug] runtime filter is not used as expected (#10001 ) * [Bug] runtime filter is not used as expected * update	2022-06-08 11:10:39 +08:00
HappenLee	35f99faa0a	[Bug][Vectorized] fix core dump on vcase_expr::close (#9893 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-06-01 08:05:09 +08:00
Pxl	d34d631519	[bugfix]fix TableFunctionNode memory leak (#9853 )	2022-05-31 19:20:22 +08:00
Xinyi Zou	c8d303a82c	[bugfix] Fix BE core about vectorized join build thread memtracker switch, and FileStat duplicate	2022-05-31 19:12:42 +08:00
HappenLee	7199102d7c	[Opt][VecLoad] Opt the vec stream load performance (#9772 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-05-31 11:53:32 +08:00
Adonis Ling	f377c26bf7	[refactor][be] Optimize headers (#9708 )	2022-05-30 16:12:10 +08:00
yiguolei	4af2493c42	[Improvement] optimize scannode concurrency query performance in vectorized engine. (#9792 )	2022-05-30 16:04:40 +08:00
EmmyMiao87	0683181fef	[API changed](parser) Remove merge join syntax (#9795 ) Remove merge join sql and merge join node	2022-05-30 09:04:21 +08:00
Amos Bird	63aab5ee5d	[Bugfix(Vec)] Fix some memory leak issues (#9824 )	2022-05-29 23:04:11 +08:00
yinzhijian	cbbda7857b	[feature-wip](parquet-orc) Support orc scanner in vectorized engine (#9541 )	2022-05-26 21:39:12 +08:00
jacktengg	f4dd3bf013	[bugfix] fix memleak in olapscannode(#9736 )	2022-05-26 15:06:54 +08:00
Dongyang Li	90e8cda5f2	[Enhancement](Vectorized)build hash table with new thread, as non-vec… (#9290 ) * [Enhancement][Vectorized]build hash table with new thread, as non-vectorized past do edit after comments * format code with clang format Co-authored-by: lidongyang <dongyang.li@rateup.com.cn> Co-authored-by: stephen <hello-stephen@qq.com>	2022-05-24 10:23:15 +08:00
HappenLee	5039ec4570	[vec][opt] opt hash join build resize hash table before insert data (#9735 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-05-23 15:13:57 +08:00
HappenLee	500c36717d	[Bug-Fix][Vectorized] Full join return error result (#9690 ) Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-05-23 13:29:37 +08:00
HappenLee	8fa677b59c	[Refactor][Bug-Fix][Load Vec] Refactor code of basescanner and vjson/vparquet/vbroker scanner (#9666 ) * [Refactor][Bug-Fix][Load Vec] Refactor code of basescanner and vjson/vparquet/vbroker scanner 1. fix bug of vjson scanner not support `range_from_file_path` 2. fix bug of vjson/vbrocker scanner core dump by src/dest slot nullable is different 3. fix bug of vparquest filter_block reference of column in not 1 4. refactor code to simple all the code It only changed vectorized load, not original row based load. Co-authored-by: lihaopeng <lihaopeng@baidu.com>	2022-05-20 11:43:03 +08:00
camby	bfb1ab059d	[BUG] fix information_schema.columns results not correctly on vec engine (#9612 ) * VSchemaScanNode get_next bugfix * add regression-test case for VSchemaScanNode Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-05-18 07:44:32 +08:00
Pxl	26353ba8b5	[clang build]fix clang compile error (#9615 )	2022-05-18 07:42:31 +08:00
yinzhijian	bee5c2f8aa	[feature-wip](parquet-vec) Support parquet scanner in vectorized engine (#9433 )	2022-05-17 09:37:17 +08:00
Adonis Ling	5660815dc6	[chore] Fix compilation errors reported by clang (#9584 )	2022-05-16 22:36:16 +08:00
carlvinhust2012	b817efd652	[feature] add vectorized vjson_scanner (#9311 ) This pr is used to add the vectorized vjson_scanner, which can support vectorized json import in stream load flow.	2022-05-14 09:50:05 +08:00

1 2

95 Commits