doris

Author	SHA1	Message	Date
abmdocrt	ca7eb94f23	[improvement](agg-function) Increase the limit maximum number of agg function parameters (#15924 )	2023-01-31 21:03:50 +08:00
Gabriel	471db80f69	[Bug](date) Fix invalid date (#16205 ) Issue Number: close #15777	2023-01-31 10:08:44 +08:00
TengJianPing	a7b030778a	[fix](sort) fix heap-use-after-free error if sort with limit and is spilled (#16267 )	2023-01-31 09:59:03 +08:00
HappenLee	1020fe165b	[Bug](function) positive function coredump in decimal (#16230 )	2023-01-30 22:17:50 +08:00
Jerry Hu	9aa0d86fec	[fix](olap) Incorrect reserving size for PredicateColumn converted from ColumnDictionary (#16249 )	2023-01-30 20:28:22 +08:00
Pxl	322dc2a104	[Bug](function) fix now(int) use_default_implementation_for_nulls && fix dround signature (#16238 )	2023-01-30 18:01:26 +08:00
chenlinzhong	fdc042bb39	[fix](vresultsink) BufferControlBlock may block all fragment handle threads (#16231 ) BufferControlBlock may block all fragment handle threads leads to be out of work modify include: BufferControlBlock cancel after max timeout StmtExcutor notify be to cancel the fragment when unexcepted occur more details see issue #16203	2023-01-30 16:53:21 +08:00
yiguolei	90b12143a3	[refactor](remove unused code) remove runtime tuple structure and useless utils class (#16237 )	2023-01-30 16:45:14 +08:00
Jerry Hu	a9671b6dfd	[feature](agg)support two level-hash map in aggregation node (#15967 )	2023-01-30 16:43:33 +08:00
yiguolei	4b6a4b3cf7	[refactor](remove unused code) Remove unused mempool declare or function params (#16222 ) * Remove unused mempool declare or function params --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-30 13:03:18 +08:00
Ashin Gau	07d58e531a	[improvement](filecache) add profile for file cache (#16223 )	2023-01-30 10:46:31 +08:00
WenYao	69e748b076	[fix](schema scanner)change schema_scanner::get_next_row to get_next_block (#15718 )	2023-01-30 10:01:50 +08:00
HappenLee	7d437d5706	[Bug](function) running_difference function coredump in regression test (#16215 )	2023-01-30 09:58:27 +08:00
Gabriel	f72efd4523	[Improvement](decimal) do not log fatal when precision is invalid (#16207 )	2023-01-30 09:54:22 +08:00
谢健	98649ec9f8	[fix](Nereids): Fix some functions error (#16197 ) * fix bugs in regexp_extract_all * fix rpad * fix weekofday * fix cryptor * fix timestamp * fix st_ function	2023-01-30 00:41:31 +08:00
starocean999	1ec88cbff6	[fix](nereids) AggregationNode process null as key column in wrong way (#16125 ) in AggregationNode, _merge_with_serialized_key_helper method should convert the key column to full column if the key column is null literal.	2023-01-29 20:12:07 +08:00
Pxl	46347a51d2	[Bug](exec) enable warning on ignoring function return value for vctx (#16157 ) * enable warning on ignoring function return value for vctx	2023-01-29 17:23:21 +08:00
abmdocrt	eb7da1c0ee	[fix](datatype) fix some bugs about data type array datetimev2 and decimalv3 (#16132 )	2023-01-29 14:26:08 +08:00
Pxl	2b5f95f08a	[Bug](function) remove datev2 signature of hour_ceil/hour_floor #16168	2023-01-29 11:27:56 +08:00
yiguolei	241a956b20	[refactor](remove unused code) remove partition info from datastream sender (#16162 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-28 19:56:41 +08:00
Gabriel	7cf7706eb1	[Bug](runtimefilter) Fix wrong runtime filter on datetime (#16102 )	2023-01-28 18:16:06 +08:00
yiguolei	e49766483e	[refactor](remove unused code) remove many xxxVal structure (#16143 ) remove many xxxVal structure remove BetaRowsetWriter::_add_row remove anyval_util.cpp remove non-vectorized geo functions remove non-vectorized like predicate Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-28 14:17:43 +08:00
Qi Chen	fa14b7ea9c	[Enhancement](icebergv2) Optimize the position delete file filtering mechanism in iceberg v2 parquet reader (#16024 ) close #16023	2023-01-28 00:04:27 +08:00
Jibing-Li	1589d453a3	[fix](multi catalog)Support parquet and orc upper case column name (#16111 ) External hms catalog table column names in doris are all in lower case, while iceberg table or spark-sql created hive table may contain upper case column name, which will cause empty query result. This pr is to fix this bug. 1. For parquet file, transfer all column names to lower case while parse parquet metadata. 2. For orc file, store the origin column names and lower case column names in two vectors, use the suitable names in different cases. 3. FE side, change the column name back to the origin column name in iceberg while doing convertToIcebergExpr.	2023-01-27 23:52:11 +08:00
yiguolei	adb758dcac	[refactor](remove non vec code) remove json functions string functions match functions and some code (#16141 ) remove json functions code remove string functions code remove math functions code move MatchPredicate to olap since it is only used in storage predicate process remove some code in tuple, Tuple structure should be removed in the future. remove many code in collection value structure, they are useless	2023-01-26 16:21:12 +08:00
yiguolei	615a5e7b51	[refactor](remove non vec code) remove non vec functions and AggregateInfo (#16138 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-25 12:53:05 +08:00
yiguolei	6e8eedc521	[refactor](remove unused code) remove storage buffer and orc reader (#16137 ) remove olap storage byte buffer remove orc reader remove time operator remove read_write_util remove aggregate funcs remove compress.h and cpp remove bhp_lib Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-24 22:29:32 +08:00
yiguolei	79ad74637d	[refactor](remove expr) remove non vectorized Expr and ExprContext related codes (#16136 )	2023-01-24 10:45:35 +08:00
Mingyu Chen	23edb3de5a	[fix](icebergv2) fix bug that delete file reader is not opened (#16133 ) This pr #15836 change the way to use parquet reader by first open() then init_reader(). But we forgot to call open() for iceberg delete file, which cause coredump.	2023-01-24 10:19:46 +08:00
yiguolei	a3cd0ddbdc	[refactor](remove broker scan node) it is not useful any more (#16128 ) remove broker scannode remove broker table remove broker scanner remove json scanner remove orc scanner remove hive external table remove hudi external table remove broker external table, user could use broker table value function instead Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-23 19:37:38 +08:00
zhangstar333	61fccc88d7	[vectorized](analytic) fix analytic node of window function get wrong… (#16074 ) [Bug] 基础函数rank()开窗排序结果错误 #15951	2023-01-23 16:09:46 +08:00
ZhaoChangle	199d7d3be8	[Refactor]Merged string_value into string_ref (#15925 )	2023-01-22 16:39:23 +08:00
yiguolei	8920295534	[refactor](remoe non vec code) remove non vectorized conjunctx from scanner (#16121 ) 1. remove arrow group filter 2. remove non vectorized conjunctx from scanner Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-21 19:23:17 +08:00
zhangstar333	253445ca46	[vectorzied](jdbc) fix jdbc executor for get result by batch and memo… (#15843 ) result set should be get by batch size2. fix memory leak3.	2023-01-21 08:22:22 +08:00
Ashin Gau	de12957057	[debug](ParquetReader) print file path if failed to read parquet file (#16118 )	2023-01-21 08:05:17 +08:00
Tiewei Fang	7814d2b651	[Fix](Oracle External Table) fix that oracle external table can not insert batch values (#16117 ) Issue Number: close #xxx This pr fix two bugs: _jdbc_scanner may be nullptr in vjdbc_connector.cpp, so we use another method to count jdbc statistic. close [Enhencement](jdbc scanner) add profile for jdbc scanner #15914 In the batch insertion scenario, oracle database does not support syntax insert into tables values (...),(...); , what it supports is: insert all into table(col1,col2) values(c1v1, c2v1) into table(col1,col2) values(c1v2, c2v2) SELECT 1 FROM DUAL;	2023-01-21 07:57:12 +08:00
abmdocrt	9ffd109b35	[fix](datetimev2) Fix BE datetimev2 type returning wrong result (#15885 )	2023-01-20 22:25:20 +08:00
yixiutt	171404228f	[improvement](vertical compaction) cache segment in vertical compaction (#16101 ) 1.In vertical compaction, segments will be loaded for every column group, so we should cache segment ptr to avoid too many repeated io. 2.fix vertical compaction data size bug	2023-01-20 16:38:23 +08:00
Tiewei Fang	1638936e3f	[fix](oracle catalog) oracle catalog support `TIMESTAMP` dateType of oracle (#16113 ) `TIMESTAMP` dateType of Oracle will map to `DateTime` dateType of Doris	2023-01-20 14:47:58 +08:00
lihangyu	116e17428b	[Enhancement](point query optimize) improve performace of point query on primary keys (#15491 ) 1. support row format using codec of jsonb 2. short path optimize for point query 3. support prepared statement for point query 4. support mysql binary format	2023-01-20 13:33:01 +08:00
Jibing-Li	3ebc98228d	[feature wip](multi catalog)Support iceberg schema evolution. (#15836 ) Support iceberg schema evolution for parquet file format. Iceberg use unique id for each column to support schema evolution. To support this feature in Doris, FE side need to get the current column id for each column and send the ids to be side. Be read column id from parquet key_value_metadata, set the changed column name in Block to match the name in parquet file before reading data. And set the name back after reading data.	2023-01-20 12:57:36 +08:00
Gabriel	6e090e4daf	[Bug](predicate) fix date predicate (#16053 )	2023-01-19 14:14:48 +08:00
yiguolei	0b5e71d3b4	[refactor](refactor field) remove unused method (#16068 )	2023-01-19 10:16:09 +08:00
lihangyu	3894de49d2	[Enhancement](topn) support two phase read for topn query (#15642 ) This PR optimize topn query like `SELECT * FROM tableX ORDER BY columnA ASC/DESC LIMIT N`. TopN is is compose of SortNode and ScanNode, when user table is wide like 100+ columns the order by clause is just a few columns.But ScanNode need to scan all data from storage engine even if the limit is very small.This may lead to lots of read amplification.So In this PR I devide TopN query into two phase: 1. The first phase we just need to read `columnA`'s data from storage engine along with an extra RowId column called `__DORIS_ROWID_COL__`.The other columns are pruned from ScanNode. 2. The second phase I put it in the ExchangeNode beacuase it's the central node for topn nodes in the cluster.The ExchangeNode will spawn a RPC to other nodes using the RowIds(sorted and limited from SortNode) read from the first phase and read row by row from storage engine. After the second phase read, Block will contain all the data needed for the query	2023-01-19 10:01:33 +08:00
HappenLee	d5a3e8df3a	[Exec](opt) Opt the vexplode_split function performance (#15945 )	2023-01-17 19:02:57 +08:00
starocean999	151ae71761	[fix](be)fix bug of VSetOperationNode::release_resource (#15997 ) should call "ExecNode::release_resource(state)" if child class override the parent's method	2023-01-17 16:16:25 +08:00
Gabriel	d062ca2944	[refactor](vectorized) remove unnecessary vectorization check (#15984 )	2023-01-17 12:21:46 +08:00
Gabriel	7d34512501	[Bug](pipeline) Fix DCHECK failure (#15928 )	2023-01-17 12:01:20 +08:00
HappenLee	9f106161a7	[Bug](join) Fix null aware anti join error in fuzzy mode (#15987 )	2023-01-17 11:32:16 +08:00
YueW	b1caa68706	[Feature-WIP](inverted index) inverted index reader's implementation, and add mysql_fulltext regression case to test fulltext query (#15823 ) Issue Number: Step2 of DSIP-023: Add inverted index for full text search implementation of inverted index reader dependency pr: #14211 #15807 #15821	2023-01-17 09:13:56 +08:00

1 2 3 4 5 ...

1163 Commits