doris

Author	SHA1	Message	Date
luozenglin	5104982614	[enhancement](tracing) append the profile counter to trace. (#11458 ) 1. append the profile counter and infos to span attributes. 2. output traceid to audit log.	2022-08-15 21:36:38 +08:00
Lightman	71df82696d	[fix](schema change) fix memory exceeded when schema change (#11748 ) In row mode schema change, it will fail sometime because memory exceeded. When the left memory is enough for sorting but not enough for next block, it will not flush row_block_arr which data in memory and continue to alloc next block so it can't alloc the memory and return directly. And if it can't alloc the memory for block, it need to flush row_block_arr and try it again unless row_block_arr is empty.	2022-08-15 17:57:39 +08:00
luozenglin	0f75bd0e38	[fix](delete) fix query result error after delete (#11754 ) convert dictionary code for delete predicates.	2022-08-15 17:52:03 +08:00
Ashin Gau	0b9bfd15b7	[feature-wip](parquet-reader) parquet physical type to doris logical type (#11769 ) Two improvements have been added: 1. Translate parquet physical type into doris logical type. 2. Decode parquet column chunk into doris ColumnPtr, and add unit tests to show how to use related API.	2022-08-15 16:08:11 +08:00
Zhengguo Yang	805c13aaa1	[fix](backup) fix backup restore raise `Storage backend not initialized.` error (#11736 ) fix backup restore raise Storage backend not initialized. error	2022-08-15 13:24:38 +08:00
carlvinhust2012	ab9529f6b5	[enhancement](array-type) support export files in 'select into outfile' (#11703 ) this pr is used to support export array type in 'select into outfile'. Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-08-15 12:34:31 +08:00
carlvinhust2012	8c8f48c4c2	[feature-wip](array-type) add the array_join function (#11406 ) this pr is used to add the array_join function. Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-08-15 11:43:17 +08:00
Gabriel	77e241cbb0	[refactor](date) Use uint32 as predicate type for date type (#11708 ) Use uint32 as predicate type for date type	2022-08-15 11:12:33 +08:00
bin41215	ec5d4e3d17	print physical memory and virtual memory separately. (#11747 )	2022-08-13 13:56:49 +08:00
Gabriel	abd2eb4fa1	[Bug](date function) Fix bug for date format %T (#11729 ) * [Bug](date function) Fix bug for date format %T	2022-08-12 19:29:58 +08:00
yiguolei	408dbf840b	[bugfix](schema change) when there is a string column with delete predicate, the schema change may core (#11739 ) * [bugfix](schema change) when there is a string column with delete predicate, the schema change may core Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-08-12 19:29:22 +08:00
pengxiangyu	1c4927eac3	[fix](core)fix bug for status not init(#11730 )	2022-08-12 17:42:37 +08:00
TengJianPing	58822c7b55	[bugfix](odbc) return error if convert unicode failed (#11728 ) * [bugfix](odbc) return error if convert unicode failed	2022-08-12 17:28:48 +08:00
Gabriel	e353be7dcb	[Bug](date function) Return null if date format is invalid (#11720 )	2022-08-12 14:07:55 +08:00
pengxiangyu	e5c2bb9699	[fix](remote)Fix bug for Cache Reader (#11629 )	2022-08-12 13:40:32 +08:00
Gabriel	15abafee71	[Bug](runtime filters) support late-arrival runtime filters (#11599 )	2022-08-12 11:55:15 +08:00
zhannngchen	0ab43c51e8	[Feature](unique-key-merge-on-write) some fix on delete bitmap usage (#11623 )	2022-08-12 11:54:31 +08:00
Gabriel	7d97aa194b	[feature-wip](datev2) Support to use datev2 as partition column (#11618 )	2022-08-12 11:54:01 +08:00
carlvinhust2012	b36680796f	[optimization] (be-log) modify the backendservice log (#11689 ) Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-08-12 11:52:24 +08:00
plat1ko	4047c3577d	[enhancement](Status) Optimize Status implementation	2022-08-12 11:39:35 +08:00
Jibing-Li	9b9ed1aef1	[data lake](arrow scanner)Fix file arrow scanner column index out of range core. (#11691 )	2022-08-12 11:34:29 +08:00
Yongqiang YANG	9950501fdf	[fix](profile) close eof scanner before transfer done (#11705 ) We should close eof scanners before transfer done, otherwise, they are closed until scannode is closed. Because plan is closed after the plan is finished, so query profile would leak stats from scanners closed by scannode::close. e.g. SegmentTotalNum in profile is less.	2022-08-12 11:28:43 +08:00
wangbo	4c8cc7f03e	[fix](storage)fix column dict incorrect result (#11694 ) Co-authored-by: Wang Bo <wangbo36@meituan.com>	2022-08-12 11:05:57 +08:00
Pxl	f5fe622a1b	[Bug](materialized view) fix create materialized view fail 1. remove referenced_column(seems unused now). 2. fix mv slot ref id wrong. 3. add type check for hll_hash. 4. enable non-nullable column change to nullable column.	2022-08-12 09:49:16 +08:00
Xin Liao	5d66839035	[feature-wip](unique-key-merge-on-write) push down runtime filter on unique key with merge on write table (#11695 )	2022-08-11 22:50:13 +08:00
yiguolei	ea57bf6370	[refactor](delete predicate) Unify delete to segmentiterator (#11650 ) * remove seek columns and unify delete columns in rowset reader Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-08-11 15:12:43 +08:00
Gabriel	2068bf2dea	[Refactor](predicate) Use primitive type as template argument for predicate (#11647 )	2022-08-11 12:06:44 +08:00
Ashin Gau	8f5aed27ec	[feature-wip](parquet-reader)read and decode parquet physical type (#11637 ) # Proposed changes Read and decode parquet physical type. 1. The encoding type of boolean is bit-packing, this PR introduces the implementation of bit-packing from Impala 2. Create a parquet including all the primitive types supported by hive ## Remaining Problems 1. At present, only physical types are decoded, and there is no corresponding and conversion methods with doris logical. 2. No parsing and processing Decimal type / Timestamp / Date. 3. Int_8 / Int_16 is stored as Int_32. How to resolve these types.	2022-08-11 10:17:32 +08:00
Gabriel	a3714981fd	[Bug](schema change) Fix bug for vectorized schema change (#11652 )	2022-08-10 21:42:51 +08:00
zhannngchen	70b39475cf	[fix](scanner) delete predicates might be inconsistent with rowset readers (#11598 )	2022-08-10 19:40:54 +08:00
Jerry Hu	c8418d13b5	[improvement](config)Use session variable to replace configuration for 'enable_function_pushdown' (#11641 )	2022-08-10 19:25:02 +08:00
Jerry Hu	0291f84a9e	[fix](like-predicate) Add missing functions in LikeColumnPredicate (#11631 )	2022-08-10 15:03:14 +08:00
starocean999	601f28dd90	[fix](regexpr)regexpr functions' contexts should be THREAD_LOCAL (#11595 )	2022-08-10 06:58:24 +08:00
camby	01e4522612	[fix]collect_list/collect_set without GROUP BY for NOT NULL column (#11529 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-08-09 20:49:37 +08:00
carlvinhust2012	df47b6941d	[feature-wip](array-type) support the array type in reverse function (#11213 ) Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-08-09 20:49:09 +08:00
Tiewei Fang	169996d8e4	[feature](information_schema) add `rowsets` table into information_s… (#11266 ) * [feature](information_schema) add 'segments' table into information_schema	2022-08-09 18:15:54 +08:00
Kang	f9b151744d	optimize topn query if order by columns is prefix of sort keys of table (#10694 ) * [feature](planner): push limit to olapscan when meet sort. * if olap_scan_node's sort_info is set, push sort_limit, read_orderby_key and read_orderby_key_reverse for olap scanner * There is a common query pattern to find latest time serials data. eg. SELECT * from t_log WHERE t>t1 AND t<t2 ORDER BY t DESC LIMIT 100 If the ORDER BY columns is the prefix of the sort key of table, it can be greatly optimized to read much fewer data instead of read all data between t1 and t2. By leveraging the same order of ORDER BY columns and sort key of table, just read the LIMIT N rows for each related segment and merge N rows. 1. set read_orderby_key to true for read_params and _reader_context if olap_scan_node's sort info is set. 2. set read_orderby_key_reverse to true for read_params and _reader_context if is_asc_order is false. 3. rowset reader force merge read segments if read_orderby_key is true. 4. block reader and tablet reader force merge read rowsets if read_orderby_key is true. 5. for ORDER BY DESC, read and compare in reverse order 5.1 segment iterator read backward using a new BackwardBitmapRangeIterator and reverse the result block before return to caller. 5.2 VCollectIterator::LevelIteratorComparator, VMergeIteratorContext return opposite result for _is_reverse order in its compare function. Co-authored-by: jackwener <jakevingoo@gmail.com>	2022-08-09 09:08:44 +08:00
pengxiangyu	b44c47fc10	[fix] (remote storage) fix bug for storage policy (#11597 )	2022-08-09 09:05:48 +08:00
Gabriel	ed7f7dead9	[Refactor](push-down predicate) Derive push-down predicate from vconjuncts (#11468 ) * [Refactor](push-down predicate) Derive push-down predicate from vconjuncts	2022-08-08 19:19:26 +08:00
yixiutt	0a5fd99d02	[feature-wip](unique-key-merge-on-write) speed up publish_txn (#11557 ) In our origin design, we calc delete bitmap in publish txn, and this operation will cost too much time as it will load segment data and lookup row key in pre rowset and segments.And publish version task should run in order, so it'll lead to timeout in publish_txn. In this pr, we seperate delete_bitmap calculation to tow part, one of it will be done in flush mem table, so this work can run parallel. And we calc final delete_bitmap in publish_txn, get a rowset_id set that should be included and remove rowsets that has been compacted, the rowset difference between memtable_flush and publish_txn is really small so publish_txn become very fast.In our test, publish_txn cost about 10ms. Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-08-08 18:57:55 +08:00
lihangyu	9349746987	[Fix](stream-load-json) fix VJsonReader::_write_data_to_column invalid column type cast when meet null (#11564 ) column_ptr will be a none nullable column pointer after `column_ptr = &nullable_column->get_nested_column()` so we should not cast column_ptr to ColumnNullable any more	2022-08-08 15:57:39 +08:00
Gabriel	87f56914e9	[Improvement](debug message) add necessary info to DCHECK message (#11586 )	2022-08-08 15:54:09 +08:00
Ashin Gau	37d1180cca	[feature-wip](parquet-reader)decode parquet data (#11536 )	2022-08-08 12:44:06 +08:00
Pxl	2cd3bf80dc	[bugfix](schema change)fix core dump on vectorized_alter_table (#11538 )	2022-08-08 10:45:28 +08:00
Xin Liao	1e6a3610a7	[feature-wip](unique-key-merge-on-write) optimize rowid conversion and add ut (#11541 )	2022-08-08 10:41:44 +08:00
slothever	e8a344b683	[feature-wip](parquet-reader) add predicate filter and column reader (#11488 )	2022-08-08 10:21:24 +08:00
yixiutt	bd4048f8fb	[enhancement](compaction) add idle schedule and max_size limit for base compaction (#11542 ) Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-08-07 16:21:57 +08:00
slothever	95753ec868	[feature](parquet-reader) add group filter util (#11533 ) * [feature-wip](parquet-reader) add group filter util Co-authored-by: jinzhe <jinzhe@selectdb.com>	2022-08-05 14:02:48 +08:00
yiguolei	321107cb40	[refactor](schema change) Using tablet schema shared ptr instead of raw ptr (#11475 ) * Using tabletschema shared ptr instead of raw ptrs Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-08-05 11:04:38 +08:00
huangzhaowei	6eb8ac0ebf	[feature-wip][multi-catalog]Support caseSensitive field name in file scan node (#11310 ) * Impl case sentive in file scan node	2022-08-05 08:03:16 +08:00

1 2 3 4 5 ...

2472 Commits