doris

Author	SHA1	Message	Date
Jerry Hu	5be8d57f52	[fix](be-ut) fix ColumnFixedLenghtObjectTest on 32 bits system (#23519 )	2023-08-28 14:02:05 +08:00
Adonis Ling	e0bf621fe0	[chore](build) Fix compilation errors for BE UT (#23535 ) Issue Number: close #23536 This issue was introduced by #23414 .	2023-08-27 11:52:13 +08:00
Jerry Hu	f80b067990	[fix](column) add unimplemented function of ColumnFixedLengthObject (#23468 )	2023-08-25 17:38:01 +08:00
Kaijie Chen	d8e499cb55	[fix](UT) fix flaky test in LoadStreamMgrTest (#23459 )	2023-08-25 13:53:20 +08:00
zclllyybb	9cacf9535a	[Opt](functions) Use preloaded cache to accelerate timezone parsing (#22694 ) * opt * bugfix * fix ut * fix stylecheck	2023-08-25 10:00:48 +08:00
Kaijie Chen	71071ba057	[feature](move-memtable)[4/7] add stream sink file writer (#23416 ) Co-authored-by: laihui <1353307710@qq.com>	2023-08-25 00:08:27 +08:00
Kaijie Chen	98d0a2f6c1	[feature](move-memtable)[3/7] add load stream manager and rpc service (#23415 ) Co-authored-by: zhengyu <freeman.zhang1992@gmail.com> Co-authored-by: Yongqiang YANG <dataroaring@gmail.com> Co-authored-by: laihui <1353307710@qq.com>	2023-08-25 00:08:04 +08:00
zclllyybb	51ac92f65c	Revert "[fix](function) to_bitmap parameter parsing failure returns null instead of bitmap_empty (#21236 )" (#23368 ) This reverts commit 1c3cc77a54938ed948ad8186b8dea8385977d23c.	2023-08-23 18:27:35 +08:00
Pxl	8ed4045df9	[Chore](primitive-type) remove VecPrimitiveTypeTraits (#22842 )	2023-08-23 08:37:40 +08:00
yiguolei	bcdb481374	[refactor](fragment) refactor non pipeline fragment executor (#23281 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-08-22 16:00:34 +08:00
Ashin Gau	5ff7b57fc1	[fix](parquet) parquet reader confuses logical/physical/slot id of columns (#23198 ) `ParquetReader` confuses logical/physical/slot id of columns. If only reading the scalar types, there's nothing wrong, but when reading complex types, `RowGroup` and `PageIndex` will get wrong statistics. Therefore, if the query contains complex types and pushed-down predicates, the probability of the result set is incorrect.	2023-08-22 13:35:29 +08:00
Kaijie Chen	0d7a61ae8c	[fix](load) fix duplicate register of memtable writer in memory limiter (#23205 )	2023-08-22 10:05:17 +08:00
Gabriel	12075f9853	[pipelineX](projection) Support projection and blocking agg (#23256 )	2023-08-21 22:23:02 +08:00
amory	33dfa0c454	[Improve](serde) support text serde for nested type-array/map (#22738 ) Now we can not support nested type array/map so this pr aim to: 1. add format option for string convert defined datatype to keep with origin from_string 2. support array map can nested array and map	2023-08-21 10:32:28 +08:00
ZenoYang	1c3cc77a54	[fix](function) to_bitmap parameter parsing failure returns null instead of bitmap_empty (#21236 ) * [fix](function) to_bitmap parameter parsing failure returns null instead of bitmap_empty * add ut * fix nereids * fix regression-test	2023-08-18 14:37:49 +08:00
Mingyu Chen	330f369764	[enhancement](file-cache) limit the file cache handle num and init the file cache concurrently (#22919 ) 1. the real value of BE config `file_cache_max_file_reader_cache_size` will be the 1/3 of process's max open file number. 2. use thread pool to create or init the file cache concurrently. To solve the issue that when there are lots of files in file cache dir, the starting time of BE will be very slow because it will traverse all file cache dirs sequentially.	2023-08-17 16:52:08 +08:00
Kaijie Chen	6cf1efc997	[refactor](load) use smart pointers to manage writers in memtable memory limiter (#23019 )	2023-08-16 16:34:57 +08:00
bobhan1	4510e16845	[improvement](delete) support delete predicate on value column for merge-on-write unique table (#21933 ) Previously, delete statement with conditions on value columns are only supported on duplicate tables. After we introduce delete sign mechanism to do batch delete, a delete statement with conditions on value columns on unique tables will be transformed into the corresponding insert into ..., __DELETE_SIGN__ select ... statement. However, for unique table with merge-on-write enabled, the overhead of inserting these data can be eliminated. So this PR add the ability to allow delete predicate on value columns for merge-on-write unique tables.	2023-08-16 12:18:05 +08:00
flynn	4e880288c6	[refactor]use clear concept to replace std::enable_if_t (#22801 ) --------- Signed-off-by: flynn <fenglv15@mails.ucas.ac.cn>	2023-08-12 15:10:30 +08:00
yujun	b9b9071c9b	[improvement](create partition) create partition require quorum replicas succ (#22554 )	2023-08-11 11:59:05 +08:00
Chuanle Chen	71807ceb5f	[Enhancement](tvf) Table value function support reading local file (#17404 ) I tested the local tvf with tpch queries. First, generate `lineitem` datasets with 6001215 rows, and load it into `lineitem` table by: ``` insert into lineitem select c11, c1, c4, c2, c3, c5, c6, c7, c8, c9, c10, c12, c13, c14, c15, c16 from local( "file_path" = "tools/tpch-tools/bin/tpch-data/lineitem.tbl.1", "backend_id" = "10003", "format" = "csv", "column_separator" = "\|" ); ``` Then, run `q1` and `q16` tpch queries, the query result is correct. It can also analyze the BE's log directly like: ``` mysql> select * from local( "file_path" = "log/be.out", "backend_id" = "10006", "format" = "csv") where c1 like "%start_time%" limit 10; +--------------------------------------------------------+ \| c1 \| +--------------------------------------------------------+ \| start time: 2023年 08月 07日星期一 23:20:32 CST \| \| start time: 2023年 08月 07日星期一 23:32:10 CST \| \| start time: 2023年 08月 08日星期二 00:20:50 CST \| \| start time: 2023年 08月 08日星期二 00:29:15 CST \| +--------------------------------------------------------+ ```	2023-08-10 20:07:42 +08:00
Kaijie Chen	58e7952eea	[refactor](load) use memtable writer in memtable memory limiter (#22780 )	2023-08-10 17:08:47 +08:00
Siyang Tang	4359089b9c	[fix](delete-pred) fix special char in delete sub condition #22667 For some users, their delete condition may contain special chars like '$', which will cause failure in parsing delete condition.	2023-08-09 00:04:26 +08:00
Kaijie Chen	9581d2b4eb	[refactor](load) split memtable writer out of delta writer (#21892 )	2023-08-08 22:02:42 +08:00
Siyang Tang	77e772e103	[enhancement](config) add some pre-process and pre-check for BE storage config attentions in docs (#22486 )	2023-08-07 18:16:57 +08:00
AlexYue	f036cdfde6	[feature](compaction) support delete in cumulative compaction (#19609 )	2023-08-07 15:22:21 +08:00
czzmmc	1a8a1e5b16	[Feature](count_by_enum) support count_by_enum function (#22071 ) count_by_enum(expr1, expr2, ... , exprN); Treats the data in a column as an enumeration and counts the number of values in each enumeration. Returns the number of enumerated values for each column, and the number of non-null values versus the number of null values.	2023-08-06 16:05:14 +08:00
Pxl	7839a0e708	[Bug](brpc) fix brpc failed on big query came concurrently (#22600 ) fix PriorityThreadPool get_info get wrong number change brpc pool from priority to fifo do not use brpc pool when send eos	2023-08-05 21:24:32 +08:00
TengJianPing	b122f9b80c	[fix](concat) ColumnString::chars is resized with wrong size (#22610 ) FunctionStringConcat::execute_impl resized with size that include string null terminator, which causes ColumnString::chars.size() does not match with ColumnString::offsets.back, this will cause problems for some string functions, e.g. like and regexp.	2023-08-04 19:13:35 +08:00
Kaijie Chen	93593a013d	[feature](load) add segment bytes limit in segcompaction (#22526 )	2023-08-04 18:00:52 +08:00
amory	86e6f5d039	[FIX](decimal)fix decimal precision (#22364 ) Now we make wrong for decimal parse from string if given string precision is bigger than defined decimal precision, we will return a overflow error, but only digit part is bigger than typed digit length , we should return overflow error when we traverse given string to decimal value	2023-08-03 21:13:58 +08:00
Chenyang Sun	19d1f49fbe	[improvement](compaction) compaction policy and options in the properties of a table (#22461 )	2023-08-01 22:02:23 +08:00
Mryange	f16a39aea1	[feature](time) using timev2 type to replace the old time type. (#22269 )	2023-08-01 15:59:07 +08:00
HappenLee	3a11de889f	[Opt](exec) opt the performance of date parquet convert by date dict (#22384 ) before： mysql> select count(l_commitdate) from lineitem; +---------------------+ \| count(l_commitdate) \| +---------------------+ \| 600037902 \| +---------------------+ 1 row in set (0.86 sec) after: mysql> select count(l_commitdate) from lineitem; +---------------------+ \| count(l_commitdate) \| +---------------------+ \| 600037902 \| +---------------------+ 1 row in set (0.36 sec)	2023-08-01 12:24:00 +08:00
Gabriel	d585a8acc1	[Improvement](shuffle) Accumulate rows in a batch for shuffling (#22218 )	2023-08-01 09:55:06 +08:00
HHoflittlefish777	ee754307bb	[refactor](load) refactor memtable flush actively (#21634 )	2023-07-30 21:31:54 +08:00
zzzzzzzs	765f1b6efe	[Refactor](load) Extract load public code (#22304 )	2023-07-29 12:56:31 +08:00
huanghaibin	ec1a4d172b	(vertical compaction) fix vertical compaction core (#22275 ) * (vertical compaction) fix vertical compaction core co-author:@zhannngchen	2023-07-28 16:41:00 +08:00
HHoflittlefish777	9e16c69925	[improvement](compression) support LZ4_HC algorithm and parse LZ4_RAW (#22165 )	2023-07-26 18:23:39 +08:00
amory	d4a4c172ea	[Improve](serde)update serialize and deserialize text for data type (#21109 )	2023-07-26 10:06:16 +08:00
Gabriel	103c473b96	[Bug](pipeline) fix pipeline shared scan + topn optimization (#21940 )	2023-07-25 12:48:27 +08:00
Ashin Gau	30c21789c8	[opt](filecache) use weak_ptr to cache the file handle of file segment (#21975 ) Use weak_ptr to cache the file handle of file segment. The max cached number of file handles can be configured by `file_cache_max_file_reader_cache_size`, default `1000000`. Users can inspect the number of cached file handles by request BE metrics: `http://be_host:be_webserver_port/metrics`: ``` # TYPE doris_be_file_cache_segment_reader_cache_size gauge doris_be_file_cache_segment_reader_cache_size{path="/mnt/datadisk1/gaoxin/file_cache"} 2500 ```	2023-07-24 19:09:27 +08:00
zhannngchen	86e80ae175	[enhancement](merge-on-write) support concurrent delete bitmap calc while close_wait (#21488 )	2023-07-24 10:09:28 +08:00
Pxl	19ba6bec38	[Improvement](pipeline) support send eos on local exchange and remove some unused code (#22086 ) support send eos on local exchange and remove some unused code	2023-07-24 09:25:32 +08:00
Chenyang Sun	f7ac827c90	[fix](compaction) fix time series compaction point policy (#21670 )	2023-07-21 23:09:02 +08:00
amory	ce397a8d32	[FIX](map)fix arrow serde with map null key #21955	2023-07-19 12:09:34 +08:00
HappenLee	b35cfc5d5e	[opt](join) Opt the performance of join probe (#21845 )	2023-07-19 01:21:22 +08:00
HHoflittlefish777	c6063ed92f	[Revert](lazy open) revert lazy open and add case (#21821 )	2023-07-18 19:41:33 +08:00
amory	cbddff0694	[FIX](map) fix map key-column nullable for arrow serde #21762 arrow is not support key column has null element , but doris default map key column is nullable , so need to deal with if doris map row if key column has null element , we put null to arrow	2023-07-14 00:30:07 +08:00
amory	3163841a3a	[FIX](serde)Fix decimal for arrow serde (#21716 )	2023-07-12 19:15:48 +08:00

1 2 3 4 5 ...

1166 Commits