doris

Author	SHA1	Message	Date
Jibing-Li	de8d884ec3	[Fix](multi catalog)Fix iceberg parquet file doesn't have iceberg.schema meta problem (#16764 ) To support schema evolution, Iceberg add schema information to Parquet file metadata. But for early iceberg version, it doesn't write any schema information to Parquet file. This PR is to support read parquet without schema information.	2023-02-16 00:08:59 +08:00
Gabriel	dd06cc7609	[pipeline](shuffle) Improve broadcast shuffle (#16779 ) Now we reuse buffer pool for broadcast shuffle on pipeline engine. This PR ensures that a pipeline with a broadcast shuffle sink will not be scheduled if there are no available buffer in the buffer pool	2023-02-15 22:03:27 +08:00
pengxiangyu	fe9b2fb803	fix bug, rename thread (#16780 )	2023-02-15 18:51:22 +08:00
Pxl	f50edff59d	[Chore](build) enable fallthrough check annd fix some fallthrough bug (#16748 ) * enable fallthrough check annd fix some fallthrough bug * fix * fix	2023-02-15 15:58:43 +08:00
TengJianPing	9b8c91e18c	[improvement](rowset reader) fix possible memleak (#16680 ) * [improvement](rowset reader) fix possible memleak * fix be UT	2023-02-15 11:13:31 +08:00
zhengshengjun	d013d529c8	[Feature](ipv6)Support IPV6 (#14063 ) Support IPV6 in Apache Doris, the main changes are: 1. enable binding to IPV6 address if network priority in config file contains an IPV6 CIDR string 2. BRPC and HTTP support binding to IPV6 address 3. BRPC and HTTP support visiting IPV6 Services	2023-02-14 21:43:10 +08:00
plat1ko	7482b6bad2	[fix](cooldown) Add cold_compaction_lock to serialize any operations which may delete the input rowsets of cold data compaction (#16742 ) Add cold_compaction_lock to serialize tablet clone, cold data compaction and follow cooldowned data	2023-02-14 21:38:33 +08:00
Gabriel	784c27deeb	[Bug](shuffle) fix mem leak in data stream sender (#16685 )	2023-02-14 16:40:13 +08:00
Pxl	ea78184551	[Feature](Materialized-View) support multiple slot on one column in materialized view (#16378 )	2023-02-14 16:10:50 +08:00
plat1ko	f1b9185830	[feature](cooldown) Implement cold data compaction (#16681 )	2023-02-14 15:21:54 +08:00
TengJianPing	fb0d08ff4c	[fix](mark join) fix bug of mark join with other conjuncts (#16655 ) Fix bug that probe_index is not increased for mark hash join with other conjuncts.	2023-02-14 14:47:15 +08:00
huangzhaowei	af1329936e	[Improvement](ES)Supprt datav2 and datetimev2 for es query (#16633 ) * Supprt datav2 and datetimev2 for es query	2023-02-14 14:47:00 +08:00
Jack Drogon	e1ef03b9d3	[Improvement](static variable) Fix exprs/MathFunctions static variable (#16687 ) Use static constexpr variable in impl file to avoid multi-addressing Remove unused my_double_round in vec/functions/math.cpp	2023-02-14 14:46:29 +08:00
Jibing-Li	0d9714b179	[Fix](multi catalog)Support read hive1.x orc file. (#16677 ) Hive 1.x may write orc file with internal column name (_col0, _col1, _col2...). This will cause query result be NULL because column name in orc file doesn't match with column name in Doris table schema. This pr is to support query Hive orc files with internal column names. For now, we haven't see any problem in Parquet file, will send new pr to fix parquet if any problem show up in the future.	2023-02-14 14:32:27 +08:00
Pxl	b1347f4c38	[Chore](build) make compile option work on C objects && some refactor of cmakelists (#16451 ) make compile option work on C objects && some refactor of cmakelists	2023-02-14 13:35:20 +08:00
yiguolei	1b83829cff	[improvement](block exception safe) make block queue exception safe (#16657 ) * [improvement](block exception safe) make block queue exception safe This is part of exception safe: #16366. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-14 10:50:21 +08:00
HappenLee	a8a5cbb403	[Opt](Hash) Deduce virtual function call is null at in single nullable column (#16650 )	2023-02-14 08:44:12 +08:00
YueW	b642491555	[fix](regression) fix add drop inverted index case (#16673 )	2023-02-14 00:24:42 +08:00
YueW	f3ab55d27d	[Optimization](index) Optimization for no need to read raw data for index column that only in where clause (#16569 )	2023-02-14 00:12:45 +08:00
YueW	ed3420000e	[fix](bthread) fix bthread hang (#16594 )	2023-02-14 00:08:57 +08:00
yixiutt	de725d5d44	[bugfix](column_reader) index_page should not be pre-decoded (#16605 ) In our current logic, index page will be pre-decoded but it will return OK as index page use BinaryPlainPageBuilder and first 4 bytes of the page is a offset so it's high probablility not equal to EncodingTypePB::DICT_ENCODING which is 5. Code in bitshuffle_page_pre_decode.h ``` if constexpr (USED_IN_DICT_ENCODING) { auto type = decode_fixed32_le((const uint8_t*)&data.data[0]); if (static_cast<EncodingTypePB>(type) != EncodingTypePB::DICT_ENCODING) { return Status::OK(); } size_of_dict_header = BINARY_DICT_PAGE_HEADER_SIZE; data.remove_prefix(4); } ``` But if type just equal to EncodingTypePB::DICT_ENCODING and then it will use BitShuffle to decode BinaryPlainPage, which will leads to an fatal error.	2023-02-14 00:06:14 +08:00
lihangyu	36955a6769	[regression-test](dynamic-table) add regression test for dynamic table (#16656 )	2023-02-14 00:03:19 +08:00
plat1ko	5014ad03e7	[feature](cooldown) Auto delete unused remote files (#16588 )	2023-02-13 23:59:39 +08:00
Tiewei Fang	c620e06f6a	[Enhencement](Broker reader)Use smart Pointer instead of native Pointers in broker reader	2023-02-13 21:01:53 +08:00
Jack Drogon	15d9dd114b	[Fix](cpp17) fix gutil unary_function binary_function for cpp17 (#16670 )	2023-02-13 20:30:10 +08:00
HappenLee	a34cc6ed23	[Refactor](exchange) Remove unless variable and change block mem count way (#16668 )	2023-02-13 19:14:01 +08:00
AlexYue	8317c4a752	[Bug](cooldown) set new replica id when early exit in doing clone when no missed versions (#16644 ) * set new replica id * reduce lock * reset when replica id is different	2023-02-13 14:39:03 +08:00
yiguolei	be9385d40a	[improvement](lock raii) use raii to lock and unlock (#16652 ) * [improvement](lock raii) use raii to lock and unlock This is part of exception safe: #16366. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-13 14:06:36 +08:00
airborne12	91c4d1cade	[Feature-WIP](inverted index) step 1 for supporting range predicate pushing down to inverted index (#16615 )	2023-02-13 10:30:51 +08:00
huangzhaowei	f41a2055d3	[feature](Load)Remove user/password in properties for mysql load to avoid double auth. (#16073 ) Use FE cluster token to auth stream load. This auth is only open for be, and fe auth still only support http basic auth. I will use this auth for mysql load to build a no-auth stream load from fe to be. And this will avoid double auth in mysql load. More information to see the design doc.	2023-02-13 10:00:08 +08:00
caiconghui	1de4e312cc	[fix](metric) Fix be core when set enable_system_metrics to false in be (#16646 ) when enable_system_metrics is false, we should not use system_metrics any more Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-02-12 23:01:41 +08:00
奕冷	cf739e7496	[Enhancement](Stmt) Set insert_into timeout session variable separately (#16343 )	2023-02-12 16:56:10 +08:00
AlexYue	6a8fc35b78	[Bug](Cooldown) fix load balance causing no cooldown replica (#16641 )	2023-02-12 16:47:38 +08:00
HappenLee	09b7c22f6b	[Opt](exec) remove unless null key when no split in convert key range (#16624 )	2023-02-11 15:44:35 +08:00
Kang	aba843bb2b	[Improvement](inverted index) inverted index query match bitmap cache (#16578 ) Add cache for inverted index query match bitmap to accelerate common query keyword, especially for keyword matching many rows. Tests result: - large result: matching 99% out of 247 million rows shows 8x speed up. - small result: matching 0.1% out of 247 million rows shows 2x speed up.	2023-02-11 13:38:58 +08:00
lihangyu	37d1519316	[WIP](dynamic-table) support dynamic schema table (#16335 ) Issue Number: close #16351 Dynamic schema table is a special type of table, it's schema change with loading procedure.Now we implemented this feature mainly for semi-structure data such as JSON, since JSON is schema self-described we could extract schema info from the original documents and inference the final type infomation.This speical table could reduce manual schema change operation and easily import semi-structure data and extends it's schema automatically.	2023-02-11 13:37:50 +08:00
YueW	b155fc07f6	[fix](fragment thread) fix thread in fragment thread pool hang (#16608 ) process the return status for exec_state->execute() in function FragmentMgr::_exec_actual	2023-02-11 09:05:10 +08:00
TengJianPing	171ae2892f	[improvement](batch size) pass batch size of exec engine to storage engine (#16614 ) Currently batch_size is not passed on to SegmentIterator, the SegmentIterator uses the hard coded value 4096 - 32 as the max row count of a block. * fix bug	2023-02-11 09:01:44 +08:00
lihangyu	8749aedbae	[Bug](point query) make get_rowset thread safe (#16609 ) `get_rowset` calling from `lookup_row_data` without lock will lead to core dump if _rs_version_map, _stale_rs_version_map changed	2023-02-10 23:54:56 +08:00
Xin Liao	c3110f8153	[fix](merge-on-write) fix that the query result has duplicate keys when load with sequence column (#16587 )	2023-02-10 22:31:05 +08:00
yiguolei	75847f7f6a	[bugfix](exchange node) should not depend on eos to judge the ending of stream receiver (#16600 ) [bugfix](exchange node) should not depend on eos to judge the ending of stream receiver #16600 Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-10 20:35:49 +08:00
YueW	ad141747b4	[fix](inverted index) fix array type inverted index query error (#16582 )	2023-02-10 17:57:15 +08:00
YueW	43eca4f209	[Feature-WIP](inverted index) Implementation for alter inverted index. (#16371 ) implementation for add/drop inverted index.	2023-02-10 17:56:17 +08:00
Xin Liao	6a5277b391	[fix](sequence-column) MergeIterator does not use the correct seq column for comparison (#16494 )	2023-02-10 17:51:15 +08:00
Jerry Hu	861f31205a	[fix](window function) invalid order_by_start in VAnalyticEvalNode (#16589 )	2023-02-10 17:40:40 +08:00
lihangyu	32188855ef	[improve](topn) seperate multiget rpc to ThreadPool (#16598 ) multiget_data working in bthread and may block the whole worker pthread of BRPC framework and effect other bthreads, so I seperate work task into a seperate task pool.	2023-02-10 17:39:31 +08:00
AlexYue	1f631c388d	[enhance](cooldown)accelerate cooldown task produce efficiency (#16089 )	2023-02-10 16:58:27 +08:00
zhangstar333	b99e2dc727	[bug](jdbc) fix jdbc can't get object of PGobject (#16496 ) when pg table have some unsupported column type like: point, polygon, jsonb...... jdbc catalog will convert it to string type in doris. but get result set in java is org.postgresql.util.PGobject Some test need this pr: #16442	2023-02-10 16:19:02 +08:00
Gabriel	06788bc2d0	[Bug](pipeline) Fix projection on streaming operator (#16592 )	2023-02-10 15:57:26 +08:00
xueweizhang	379bef598d	[fix-core](block) clear block row_same_bit when block reuse (#16172 )	2023-02-10 12:21:27 +08:00

1 2 3 4 5 ...

3673 Commits