doris

Author	SHA1	Message	Date
zhangstar333	4ef46159ae	[vectorized](udaf) support array type for java-udaf (#17351 )	2023-03-09 11:30:07 +08:00
amory	06dee69174	[Refactor](map) remove using column array in map to reduce offset column (#17330 ) 1. remove column array in map 2. add offsets column in map Aim to reduce duplicate offset from key-array and value-array in disk	2023-03-09 11:22:26 +08:00
lihangyu	368e6a4f9c	[Bug](array filter) Fix bug due to `ColumnArray::filter_generic` invalid inplace `size_at` after `set_end_ptr` (#17554 ) We should make a new PodArray to add items instead of do it inplace	2023-03-09 10:59:29 +08:00
luozenglin	00727e8c11	[fix](in-bitmap) fix result may be wrong if the left side of the in bitmap predicate is a constant (#17570 )	2023-03-09 10:59:05 +08:00
Pxl	65b8dfc7ff	[Enchancement](function) Inline some aggregate function && remove nullable combinator (#17328 ) 1. Inline some aggregate function 2. remove nullable combinator	2023-03-09 10:39:04 +08:00
zxealous	6923bf8d7b	[fix](file cache)fix block file cache can't be configured (#17511 )	2023-03-09 10:12:08 +08:00
Xinyi Zou	397cc011c4	[fix](function) fix AES/SM3/SM4 encrypt/ decrypt algorithm initialization vector bug (#17420 ) ECB algorithm, block_encryption_mode does not take effect, it only takes effect when init vector is provided. Solved: 192/256 supports calculation without init vector For other algorithms, an error should be reported when there is no init vector Initialization Vector. The default value for the block_encryption_mode system variable is aes-128-ecb, or ECB mode, which does not require an initialization vector. The alternative permitted block encryption modes CBC, CFB1, CFB8, CFB128, and OFB all require an initialization vector. Reference: https://dev.mysql.com/doc/refman/8.0/en/encryption-functions.html#function_aes-decrypt Note: This fix does not support smooth upgrades. during upgrade process, query may report error: funciton not found	2023-03-09 09:51:41 +08:00
starocean999	2b6d971c2f	[fix](nereids)fix first_value/lead/lag window function bug in nereids (#17315 ) * [fix](nereids)fix first_value/lead/lag window function bug in nereids * add more test * add order by to fix test case * fix test cases	2023-03-09 09:35:27 +08:00
zhannngchen	2cf90ddfc5	[fix](scanner) remove useless _src_block_mem_reuse to avoid core dump while loading (#17559 ) The _src_block_mem_reuse variable actually not work, since the _src_block is cleared each time when we call get_block. But current code may cause core dump, see issue #17587. Because we insert some result column generated by expr into dest block, and such a column holds a pointer to some column in original schema. When clearing the data of _src_block, some column's data in dest block is also cleared. e.g. coalesce will return a result column which holds a pointer to some original column, see issue #17588	2023-03-09 09:26:32 +08:00
ElvinWei	bd5ed2b0c2	[enhancement](histogram) optimize the histogram bucketing strategy, etc (#17264 ) * optimize the histogram bucketing strategy, etc * fix p0 regression of histogram	2023-03-08 20:12:05 +08:00
TengJianPing	eea6d770d7	[fix](bitmap) fix wrong result of bitmap_or for null (#17456 ) Result of select bitmap_to_string(bitmap_or(to_bitmap(1), null)) should be 1 instead of null. This PR fix logic of bitmap_or and bitmap_or_count. Other count related funcitons should also be checked and fix, they will be fixed in another PR.	2023-03-08 16:29:01 +08:00
AlexYue	f3b50b3472	[enhance](cooldown) skip once failed follow cooldown tablet (#16810 )	2023-03-08 14:14:13 +08:00
Xin Liao	8001d65811	[fix](insert) fix memory leak for insert transaction (#17530 )	2023-03-08 14:10:59 +08:00
AlexYue	273d2100ac	[enhance](cooldown) turn write cooldown meta async (#16813 )	2023-03-08 14:06:21 +08:00
qiye	3a877857ae	[improvement](inverted index)Remove searcher bitmap timer to improve query speed (#17407 ) Timer becomes a bottleneck when the query hit volume is very high.	2023-03-08 14:03:36 +08:00
Xinyi Zou	335c1e5953	[fix](memory) Fix MacOS mem_limit parse error and GC after env Init #17528 Fix MacOS mem_limit parse result is 0. Fix GC after env Init, otherwise, when the memory is insufficient, BE will start failure. * Query id: 0-0 * * Aborted at 1677833773 (unix time) try "date -d @1677833773" if you are using GNU date * * Current BE git commitID: 8ee5f45 * * SIGSEGV address not mapped to object (@0x70) received by PID 24145 (TID 0x7fa53c9fd700) from PID 112; stack trace: * 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t, void) at be/src/common/signal_handler.h:420 1# os::Linux::chained_handler(int, siginfo, void) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 2# JVM_handle_linux_signal in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 3# signalHandler(int, siginfo, void) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 4# 0x00007FA56295A400 in /lib64/libc.so.6 5# doris::MemTrackerLimiter::log_process_usage_str(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) at be/src/runtime/memory/mem_tracker_limiter.cpp:208 6# doris::MemTrackerLimiter::print_log_process_usage(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) at be/src/runtime/memory/mem_tracker_limiter.cpp:226 7# doris::Daemon::memory_maintenance_thread() at be/src/common/daemon.cpp:245 8# doris::Thread::supervise_thread(void*) at be/src/util/thread.cpp:455 9# start_thread in /lib64/libpthread.so.0 10# clone in /lib64/libc.so.6	2023-03-08 14:00:57 +08:00
bobhan1	4ea0d6c5fa	[feature](array_function) add support for array_popfront (#17416 )	2023-03-08 13:57:38 +08:00
gitccl	b1d65f855d	[Feature](array-function) Support array_concat function (#17436 )	2023-03-08 13:57:16 +08:00
Pxl	e2ac06d6d6	[Chore](execution) change PipelineTaskState to enum class && remove some row-based code (#17300 ) 1. change PipelineTaskState to enum class 2. remove some row-based code on FoldConstantExecutor::_get_result 3. reduce memcpy on minmax runtime filter function(Now we can guarantee that the input data is aligned) 4. add Wunused-template check, and remove some unused function, change some static function to inline function.	2023-03-08 12:41:15 +08:00
TengJianPing	778acb3c5b	[opt](string) optimize string equal comparision (#17336 ) Optimize string equal and not-equal comparison by using memequal_small_allow_overflow15.	2023-03-08 11:30:00 +08:00
yiguolei	9213dd906a	[enhancement](exception) add exception structure and using unique ptr in VExplodeBitmapTableFunction (#17531 ) add exception class in common. using unique ptr in VExplodeBitmapTableFunction support single exception or nested exception, like this: ---SingleException [E-100] test OS_ERROR bug @ 0x55e80b93c0d9 doris::Exception::Exception<>() @ 0x55e80b938df1 doris::ExceptionTest_NestedError_Test::TestBody() @ 0x55e82e16bafb testing::internal::HandleSehExceptionsInMethodIfSupported<>() @ 0x55e82e15ab3a testing::internal::HandleExceptionsInMethodIfSupported<>() @ 0x55e82e1361e3 testing::Test::Run() @ 0x55e82e136f29 testing::TestInfo::Run() @ 0x55e82e1376e4 testing::TestSuite::Run() @ 0x55e82e148042 testing::internal::UnitTestImpl::RunAllTests() @ 0x55e82e16dcab testing::internal::HandleSehExceptionsInMethodIfSupported<>() @ 0x55e82e15ce4a testing::internal::HandleExceptionsInMethodIfSupported<>() @ 0x55e82e147bab testing::UnitTest::Run() @ 0x55e80c4b39e3 RUN_ALL_TESTS() @ 0x55e80c4a99b5 main @ 0x7f0a619d0493 __libc_start_main @ 0x55e80b84602a _start @ (nil) (unknown)	2023-03-08 10:44:14 +08:00
yiguolei	4692d6764c	[refactor](remove string val) remove string val structure, it is same with string ref (#17461 ) remove stringval, decimalv2val, bigintval	2023-03-08 10:42:20 +08:00
qiye	a767472c56	[fix](DOE)Fix es p0 case error (#17502 ) Fix es array parse error, introduced by #16806	2023-03-08 08:06:30 +08:00
htyoung	69c62b6c6c	[Fix](vectorization) fixed that when a column's _fixed_values exceeds the max_pushdown_conditions_per_column limit, the column will not perform predicate pushdown, but if there are subsequent columns that need to be pushed down, the subsequent column pushdown will be misplaced in _scan_keys and it causes query results to be wrong (#17405 ) the max_pushdown_conditions_per_column limit, the column will not perform predicate pushdown, but if there are subsequent columns that need to be pushed down, the subsequent column pushdown will be misplaced in _scan_keys and it causes query results to be wrong Co-authored-by: tongyang.hty <hantongyang@douyu.tv>	2023-03-08 07:23:56 +08:00
zxealous	5334a5899e	[fix](remote)fix whole file cache and sub file cache (#17468 )	2023-03-07 19:55:18 +08:00
zhangstar333	06468ba627	[vectorized](bug) fix array constructor function change origin column from block (#17296 )	2023-03-07 16:42:23 +08:00
ZhangYu0123	8ccc805cd0	[Fix](Lightweight schema Change) query error caused by array default type is unsupported (#17331 ) We have supportted array type default [], but when using lightweight schema Change to add column array type, query failed as follows: Fix "array default type is unsupported" error. Fix the default value filling assignment digit problem.	2023-03-07 16:30:41 +08:00
Jerry Hu	caacee253d	[fix](olap)Crashing caused by IS NULL expression (#17463 ) Issue Number: close #17462	2023-03-07 15:32:52 +08:00
Pxl	d8f0ca7108	[Chore](schema change) remove some unused code in schema change (#17459 ) remove some unused code in schema change. remove some row-based config and code.	2023-03-07 09:18:34 +08:00
Jerry Hu	6f3801d9da	[chore](config) Increase the default maximum depth limit for expressions (#17418 )	2023-03-07 08:53:00 +08:00
Tiewei Fang	48c2d806d7	[enhencement](jdbc catalog) Use Druid instead of HikariCP in JdbcClient (#17395 ) This pr does three things: 1. Use Druid instead of HikariCP in JdbcClient 2. when download udf jar, add the name of the jar package after the local file name. 3. refactor some jdbcResource code	2023-03-07 08:51:10 +08:00
Ashin Gau	1d858db617	[feature](filecache) add a const parameter to control the cache version (#17441 ) * [feature](filecache) add a const parameter to control the cache version * fix	2023-03-07 08:03:18 +08:00
Xin Liao	b0d67c0358	[fix](merge-on-write) fix cu compaction correctness check (#17347 ) During concurrent import, the same row location may be marked delete multiple times by different versions of rowset. Duplicate row location need to be removed.	2023-03-06 21:31:48 +08:00
xueweizhang	4b13d81151	[fix](publish) fix when TabletPublishTxnTask::handle() error, transaction publish success, and query table error (#17409 ) be use EnginePublishVersionTask to publish all replica of all tablets of table of one transaction, and EnginePublishVersionTask use TabletPublishTxnTask to truly publish tablet and make rowset visible. but if TabletPublishTxnTask error, tablet id will add _error_tablet_ids but no return some errors, and EnginePublishVersionTask will not report any error to fe, and fe make this transaction visible, and partition's version add 1. but if you query this table, will return error like "MySQL [test]> select * from test12;ERROR 1105 (HY000): errCode = 2, detailMessage = [INTERNAL_ERROR]failed to initialize storage reader. tablet=14023.730105214.d742d664692db946-386daa993d84d89d, res=[INTERNAL_ERROR][9.134.167.25]fail to find path in version_graph. spec_version: 0-3, backend=9.134.167.25". after this pr, _error_tablet_ids will report to fe, this transaction will not be visible and add ErrMsg like "publish on tablet 14038 failed.". Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-06 19:57:04 +08:00
Ashin Gau	dca16796ad	[fix](ParquetReader) definition level of repeated parent is wrong (#17337 ) Fix three bugs: 1. `repeated_parent_def_level ` should be the definition of its repeated parent. 2. Failed to parse schema like `decimal(p, s)` 3. Fill wrong offsets for array type	2023-03-06 18:15:57 +08:00
yiguolei	9477c48ef8	[refactor](functioncontext) remove duplicate type definition in function context (#17421 ) remove duplicate type definition in function context remove unused method in function context not need stale state in vexpr context because vexpr is stateless and function context saves state and they are cloned. remove useless slot_size in all tuple or slot descriptor. remove doris_udf namespace, it is useless. remove some unused macro definitions. init v_conjuncts in vscanner, not need write the same code in every scanner. using unique ptr to manage function context since it could only belong to a single expr context. Issue Number: close #xxx --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-06 16:07:09 +08:00
luozenglin	e7cba11680	[fix](array)(parquet) fix be core dump due to load from parquet file containing array types (#17298 )	2023-03-06 15:18:42 +08:00
ZhaoChangle	8e6c34dd11	[Opt](Vec) Use const_col to opt current functions. (#17324 )	2023-03-06 11:55:41 +08:00
Xinyi Zou	9617f46fa5	[improvement](memory) Modify `mem_limit` default value (#17322 ) Modify the default value of mem_limit to auto. auto means process mem limit is equal to max(physical mem * 0.9, 6.4G). 6.4G is the maximum memory reserved for the system.	2023-03-06 10:53:27 +08:00
AlexYue	400b4bf7a7	[enhance](report) add local and remote size in tablet meta header action (#17406 )	2023-03-06 10:43:57 +08:00
AlexYue	ee1be6edd7	[chore](fe) enhance_mysql_data_type (#17429 )	2023-03-06 10:42:01 +08:00
WenYao	a8f20eb4ac	[Enhencement](schema_scanner) Optimize the performance of reading information schema tables (#17371 ) batch fill block batch call rpc from FE to get table desc For 34w colunms SELECT COUNT( * ) FROM information_schema.columns; time: 10.3s --> 0.4s	2023-03-06 09:53:01 +08:00
Mingyu Chen	3d0beec01d	[fix](orc) fix heap-use-after-free and potential memory leak of orc reader (#17431 ) fix heap-use-after-free The OrcReader has a internal FileInputStream, If the file is empty, the memory of FileInputStream will leak. Besides, there is a Statistics instance in FileInputStream. FileInputStream maybe delete if the orc reader is inited failed, but Statistics maybe used when orc reader is closed, causing heap-use-after-free error. Potential memory leak When init file scanner in file scan node, the file scanner prepare failed, the memory of file scanner will leak.	2023-03-06 08:42:35 +08:00
Xin Liao	0801883604	[fix](merge-on-write) fix that delete bitmap is not calculated correctly when clone tablet (#17334 )	2023-03-05 22:04:28 +08:00
Xin Liao	5190a496ac	[fix](rebalance) fix that the clone operation is not performed due to incorrect condition judgment (#17381 )	2023-03-05 21:58:33 +08:00
zhengyu	d08b231073	[fix](segcompaction) core when doing segcompaction for cancelling load(#16731 ) (#17432 ) segcompaction is async and in parallel with load job. If the load job is canncelling, memory structures will be destroyed and cause segcompaction crash. This commit will wait segcompaction finished before destruction.	2023-03-05 21:24:32 +08:00
yongkang.zhong	779d94f932	[fix](metrics)Delete the extra underline for metrics (#17397 )	2023-03-05 16:38:43 +08:00
lihangyu	59bf305c5d	[Improve](point query) put tablet fetch interface which is high concurrent point query operation to light_work_pool (#17400 ) Since the point query lookup is very light weight	2023-03-05 10:36:50 +08:00
yiguolei	b9b028099d	[enhancement](stream load pipe) using queryid or load id to identify stream load pipe instead of fragment instance id (#17362 ) * [enhancement](stream load pipe) using queryid or load id to identify stream load pipe instead of fragment instance id NewLoadStreamMgr already has pipe and other info. Do not need save the pipe into fragment state. and FragmentState should be more clear. But this pr will change the behaviour of BE. I will pick the pr to doris 1.2.3 and add the load id to FE support. The user could upgrade from 1.2.3 to 2.x Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-04 16:19:36 +08:00
lihangyu	2b014a0464	[Improve](doris::Status performance) fix the performance issue due to copy of std::string (#17411 )	2023-03-04 15:08:59 +08:00

1 2 3 4 5 ...

3873 Commits