doris

Author	SHA1	Message	Date
luozenglin	8a810cd554	[fix](bitmapfilter) fix core dump caused by bitmap filter (#15296 ) Do not push down the bitmap filter to a non-integer column	2022-12-23 16:42:45 +08:00
Gabriel	b085ff49f0	[refactor](non-vec) delete non-vec data sink (#15283 ) * [refactor](non-vec) delete non-vec data sink Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-12-23 14:10:47 +08:00
Xinyi Zou	77c15729d4	[fix](memory) Fix too many repeat cause OOM (#15217 )	2022-12-22 17:16:18 +08:00
Gabriel	e9a201e0ec	[refactor](non-vec) delete some non-vec exec node (#15239 ) * [refactor](non-vec) delete some non-vec exec node	2022-12-22 14:05:51 +08:00
Gabriel	af54299b26	[Pipeline](projection) Support projection on pipeline engine (#15220 )	2022-12-21 15:47:29 +08:00
zbtzbtzbt	9d48154cdc	[minor](non-vec) delete unused interface in RowBatch (#15186 )	2022-12-20 13:06:34 +08:00
Lijia Liu	938f4f33d6	[Pipeline] Add MLFQ when schedule (#15124 )	2022-12-20 11:49:15 +08:00
Zhengguo Yang	98cdeed6e0	[chore](routine load) remove deprecated property of librdkafka reconnect.backoff.jitter.ms #15172	2022-12-20 10:13:56 +08:00
Xin Liao	03ea2866b7	[fix](load) add to error tablets when delta writer failed to close (#15118 ) The result of load should be failed when all tablets delta writer failed to close on single node. But the result returned to client is success. The reason is that the committed tablets and error tablets are both empty, so publish will be success. We should add it to error tablets when delta writer failed to close, then the transaction will be failed.	2022-12-19 14:22:25 +08:00
Tiewei Fang	8a08085356	[enhancement](signal) output query_id when 'be' core dumped #15080	2022-12-19 09:47:38 +08:00
Gabriel	13bc8c2ef8	[Pipeline](runtime filter) Support runtime filters on pipeline engine (#15040 )	2022-12-18 21:48:00 +08:00
zhannngchen	0cd791ec57	[fix](load) delta writer init failed might cause data inconsistency between multiple replicas (#15058 ) In the following case, data inconsistency would happen between multiple replicas current delta writer only writes a few lines of data (which meas the write() method only called once) writer failed when init()(which is called at the fist time we call write()), and current tablet is recorded in _broken_tablets delta writer closed, and in the close() method, delta writer found it's not inited, treat such case as an empty load, it will try to init again, which would create an empty rowset. tablet sink received the error report in rpc response, marked the replica as failed, but since the quorum replicas are succeed, so the following load commit operation will succeed. FE send publish version task to each be, the one with empty rowset will publish version successfully. We got 2 replica with data and 1 empty replica.	2022-12-16 22:07:00 +08:00
Gabriel	94e0955687	[Bug](thread token) Fix invalid thread token (#15110 )	2022-12-15 21:54:57 +08:00
Gabriel	5ef4c42a80	[Bug](datev2) Fix wrong result when use datev2 as partition key (#15094 )	2022-12-15 21:27:05 +08:00
Tiewei Fang	c6d93f739c	[feature-wip](file reader) Merge stream_load_pipe to the new file reader (#15035 ) Currently, there are two sets of file readers in Doris, this pr rewrites the old stream_load_pipe with the new file reader.	2022-12-15 16:31:22 +08:00
HappenLee	284a3351f4	[Refactor](exec) refactor the code of datasink eos logic (#15009 )	2022-12-13 15:33:08 +08:00
Pxl	c25a7235f9	[Pipeline](load) support pipeline broker load (#14940 ) support pipeline broker load	2022-12-13 00:28:36 +08:00
plat1ko	f3aea7f0f0	[Enhancement](status) Unify error code and enable customed err msg for BE internal errors (#14744 )	2022-12-11 23:33:18 +08:00
Gabriel	b213ce6ffd	[Bug](pipeline) Fix double prepare on pipeline engine (#14959 )	2022-12-09 14:35:34 +08:00
Xinyi Zou	dffa3c0db2	[enhancement](memory) Support query memroy overcommit #14948 Add conf enable_query_memroy_overcommit If true, when the process does not exceed the soft mem limit, the query memory will not be limited; when the process memory exceeds the soft mem limit, the query with the largest ratio between the currently used memory and the exec_mem_limit will be canceled. If false, cancel query when the memory used exceeds exec_mem_limit, same as before.	2022-12-09 14:09:05 +08:00
Pxl	375e0e08ca	[Bug](predicate) fix ccore dump on varchar with in list predicate (#14881 ) * fix ccore dump on varchar with in list predicate * update case * Update sqlsmith01.sql	2022-12-08 17:14:23 +08:00
Pxl	dbaa02d3a0	[Pipeline](fix) fix enable_pipeline_engine variable not work (#14909 )	2022-12-08 14:52:52 +08:00
Pxl	48a9166aa4	[Pipeline](sink) support olap table sink operator (#14872 ) * support olap table sink operator * update config	2022-12-07 15:29:56 +08:00
Xinyi Zou	cdbbf1e4ee	[enhancement](memory) Add Memory GC when the available memory of the BE process is lacking (#14712 ) When the system MemAvailable is less than the warning water mark, or the memory used by the BE process exceeds the mem soft limit, run minor gc and try to release cache. When the MemAvailable of the system is less than the low water mark, or the memory used by the BE process exceeds the mem limit, run fucc gc, try to release the cache, and start canceling from the query with the largest memory usage until the memory of mem_limit * 20% is released.	2022-12-07 15:28:52 +08:00
Xinyi Zou	9e51e0263d	[fix](memory leak) Fix load fragment `QueryFragmentsCtx` is not destroyed (#14840 )	2022-12-07 08:45:53 +08:00
Xinyi Zou	8726bfa121	[enhancement](memory) Add tablet schema cache metrics (#14742 )	2022-12-05 18:19:13 +08:00
HappenLee	12304bc0ee	[Pipeline](exec) Support pipeline exec engine (#14736 ) Co-authored-by: Lijia Liu <liutang123@yeah.net> Co-authored-by: HappenLee <happenlee@hotmail.com> Co-authored-by: Jerry Hu <mrhhsg@gmail.com> Co-authored-by: Pxl <952130278@qq.com> Co-authored-by: shee <13843187+qzsee@users.noreply.github.com> Co-authored-by: Gabriel <gabrielleebuaa@gmail.com> ## Problem Summary: ### 1. Design DSIP: https://cwiki.apache.org/confluence/display/DORIS/DSIP-027%3A+Support+Pipeline+Exec+Engine ### 2. How to use: Set the environment variable `set enable_pipeline_engine = true; `	2022-12-02 17:11:34 +08:00
Xinyi Zou	176f519fa1	[enhancement](memtracker) Optimize exec node memory tracking (#14711 )	2022-12-01 14:52:21 +08:00
Kang	79688a54d6	[bug](jsonb) fix be core at insert invalid json to JSONB column (#14686 )	2022-11-30 14:00:50 +08:00
AlexYue	898d0d42f1	[improvement](load)add more log for better bug tracing experience for be write (#14424 ) Recently when tracing one bug happened in version 1.1.4 I found out there were some places we can add more log for a better tracing.	2022-11-29 22:28:39 +08:00
Xinyi Zou	e1f0fa069c	[enhancement](memory) Refactored process memory statistics periodically refresh, and fix catch bad_alloc (#14580 )	2022-11-29 10:15:25 +08:00
Yongqiang YANG	0702277196	[improvement](tcmalloc) add moderate mode and avoid oom with a lot of cache (#14374 ) ReleaseToSystem aggressively when there are little free memory.	2022-11-28 20:17:51 +08:00
camby	73a600fba8	bug fix for outfile (#14550 ) Co-authored-by: cambyzju <zhuxiaoli01@baidu.com>	2022-11-28 15:46:41 +08:00
Xinyi Zou	d5d3f7e0b7	[fix](memtracker) Fix thrift BackendService thread local is not initialized, memtracker init fail (#14589 )	2022-11-26 13:04:39 +08:00
Kang	52c6ba051e	[feature](jsonb type)refactor JSONB type using column and add testcase (#13778 ) 1. Refactor JSONB type using ColumnString instead making a copy. 2. Add regression testcase for JSONB load and functions.	2022-11-26 10:06:15 +08:00
luozenglin	4728e75079	[feature](bitmap) Support in bitmap syntax and bitmap runtime filter (#14340 ) 1.Support in bitmap syntax, like 'where k1 in (select bitmap_column from tbl)'; 2.Support bitmap runtime filter. Generate a bitmap filter using the right table bitmap and push it down to the left table storage layer for filtering.	2022-11-25 15:22:44 +08:00
Jerry Hu	9103ded1dd	[improvement](join)optimize sharing hash table for broadcast join (#14371 ) This PR is to make sharing hash table for broadcast more robust: Add a session variable to enable/disable this function. Do not block the hash join node's close function. Use shared pointer to share hash table and runtime filter in broadcast join nodes. The Hash join node that doesn't need to build the hash table will close the right child without reading any data(the child will close the corresponding sender).	2022-11-24 21:06:44 +08:00
TengJianPing	6c7f758ef7	[improvement](hashjoin) support partitioned hash table in hash join (#14480 )	2022-11-24 14:16:47 +08:00
wxy	6472d5506f	[fix](cache) fix cache overflow problem #14515 (#14516 ) Co-authored-by: wangxiangyu@360shuke.com <wangxiangyu@360shuke.com>	2022-11-24 11:18:46 +08:00
Gabriel	496a92b668	[JavaUDF](loader) Fix compatible problem for JAVA 11 (#14519 )	2022-11-23 23:36:39 +08:00
yiguolei	fd3af489a4	[memory](chunkallocator) disable chunkallocator when reserved bytes == 0 (#14494 ) disable chunkallocator when reserved bytes == 0 disable chunkallocator by default	2022-11-23 17:12:53 +08:00
Adonis Ling	249b688663	[chore](github) Add a workflow to check BE UT on macOS (#14506 )	2022-11-23 08:38:28 +08:00
zhangstar333	b04ec41c1d	[Vectorized](udaf) fix java-udaf couldn't get jar core dump (#14393 ) fix java-udaf couldn't get jar core dump	2022-11-22 20:49:02 +08:00
zhannngchen	3e1e8db173	[fix](exec) fix thread token shutdown (#14418 ) Fix Thread pool token was shut down error. This is because when there are more than 1 fragment of a query on one BE, the thread token maybe reset incorrectly, causing thread token shutdown earlier. cherry-pick from master Introduced from #13021	2022-11-20 00:04:48 +08:00
Gabriel	2c42f0a905	[refactor](decimalv3) Refine code for DecimalV3 (#14394 )	2022-11-19 16:57:17 +08:00
starocean999	1f326fc0d6	[enhancement](be)limit mem cost to 16m when pre serialize keys in agg node (#14321 ) * [enhancement](be)limit mem cost to 16m when pre serialize keys in agg node * use only one chunk memory when serializing keys in agg node	2022-11-18 12:31:52 +08:00
Xinyi Zou	bd5a593403	[enhancement](memtracker) Use proc/meminfo MemAvailable to control memory and optimize MemTracker log printing (#14335 )	2022-11-17 22:46:07 +08:00
TengJianPing	a382bb95e7	[fix](runtimefilter) fix heap-user-after-free of runtime filter merge (#14362 )	2022-11-17 19:38:45 +08:00
Adonis Ling	333c6390ee	[fix](be-ut) AddressSanitizer detects container-overflow issues (#14255 ) * [chore] Fix the container-overflow errors detected by address sanitizer * Fix compilation errors	2022-11-15 15:49:55 +08:00
Xinyi Zou	cffdeff4ec	[fix](memory) Fix memory leak by calling boost::stacktrace (#14269 ) boost::stacktrace::stacktrace() has memory leak, so use glog internal func to print stacktrace. The reason for the memory leak of boost::stacktrace is that a state is saved in the thread local of each thread but not actively released. The test found that each thread leaked about 100M after calling boost::stacktrace. refer to: boostorg/stacktrace#118 boostorg/stacktrace#111	2022-11-15 08:58:57 +08:00

1 2 3 4 5 ...

817 Commits