doris

Author	SHA1	Message	Date
Zhengguo Yang	b51ce415e7	[Feature](load) Add submitter and comments to load job (#16878 ) * [Feature](load) Add submitter and comments to load job	2023-02-28 09:06:19 +08:00
zhannngchen	84413f33b8	[enhancement](merge-on-write) add skip_delete_bitmap session variable for debug purpose (#17127 )	2023-02-27 23:31:28 +08:00
zhangstar333	94927b3b1c	[vectorized](bug) fix open fold constant cause be core dump (#17055 ) add a defer in fold constant to close. add more type when call _get_result function in fold constant.3. fix in can't handle null. eg:select 1 in (2, NULL, 1); in java udf jni_ctx will be nullptr, so call close will be core dump. Describe your changes.	2023-02-26 12:30:03 +08:00
Xin Liao	c071c327e7	[fix](load) fix add broken tablet core dump (#17104 )	2023-02-24 23:59:03 +08:00
yiguolei	03a4fe6f39	[enhancement](streamload) make stream load context as shared ptr and save it in global load mgr (#16996 )	2023-02-24 11:15:29 +08:00
amory	7229751bd9	[Improve](map-type) Add contains_null for map (#16948 ) Add contains_null for map type.	2023-02-23 20:47:26 +08:00
Xinyi Zou	a1c0054b4c	[fix](memory) fix memory GC details and join probe catch bad_alloc (#16989 ) Fix Redhat 4.x OS /proc/meminfo has no MemAvailable, disable MemAvailable to control memory. vm_rss_str and mem_available_str recorded when gc is triggered, to avoid memory changes during gc and cause inaccurate logs. join probe catch bad_alloc, this may alloc 64G memory at a time, avoid OOM. Modify document doris_be_all_segments_num and doris_be_all_rowsets_num names.	2023-02-23 08:33:30 +08:00
zxealous	29c46d6926	[fix](struct-type) fix be core when load array orc file (#16978 ) * fix be core when load array orc file	2023-02-22 10:15:39 +08:00
zhengyu	09d41c3479	[fix](log) clarify error msg for tablet writer write failure (#14078 ) (#16954 ) (#16950 ) fmt::format dosen't support non-template object as args, even if it implements `to_string()` or `operator<<`. so orignal code may cause `false` to be printed instead of real cause of the failure. So to_string() need to be manually invoked. Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-02-21 19:42:49 +08:00
Xinyi Zou	2074b83c67	[enhancement](third-party) Upgrade JEMalloc version from 5.2.1 to 5.3.0 (#14871 ) https://github.com/jemalloc/jemalloc/releases	2023-02-20 00:00:40 +08:00
Pxl	da147f1d1c	[Chore](build) remove memory_copy and remove some wno build check (#16831 ) * remove memory_copy and remove some wno cbuild check	2023-02-17 14:43:24 +08:00
Gabriel	3d6077efe0	[pipeline](profile) Support real-time profile report in pipeline (#16772 )	2023-02-17 10:01:34 +08:00
Gabriel	0bb6005143	[Improvement](thrift) optimize thrift messages (#16383 ) Now we use a thrift message per fragment instance. However, there are many same messages between instances in a fragment. So this PR aims to extract the same messages and we only need to send thrift message once for a fragment	2023-02-16 11:07:46 +08:00
Pxl	f50edff59d	[Chore](build) enable fallthrough check annd fix some fallthrough bug (#16748 ) * enable fallthrough check annd fix some fallthrough bug * fix * fix	2023-02-15 15:58:43 +08:00
zhengshengjun	d013d529c8	[Feature](ipv6)Support IPV6 (#14063 ) Support IPV6 in Apache Doris, the main changes are: 1. enable binding to IPV6 address if network priority in config file contains an IPV6 CIDR string 2. BRPC and HTTP support binding to IPV6 address 3. BRPC and HTTP support visiting IPV6 Services	2023-02-14 21:43:10 +08:00
YueW	ed3420000e	[fix](bthread) fix bthread hang (#16594 )	2023-02-14 00:08:57 +08:00
huangzhaowei	f41a2055d3	[feature](Load)Remove user/password in properties for mysql load to avoid double auth. (#16073 ) Use FE cluster token to auth stream load. This auth is only open for be, and fe auth still only support http basic auth. I will use this auth for mysql load to build a no-auth stream load from fe to be. And this will avoid double auth in mysql load. More information to see the design doc.	2023-02-13 10:00:08 +08:00
奕冷	cf739e7496	[Enhancement](Stmt) Set insert_into timeout session variable separately (#16343 )	2023-02-12 16:56:10 +08:00
Kang	aba843bb2b	[Improvement](inverted index) inverted index query match bitmap cache (#16578 ) Add cache for inverted index query match bitmap to accelerate common query keyword, especially for keyword matching many rows. Tests result: - large result: matching 99% out of 247 million rows shows 8x speed up. - small result: matching 0.1% out of 247 million rows shows 2x speed up.	2023-02-11 13:38:58 +08:00
lihangyu	37d1519316	[WIP](dynamic-table) support dynamic schema table (#16335 ) Issue Number: close #16351 Dynamic schema table is a special type of table, it's schema change with loading procedure.Now we implemented this feature mainly for semi-structure data such as JSON, since JSON is schema self-described we could extract schema info from the original documents and inference the final type infomation.This speical table could reduce manual schema change operation and easily import semi-structure data and extends it's schema automatically.	2023-02-11 13:37:50 +08:00
YueW	b155fc07f6	[fix](fragment thread) fix thread in fragment thread pool hang (#16608 ) process the return status for exec_state->execute() in function FragmentMgr::_exec_actual	2023-02-11 09:05:10 +08:00
xy720	1b3902baa2	[Feature](Complex-type) Add struct and map type to Doris (#16444 ) This commit support: 1、Insert + select for struct/map type 2、Json stream load for struct type 3、m[key] function for map type How to use: Set the fe config to create table for struct and map type 1、admin set frontend config("enable_struct_type" = "true"); 2、admin set frontend config("enable_map_type" = "true"); #16547 Co-authored-by: xy720 <xuyang25@baidu.com> Co-authored-by: amory <wangqiannan@selectdb.com> Co-authored-by: cambyzju <zhuxiaoli01@baidu.com> Co-authored-by: hucheng01 <hucheng01@baidu.com>	2023-02-10 11:00:33 +08:00
yiguolei	4fcd6cd236	[refactor](remove unused code) remove load stream mgr (#16580 ) remove old stream load pipe remove old stream load manager --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-10 07:46:18 +08:00
yiguolei	646ba2cc88	[bugfix](scannode) 1. make rows_read correct 2. use single scanner if has limit clause (#16473 ) make rows_read correct so that the scheduler could using this correctly. use single scanner if has limit clause. Move it from fragment context to scannode. --------- Co-authored-by: yiguolei <yiguolei@gmail.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2023-02-09 14:12:18 +08:00
yiguolei	6fdd35a6f2	[enhancement](mpp process) remove unused method and make report process more clear (#16441 ) both update status and open_vectorized_internal will call send_report and stop report thread. move update_status code to open method and remove unnecessary send_report and stop_report_thread. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-07 12:28:55 +08:00
Kang	737c73dcf0	[Improvement](topn) order by key topn query optimization (#15663 )	2023-02-06 15:36:05 +08:00
lihangyu	f2fd47f238	[Improve](row-store) support row cache (#16263 )	2023-02-06 11:16:39 +08:00
yiguolei	c689b8918a	[enhancement](runtimefilter) no need wait for fragment because two phase exec fragment (#16418 ) call pthread condition wait may block brpc thread. no need wait for fragment because two phase exec fragment already guarantee that the fragment instance exits when runtime filter comes. So that I remove the condition wait code. Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-05 10:09:31 +08:00
Xinyi Zou	63d57b83f3	[fix](memory) Fix request jemallloc metrics wait lock je_malloc_mutex_lock_slow #16381 MetricRegistry::trigger_all_hooks holds the metrics lock and is stuck in get_je_metrics, to_prometheus is waiting for MetricRegistry::trigger_all_hooks to release the lock, so get_je_metrics is no longer called in MetricRegistry::trigger_all_hooks.	2023-02-04 22:49:22 +08:00
Pxl	5e4bb98900	[Chore](build) enable -Wpedantic and update lowest gcc version to 11.1 (#16290 ) enable -Wpedantic and update lowest gcc version to 11.1	2023-02-03 11:28:48 +08:00
Mingyu Chen	cb6875b5a4	[improvement](multi-catalog) use date/datetimev2 as default col type for catalog table (#16304 ) 1. When mapping column from external datasource, use date/datetimev2 as default type 2. check `is_cancelled` when read data, to avoid endless loop after query is cancelled	2023-02-02 17:35:48 +08:00
yiguolei	eba70f972e	[improvement](global context) remove some unused method from runtime state (#16329 ) This is part of #16296. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-02-02 10:24:55 +08:00
Xinyi Zou	63042a38bd	[fix](memtracker) Fix high frequency load slow lock in memtracker (#16244 ) Global lock stuck in memtracker when bthread is frequently created	2023-02-02 09:53:44 +08:00
Kang	aa0837f198	[bugfix](topn) fix topn runtime predicate getting value bug for decimal type (#16331 ) * fix topn runtime predicate getting value bug for decimal type * fix cast_to_string bug for TYPE_DECIMALV2	2023-02-02 09:13:32 +08:00
Pxl	ca73c60442	[Chore](build) enable ignored-qualifiers check (#16196 ) enable ignored-qualifiers check	2023-02-01 15:15:59 +08:00
zhannngchen	6470ae58ea	[enhancement](config) remove config load_process_max_memory_limit_bytes (#15686 )	2023-01-31 21:36:34 +08:00
plat1ko	00a598a839	[feature](cooldown) Decouple storage policy and resource (#15873 )	2023-01-31 14:13:47 +08:00
yiguolei	c59a8cb15d	[refactor](remove unused code) remove log error hub (#16183 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-30 16:53:56 +08:00
chenlinzhong	fdc042bb39	[fix](vresultsink) BufferControlBlock may block all fragment handle threads (#16231 ) BufferControlBlock may block all fragment handle threads leads to be out of work modify include: BufferControlBlock cancel after max timeout StmtExcutor notify be to cancel the fragment when unexcepted occur more details see issue #16203	2023-01-30 16:53:21 +08:00
yiguolei	90b12143a3	[refactor](remove unused code) remove runtime tuple structure and useless utils class (#16237 )	2023-01-30 16:45:14 +08:00
Jerry Hu	a9671b6dfd	[feature](agg)support two level-hash map in aggregation node (#15967 )	2023-01-30 16:43:33 +08:00
yiguolei	4b6a4b3cf7	[refactor](remove unused code) Remove unused mempool declare or function params (#16222 ) * Remove unused mempool declare or function params --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-30 13:03:18 +08:00
yiguolei	3235b636cc	[refactor](remove unused code) remove thread pool manager (#16179 ) * remove thread resource manager * remove string buffer --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-29 13:03:08 +08:00
Xinyi Zou	e9afd3210c	[improvement](memory) Optimize the log of process memory insufficient and support regular GC cache (#16084 ) 1. When the process memory is insufficient, print the process memory statistics in a more timely and detailed manner. 2. Support regular GC cache, currently only page cache and chunk allocator are included, because many people reported that the memory does not drop after the query ends. 3. Reduce system available memory warning water mark to reduce memory waste 4. Optimize soft mem limit logging	2023-01-29 10:02:04 +08:00
yiguolei	241a956b20	[refactor](remove unused code) remove partition info from datastream sender (#16162 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-28 19:56:41 +08:00
Gabriel	7cf7706eb1	[Bug](runtimefilter) Fix wrong runtime filter on datetime (#16102 )	2023-01-28 18:16:06 +08:00
Xinyi Zou	949a065f22	[improvement](memory) load support overcommit memory (#16083 ) Overcommit memory means that when the memory is sufficient, it no longer checks whether the memory of query/load exceeds the exec mem limit. Instead, when the process memory exceeds the limit or the available system memory is insufficient, cancel the top overcommit query in minor gc, and cancel top memory query in full gc. Previously only query supported overcommit memory, this pr supports load, including insert into and stream load. Detailed explanation, I will update the memory document in these two days~ `15bd56cd43/docs/zh-CN/docs/admin-manual/maint-monitor/memory-management/memory-limit-exceeded-analysis.md`	2023-01-28 16:10:18 +08:00
yiguolei	e49766483e	[refactor](remove unused code) remove many xxxVal structure (#16143 ) remove many xxxVal structure remove BetaRowsetWriter::_add_row remove anyval_util.cpp remove non-vectorized geo functions remove non-vectorized like predicate Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-01-28 14:17:43 +08:00
zhannngchen	4e64ff6329	[enhancement](load) avoid schema copy to reduce cpu usage (#16034 )	2023-01-28 11:13:57 +08:00
yiguolei	adb758dcac	[refactor](remove non vec code) remove json functions string functions match functions and some code (#16141 ) remove json functions code remove string functions code remove math functions code move MatchPredicate to olap since it is only used in storage predicate process remove some code in tuple, Tuple structure should be removed in the future. remove many code in collection value structure, they are useless	2023-01-26 16:21:12 +08:00

1 2 3 4 5 ...

907 Commits