doris

Author	SHA1	Message	Date
Mingyu Chen	38c5030f97	[opt](log) refactor the log dir config (#32933 ) Refactor the config for log dir of FE and BE TLDR: - Use env variable `LOG_DIR` to set root log dir - Remove `sys_log_dir` for FE and BE Details: 1. FE 1. The root log dir is set by env variable `LOG_DIR` in `fe.conf` 2. The default value of `audit_log_dir` is same as `${LOG_DIR}/` 3. The default value of `spark_launcher_log_dir` is `${LOG_DIR}/spark_launcher_log` 4. The default value of `nereids_trace_log_dir` is `${LOG_DIR}/nereids_trace_log` 5. The origin `sys_log_dir` is deprecated, and default value is `""`. But for compatibility, if user already set `sys_log_dir` before, Doris will still use it as root log dir. 2. BE 1. The root log dir is set by env variable `LOG_DIR` in `be.conf` 2. Remove `pipeline_tracing_log_dir`, use `${LOG_DIR}` directly. 3. The origin `sys_log_dir` is deprecated, and default value is `""`. But for compatibility, if user already set `sys_log_dir` before, Doris will still use it as root log dir.	2024-04-17 23:41:59 +08:00
Qi Chen	e841d82ffb	[Enhancement](hive-writer) Adjust table sink exchange rebalancer params. (#33397 ) Issue Number: #31442 Change table sink exchange rebalancer params to node level and adjust these params to improve write performance by better balance. rebalancer params: ``` DEFINE_mInt64(table_sink_partition_write_min_data_processed_rebalance_threshold, "26214400"); // 25MB // Minimum partition data processed to rebalance writers in exchange when partition writing DEFINE_mInt64(table_sink_partition_write_min_partition_data_processed_rebalance_threshold, "15728640"); // 15MB ```	2024-04-12 13:09:56 +08:00
abmdocrt	1b3a11a02b	[Enhancement](merge-on-write) Support dynamic delete bitmap cache (#32991 ) * The default delete bitmap cache is set to 100MB, which can be insufficient and cause performance issues when the amount of user data is large. To mitigate the problem of an inadequate cache, we will take the larger of 5% of the total memory and 100MB as the delete bitmap cache size.	2024-04-10 14:53:56 +08:00
TengJianPing	517c12478f	[improvement](spill) spill trigger improvement (#32641 )	2024-04-10 14:52:46 +08:00
Xinyi Zou	cf7595d423	[opt](memory) Optimize mem tracker accuracy (#32039 ) (#33140 )	2024-04-10 11:42:19 +08:00
Mingyu Chen	c758a25dd8	[opt](fqdn) Add DNS Cache for FE and BE (#32869 ) In previously, when enabling FQDN, Doris will call dns resolver to get IP from hostname each time when 1) FE gets BE's grpc client. 2) BE gets other BE's brpc client. So when in high concurrency case, the dns resolver be overloaded and failed to resolve hostname. This PR mainly changes: 1. Add DNSCache for both FE and BE. The DNSCache will run on every FE and BE node. It has a cache, key is hostname and value is IP. Caller can get IP by hostname from this cache, and if hostname does not exist, it will try to resolve it and update the cache. In addition, DNSCache has a daemon thread to refresh the cache every 1 min, in case that the IP may be changed at anytime. There are other implements of this dns cache: 1. `36fed13997` This is for BE side, but it does not handle the IP change case. 3. https://github.com/apache/doris/pull/28479 This is for FE side, but it can only work with Master FE. Other FE node will not be aware of the IP change. And there are a bunch of BackendServiceProxy, this PR only handle cache in one of them.	2024-04-07 22:16:04 +08:00
airborne12	0122b8a6b4	[Update](inverted index) add config for inverted index query cache shards (#32666 )	2024-03-26 20:27:33 +08:00
zhangstar333	4ebbabf15e	[test](fuzzy) test fuzzy in BE (#31607 ) test fuzzy in BE	2024-03-24 08:06:13 +08:00
qiye	a4a191fe56	[fix](index compaction)Fix MOW index compaction core (#32121 ) (#32657 )	2024-03-22 14:20:19 +08:00
Mingyu Chen	e99b33c274	[opt](file-meta-cache) reduce file meta cache size and disable cache for some cases (#32340 ) File meta cache on BE is used to cache the meta for external table's file such as parquet footer. This cache is counted by number, not memory consumption. So if the cache object is big(eg, a large parquet footer), the total memory consumption of this cache will be large and causing OOM. This PR mainly changes: 1. Add a new method `exceed_prune_limit()` for `CachePolicy` For `ObjLRUCache`, it always return true so that the minor of full gc on BE will prune the cache each time. 2. Reduce the default capability of file meta cache, from 20000 to 1000 Also change the default capability of hdfs file handle cache, from 20000 to 1000 4. Change judgement of whether enable file meta cache when querying If the number of file need to be read is larger than the 1/3 of the file meta cache's capability, file meta cache will be disabled for this query. Because cache is useless if there are too many files.	2024-03-21 14:07:22 +08:00
Mingyu Chen	ef2151ae66	[Feature-WIP](multi-catalog) Add Hive sink on BE side. (#32306 ) (#32364 ) bp #32306 Co-authored-by: Qi Chen <kaka11.chen@gmail.com>	2024-03-18 11:23:01 +08:00
Gabriel	4bf202db04	[pipelineX](exchange) Make exchange buffer size configurable (#32201 )	2024-03-16 20:58:20 +08:00
ryanzryu	c5ffeff833	[fix](s3 client)add default ca cert list for s3 client to avoid problem:'curlCode:77' (#32285 ) Co-authored-by: ryanzryu <ryanzryu@tencent.com>	2024-03-16 20:55:28 +08:00
TengJianPing	3358f76a7f	[feature](spill) Implement spill to disk for hash join, aggregation and sort for pipelineX (#31910 ) Co-authored-by: Jerry Hu <mrhhsg@gmail.com>	2024-03-12 14:12:09 +08:00
airborne12	daa171ee3a	[Update](cloud) add inverted index tmp dir support (#31484 )	2024-03-02 01:08:51 +08:00
zhangstar333	c40c16b8b3	[improve](conf)refactor fuzzy mode in BE (#31412 ) refactor the code of fuzzy in BE, and will be add more variables in it, then could test case at different mode.	2024-02-29 19:51:07 +08:00
zclllyybb	82add8dfc1	[Fix](timezone) Introduce a config to use Doris tzdata directly (#31561 )	2024-02-29 12:38:03 +08:00
zclllyybb	b177b26d39	[branch-2.1](tracing) Pick pipeline tracing and relative bugfix (#31367 ) * [Feature](pipeline) Trace pipeline scheduling (part I) (#31027) * [fix](compile) Fix performance compile fail #31305 * [fix](compile) Fix macOS compilation issues for PURE macro and CPU core identification (#31357) * [fix](compile) Correct PURE macro definition to fix compilation on macOS * 2 --------- Co-authored-by: zy-kkk <zhongyk10@gmail.com>	2024-02-29 08:42:35 +08:00
AlexYue	f18c853495	[enhance](S3) Init default retry strategy for aws s3 sdk (#31329 )	2024-02-28 13:08:36 +08:00
Xinyi Zou	4c3a96e7df	[fix](memory) Fix LRU cache frequent prune (#31220 )	2024-02-22 19:51:20 +08:00
Mingyu Chen	a8d8c6a271	[fix](file-writer) opt s3 file writer and fix empty file related issue #28983 #30703 #31169 (#31213 ) * (feature)(cloud) Use dynamic allocator instead of static buffer pool for better elasticity. (#28983) * [fix](outfile) Fix unable to export empty data (#30703) Issue Number: close #30600 Fix unable to export empty data to hdfs / S3, this behavior is inconsistent with version 1.2.7, version 1.2.7 can export empty data to hdfs/ S3, and there will be exported files on S3/HDFS. * [fix](file-writer) avoid empty file for segment writer (#31169) --------- Co-authored-by: AlexYue <yj976240184@gmail.com> Co-authored-by: zxealous <zhouchangyue@baidu.com>	2024-02-21 16:48:54 +08:00
Xin Liao	9a708806e0	[fix](segcompaction) enable segcompaction by default (#30810 )	2024-02-19 19:04:22 +08:00
yiguolei	a3c78dd21a	[chore](refactor) refactor some rf code and delete rpc file (#31031 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-02-18 11:50:17 +08:00
huanghaibin	bbbe3e666a	[improvement](group_commit) Rename fail wal to tmp should only use in test P0 scenario (#30959 )	2024-02-16 10:12:24 +08:00
HHoflittlefish777	d123abc903	disable check segment when build rowset meta by default (#30857 )	2024-02-05 22:00:36 +08:00
Kaijie Chen	499fd27ed0	[config](move-memtable) set StreamWait timeout default to 10min (#30831 )	2024-02-05 21:59:55 +08:00
zy-kkk	4f8730d092	[improvement](jdbc catalog) Optimize connection pool parameter settings (#30588 ) This PR makes the following changes to the connection pool of JDBC Catalog 1. Set the maximum connection survival time, the default is 30 minutes - Moreover, one-half of the maximum survival time is the recyclable time, - One-tenth is the check interval for recycling connections 2. Keepalive only takes effect on the connection pool on BE, and will be activated based on one-fifth of the maximum survival time. 3. The maximum number of existing connections is changed from 100 to 10 4. Add the connection cache recycling thread on BE, and add a parameter to control the recycling time, the default is 28800 (8 hours) 5. Add CatalogID to the key of the connection pool cache to achieve better isolation, requires refresh catalog to take effect 6. Upgrade druid connection pool to version 1.2.20 7. Added JdbcResource's setting of default parameters when upgrading the FE version to avoid errors due to unset parameters.	2024-02-03 20:26:03 +08:00
Xin Liao	bc8c1875ac	[enhencement](config) change default write buffer size (#30749 )	2024-02-03 20:25:25 +08:00
Pxl	0f47f7f389	[Feature](runtime filter) normalize ignore runtime filter (#30152 ) normalize ignore runtime filter	2024-02-03 20:24:39 +08:00
HHoflittlefish777	1d632f1af4	[improvement](move-memtable) enable stream write to socket in background bthread (#30586 )	2024-01-31 23:53:39 +08:00
deardeng	cc3c6d1479	[improvement](create tablet) backend create tablet round robin among … (#30530 ) * [improvement](create tablet) backend create tablet round robin among … (#29818) * [improvement](create tablet) be choose disk tolerate with little skew (#30354) --------- Co-authored-by: yujun <yu.jun.reach@gmail.com>	2024-01-30 10:20:35 +08:00
Yongqiang YANG	bedad15f03	[enhancement](scanner) add a lower bound for bytes in scanner queue (#29624 )	2024-01-27 09:13:21 +08:00
AlexYue	46cadc9856	[minor](Prefetch) log slow prefetch io operation #30415	2024-01-27 09:11:02 +08:00
Kaijie Chen	90ed663eb0	[fix](move-memtable) all sinks wait stream close for load timeout (#30356 )	2024-01-25 21:37:33 +08:00
wangbo	9e0c518aaf	[Feature](executor)Workload Group support Non-Pipeline Execution (#30164 )	2024-01-23 10:11:25 +08:00
Kaijie Chen	9dd368f8dc	[config](load) set default memtable_flush_running_count_limit to 2 (#30146 )	2024-01-23 10:07:51 +08:00
Pxl	30378f9bbc	[Chore](config) remove some unused config (#29983 ) remove some unused config	2024-01-18 12:03:07 +08:00
HHoflittlefish777	2fa511f80e	[improve](multi-table-load) avoid plan and execute too many plan at once (#29951 )	2024-01-16 21:14:35 +08:00
Ashin Gau	a0355a6580	[opt](scanner) optimize the number of threads of scanners, follow up #28640 (#30030 ) follow up #28640, doris_scanner_thread_pool_thread_num may be to large in machines with 128 cores.	2024-01-16 20:23:09 +08:00
yujun	8ca807578f	[fix](migrate disk) fix migrate disk lost data during publish version (#29887 ) Co-authored-by: Yongqiang YANG <98214048+dataroaring@users.noreply.github.com>	2024-01-16 18:37:06 +08:00
slothever	25428bd7fb	[fix](kerberos) fix BE kerberos ccache renew, optimize kerbero options (#29291 ) 1. we need remove BE kinit, and use jni login with keytab, because kinit cannot renew TGT for doris in many complex cases. > This pull requet will support new instance from keytab: https://github.com/apache/doris-thirdparty/pull/173, so now we won't need kinit cmd, just login with keytab and principal 2. add `kerberos_ccache_path` to set kerberos credentials cache path manually. 3. add `max_hdfs_file_handle_cache_time_ms` to set hdfs fs handle cache time.	2024-01-16 18:35:29 +08:00
Kaijie Chen	620cfc3cd7	[fix](move-memtable) set idle timeout equal to load timeout (#29839 )	2024-01-16 18:33:51 +08:00
yujun	e3a1138da7	[fix](migrate disk) fix tablet disk migration timeout too large (#29895 )	2024-01-16 18:33:51 +08:00
wangbo	ad986a78ae	[Fix](executor)Fix Grayscale upgrade be code dump when report statistics #29843	2024-01-12 13:58:19 +08:00
Lightman	d525f576e1	[improve] Use lru cache to count the number of column in tablet schema to control memory (#29668 )	2024-01-12 13:58:19 +08:00
wangbo	0d691c638b	[Feature](profile)Support report runtime workload statistics #29591	2024-01-12 11:59:27 +08:00
Kaijie Chen	28f18a33ea	[improve](config) print value in error message (#29207 )	2024-01-06 18:43:52 +08:00
deardeng	f40cce1406	[Fix](partition) Skip rowset partition id eq 0 smaller than config wh… (#29510 )	2024-01-05 19:39:51 +08:00
huanghaibin	eea6219920	[improvement](group commit) Add an option of enable relay wal on all regression test (#28757 )	2024-01-05 18:27:31 +08:00
Yongqiang YANG	d6cb2d6d5c	[improvement](compaction) start 1 cumu compaction thread each disk by default (#29430 )	2024-01-03 20:48:11 +08:00

1 2 3 4 5

247 Commits