doris

Author	SHA1	Message	Date
Mingyu Chen	e848e456be	[config] modify tablet_shard to 4 and add some log (#18416 ) modify the default value of BE config tablet_map_shard_size to 4. To reduce lock contention. Add log when failed writing disk test file, for debug	2023-04-06 17:18:16 +08:00
gitccl	7f8d92656e	[fix](streamload) fix stream load failed when enable profile (#18364 ) #18015 enables stream load profile log, however be will encounter rpc fail when loading tpch data(see #18291). This is because when `is_report_success` is true, be will reportExecStatus to fe, but fe cannot find QueryInfo in `coordinatorMap`, thus it will return error to be.	2023-04-05 01:01:46 +08:00
Xinyi Zou	dd78001cc1	[fix](memory) Fix memtable flush mem tracker #18330	2023-04-03 20:37:14 +08:00
Mingyu Chen	7e61a85331	[refactor](libhdfs) introduce hadoop libhdfs (#18204 ) 1. Introduce hadoop libhdfs 2. For Linux-X86 platform, use the hadoop libhdfs 3. For other platform, use libhdfs3, because currently we don't have hadoop libhdfs binary for other platform Co-authored-by: adonis0147 <adonis0147@gmail.com>	2023-03-31 18:41:39 +08:00
lihangyu	35bae25568	[Improve](row store) add more profile info in log for point query and make row column page size more configurable (#18181 ) save about 20% FE cpu cost for point query with prepared statement which table contains 100 columns	2023-03-31 10:58:59 +08:00
Xinyi Zou	e5793249cd	[opt](hashtable) Modify default filled strategy to 75% (#18242 )	2023-03-31 09:28:11 +08:00
Lijia Liu	2ee1468576	[improvement](executor) Support task group schedule in pipeline engine (#17615 )	2023-03-30 10:49:50 +08:00
Xinyi Zou	01d012bab7	[fix](memory) Remove page cache regular clear, disabled jemalloc prof by default (#18218 ) Remove page cache regular clear Now the page cache is turned off by default. If the user manually opens the page cache, it can be considered that the user can accept the memory usage of the page cache, and then can consider adding a manual clear command to the cache. fix memory gc cancel top memory query jemalloc prof is not enabled by default	2023-03-30 09:39:37 +08:00
Mingyu Chen	05db6e9b55	[refactor](file-system)(step-2) remove env, file_utils and filesystem_utils (#18009 ) Follow #17586. This PR mainly changes: Remove env/ Remove FileUtils/FilesystemUtils Some methods are moved to LocalFileSystem Remove olap/file_cache Add s3 client cache for s3 file system In my test, the time of open s3 file can be reduced significantly Fix cold/hot separation bug for s3 fs. This is the last PR of #17764. After this, all IO operation should be in io/fs. Except for tests in #17586, I also tested some case related to fs io: clone concurrency query on local/s3/hdfs load error log create and clean disk metrics	2023-03-29 09:00:52 +08:00
yiguolei	359f5be53e	[refactor](cgroup) remove cgroup manager it is useless (#18124 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-27 23:02:18 +08:00
yiguolei	7ae51c856e	[refactor](unify exception) unify exception definition and error code (#18006 ) * [refactor](unify exception) unify exception definition and error code --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-25 12:41:07 +08:00
gitccl	0523860877	[Enhancement](streamload) print profile for streamload (#18015 ) When both enable_profile and enable_stream_load_profile_log is true, stream load profile is printed to the log	2023-03-24 20:17:33 +08:00
qiye	cedd36c786	[improvement](compaction)Support segcompaction for inverted index (#17874 ) Since Doris supports segcompaction #12866 during loading, inverted index support is also needed.	2023-03-23 14:41:30 +08:00
Mingyu Chen	cb79e42e5c	[refactor](file-system)(step-1) refactor file sysmte on BE and remove storage_backend (#17586 ) See #17764 for details I have tested: - Unit test for local/s3/hdfs/broker file system: be/test/io/fs/file_system_test.cpp - Outfile to local/s3/hdfs/broker. - Load from local/s3/hdfs/broker. - Query file on local/s3/hdfs/broker file system, with table value function and catalog. - Backup/Restore with local/s3/hdfs/broker file system Not test: - cold & host data separation case.	2023-03-21 21:08:38 +08:00
HappenLee	c29582bd57	[pipeline](split by segment)support segment split by scanner (#17738 ) * support segment split by scanner * change code by cr	2023-03-16 15:25:52 +08:00
airborne12	2e0af4e33c	[Enhancement](inverted-index) use read buffer when read index bytes in compound reader (#17306 ) Read IO would be a problem when reading inverted index from disk. Using read buffer to reduce IO. Set use buffer flag to be true when reading internal bytes in compound reader for inverted index.	2023-03-14 10:10:59 +08:00
Pxl	16fc3a0e22	[Chore](compile) remove some unused static on inline function to reduce compile time (#17603 ) remove some unused static on inline function to reduce compile time	2023-03-13 11:11:59 +08:00
HappenLee	39b5682d59	[Pipeline](shared_scan_opt) Support shared scan opt in pipeline exec engine	2023-03-13 10:33:57 +08:00
Gabriel	0432ba8b33	[refactor](status) refactor status judgement (#17592 )	2023-03-09 17:40:25 +08:00
zxealous	6923bf8d7b	[fix](file cache)fix block file cache can't be configured (#17511 )	2023-03-09 10:12:08 +08:00
Xinyi Zou	335c1e5953	[fix](memory) Fix MacOS mem_limit parse error and GC after env Init #17528 Fix MacOS mem_limit parse result is 0. Fix GC after env Init, otherwise, when the memory is insufficient, BE will start failure. * Query id: 0-0 * * Aborted at 1677833773 (unix time) try "date -d @1677833773" if you are using GNU date * * Current BE git commitID: 8ee5f45 * * SIGSEGV address not mapped to object (@0x70) received by PID 24145 (TID 0x7fa53c9fd700) from PID 112; stack trace: * 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t, void) at be/src/common/signal_handler.h:420 1# os::Linux::chained_handler(int, siginfo, void) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 2# JVM_handle_linux_signal in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 3# signalHandler(int, siginfo, void) in /usr/local/jdk/jre/lib/amd64/server/libjvm.so 4# 0x00007FA56295A400 in /lib64/libc.so.6 5# doris::MemTrackerLimiter::log_process_usage_str(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) at be/src/runtime/memory/mem_tracker_limiter.cpp:208 6# doris::MemTrackerLimiter::print_log_process_usage(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, bool) at be/src/runtime/memory/mem_tracker_limiter.cpp:226 7# doris::Daemon::memory_maintenance_thread() at be/src/common/daemon.cpp:245 8# doris::Thread::supervise_thread(void*) at be/src/util/thread.cpp:455 9# start_thread in /lib64/libpthread.so.0 10# clone in /lib64/libc.so.6	2023-03-08 14:00:57 +08:00
Pxl	e2ac06d6d6	[Chore](execution) change PipelineTaskState to enum class && remove some row-based code (#17300 ) 1. change PipelineTaskState to enum class 2. remove some row-based code on FoldConstantExecutor::_get_result 3. reduce memcpy on minmax runtime filter function(Now we can guarantee that the input data is aligned) 4. add Wunused-template check, and remove some unused function, change some static function to inline function.	2023-03-08 12:41:15 +08:00
yiguolei	9213dd906a	[enhancement](exception) add exception structure and using unique ptr in VExplodeBitmapTableFunction (#17531 ) add exception class in common. using unique ptr in VExplodeBitmapTableFunction support single exception or nested exception, like this: ---SingleException [E-100] test OS_ERROR bug @ 0x55e80b93c0d9 doris::Exception::Exception<>() @ 0x55e80b938df1 doris::ExceptionTest_NestedError_Test::TestBody() @ 0x55e82e16bafb testing::internal::HandleSehExceptionsInMethodIfSupported<>() @ 0x55e82e15ab3a testing::internal::HandleExceptionsInMethodIfSupported<>() @ 0x55e82e1361e3 testing::Test::Run() @ 0x55e82e136f29 testing::TestInfo::Run() @ 0x55e82e1376e4 testing::TestSuite::Run() @ 0x55e82e148042 testing::internal::UnitTestImpl::RunAllTests() @ 0x55e82e16dcab testing::internal::HandleSehExceptionsInMethodIfSupported<>() @ 0x55e82e15ce4a testing::internal::HandleExceptionsInMethodIfSupported<>() @ 0x55e82e147bab testing::UnitTest::Run() @ 0x55e80c4b39e3 RUN_ALL_TESTS() @ 0x55e80c4a99b5 main @ 0x7f0a619d0493 __libc_start_main @ 0x55e80b84602a _start @ (nil) (unknown)	2023-03-08 10:44:14 +08:00
Pxl	d8f0ca7108	[Chore](schema change) remove some unused code in schema change (#17459 ) remove some unused code in schema change. remove some row-based config and code.	2023-03-07 09:18:34 +08:00
Jerry Hu	6f3801d9da	[chore](config) Increase the default maximum depth limit for expressions (#17418 )	2023-03-07 08:53:00 +08:00
Xinyi Zou	9617f46fa5	[improvement](memory) Modify `mem_limit` default value (#17322 ) Modify the default value of mem_limit to auto. auto means process mem limit is equal to max(physical mem * 0.9, 6.4G). 6.4G is the maximum memory reserved for the system.	2023-03-06 10:53:27 +08:00
Mingyu Chen	3d0beec01d	[fix](orc) fix heap-use-after-free and potential memory leak of orc reader (#17431 ) fix heap-use-after-free The OrcReader has a internal FileInputStream, If the file is empty, the memory of FileInputStream will leak. Besides, there is a Statistics instance in FileInputStream. FileInputStream maybe delete if the orc reader is inited failed, but Statistics maybe used when orc reader is closed, causing heap-use-after-free error. Potential memory leak When init file scanner in file scan node, the file scanner prepare failed, the memory of file scanner will leak.	2023-03-06 08:42:35 +08:00
lihangyu	2b014a0464	[Improve](doris::Status performance) fix the performance issue due to copy of std::string (#17411 )	2023-03-04 15:08:59 +08:00
Jerry Hu	823d968452	[fix](expr) avoid crashing caused by big depth of expression tree (#17314 )	2023-03-02 16:55:53 +08:00
Mingyu Chen	30df268c1f	[fix](hdfs)(catalog) fix BE crash when hdfs-site.xml not exist in be/conf and fix compute node logic (#17244 ) We set LIBHDFS3_CONF env in start_be.sh, so libhdfs3 will try to read this hdfs-site.xml, if file does not exist, it will throw error. But Doris does not handle this error, cause BE crash. This CL mainly changes: Modify start_be.sh to only set LIBHDFS3_CONF if hdfs-site.xml exist. Refactor the HDFSCommonBuilder so that it can return error correctly. Add BE IP info in status, so that we can get ip from error msg like: ERROR 1105 (HY000): errCode = 2, detailMessage = [INTERNAL_ERROR]failed to init reader for file 000.snappy.orc, err: [INTERNAL_ERROR][172.21.0.101]failed to init HDFSCommonBuilder, please check check be/conf/hdfs-site.xml The logic of prefer compute node is wrong, which causing the external table query can only assign up to 3 backends. This CL refactor this logic and also change some FE config: prefer_compute_node_for_external_table If set to true, query on external table will prefer to assign to compute node. And the max number of compute node is controlled by min_backend_num_for_external_table. If set to false, query on external table will assign to any node. min_backend_num_for_external_table Only take effect when prefer_compute_node_for_external_table is true. If the compute node number is less than this value, query on external table will try to get some mix node to assign, to let the total number of node reach this value. If the compute node number is larger than this value, query on external table will assign to compute node only.	2023-03-02 11:09:55 +08:00
Xinyi Zou	b7677beab7	[enhancement](memtracker) Add special counter for memtracker and fix thread create and destroy track #17301 Add a special counter for memtracker, faster, but relaxed ordering and not accurate in real time Track thread create and destroy memory, which was previously removed due to performance loss and added back	2023-03-02 08:55:00 +08:00
Xinyi Zou	a1e3b908d7	[fix](memory) split mem usage thread and gc thread to different threads (#17213 ) Ensure that the memory status is refreshed in time Avoid frequent GC	2023-03-01 19:19:05 +08:00
shee	6eeba204f9	[Enhancement] path scan causes disk io to skyrocket (#16968 )	2023-02-25 09:15:15 +08:00
zhannngchen	e5f884a6fc	[enhancement](cache) make segment cache prune more effectively (#17011 ) BloomFilter in MoW table may consume lots of memory, and it's life cycle is same as segment. This patch try to improve the efficiency of recycling segment cache, to release the memory in time.	2023-02-23 18:24:18 +08:00
Lijia Liu	8eeb435963	[improvement](meta) Enhance Doris's fault tolerance to disk error (#16472 ) Sense io error. Retry query when io error. Greylist: When finds one disk is completely broken, or the diff of tablet number in BE and FE meta is too large,reduce the query priority of the BE.	2023-02-23 08:40:45 +08:00
Xinyi Zou	a1c0054b4c	[fix](memory) fix memory GC details and join probe catch bad_alloc (#16989 ) Fix Redhat 4.x OS /proc/meminfo has no MemAvailable, disable MemAvailable to control memory. vm_rss_str and mem_available_str recorded when gc is triggered, to avoid memory changes during gc and cause inaccurate logs. join probe catch bad_alloc, this may alloc 64G memory at a time, avoid OOM. Modify document doris_be_all_segments_num and doris_be_all_rowsets_num names.	2023-02-23 08:33:30 +08:00
YueW	7b0fc17c04	[enhancement](inverted index) Support fulltext index evaluate equal query and list query (#16994 ) Fulltext index is the inverted index of the specified tokenizer, before this pr, fulltext index only can evaluate match predicate, this pr to support evaluate equal predicate and list predicate.	2023-02-22 20:18:10 +08:00
chenlinzhong	0e3be4eff5	[Improvement](brpc) Using a thread pool for RPC service avoiding std::mutex block brpc::bthread (#16639 ) mainly include: - brpc service adds two types of thread pools. The number of "light" and "heavy" thread pools is different Classify the interfaces of be. Those related to data transmission are classified as heavy interfaces and others as light interfaces - Add some monitoring to the thread pool, including the queue size and the number of active threads. Use these - indicators to guide the configuration of the number of threads	2023-02-22 14:15:47 +08:00
Mingyu Chen	c0bb2e33a8	[improvement](scan) separate scanner into local and remote scanner pool (#16891 ) There are 2 kinds for scanner thread pool, local and remote. Local is for local file read, specially for olap scanner. Remote is for other external data source, such as file scanner, jdbc scanner. This PR mainly changes: For olap scanner, use cold or hot rowset to decide whether to use local or remote pool. For other scanner, user remote pool by default. Add a new BE config doris_max_remote_scanner_thread_pool_thread_num, default is 512, indicate the max thread number of the remote scanner thread pool This will alleviate the problem of interaction between olap queries with load job and external queries.	2023-02-21 14:13:09 +08:00
lihangyu	113023fb86	(Enhancement)[load-json] support simdjson in new json reader (#16903 ) be config: enable_simdjson_reader=true related PR #11665	2023-02-21 11:31:00 +08:00
YueW	30dafd6a44	[improve](inverted index) Add element count limit for inverted index searcher cache (#16758 ) The element in InvertedIndexSearcherCache is inverted index searcher, which is a file descriptor of inverted index file, so InvertedIndexSearcherCache is actually cache file descriptor of inverted index file. If open file descriptor limit of the Linux system is set too small and config inverted_index_searcher_cache_limit is too big, during high pressure load maybe cause "Too many open files". So, when insert inverted index searcher into InvertedIndexSearcherCache, need also check whether reach file_descriptor_number limit for inverted index file.	2023-02-17 11:53:07 +08:00
lihangyu	2426d8e6e8	[chore](be-config) set disable_storage_row_cache default true to default disable row cache (#16827 )	2023-02-17 10:21:28 +08:00
zhengshengjun	d013d529c8	[Feature](ipv6)Support IPV6 (#14063 ) Support IPV6 in Apache Doris, the main changes are: 1. enable binding to IPV6 address if network priority in config file contains an IPV6 CIDR string 2. BRPC and HTTP support binding to IPV6 address 3. BRPC and HTTP support visiting IPV6 Services	2023-02-14 21:43:10 +08:00
plat1ko	f1b9185830	[feature](cooldown) Implement cold data compaction (#16681 )	2023-02-14 15:21:54 +08:00
plat1ko	5014ad03e7	[feature](cooldown) Auto delete unused remote files (#16588 )	2023-02-13 23:59:39 +08:00
huangzhaowei	f41a2055d3	[feature](Load)Remove user/password in properties for mysql load to avoid double auth. (#16073 ) Use FE cluster token to auth stream load. This auth is only open for be, and fe auth still only support http basic auth. I will use this auth for mysql load to build a no-auth stream load from fe to be. And this will avoid double auth in mysql load. More information to see the design doc.	2023-02-13 10:00:08 +08:00
caiconghui	1de4e312cc	[fix](metric) Fix be core when set enable_system_metrics to false in be (#16646 ) when enable_system_metrics is false, we should not use system_metrics any more Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-02-12 23:01:41 +08:00
Kang	aba843bb2b	[Improvement](inverted index) inverted index query match bitmap cache (#16578 ) Add cache for inverted index query match bitmap to accelerate common query keyword, especially for keyword matching many rows. Tests result: - large result: matching 99% out of 247 million rows shows 8x speed up. - small result: matching 0.1% out of 247 million rows shows 2x speed up.	2023-02-11 13:38:58 +08:00
lihangyu	37d1519316	[WIP](dynamic-table) support dynamic schema table (#16335 ) Issue Number: close #16351 Dynamic schema table is a special type of table, it's schema change with loading procedure.Now we implemented this feature mainly for semi-structure data such as JSON, since JSON is schema self-described we could extract schema info from the original documents and inference the final type infomation.This speical table could reduce manual schema change operation and easily import semi-structure data and extends it's schema automatically.	2023-02-11 13:37:50 +08:00
YueW	43eca4f209	[Feature-WIP](inverted index) Implementation for alter inverted index. (#16371 ) implementation for add/drop inverted index.	2023-02-10 17:56:17 +08:00

1 2 3 4 5 ...

499 Commits