doris

Author	SHA1	Message	Date
zxealous	54fc038dc5	[Fix](remote) Fix thread safety issue in cache (#11984 )	2022-08-24 18:14:14 +08:00
Mingyu Chen	05da3d947f	[feature-wip](new-scan) add scanner scheduling framework (#11582 ) There are currently many types of ScanNodes in Doris. And most of the logic of these ScanNodes is the same, including: Runtime filter Predicate pushdown Scanner generation and scheduling So I intend to unify the common logic of all ScanNodes. Different data sources only need to implement different Scanners for data access. So that the future optimization for scan can be applied to the scan of all data sources, while also reducing the code duplication. This PR mainly adds 4 new class: VScanner All Scanners' parent class. The subclasses can inherit this class to implement specific data access methods. VScanNode The unified ScanNode, and is responsible for common logic including RuntimeFilter, predicate pushdown, Scanner generation and scheduling. ScannerContext ScannerContext is responsible for recording the execution status of a group of Scanners corresponding to a ScanNode. Including how many scanners are being scheduled, and maintaining a producer-consumer blocks queue between scanners and scan nodes. ScannerContext is also the scheduling unit of ScannerScheduler. ScannerScheduler schedules a ScannerContext at a time, and submits the Scanners to the scanner thread pool for data scanning. ScannerScheduler Unified responsible for all Scanner scheduling tasks Test: This work is still in progress and default is disabled. I tested it with jmeter with 50 concurrency, but currently the scanner is just return without data. The QPS can reach about 9000. I can't compare it to origin implement because no data is read for now. I will test it when new olap scanner is ready. Co-authored-by: morningman <morningman@apache.org>	2022-08-23 08:45:18 +08:00
Pxl	089fe01aea	[Feature](vectorized alter table) set vectorized alter table to default open (#11897 )	2022-08-19 10:57:00 +08:00
lihangyu	01383c3217	[Enhancement](stream-load-json) using simdjson to parse json (#11665 ) Currently we use rapidjson to parse json document, It's fast but not fast enough compare to simdjson.And I found that the simdjson has a parsing front-end called simdjson::ondemand which will parse json when accessing fields and could strip the field token from the original document, using this feature we could reduce the cost of string copy(eg. we convert everthing to a string literal in _write_data_to_column by sprintf, I saw a hotspot from the flamegrame in this function, using simdjson::to_json_string will strip the token(a string piece) which is std::string_view and this is exactly we need).And second in _set_column_value we could iterate through the json document by for (auto field: object_val) {xxx}, this is much faster than looking up a field by it's field name like objectValue.FindMember("k1").The third optimization is the at_pointer interface simdjson provided, this could directly get the json field from original document.	2022-08-16 14:49:50 +08:00
plat1ko	fecfdd78bf	[enhancement](status) Fix Status related macros to enable RVO or move ctor (#11753 )	2022-08-16 14:40:35 +08:00
Xinyi Zou	c124470408	[enhancement](memory) Fix too much cache leads to less memory available for queries (#11751 ) Disable Chunk Allocator in Vectorized Allocator, this will reduce memory cache. For high concurrent queries, using Chunk Allocator with vectorized Allocator can reduce the impact of gperftools tcmalloc central lock. Jemalloc or google tcmalloc have core cache, Chunk Allocator may no longer be needed after replacing gperftools tcmalloc.	2022-08-16 14:35:57 +08:00
plat1ko	4047c3577d	[enhancement](Status) Optimize Status implementation	2022-08-12 11:39:35 +08:00
Jerry Hu	c8418d13b5	[improvement](config)Use session variable to replace configuration for 'enable_function_pushdown' (#11641 )	2022-08-10 19:25:02 +08:00
Ashin Gau	37d1180cca	[feature-wip](parquet-reader)decode parquet data (#11536 )	2022-08-08 12:44:06 +08:00
Pxl	2cd3bf80dc	[bugfix](schema change)fix core dump on vectorized_alter_table (#11538 )	2022-08-08 10:45:28 +08:00
yixiutt	bd4048f8fb	[enhancement](compaction) add idle schedule and max_size limit for base compaction (#11542 ) Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-08-07 16:21:57 +08:00
pengxiangyu	a943adac1a	[feature](cache) Add FileCache for RemoteFile (#11186 ) Add FileCache for RemoteFile, it will be opened in StoragePolicy. Cold data in remote file will be download to local cache files.	2022-08-04 10:57:32 +08:00
weizuo93	f730a048b1	[feature-wip](load) Support single replica load (#10298 ) During load process, the same operation are performed on all replicas such as sort and aggregation, which are resource-intensive. Concurrent data load would consume much CPU and memory resources. It's better to perform write process (writing data into MemTable and then data flush) on single replica and synchronize data files to other replicas before transaction finished.	2022-08-02 11:44:18 +08:00
Mingyu Chen	abbf75d302	[doc][refactor](metrics) Reorganize FE and BE metrics and add document (#11307 )	2022-08-02 11:34:06 +08:00
Luwei	d6f937cb01	(performance)[scanner] Isolate local and remote queries using different scanner… (#11006 )	2022-07-29 19:14:46 +08:00
luozenglin	d3c88471ad	[tracing] Support opentelemtry collector. (#10864 ) * [tracing] Support opentelemtry collector. 1. support for exporting traces to multiple distributed tracing system via collector; 2. support using collector to process traces.	2022-07-29 16:49:40 +08:00
plat1ko	a6537a90cd	[Enhancement] Garbage collection of unused data on remote storage backend (#10731 ) * [Feature](cold_on_s3) support unused remote rowset gc * return aborted when skip drop tablet * perform unused remote rowset gc	2022-07-29 14:38:39 +08:00
Pxl	4e6a59df4c	[Improvement][chore] add const to all operator== (#11251 )	2022-07-27 21:46:47 +08:00
Xinyi Zou	b6bdb3bdbc	[fix] (mem tracker) Fix MemTracker accuracy (#11190 )	2022-07-27 18:59:24 +08:00
yixiutt	01e108cb7b	[feature-wip](unique-key-merge-on-write) update delete bitmap while publish version (#11195 ) 1.make version publish work in version order 2.update delete bitmap while publish version, load current version rowset primary key and search in pre rowsets 3.speed up publish version task by parallel tablet publish task Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-07-27 16:26:42 +08:00
Compilation Success	0d50a658f5	[fix](config) Fix uninitialized config validator (#11022 ) * [fix](config) Fix uninitialized config validator If we don't set any validator, the original implementation will cause seg-fault due to nullptr of `RegisterConfValidator::_s_field_validator`, which initial value relies on the first config validator definition. We need to skip finding validator from uninitialized `RegisterConfValidator::_s_field_validator`.	2022-07-25 15:10:55 +08:00
xy720	40c8853c5d	[Fix] Fix select external table return “Lost connection to MySQL server during query” error	2022-07-22 11:24:09 +08:00
Xinyi Zou	4960043f5e	[enhancement] Refactor to improve the usability of MemTracker (step2) (#10823 )	2022-07-21 17:11:28 +08:00
Compilation Success	a1758bd139	[feature-wip](unique-key-merge-on-write) Add agg cache for delete bitmap DSIP-018 (#10921 ) Use global LRU for delete bitmap cache	2022-07-21 12:48:44 +08:00
Jet He	f6cb7a838b	[Optimize] Improve performance like/not like filter through pushdown function to storage engine (#10355 ) * support like/not like conjuncts push down to storage engine * vectorized engine support like/not like conjuncts push down to storage engine * support both evaluate and evaluate_vec method in like predicate * reuse remove_pushed_conjuncts and prevent logic error during move function conjuncts * change #ifndef to pragma once as per comments * change enable_function_pushdown default to false Co-authored-by: heguangnan <heguangnan@bytedance.com>	2022-07-19 08:33:04 +08:00
Gabriel	842ff2b1e2	[refactor] Refactor time LUT (#10982 )	2022-07-19 08:23:29 +08:00
slothever	8a366c9ba2	[feature](multi-catalog) read parquet file by start/offset (#10843 ) To avoid reading the repeat row group, we should align offsets	2022-07-18 20:51:08 +08:00
TengJianPing	890fd70620	[improvement] dynamically calculate max rows to read in a batch to avoid oom (#10972 )	2022-07-18 17:43:53 +08:00
Tiewei Fang	3d52bff8d1	[improvement]output query_id when be core dumped. (#10822 )	2022-07-14 10:55:28 +08:00
Gabriel	3b46242483	[feature-wip] Optimize Decimal type (#10794 ) * [feature-wip](decimalv3) support decimalv3 * [feature-wip] Optimize Decimal type Co-authored-by: liaoxin <liaoxinbit@126.com>	2022-07-14 10:50:50 +08:00
Tiewei Fang	def59a686e	[improvement]output fetal log to stderr (#10789 )	2022-07-13 16:34:37 +08:00
Tiewei Fang	6063c0c9c8	[enhancement](singal) output git commit id when the program coredump (#10788 ) * [enhancement](singal) output git commit id when the program coredump * modift output info	2022-07-13 08:24:58 +08:00
lihangyu	b04a791895	[Enhancement] support compile with jemalloc (#10542 ) A test feature to use jemalloc as default malloc.	2022-07-11 12:15:35 +08:00
Kidd	4cb80c5733	[memtracker]fix fix_memtracker_performance_ (#10629 )	2022-07-11 08:35:05 +08:00
luozenglin	d5ea677282	[feature](tracing) Support query tracing to improve doris observability by introducing OpenTelemetry. (#10533 ) The collection of query traces is implemented in fe and be, and the spans are exported to zipkin. DSIP: https://cwiki.apache.org/confluence/display/DORIS/DSIP-012%3A+Introduce+opentelemetry	2022-07-09 15:50:40 +08:00
slothever	c358a43f35	[feature-wip] support parquet predicate push down (#10512 )	2022-07-08 23:11:25 +08:00
plat1ko	331fa50501	[feature](cold-data) move cold data to object storage without losing any feature(BE) (#10280 ) This PR supports rowset level data upload on the BE side, so that there can be both cold data and hot data in a tablet, and there is no necessary to prohibit loading new data to cooled tablets. Each rowset is bound to a `FileSystem`, so that the storage layer can read and write rowsets without perceiving the underlying filesystem. The abstracted `RemoteFileSystem` can try local caching strategies with different granularity, instead of caching segment files as before. To avoid conflicts with the code in be/src/io, we temporarily put the file system related code in the be/src/io/fs directory. In the future, `FileReader`s and `FileWriter`s should be unified.	2022-07-08 12:18:39 +08:00
yiguolei	89e56ea67f	[refactor] remove alpha rowset related code and vectorized row batch related code (#10584 )	2022-07-05 20:33:34 +08:00
caiconghui	1a173a854e	[fix](routine-load) Fix that routine load cannot work with old kafka version (#10554 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-07-04 10:47:50 +08:00
Tiewei Fang	c9f86bc7e2	[refactor] Refactoring Status static methods to format message using fmt(#9533 )	2022-07-02 18:58:23 +08:00
yiguolei	97996c9275	[fix](Insert) fix 5 concurrent "insert...select..." OOM (#10501 ) * [hotfix](dev-1.0.1) 5 concurrent insert...select... OOM Co-authored-by: minghong <minghong.zhou@163.com> Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-07-01 15:29:26 +08:00
Pxl	8e713ddfcf	[Feature] [Vectorized] support vectorized schema-change (#10187 )	2022-06-29 15:07:21 +08:00
Xinyi Zou	deeb3028ad	[Enhancement] [Memory] [Vectorized] Stress test and optimize memory allocation (#9581 ) * vec stress test, Allocator introduce chunkallocator * fix comment	2022-06-29 02:57:51 +08:00
ZenoYang	4ca257a1cd	[improvement] Modify the default value of doris_scan_range_max_mb (#10232 ) * [improvement] Modify the default value of doris_scan_range_max_mb * fix regression-test	2022-06-25 19:48:49 +08:00
carlvinhust2012	1541dcd919	fix some typo in comments (#10374 )	2022-06-24 07:20:08 +08:00
Yongqiang YANG	f466668d48	[improvement] each tuple starting at aligned address to build with ubsan enabled (#8831 ) When I builded doris be with ubsan enabled and enabled vectorization, be core dump at doris::DecimalV2Value::operator long(). It cored because accessing on a non-aligned address by sse. With ubsan enabled, compile generates different assemble code including sse instruction. A sender serializes tuples to a contiguous memory area, while a receiver just copy it. So we should align each tuple offset to 16 bytes. For compatibility, we should use a config to control it. BTW: with tools like ubsan, asan, tsan we can find bugs more easily, e.g. #8815. It is difficult to find the bug without ubsan. Anyway, we should use modern tools to be more productive.	2022-06-23 14:03:01 +08:00
Xinyi Zou	6ad024a2bf	[fix] (mem tracker) Refactor memtable mem tracker, fix flush memtable DCHECK failed (#10156 ) 1. Added memory leak detection for `DeltaWriter` and `MemTable` mem tracker 2. Modify memtable mem tracker to virtual to avoid frequent recursive consumption of parent tracker. 3. Disable memtable flush thread attach memtable tracker, ensure that memtable mem tracker is completely accurate. 4. Modify `memory_verbose_track=false`. At present, there is a performance problem in the frequent switch thread mem tracker. - Because the mem tracker exists as a shared_ptr in the thread local. Each time it is switched, the atomic variable use_count in the shared_ptr of the current tracker will be -1, and the tracker to be replaced use_count +1, multi-threading Frequent changes to the same tracker shared_ptr are slow. - TODO: 1. Reduce unnecessary thread mem tracker switch, 2. Consider using raw pointers for mem tracker in thread local.	2022-06-19 16:48:42 +08:00
yixiutt	f35b235c3b	[opt](compaction) optimize compaction in concurrent load (#10153 ) add some logic to opt compaction: 1.seperate base&cumu compaction in case base compaction runs too long and affect cumu compaction 2.fix level size in cu compaction so that file size below 64M have a right level size, when choose rowsets to do compaction, the policy will ignore big rowset, this will reduce about 25% cpu in high frequency concurrent load 3.remove skip window restriction so rowset can do compaction right after generated, cause we'll not delete rowset after compaction. This will highly reduce compaction score in concurrent log. 4.remove version consistence check in can_do_compaction, we'll choose a consecutive rowset to do compaction, so this logic is useless after add logic above, compaction score and cpu cost will have a substantial optimize in concurrent load. Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-06-17 17:49:45 +08:00
Pxl	fd0bd395ac	[Enhancement] Remove some unused include (#10035 )	2022-06-17 10:47:25 +08:00
Pxl	5805f8077f	[Feature] [Vectorized] Some pre-refactorings or interface additions for schema change part2 (#10003 )	2022-06-16 10:50:08 +08:00

1 2 3 4 5 ...

321 Commits