doris

Author	SHA1	Message	Date
Pxl	f87fad97e1	[Bug](storage) add lock on base tablet when create_tablet #21915	2023-07-19 00:47:19 +08:00
starocean999	fff1983f40	[fix](planner)use tupleId of agg node to get its unsigned conjuncts (#21949 )	2023-07-19 00:46:49 +08:00
yujun	beec0e9169	[Improvement](tablet clone) impr tablet sched speed and fix tablet sched failed too many times (#21856 )	2023-07-18 23:25:22 +08:00
Ashin Gau	dcb165cc9f	[opt](hudi) get hudi split concurrently by using parallelStream (#21871 ) This PR contains two optimizations: 1. Using parallel stream to get hoodie splits concurrently. It reduce the split time from 1min20s to 12s when splitting 10,000 partitions. 2. Reading hoodie meta table to get table partitions. It reduce the getting partition time from 12min to 3s when reading 10,000 partitions.	2023-07-18 23:19:34 +08:00
AKIRA	28dfcd8785	[fix](pipeline) Fix pipeline that cause plenty timeout of p0 cases #21917	2023-07-18 23:15:49 +08:00
AlexYue	d2b199955a	[bugfix](deserialize ) pack struct to avoid parse wrong content for file header (#21907 ) Recently we encountered one strange bug where the log is file length is not match. file=/mnt/hdd01/master/NO_AVX2/doris.HDD/snapshot/20230713122303.26.72000/45832/536215111/45832.hdr, file_length=, real_file_length=0 when running restore P2 case, after checking the file on the remote storage we doubt it's the local file deserialize who caused this situation. Then we analyzed the layout for the struct and the content of the hdr file then we found out that it must be the wrong layout which cause reading wrong content.	2023-07-18 22:32:41 +08:00
TengJianPing	a9ea138caf	[fix](two level hash table) fix dead loop when converting to two level hash table for zero value (#21899 ) When enable two level hash table , if there is zero value in the existing one level hash table, it will cause dead loop when converting to two level hash table, because the PartitionedHashTable::_is_partitioned flag is not set correctly when doing the converting.	2023-07-18 19:50:30 +08:00
HHoflittlefish777	c6063ed92f	[Revert](lazy open) revert lazy open and add case (#21821 )	2023-07-18 19:41:33 +08:00
zhangstar333	87556b5741	[bug](test) fix regression test case failed with curdate (#21922 ) fix regression test case failed with curdate	2023-07-18 19:10:55 +08:00
morrySnow	d6d27ef428	[fix](Nereids) join other conjuncts should get slot from join output (#21840 )	2023-07-18 18:22:40 +08:00
Kaijie Chen	2013dcd0e9	[refactor](load) cleanup segment flush logic in beta rowset writer (#21635 )	2023-07-18 18:17:57 +08:00
Mryange	c36d225a27	[feature](profile) add process hashtable time in join node (#21878 ) add process hashtable time in join node	2023-07-18 18:09:42 +08:00
Pxl	3089e4b3b6	[Bug](excution) fix ScannerContext is done make query failed (#21923 ) fix ScannerContext is done make query failed	2023-07-18 17:58:00 +08:00
Siyang Tang	e654b5ddfc	[enhancement](broker-load) support special partition path pattern (#21778 ) Some users may have non-ACID path like `/path/to/k=v/1/filename`, introducing by HQL statement `insert into union all`, for which path partition `k=v` should be parsed normally in broker load.	2023-07-18 14:50:37 +08:00
starocean999	ec12a4159a	[fix](planner) push conjuncts into SetOperationStmt inline view (#21718 ) * [fix](planner)push conjuncts into SetOperationStmt inline view	2023-07-18 14:17:07 +08:00
Xiangyu Wang	50b81a9c13	[Fix](multi-catalog) Filter invisible files for hive table. (#21867 ) In fact, hive can not read files which startswith "." or "_", so we need filter these files.	2023-07-18 13:08:12 +08:00
Pxl	417e3e5616	[Feature](delete) support fold constant on delete stmt (#21833 ) support fold constant on delete stmt	2023-07-18 12:56:28 +08:00
Pxl	19492b06c1	[Bug](decimalv3) fix failed on test_dup_tab_decimalv3 due to wrong precision (#21890 ) fix failed on test_dup_tab_decimalv3 due to wrong precision	2023-07-18 12:53:09 +08:00
mch_ucchi	e1a116af94	[fix](planner)normalize the behavior of from_unixtime() according to Nereids planner (#21723 ) if from_unixtime() receive an integer out of int range, the function returns null.	2023-07-18 12:15:38 +08:00
starocean999	07e720e65d	[fix](planner)need recalculate nullable info of output slots for join node (#21650 ) * [fix](planner)need recalculate nullable info of output slots for join node	2023-07-18 12:10:27 +08:00
Pxl	b3d3ffa2de	[Bug](pipeline) adjust scanner scheduler.submit and _num_scheduling_ctx maintain (#21843 ) adjust scanner scheduler.submit and _num_scheduling_ctx maintain	2023-07-18 11:55:21 +08:00
Jibing-Li	489171e4c1	[Fix](multi catalog)Fix hive partition value contains special character such as / bug (#21876 ) Hive escapes some special characters in partition value to %XX, for example, / is escaped to %2F. Doris didn't handle this case which will cause doris failed to list the files under partition with special characters. This pr is to fix this bug.	2023-07-18 11:20:38 +08:00
yujun	ebd2a4b707	[fix](dynamic partition) fix create hot partition failed without error response (#20996 )	2023-07-18 10:56:37 +08:00
ZhenchaoXu	e24867e138	[typo][docs] Modify the description of CREATE-TABLE (#21858 )	2023-07-18 10:29:47 +08:00
Xin Liao	726e0d5ebf	[fix](load) fix dead loop in _handle_mem_exceed_limit function when reduce memory for load (#21886 )	2023-07-18 09:49:36 +08:00
Mryange	b656f31cf2	[Enchancement](compatible) show decimalv3 to decimal (#21782 )	2023-07-18 09:17:14 +08:00
zhangstar333	b6517ed83b	[Enhance](function) add boolean type for sum agg function (#21862 ) before the sum agg not register for boolean type, so it need cast to other type can execute.	2023-07-18 08:06:52 +08:00
wangbo	37ca133845	[feature](profile)add monotonice timer for pipeline task #21791 Add monotonice timer for piplinetask; WaitBfTime Task1BeginExecuteTime Task2EosTime Task3SrcPendingFinishOverTime Task4DstPendingFinishOverTime Task5TotalTime Task6ClosePipelineTime	2023-07-18 07:57:14 +08:00
Tiewei Fang	83e5a29855	[Fix](Export) fix nullptr exception when upgrading from 1.2.3 to 2.0 (#21799 )	2023-07-18 00:07:09 +08:00
Tiewei Fang	12784f863d	[fix](Export) Fixed the bug that would be core when exporting large amounts of data (#21761 ) A heap-buffer-overflow error occurs when exporting large amounts of data to orc format. Reserve 50B for buffer to avoid this problem.	2023-07-18 00:06:38 +08:00
AKIRA	05cf095506	[feature](stats) Support full auto analyze (#21192 ) 1. Auto analyze all tables except for internal tables 2. make resource used by analyze configurable	2023-07-17 20:42:57 +08:00
yujun	be750e88b2	[fix](clone) fix cannot further repair clone replica which miss version data (#21382 )	2023-07-17 20:00:50 +08:00
Luzhijing	ebc1e9e9f9	[docs](releasenote)add 1.2.6 release note (#21875 )	2023-07-17 17:56:08 +08:00
zy-kkk	014b34bebb	[enhancement](jdbc catalog) Add mysql jdbc url param `rewriteBatchedStatements=true` (#21864 ) When `rewriteBatchedStatements=false`, the JDBC driver will not merge multiple insert statements into one larger insert statement. Therefore, during the batch insertion process, each insert statement needs to be sent to the MySQL server individually, leading to a higher number of network roundtrips. Network latency could potentially be a significant factor contributing to the performance degradation. For this reason, we propose to set this parameter to true by default, to enhance the performance of prepared statement batch inserts.	2023-07-17 17:39:26 +08:00
ZhenchaoXu	1c36b77024	[typo][docs] Modify a typo in the aggr_type description for CREATE-TABLE (#21861 ) Modify a typo in the CREATE-TABLE's aggr_type description to change "后倒入" to "后导入".	2023-07-17 17:02:39 +08:00
ZhenchaoXu	4cea785f13	[typo][docs] Delete the extra characters in the tablet-local-debug Chinese document. (#21846 )	2023-07-17 17:02:16 +08:00
catpineapple	020f238fbc	[feature](dbt) read table columns from model config file (#21831 ) 1、read table columns (datat_ype) from model config file 2、read table description(comment) from model config file	2023-07-17 15:45:12 +08:00
Calvin Kirs	def6e6b158	[Fix](Sonar)sonar not working due to changing thrift code generation method (#21870 )	2023-07-17 15:38:46 +08:00
Jibing-Li	a92508c3f9	[Fix](statistics) Fix analyze db always use internal catalog bug (#21850 ) `Analyze database db_name ` command couldn't use current catalog, it is always using the internal catalog. This will cause the command failed to find the db. This pr is to fix this bug.	2023-07-17 15:28:54 +08:00
zhangstar333	29146c680e	[refactor](profile)add node id info in pipeline profile (#21823 )	2023-07-17 15:24:02 +08:00
wangbo	c98d2bf0d3	Fix incorrect priority queue refactor (#21857 )	2023-07-17 14:56:24 +08:00
Mingyu Chen	5fc0a84735	[improvement](catalog) reduce the size thrift params for external table query (#21771 ) ### 1 In previous implementation, for each FileSplit, there will be a `TFileScanRange`, and each `TFileScanRange` contains a list of `TFileRangeDesc` and a `TFileScanRangeParams`. So if there are thousands of FileSplit, there will be thousands of `TFileScanRange`, which cause the thrift data send to BE too large, resulting in: 1. the rpc of sending fragment may fail due to timeout 2. FE will OOM For a certain query request, the `TFileScanRangeParams` is the common part and is same of all `TFileScanRange`. So I move this to the `TExecPlanFragmentParams`. After that, for each FileSplit, there is only a list of `TFileRangeDesc`. In my test, to query a hive table with 100000 partitions, the size of thrift data reduced from 151MB to 15MB, and the above 2 issues are gone. ### 2 Support when setting `max_external_file_meta_cache_num` <=0, the file meta cache for parquet footer will not be used. Because I found that for some wide table, the footer is too large(1MB after compact, and much more after deserialized to thrift), it will consuming too much memory of BE when there are many files. This will be optimized later, here I just support to disable this cache.	2023-07-17 13:37:02 +08:00
lihangyu	1101d7d947	[chore](topn opt) disable two phase read when light schema change is disabled (#21809 )	2023-07-17 12:46:28 +08:00
zy-kkk	03b575842d	[Feature](table function) support explode_json_array_json (#21795 )	2023-07-17 11:40:02 +08:00
zclllyybb	d0775f8209	[log](profile) add doris version info to query profile (#21501 )	2023-07-17 11:18:05 +08:00
Pxl	86841d8653	[Bug](materialized-view) fix some problems of mv and make ssb mv work on nereids (#21559 ) fix some problems of mv and make ssb mv work on nereids	2023-07-17 10:08:25 +08:00
herry2038	6fba092741	[optimization](show-frontends) Add start time in Show frontends (#21844 ) --------- Co-authored-by: yuxianbing <iloveqaz123>	2023-07-17 05:09:43 +08:00
abmdocrt	c409fa0f58	[Feature](Compaction)Support full compaction (#21177 )	2023-07-16 13:21:15 +08:00
HappenLee	a7eb186801	[Bug](CSVReader) fix null pointer coredump in CSVReader in p2 (#20811 )	2023-07-15 22:50:10 +08:00
Xin Liao	d2ff68ac2d	[fix](unique-key) fix query results show duplicate key for unique key table after upgrading (#21814 )	2023-07-15 17:17:41 +08:00

1 2 3 4 5 ...

11896 Commits