doris

Author	SHA1	Message	Date
Mingyu Chen	104fcea725	[fix](mtmv) mtmv disable dynamic partition (#35686 ) (#35949 ) bp #35686 Co-authored-by: zhangdong <493738387@qq.com>	2024-06-06 14:43:40 +08:00
wangbo	5cecbfc6ea	[cherry-pick]Add workload metric query_be_memory (#35911 )	2024-06-06 14:33:30 +08:00
slothever	b6ab0c4e59	[fix](multi-catalog)fix hive partition insert regression case (#35846 ) from #35821	2024-06-06 11:35:04 +08:00
amory	b5a35b9cef	[FIX] Pick array inverted index bugfix (#35837 ) here with some array with inverted index bugfix: see also: https://github.com/apache/doris/pull/34766 https://github.com/apache/doris/pull/35086 https://github.com/apache/doris/pull/34683 https://github.com/apache/doris/pull/34076	2024-06-06 09:54:14 +08:00
airborne12	efe17245b0	[Fix](inverted index) fix index_id wrong size in V2 (#35909 ) This pull request modifies the index_id type in inverted index storage format v2 to int64_t. The index_id is now stored in the inverted index file using 4 bytes.	2024-06-05 20:36:40 +08:00
starocean999	bcde9c6524	[enhancement](nereids)eliminate repeat node if there is only 1 grouping set and no grouping scalar function (#35872 )	2024-06-05 18:03:20 +08:00
Yulei-Yang	af31e96c4e	[fix](storage_policy) fix cannot cancel a partition's storage policy (#35890 )	2024-06-05 15:53:26 +08:00
Siyang Tang	fdd87fe008	[enhancement](delete-pred) enable delete sub predicate v2 for compaction (#35859 ) (#35895 ) ## Proposed changes This PR enable `delete sub predicate v2` for compaction, and legacy version of delete predicate will be processed in the original way.	2024-06-05 12:05:21 +08:00
AlexYue	630fd06ccf	Pick "[feature](Resource) Support to specify the root path for hdfs resource #32632 " (#35848 ) same as #32632	2024-06-05 08:59:20 +08:00
plat1ko	c2b830e1e7	Pick "[Fix](Tablet) Fix the issue of redundant loading of stale rowset (#35768 )" (#35882 )	2024-06-05 07:55:04 +08:00
zhannngchen	5541fd11e9	[branch-2.1](partial update)add logs for partial update (#35416 ) add logs for partial update the master PR is #35802 If this is a relatively large or complex change, kick off the discussion at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc...	2024-06-04 22:47:48 +08:00
Yulei-Yang	0585de12b5	[Improvement](coldhot) add statement to show objects which use storage policy (#35839 )	2024-06-04 19:24:31 +08:00
Mingyu Chen	5c8f87e01e	[opt](log) refine the FE logger (#35679 ) Previously, FE logs were written to files. The main FE logs include fe.log, fe.warn.log, fe.audit.log, fe.out, and fe.gc.log. In a K8s deployment environment, logs usually need to be output to standard output, and then other components process the log stream. This PR made the following changes: 1. Modified the log4j configuration template - When started with `--daemon`, logs are still written to various files, and the format remains unchanged. - When started with `--console`, all logs are output to standard output and marked with different prefixes: - `StdoutLogger`: logs for standard output - `StderrLogger`: logs for standard error output - `RuntimeLogger`: logs for fe.log or fe.warn.log - `AuditLogger:` logs for fe.audit.log - No prefix: logs for fe.gc.log Examples are as follows: ``` RuntimeLogger 2024-06-03 14:54:51,229 INFO (binlog-gcer\|62) [BinlogManager.gc():359] begin gc binlog ``` 2. Added a new FE config: `enable_file_logger` Defaults to true. Indicates that logs will be recorded to files regardless of the startup method. For example, if it is started with `--console`, the log will be output to both the file and the standard output. If it is `false`, the log will not be recorded in the file regardless of the startup method. 3. Optimized the log format of standard output The byte streams of stdout and stderr are captured. The logs previously outputted using `System.out` will be captured in fe.log for unified management.	2024-06-04 18:20:30 +08:00
deardeng	f94222a04e	[fix](log) Support fe log rollover size strategy (#34446 )	2024-06-04 18:18:16 +08:00
deardeng	db3bbc2437	[feature](merge-cloud) Change fe log rolling max size (#32777 )	2024-06-04 18:17:33 +08:00
feiniaofeiafei	4b5163c905	[Feat](nereids) add transform rule MergePercentileToArray (#35809 ) cherry-pick #34313 to branch-2.1 MergePercentileToArray is to perform a transformation in this case: select ss_item_sk, percentile(ss_quantity,0.9), percentile(ss_quantity,0.6), percentile(ss_quantity,0.3) from store_sales group by ss_item_sk; ==> select ss_item_sk, percentile_array(ss_quantity,[0.3,0.6,0.9]) from store_sales group by ss_item_sk;	2024-06-04 17:50:36 +08:00
starocean999	c23ab25474	[fix](nereids)keep equal predicate as join conjunct even if it can be fold to null literal (#35842 ) pick from master https://github.com/apache/doris/pull/35811 ## Proposed changes Issue Number: close #xxx <!--Describe your changes.-->	2024-06-04 14:46:58 +08:00
Kang	bc6b316e87	[chore](index) add config enable_create_bitmap_index_as_inverted_index default true #33434 (#35521 )	2024-06-04 12:07:03 +08:00
yiguolei	398919d090	2.1.4-rc02	2024-06-03 23:24:33 +08:00
amory	fe1a4c4136	[Feature](IP) support ipv4/ipv6 with inverted index and conjuncts for query (#35734 ) support data type ipv4/ipv6 with inverted index and then we can query like "> or < or >= or <= or in/not in " this conjuncts expr for ip with inverted index speeding up	2024-06-03 23:24:03 +08:00
AlexYue	ba0161c8b9	Revert "[feature-wip](Cloud) Introduce azure core C++ sdk (#35208 )" (#35826 ) ## Proposed changes Issue Number: close #xxx <!--Describe your changes.--> This reverts commit #35641 because of the compilation of such is not successful on arm plateform.	2024-06-03 23:22:56 +08:00
Ashin Gau	940e26f341	[feat](nereids) support Iceberg time travel syntax (#35812 ) backport: #34681 Co-authored-by: Butao Zhang <zhangbutao@cmss.chinamobile.com>	2024-06-03 20:24:11 +08:00
yiguolei	9cbf8cb3bd	2.1.4-rc01	2024-06-03 18:57:32 +08:00
morrySnow	958687f7d4	[fix](Nereids) unix_timestamp compute signature and fold const is wrong (#35727 ) 1. compute signature should call super#computeSignature first 2. fold const return type not changed after signature changed in #26827 we already have p0 for this case, but our regression framework has bug that it report success when compare decimal type if real result lose scale	2024-06-03 18:56:53 +08:00
yiguolei	f80b856405	[enhancement](oom) return error when bloom filter allocate memory failed (#35790 ) ## Proposed changes 1. return error when bloom filter allocate memory failed 2. return error when deserialize a block， it may need a lot of memory. --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-06-03 18:22:11 +08:00
wangbo	3e096dda91	[Cherry-pick]Support kill query in be (#35794 ) ## Proposed changes pick #35602 ``` mysql [information_schema]>kill query '2047df937c66704d-3ac4cfaf17f65eae'; Query OK, 0 rows affected (0.01 sec) I20240603 15:21:50.373333 3355508 internal_service.cpp:592] Cancel query 2047df937c66704d-3ac4cfaf17f65eae, reason: USER_CANCEL ```	2024-06-03 15:39:30 +08:00
Ashin Gau	4f0365e0bf	[fix](s3) move s3 providers to fe-common to be accessible for jni reader (#35779 ) backport: #35690 `PropertyConverter.setS3FsAccess` has add customized s3 providers: ``` public static final List<String> AWS_CREDENTIALS_PROVIDERS = Arrays.asList( DataLakeAWSCredentialsProvider.class.getName(), TemporaryAWSCredentialsProvider.class.getName(), SimpleAWSCredentialsProvider.class.getName(), EnvironmentVariableCredentialsProvider.class.getName(), IAMInstanceCredentialsProvider.class.getName()); ``` And these providers are set as configuration value of `fs.s3a.aws.credentials.provider`, which will be used as configuration to build s3 reader in JNI readers. However, `DataLakeAWSCredentialsProvider` is in `fe-core`, that is not dependent by JNI readers, so we have to move s3 providers to `fe-common'.	2024-06-03 14:04:39 +08:00
Thearas	d4956bfaf5	do not use path style to access s3 (#35788 ) ## Proposed changes	2024-06-03 13:57:13 +08:00
zhangstar333	a086111a96	[improve](session) print more error msg when set a wrong session variable name (#35775 ) ## Proposed changes when set a wrong session variable, eg: mysql [(none)]>set enable_profileXXXXXXX=true; ERROR 1228 (HY000): errCode = 2, detailMessage = Unknown system variable 'enable_profileXXXXXXX', the similar variables are {'enable_profile', 'enable_force_spill', 'enable_projection'} <!--Describe your changes.-->	2024-06-03 12:45:58 +08:00
walter	f25b7fb4eb	[fix](binlog) Add is temp for UpsertRecord (#35774 ) Cherry-pick #35636. The ccr-syncer does not support syncing temporary partitions, so this PR adds a field to record whether this upsert record comes from a temporary partition.	2024-06-03 12:41:29 +08:00
yiguolei	c033c71aed	[enhancement](cancel) add cancel query ability on be	2024-06-03 11:43:57 +08:00
yiguolei	f03cee5e30	[enhancement](oom) add exception in olap data convertor when memory is not enough to prevent oom (#35761 ) Issue Number: close #xxx <!--Describe your changes.--> --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-06-02 21:12:53 +08:00
Mingyu Chen	e755d64e62	[feature](be jvm monitor)append enable_jvm_monitor in be.conf to control jvm monitor. (#35608 ) (#35764 ) bp #35608 Co-authored-by: daidai <2017501503@qq.com>	2024-06-02 00:18:44 +08:00
zhannngchen	72489a04c3	[cherry-pick](branch-2.1) remove some CHECKs in Tablet::revise_tablet_meta (#31268 ) (#34702 ) ## Proposed changes Issue Number: close #xxx cherry-pick #31268 ## Further comments If this is a relatively large or complex change, kick off the discussion at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc...	2024-06-02 00:15:31 +08:00
Mingyu Chen	75fd8c4226	[fix](drop table) Improve error prompts when deleting materialized views (#35437 ) (#35763 ) bp #35437 Co-authored-by: HB <hubiao01@corp.netease.com>	2024-06-01 23:48:52 +08:00
Mingyu Chen	2675014491	[fix](s3) fix invalid s3 properties checking logic (#35757 ) Introduced from #35747 pick part of #35762	2024-06-01 23:30:22 +08:00
谢健	bc95902fed	[fix](Nereids): Convert VarcharLiteral to StringLikeLiteral in Function Signatures (#35536 ) ## Proposed changes This pull request updates the function signatures where VarcharLiteral is currently used, replacing it with StringLikeLiteral. This change aims to enhance flexibility and consistency across functions that handle similar types of string data. By adopting StringLikeLiteral, we can support a broader range of string-like types beyond the basic VARCHAR type, facilitating more robust and versatile string handling capabilities in our codebase. This update ensures better type abstraction and promotes code reusability. <!--Describe your changes.--> ## Further comments If this is a relatively large or complex change, kick off the discussion at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc...	2024-06-01 14:01:29 +08:00
airborne12	f67bd4b03b	[Fix](inverted index) fix fast execute condition for vexpr (#35673 )	2024-06-01 11:24:54 +08:00
zhangstar333	8df1a3c849	[Bug](load) fix s3 load not display the progress info (#35719 ) ## Proposed changes should display the load progress info, so the user could know it loading step. ``` JobId: 49088 Label: rpt_10002184_syqzzywqkb10 State: FINISHED Progress: 100.00% (10/10) ``` <!--Describe your changes.-->	2024-06-01 11:24:54 +08:00
HHoflittlefish777	d83c714824	[branch-2.1](routine-load) adjusting the default configuration of routing load (#35753 ) #34898	2024-06-01 11:22:21 +08:00
Yongqiang YANG	4e06f136f0	[fix](segment cache) estimate momory consumed by segment (#35647 ) (#35751 ) The memory consumed in segment cache is 0 after https://github.com/apache/doris/pull/35432/files. The pr also tracks memory usage of column readers.	2024-06-01 09:34:32 +08:00
yiguolei	9dd573888a	[bugfix](stdcallonce) replace std callonce with a lock because it is not exception safe (#35126 )	2024-06-01 08:00:42 +08:00
Qi Chen	2bad561ede	(Fix)[hive-writer] Fixed the issue when partition values contain spaces when writing to s3. (#35645 ) ## Proposed changes Issue Number: close #31442 (Fix) [hive-writer] Fixed the issue when partition values contain spaces when writing to s3. ### Error msg ``` org.apache.doris.common.UserException: errCode = 2, detailMessage = java.net.URISyntaxException: Illegal character in path at index 114: oss://xxxxxxxxxxx/hive/tpcds1000_partition_oss/call_center/cc_call_center_sk=1/cc_mkt_class=A bit narrow forms matter animals. Consist/cc_market_manager=Daniel Weller/cc_rec_end_date=2001-12-31/f6b5ff4253414b06-9fd365ef68e5ddc5_133f02fb-a7e0-4109-9100-fb748a28259e-0.zlib.orc at org.apache.doris.common.util.S3URI.validateUri(S3URI.java:134) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.common.util.S3URI.parseUri(S3URI.java:120) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.common.util.S3URI.<init>(S3URI.java:116) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.common.util.S3URI.create(S3URI.java:108) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.fs.obj.S3ObjStorage.deleteObject(S3ObjStorage.java:194) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.fs.remote.ObjFileSystem.delete(ObjFileSystem.java:150) ~[doris-fe.jar:1.2-SNAPSHOT] at org.apache.doris.fs.remote.SwitchingFileSystem.delete(SwitchingFileSystem.java:92) ~[doris-fe.jar:1.2- ``` ### Root Cause Hadoop partition names will encode some special characters, but not space characters, which is different from URI encoding. Therefore, an error will be reported when constructing URI. ### Solution The solution is to use regular expressions to parse URI, and then pass in each part of URI to construct URI. This URI constructor will encode each part of URI.	2024-06-01 00:49:25 +08:00
wuwenchi	cb96a79d07	[bugfix](iceberg)fix datetime conversion error and data path error (#35708 ) ## Proposed changes Issue #31442 <!--Describe your changes.--> 1. The unit of the seventh parameter of `ZonedDateTime.of` is nanosecond, so we should multiply the microsecond by 1000. 2. When writing to a non-partitioned iceberg table, the data path has an extra slash	2024-06-01 00:42:48 +08:00
daidai	bc062a2595	[fix](orc)fix orc reader missing column. (#35735 ) ## Proposed changes bp #35583 Issue Number: close #xxx <!--Describe your changes.-->	2024-05-31 22:51:44 +08:00
Mingyu Chen	1d89dd7607	[fix](catalog) close connection on refresh (#35426 ) (#35743 ) bp #35426	2024-05-31 22:51:13 +08:00
Gabriel	b864aa7aa2	[fix](pipeline) Fix query hang up if limited rows is reached (#35513 ) (#35746 ) Follow-up for #35466. We should assure closed tasks will not block other tasks. ## Proposed changes Issue Number: close #xxx <!--Describe your changes.-->	2024-05-31 22:50:57 +08:00
Mingyu Chen	9468227842	[Fix](tvf) Pass through user-defined properties (#35515 ) (#35747 ) bp #35515 Co-authored-by: Tiewei Fang <43782773+BePPPower@users.noreply.github.com>	2024-05-31 22:50:26 +08:00
seawinde	5315df36c0	[opt](mtmv) Improve the mv rewrite performance by optimize code usage (#35674 ) Improve the performance from two points, one is optimize decide model method and another is to reuse the mv struc info: 1. Instead of use java.util.List#containsAll by java.util.Set#containsAll in method AbstractMaterializedViewRule#decideMatchMode 2. Reuse the mv struct info in different query, because mv struct info is immutable. Notes: tableBitSet in struct info is relevant to the statementContext in cascadesContext, if reuse the mv struct info for different query, we should re generate table bitset and construct new struct info with method StructInfo#withTableBitSet	2024-05-31 22:46:29 +08:00
lw112	48d4601ee3	[regression-test](load) add something like $.tag.[a.b] key's json case (#35134 )	2024-05-31 22:45:09 +08:00

1 2 3 4 5 ...

18969 Commits