doris

Author	SHA1	Message	Date
YueW	2596d68424	[fix](schema change) Change table state to NORMAL by SchemaChangeJob instead of SchemaChangeHandler (#19838 ) fix problem: If there is an unfinished schema change job (job-2), and before this time, another schema change job (job-1) of the same table has been finished. Then restart fe, will replay edit log (pending log and waiting_txn log) for job-2, and the table's state is set to SHCEMA_CHANGE, but when loadAlterJob after replayJournal, will add job-1 to schema change handler, and then run the job-1 will set the table to NORMAL because of job-1 is done, but at this point, the job-2 is doing runWaitingTxnJob, in this function will check table's state, if not normal will throw exception, not change the job's state, and cannot cancel the job because the table is not under schema change.	2023-05-23 18:23:12 +08:00
xy720	8b184cc5ef	[bug](compile) fix fe compile error #19946 Fix fe maven package has a version conflict for package grpc-core.	2023-05-23 18:20:48 +08:00
xy720	c88ba85e10	[Bug](schema-change) fix varchar can not change to datev2 #19952	2023-05-23 18:18:55 +08:00
slothever	6f511ac859	[fix](s3)fix s3 resource check (#19933 ) fix s3 resource check: ERROR 1105 (HY000): Unexpected exception: org.apache.doris.common.DdlException: errCode = 2, detailMessage = Missing [AWS_ACCESS_KEY] in properties. we should use new properties to check s3 available	2023-05-23 16:20:07 +08:00
morrySnow	7247ac9b75	[fix](Nereids) join reorder lead to circle in memo (#19935 ) If we have join as the root node, then after some join reorder join, the root Group in Memo will have a GroupExpression including LogicalProject as its plan and the children is its ownerGroup. This PR add a rewrite rule to ensure we have a Project on the top of the top Join of plan to avoid circle in Memo.	2023-05-23 15:22:32 +08:00
Calvin Kirs	ebe3f6ec42	[refactor](routineload)Refactored routineload to improve scalability (#19834 ) - The data source parameters are sunk into the specific data source class - Simplify some code logic to reduce code complexity - Provide a data source factory class to extract public logic - Code that removes tests from production code. We should not include code for testing purposes in any production code.	2023-05-23 14:05:47 +08:00
Jibing-Li	4398b91576	[Fix](multi catalog)Change all partition names to lower case (#19816 ) Iceberg table partition name may contain upper case characters, for example: City=xxx, Nation=xxx. But in Doris, all column names are in lower case. Here we transfer the partition name to lower case to keep consist with column name.	2023-05-23 09:31:31 +08:00
jakevin	633989c78e	[fix](Nereids): commute non-inner join for DPHyp (#19929 )	2023-05-23 09:30:50 +08:00
Ashin Gau	bd74890cf7	[fix](multi-catalog) JDBC Catalog Unknown UNSIGNED type of mysql, type: [DOUBLE] (#19912 )	2023-05-23 09:29:57 +08:00
Gabriel	3dcdadcea6	[Improvement](function) support decimalv3 for function `least` and `greatest` (#19931 )	2023-05-22 22:48:44 +08:00
Pxl	e9223f6a19	[Feature](aggregation) add agg_state define and ddl support (#19824 ) add agg_state define and ddl support	2023-05-22 11:45:53 +08:00
minghong	76dc5841dc	[opt](nereids)compute runtime filter size by the ndv of build side (#18803 )	2023-05-22 10:38:11 +08:00
Mingyu Chen	7c539575c7	[refactor](hudi) remove hudi external table (#19908 ) Hudi external table is deprecated since 1.2. We should remove it now. Recommend to use "multi-catalog" feature to connect to Hudi. User can not create Hudi external table. When restarting FE, all hudi external table will still be replayed but can not be read. And when doing checkpoint, all these tables will be discarded.	2023-05-22 09:02:34 +08:00
luozenglin	33fd965b5c	[feature-wip](resouce-group) Supports memory soft isolation of resource group (#19802 ) create resource groups name properties( 'enable_memory_overcommit' = 'true' // whether to enable memory soft isolation )	2023-05-21 19:33:57 +08:00
Mingyu Chen	a7f3bfec89	[refactor](cluster)(step-2) remove cluster related to Backend (#19842 )	2023-05-21 09:00:35 +08:00
Mingyu Chen	777bdce5a5	[minor](clone) add more debug log for tablet scheduler (#19892 ) Sometimes I find that the tablet scheduler can not schedule tablet, and with no more info for debugging. So I add some debug log for this process. No logic is changed.	2023-05-20 15:59:26 +08:00
wangbo	8b9813663d	[test](executor)add crud regression test for resource group (#19659 ) dd crud regression test for resource group (#19659)	2023-05-20 13:49:02 +08:00
Nick Young	499f443779	[feature](iceberg) Support read iceberg data on gcs (#19815 )	2023-05-20 12:40:03 +08:00
HB	1b119704f8	[Enhancement] show total transactions in show proc "/transactions" (#19492 ) In a scenario where multiple DBs are simultaneously imported with high concurrency, a significant number of transactions will be generated. Without a summary field, we cannot clearly see how many transactions there are in the current cluster. Therefore, I have enhanced this point. ``` mysql> show proc "/transactions"; +-------+-----------------------------------+-----------------------+ \| DbId \| DbName \| RunningTransactionNum \| +-------+-----------------------------------+-----------------------+ \| 10002 \| default_cluster:xxxx \| 0 \| \| 14005 \| default_cluster:__internal_schema \| 0 \| \| Total \| 2 \| 0 \| +-------+-----------------------------------+-----------------------+ 3 rows in set (0.02 sec) ```	2023-05-20 11:26:28 +08:00
zhangdong	a81db3e984	[improvement](FQDN) broker support fqdn (#19821 ) 1.broker support fqdn 2.change 'master_only' attr of 'enable_fqdn_mode'	2023-05-20 11:25:58 +08:00
zhangdong	178d6cc529	[improvement](multi-catalog)hms sync event log more info #19887	2023-05-20 08:25:14 +08:00
HappenLee	77dfdfdd50	[Bug][pipeline] Fix regression tpcds failed in nereid planner (#19885 )	2023-05-19 22:30:48 +08:00
jakevin	24b2fab943	[fix](Nereids): BuildAggForUnion forgot to convert Qualifier Type. (#19883 )	2023-05-19 22:18:38 +08:00
Gabriel	5547bbbaef	[decimalv3](function) support function width_bucket (#19806 )	2023-05-19 20:28:59 +08:00
LiBinfeng	78bcc68ab8	[Fix](Nereids) fix serialize colocate table index concurrent bug (#19862 ) When doing serialization of minidump input, we can find that when serializing colocate table index, the size and entry get by the hash map always unmatched when concurrent occur. So a write lock be added to ensure concurrency.	2023-05-19 19:51:22 +08:00
zy-kkk	ae1577e95c	[improvement](jdbc catalog) set oceanbase mysql mode jdbc param `useCursorFetch` default true (#19856 )	2023-05-19 19:45:22 +08:00
jakevin	68be81363b	[enhance](Nereids): Pushdown Filter Through Project in Post Processor. (#19873 ) Originally, PushdownFilterThroughProject is in CBO phase, but it will increase Memo size. So, we move it into PostProcessor	2023-05-19 19:27:52 +08:00
amory	67dc68630b	[Improve](complex-type)improve array/map/struct creating and function with decimalv3 (#19830 )	2023-05-19 17:43:36 +08:00
zhangdong	2ab844550f	[feature-wip](MTMV) support multi catalog (#19854 ) * mtmv support multi catalog * mtmv support multi catalog	2023-05-19 16:44:55 +08:00
Gabriel	0fc8d2e029	[Bug](decimal) fix variance_samp and avg_weighted #19861	2023-05-19 16:44:36 +08:00
airborne12	9d54545bac	[Fix](inverted index) add datev2/datetimev2 for inverted index column type (#19845 ) When we try to query array of datetimev2 column by inverted index, it returns an error like this: CREATE TABLE `nested` ( `qid` bigint(20) NULL, `tag` array<text> NULL, `creationDate` datetime NULL, `title` text NULL, `user` text NULL, `answers.user` array<text> NULL, `answers.date` array<datetimev2(0)> NULL, INDEX tag_idx (`tag`) USING INVERTED PROPERTIES("parser" = "english") COMMENT '', INDEX creation_date_idx (`creationDate`) USING INVERTED COMMENT '', INDEX title_idx (`title`) USING INVERTED COMMENT '', INDEX user_idx (`user`) USING INVERTED COMMENT '', INDEX answers_user_idx (`answers.user`) USING INVERTED COMMENT '', INDEX answers_date_idx (`answers.date`) USING INVERTED COMMENT '' ) ENGINE=OLAP DUPLICATE KEY(`qid`) COMMENT 'OLAP' DISTRIBUTED BY HASH(`qid`) BUCKETS 18 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "storage_format" = "V2", "compression" = "ZSTD", "light_schema_change" = "true", "dynamic_schema" = "true", "disable_auto_compaction" = "false" ); mysql> select * from nested.nested where tag match 'java' and `answers.date` element_le '2012-04-08T21:15:33.873Z' limit 10; ERROR 1105 (HY000): errCode = 2, detailMessage = no function found for MATCH_ELEMENT_LE,`answers.date` MA	2023-05-19 14:57:01 +08:00
jiawei liang	f46f0c84b2	[Enhancement](meta) Show remote data usage via SHOW DATA #19533 (#19752 ) * [Enhancement](meta) Show remote data usage via SHOW DATA #19533 * [fix] correct some unit test results	2023-05-19 14:23:50 +08:00
Gabriel	c4900eb658	[Bug](DecimalV3) fix decimalv3 functions (#19801 )	2023-05-19 14:10:01 +08:00
jakevin	fcffb1d3de	[minor](Nereids): add toString() for LogicalProperties (#19851 )	2023-05-19 13:46:47 +08:00
morrySnow	92c6a3c53b	[fix](Nereids) normalize repeat generate push down project with error nullable (#19831 )	2023-05-19 13:15:42 +08:00
yiguolei	9c86cad4ec	[improvement](session variable) add max execution time session variabe like mysql and add setter attributes in variables (#19759 ) 1. add session variable max_execution_time to an alias of query timeout, if user set max_execution_time, the query timeout will be modified too. 2. add a setter attribute to session variable, so that we could add some logic in setter method instead of field reflection.	2023-05-19 12:42:47 +08:00
lihangyu	cf7083d58b	[explain](point query) modify explain for SHORT-CIRCUIT query (#19820 )	2023-05-19 11:50:08 +08:00
yixiutt	609b20bd02	[Feature](planner) use partial update in update from & delete from (#19262 )	2023-05-19 09:46:29 +08:00
minghong	84bad03ccb	[feature](nereids) set proper min/max value for column stats when minExpr/maxExpr is not avialable #19673	2023-05-19 09:02:40 +08:00
luozenglin	0dd361dbf7	[fix](tracing) fix the issue that a trace may track multiple queries (#19804 )	2023-05-19 08:58:53 +08:00
minghong	6f6d744a2a	[fix](nereids) avoid 0 row count in stats derive #19640 row count of join estimation is at least 1 to make less error propagation.	2023-05-19 08:54:24 +08:00
Mingyu Chen	14620a6766	[minor](log) add details for unqueryable replicas (#19792 ) Add a new FE config: show_details_for_unaccessible_tablet. Default is false, when set to true, if a query is unable to select a healthy replica, the detailed information of all the replicas of the tablet including the specific reason why they are unqueryable, will be printed out.	2023-05-19 08:53:57 +08:00
minghong	dc8a992bba	[improve](nereids) check be status when column stats is unknown #19742 when forbid_unknown_col_stats is open and some column stats is unknown, we will check the be status by StatisticsUtil.statsTblAvailable(), and report error according to be status.	2023-05-19 08:53:34 +08:00
Xinyi Zou	1e8eb1c756	[fix](profile) Fix pipeline load channel profile #19828	2023-05-19 08:51:02 +08:00
Adonis Ling	adc5522c9b	[bug](MTMV) Fix the wrong interpretation for NEVER REFRESH (#19800 )	2023-05-18 23:56:56 +08:00
yongkang.zhong	dfc4432e83	[improvement](jdbc catalog) Add adaptation to Oracle special character `/` table names (#19809 )	2023-05-18 22:58:33 +08:00
yongkang.zhong	f2b2a568de	[fix](jdbc catalog)fixed oceanbase catalog row limit bug (#19796 )	2023-05-18 22:05:51 +08:00
yongjinhou	40ab4ce305	fix select resource groups bug (#19808 )	2023-05-18 21:54:31 +08:00
WenYao	481e9aebdb	[Refactor](spark load) remove parquet scanner (#19251 )	2023-05-18 19:19:13 +08:00
luozenglin	f68d3a660e	[improvement](opentelemetry) upgrade opentelemetry jar to v1.26.0 and opentelemetry-cpp to v1.8.3 (#19733 ) why upgrade? anything wrong? Try to fix the problem about opentelemetry::v1::ext::http::client::curl::HttpOperation::Send(), I have updated the pr info.	2023-05-18 18:46:20 +08:00

... 71 72 73 74 75 ...

8289 Commits