doris

Author	SHA1	Message	Date
AKIRA	c6a92955ca	[refacotr](optimizer) Remove useless check #24237 Check stats table status at first Comment histgram_tbl check since it useless for now Do preheat both in master and follower	2023-09-14 19:35:56 +08:00
bobhan1	3ee89aea35	[Feature](merge-on-write)Support ignore mode for merge-on-write unique table (#21773 )	2023-09-14 18:03:51 +08:00
Kang	9c6734e68e	[bugfix](index) Fix build index limitations (#24358 ) 1. skip existed index on column with different id on build index 2. allow build index for CANCELED or FINISHED state	2023-09-14 17:53:22 +08:00
Lei Zhang	eaa35649bc	[fix](bdbje) handle `ReplicaWriteException` in `BDBJEJournal.write` (#24259 ) * When BDBJEJournal.write meet `ReplicaWriteException`, we should not retry. Because at the monment the bdbje node state is `REPLICA` (not `MASTER`) if we still retry write, at the same time trigger election, the orgin `REPLICA` node may transfer to `MASTER` and will cause incorrect journalId Co-authored-by: yiguolei <676222867@qq.com>	2023-09-14 17:49:28 +08:00
starocean999	d035a58374	[feature](nereids) support unnest subquery in LogicalOneRowRelation (#24355 ) select (select 1); before : ERROR 1105 (HY000): errCode = 2, detailMessage = Subquery is not supported in the select list. after: mysql> select (select 1); +---------------------------------------------------------------------+ \| (SCALARSUBQUERY) (LogicalOneRowRelation ( projects=[1 AS `1`#0] )) \| +---------------------------------------------------------------------+ \| 1 \| +---------------------------------------------------------------------+ 1 row in set (0.61 sec)	2023-09-14 17:22:08 +08:00
AKIRA	0be0b8ff58	[opt](stats) Support display of auto analyze jobs (#24135 ) ### Support dispaly of auto analyze jobs After this PR, users and DBA could use such grammar to check the execution status of auto analyze jobs: ```sql SHOW AUTO ANALYZE [tbl_name] [WHERE STATE='SOME STATE'] ``` Record count of history auto analyze job could be configured by setting FE option: auto_analyze_job_record_count, default value is 2000 ### Enhance auto analyze After this PR, auto jobs those created automatically will no longer execute beyond a specific time frame.	2023-09-14 17:10:04 +08:00
zclllyybb	4fbb25bc55	[Enhancement](function) Support date_trunc(date) and use it in auto partition (#24341 ) Support date_trunc(date) and use it in auto partition	2023-09-14 16:53:09 +08:00
zclllyybb	b6d7116dea	[fix](datetime) fix compare of DatetimeLiteral (#24343 ) fix compare of DatetimeLiteral	2023-09-14 16:51:50 +08:00
谢健	7ff76c5a1e	[test](Nereids) add normalize sort test (#24230 )	2023-09-14 16:33:36 +08:00
谢健	ace3e79498	[test](Nereids) add test for uncorrelatedApplyFilter #24220 add test for uncorrelatedApplyFilter rule	2023-09-14 16:10:08 +08:00
starocean999	ccba5a729a	[fix](planner)cast string to float like type should return NULL literal if it fails (#24222 )	2023-09-14 15:59:20 +08:00
starocean999	40e1c2af45	[fix](nereids)the common type of decimalv2 and decimalv3 shoud be decimalv3 in BinaryArithmetic operator (#24215 ) the common type of decimalv2 and decimalv3 shoud be decimalv3 in BinaryArithmetic operator	2023-09-14 15:53:23 +08:00
HHoflittlefish777	51a5895464	[Feature](RoutineLoad) Support max filter ratio for routine load (#24035 )	2023-09-14 15:30:40 +08:00
神技圈子	d8feca2530	[Enhancement]The page cache can be parameterized by the session variable of fe. (#23981 )	2023-09-14 14:28:19 +08:00
lihangyu	1ef22d7f7c	[Feature](variant) add variant type (#24170 ) Add variant type for metadata Add persistent information for variant, including the path of variant sub-columns, persisting them to the segment footer and tablet schema of the rowset.	2023-09-14 14:21:53 +08:00
Pxl	ec7b1790f9	[Improvement](ddl) make create table with mv column name report error msg more readable (#24349 ) make create table with mv column name report error msg more readable	2023-09-14 14:18:49 +08:00
Pxl	42ca0fc857	[Improvement](delete) cast delete binary predicate rhs type to lhs type (#24321 ) cast delete binary predicate rhs type to lhs type	2023-09-14 14:15:27 +08:00
jakevin	b044ff0556	[test](Nereids): add ut for PullUpProjectUnderApply (#24331 )	2023-09-14 12:56:27 +08:00
xu tao	f8692bef4b	[fix](io): use try with resource make io stream close automatically to avoid resource leak (#24297 )	2023-09-14 11:51:30 +08:00
Lei Zhang	adde012de0	[enhancement](fe) Add more detail log for replayJournal (#24218 )	2023-09-14 11:45:48 +08:00
Tiewei Fang	4eccf72bdd	[Fix](S3 tvf) fix that S3 tvf can not run properly (#24289 ) This bug is caused by this PR: #23635	2023-09-14 11:45:29 +08:00
zzzxl	ed108d48fa	[fix](invert index) fix query use char filter (#24268 )	2023-09-14 11:42:47 +08:00
zhangdong	1f0844b992	[fix](env)mock env.isCheckpointThread #24280 ssue Number: close #xxx ShowTableStmtTest.testNoDb and DropDbStmtTest.testNoPriv are unstable cases，error msg is: java.lang.Exception: Unexpected exception, expected<org.apache.doris.common.AnalysisException> but was<mockit.internal.expectations.invocation.MissingInvocation> reason is missing mock env.isCheckpointThread	2023-09-14 11:37:01 +08:00
jakevin	354cb3970b	[feature](Nereids): normalize two-digit basic date/datetime (#24333 ) normalize two-digit basic date/datetime 220201 -> 20220201 220201T010101 -> 20220201T010101 ......	2023-09-14 11:25:00 +08:00
morrySnow	46f5988245	[fix](Nereids) set operation children output order not same (#24060 ) we generate project for all set operation's children to ensure the order of all children are not changed. However, some rules, such as PushDownProjectThroughLimit could remove these projects involuntarily. When it happen, the column order is wrong and lead to BE core dump. This PR use a new variable in SetOperation to save the output order of children of set operation. Then the children's output order could be changed and never affect to SetOperation at all.	2023-09-14 11:09:58 +08:00
Calvin Kirs	64337a8698	[Improve](metadata)Start the script to set metadata_failure_recovery (#24308 )	2023-09-14 10:02:35 +08:00
minghong	1a4929b59e	[fix](planner) having clause analyze bug #24288	2023-09-14 09:54:09 +08:00
Tiewei Fang	9847f7789f	[Feature](Export) `Export` sql supports to export data of `view` and `exrernal table` (#24070 ) Previously, EXPORT only supported the export of the olap table, This pr supports the export of view table and external table.	2023-09-13 22:55:19 +08:00
jakevin	d7e5f97b74	[feature](Nereids): eliminate AssertNumRows (#23842 )	2023-09-13 22:24:02 +08:00
zy-kkk	dbfacdc4af	[improvement](jdbc catalog) Optimize Loop Performance by Caching `isNebula` Method Result (#24260 )	2023-09-13 21:40:28 +08:00
zy-kkk	5238be24a2	[fix](jdbc catalog) Ensure Thread Safety by Refactoring isDoris&convertDateToNull Static Variable in JdbcMySQLClient (#24253 )	2023-09-13 20:19:44 +08:00
minghong	dad671af8e	[feature](nereids)prune runtime filter (tpch part) #19312 A rf is effective if it could filter target data. In this pr, a rf is effective if any one of following conditions is satisfied: A filter is applied on rf src, like T.A =1 A effective rf applied on this rf's src, denote X as src and target insertsection range. src.ndv with respect to X is smaller than target.ndv explaination of condition 2 Supplier join Nation on s_nationkey = n_nationkey join Region on n_regionkey = r_regionkey RF(nation->supplier) is effective because nation is filtered by an effective rf: RF(region->nation)	2023-09-13 20:12:08 +08:00
AKIRA	786a721e03	[feat](stats) Support analyze with sample automatically (#23978 ) 1. Analyze with sample automatically when table size is greater than huge_table_lower_bound_size_in_bytes(5G by default). User can disable this feature by fe option enable_auto_sample 2. Support grammer like `ANALYZE TABLE test WITH FULL` to force do full analyze whatever table size is 3. Fix bugs that tables stats doesn't get updated properly when stats is dropped, or only few column is analyzed	2023-09-13 19:42:10 +08:00
jakevin	05722b4cfd	[feature](Nereids): date/datetime parser support many complex case (#24287 ) - feature: normalize date/datetime with leading 0 - feature: support 'HH' offset in date/datetime - feature: normalize() add missing Minute/Second in Time part - feature: normalize offset HH to HH:MM - correct DateTimeFormatterUtilsTest	2023-09-13 17:30:58 +08:00
starocean999	231038f050	[fix](planner)allow infer predicate for external table (#24227 ) CREATE EXTERNAL TABLE `dim_server` ( `col1` varchar(50) NOT NULL, `col2` varchar(50) NOT NULL ) create view ads_oreo_sid_report ( `col1` , `col2` ) AS select tmp.col1,tmp.col2 from ( select 'abc' as col1,'def' as col2 ) tmp inner join dim_server ds on tmp.col1 = ds.col1 and tmp.col2 = ds.col2; select * from ads_oreo_sid_report where col1='abc' and col2='def'; before this pr, col1='abc' and col2='def' can't be pushed to dim_server. now the 2 predicates can be pushed to odbc table.	2023-09-13 17:22:39 +08:00
Siyang Tang	d87b852e18	[enhancement](delete-handler) split Deletehandler#commitJob and add preconditions to intercept NPE(#24086 )	2023-09-13 14:34:12 +08:00
谢健	335064f897	[feature](Nereids) add lambda argument and array_map function (#23598 ) add array_map function SELECT ARRAY_MAP(x->x+1, ARRAY(87, 33, -49)) +----------------------------------------------------------------------+ \| array_map([x] -> (x + 1), x#1 of array(87, 33, -49)) \| +----------------------------------------------------------------------+ \| [88, 34, -48] \| +----------------------------------------------------------------------+	2023-09-13 14:24:16 +08:00
mch_ucchi	f985b28ac6	[fix](Nereids) default partition be prunned by mistake (#24186 ) ```sql CREATE TABLE IF NOT EXISTS t ( k1 tinyint NOT NULL, k2 smallint NOT NULL, k3 int NOT NULL, k4 bigint NOT NULL, k5 decimal(9, 3) NOT NULL, k8 double max NOT NULL, k9 float sum NOT NULL ) AGGREGATE KEY(k1,k2,k3,k4,k5) PARTITION BY LIST(k1) ( PARTITION p1 VALUES IN ("1","2","3","4"), PARTITION p2 VALUES IN ("5","6","7","8"), PARTITION p3 ) DISTRIBUTED BY HASH(k1) BUCKETS 5 properties("replication_num" = "1") select * from t where k1=10 ``` The query will return 0 rows because p3 is pruned, we fix it by skip prune default partitions. TODO: prune default partition if filter do not hit it	2023-09-13 12:04:20 +08:00
jakevin	7025293e17	[refactor](Nereids): new Date/Datetime parser to support more condition (#24224 ) * unify all Date/Datetime use one string-parser * support microsecond & ZoneOffset both exist * add many UT case * add determineScale() to get scale of datetime, original code just get length of part after . * reject more bad condition like 2022-01-01 00:00:00., we don't allow . without microsecond. * .....	2023-09-13 11:20:27 +08:00
AKIRA	f205473426	[feat](stats) enable set auto analyze time by set global session variable (#24026 )	2023-09-13 10:59:25 +08:00
morrySnow	1a3b70bf4a	[fix](Nereids) fix ctas bugs (#24267 ) 1. ctas should support without distribution desc 2. ctas should support column name list 3. ctas should throw exception when excution failed 4. ctas should convert null type to tinyint 5. ctas should support type conversion 6. ctas should convert first column from string to varchar	2023-09-13 09:17:57 +08:00
daidai	ebe3749996	[fix](tvf)support s3,local compress_type and append regression test (#24055 ) support s3,local compress_type and append regression test.	2023-09-13 00:32:59 +08:00
Qi Chen	9df72a96f3	[Feature](multi-catalog) Support hadoop viewfs. (#24168 ) ### Feature Support hadoop viewfs. ### Test - Regression tests: - hive viewfs test. - tvf viewfs test. - Broker load with broker and with hdfs tests manually.	2023-09-13 00:20:12 +08:00
Mingyu Chen	c402d48f97	[fix](query-cache) fix query cache with empty set (#24147 ) If the query result set is empty, the query cache will not cache the result. This PR fix it.	2023-09-12 20:11:20 +08:00
zclllyybb	d3f1388717	[Feature](partitions) Support auto-partition (#24153 ) Co-authored-by: zhangstar333 <2561612514@qq.com>	2023-09-12 15:23:15 +08:00
TengJianPing	4bb9a12038	[function](bitmap) support bitmap_remove (#24190 )	2023-09-12 14:52:04 +08:00
yujun	9e0d843501	[fix](publish) publish go ahead even if quorum is not met (#23806 ) Co-authored-by: Yongqiang YANG <dataroaring@gmail.com>	2023-09-12 14:29:01 +08:00
Jibing-Li	2e2e174804	[fix](forward master op)Set default catalog and db only when they exist in master FE while executing forwarded stmt (#24212 ) In this case, forward to master will throw catalog or db not found exception: Connect to a follower: 1. create database test 2. use test 3. drop database test 4. create database test This is because after step 2, the default db in follower has been set to `test`, drop database will not change the default db. In step 4, the default db `test` is set and forwarded to master, and master will fail to find it because it is already dropped. This pr is to set the default catalog and db only when they exist. The actual reason is that, when Follower handle the `drop db` stmt, it will forward to master to execute it, but can not unset its own "current db"	2023-09-12 14:12:18 +08:00
Calvin Kirs	232f120edc	[Improve](Job)Support other types of Job query interfaces (#24172 ) - Support MTMV job - Task info add create time and sql - Optimize scheduling logic	2023-09-12 13:55:56 +08:00
谢健	5ab2aea8af	add test for bindExpr (#24032 ) add unit test for bindExpression rule	2023-09-12 11:00:57 +08:00

1 2 3 4 5 ...

5911 Commits