doris

Author	SHA1	Message	Date
Xin Liao	9eade148dd	[enhancement](merge-on-write) add primary key data page size config (#20961 )	2023-06-20 19:51:02 +08:00
catpineapple	19dd35f908	[doc](fix) cold hot separation cache doc (#20994 )	2023-06-20 18:18:22 +08:00
Jibing-Li	ca8f51602b	[Improvement](multi catalog, statistics)Support two level external statistics cache loader (#20906 ) The current column statistic cache loader is to load data from column_statistics olap table. This pr is to change the cache loader logic to First load from column_statistics olap table, if no data was loaded, then load from table metadata. This is mainly to support fetch statistics data for external catalog using HMS or Iceberg api. This is the first PR, next pr will implement the fetch logic for different external catalogs.	2023-06-20 16:43:18 +08:00
Yongqiang YANG	cb89af49e7	[improvement](replica) donot care last failed version in publish (#21001 ) We just care 2 things: 1. If the replica acks right 2. If the replica catches up	2023-06-20 15:57:54 +08:00
DeadlineFen	0b1bbe4045	[Bugfix](CCR) BinlogTombstone tableId is null when db disable binlog (#20995 )	2023-06-20 15:48:47 +08:00
Yongqiang YANG	0d80456869	[enhancement](backup) teach fe to acquire a consistent backup between be and fe (#21014 )	2023-06-20 15:37:41 +08:00
airborne12	ccba11d7ea	[Fix](inverted index) remove IndexReader::indexExists, use fs interface (#20970 )	2023-06-20 15:22:25 +08:00
mch_ucchi	f4d3f4ae19	[Fix](Nereids) failed to fold date_format() to constant (#20976 )	2023-06-20 15:11:25 +08:00
AKIRA	ec34f72204	[enhancement](nereids) log for exception stack of sync analyze (#21013 )	2023-06-20 15:11:03 +08:00
AKIRA	6b4a9edbbd	[fix](nereids) Fix explain graph with CTE #20997 Add support of MultiCastDataSink	2023-06-20 14:55:21 +08:00
mch_ucchi	7da3fde89c	[Fix](Nereids)cast to datev2 default for Nereids if enable_date_conversion (#20973 )	2023-06-20 14:53:20 +08:00
Kaijie Chen	012813b3f7	[fix](load) add missing flush context for BetaRowsetWriter::_add_block() (#20884 )	2023-06-20 14:27:39 +08:00
zy-kkk	53b2fe5db6	[improvement](jdbc) Set the JDBC connection timeout to be conf (#21000 )	2023-06-20 14:23:48 +08:00
Qi Chen	c85271d2ae	[Fix](orc-reader) Fix filter size mismatch in orc reader. (#20998 ) Fix filter size mismatch in orc reader introduced by #20806	2023-06-20 12:27:16 +08:00
zzzxl	d05614ef51	[Fix](invert index)all directories use NoLock (#20962 )	2023-06-20 12:12:16 +08:00
Calvin Kirs	74a09fc6e5	[Dependency](fe)Use the release version of hive-catalog (#20921 ) Used hive-catalog-shade 1.0.1	2023-06-20 11:53:59 +08:00
Calvin Kirs	1eb4e5bd06	[Fix](Routineload)routine load does not support lowercase data source names (#21005 )	2023-06-20 11:44:02 +08:00
Ashin Gau	923f7edad0	[opt](hudi) using native reader to read the base file with no log file (#20988 ) Two optimizations: 1. Insert string bytes directly to remove decoding&encoding process. 2. Use native reader to read the hudi base file if it has no log file. Use `explain` to show how many splits are read natively.	2023-06-20 11:20:21 +08:00
zy-kkk	7e01f074e2	[improvement](jdbc mysql) support auto calculate the precision of timestamp/datetime (#20788 )	2023-06-20 10:39:34 +08:00
AKIRA	87258a13c4	[enhancement](nereids) Remove useless config option #20905 1. Remove useless config option 2. Fix timeout cancel, before this PR an OlapAnalysisTask would continue running even if it's already timeout.	2023-06-20 10:37:46 +08:00
zzzzzzzs	824bc02603	[Function] Support date function: microsecond() (#20044 )	2023-06-20 10:32:54 +08:00
lsy3993	0287cc15f2	[fix](meta) 'clean label from db' does not work (#20625 ) When we use a label to load data, this label can not be used twice. But when we execute a sql 'CLEAN LABEL [label] FROM db;', we hope that the same label can be used again. However, the sql above does not work. This PR is fixing this problem.	2023-06-20 10:25:31 +08:00
jakevin	d02ecef406	[fix](Nereids): revert `push down alias into union` (#20991 ) revert #20543 to tmp avoid problem	2023-06-20 09:32:26 +08:00
morrySnow	e7b070c9ec	[fix](Nereids) subquery not return correct data type (#20985 ) if we do type coercion on subquery, it return datatype after type coercion error info ``` Both side of binary arithmetic is not numeric. left type is DECIMALV3(2, 1) and right type is DECIMAL(27, 9)') ```	2023-06-19 23:44:58 +08:00
Mryange	5a28b6f9fc	[fix](datetime) Fix the error in date calculation that includes constants (#20863 ) before ``` mysql> select hours_add('2023-03-30 22:23:45.23452',8); +-------------------------------------+ \| hours_add('2023-03-30 22:23:45', 8) \| +-------------------------------------+ \| 2023-03-31 06:23:45 \| +-------------------------------------+ mysql> select date_add('2023-03-30 22:23:45.23452',8); +------------------------------------+ \| date_add('2023-03-30 22:23:45', 8) \| +------------------------------------+ \| 2023-04-07 22:23:45 \| +------------------------------------+ mysql [test]>select hours_add('2023-03-30 22:23:45.23452',8); +-------------------------------------------+ \| hours_add('2023-03-30 22:23:45.23452', 8) \| +-------------------------------------------+ \| 2023-03-31 06:23:45.000234 \| +-------------------------------------------+ ``` after ``` mysql [test]>select hours_add('2023-03-30 22:23:45.23452',8); +-------------------------------------------+ \| hours_add('2023-03-30 22:23:45.23452', 8) \| +-------------------------------------------+ \| 2023-03-31 06:23:45.23452 \| +-------------------------------------------+ 1 row in set (0.01 sec) mysql [test]>select date_add('2023-03-30 22:23:45.23452',8); +------------------------------------------+ \| date_add('2023-03-30 22:23:45.23452', 8) \| +------------------------------------------+ \| 2023-04-07 22:23:45.23452 \| +------------------------------------------+ 1 row in set (0.00 sec) mysql [test]>set enable_nereids_planner=true; Query OK, 0 rows affected (0.00 sec) mysql [test]>set enable_fallback_to_original_planner=false; Query OK, 0 rows affected (0.00 sec) mysql [test]>select hours_add('2023-03-30 22:23:45.23452',8); +-------------------------------------------+ \| hours_add('2023-03-30 22:23:45.23452', 8) \| +-------------------------------------------+ \| 2023-03-31 06:23:45.23452 \| +-------------------------------------------+ 1 row in set (0.03 sec) mysql [test]>select date_add('2023-03-30 22:23:45.23452',8); +------------------------------------------+ \| days_add('2023-03-30 22:23:45.23452', 8) \| +------------------------------------------+ \| 2023-04-07 22:23:45.23452 \| +------------------------------------------+ 1 row in set (0.00 sec) ```	2023-06-19 23:44:30 +08:00
starocean999	e6f50c04f1	[fix](nereids)SubqueryToApply rule lost is null condition (#20971 ) * [fix](nereids)SubqueryToApply rule lost is null condition	2023-06-19 23:43:40 +08:00
minghong	be8fb68712	[fix](nereids)distribute node missing rows and cost #20943 in dumped memo, distribute node missed estimated rows and cost.	2023-06-19 23:42:01 +08:00
minghong	f20ef165fe	[opt](Nereids) update join stats derive (#20895 ) in hash join condition, some equals are trustable, some are not. an equal is trustable if one side is almost unique, like primary key. for such equal condition we could estimate more accurate. the problem is in rewriten q20, the are 2 equal condition, one is trustable, another is not. But we treat both of them as trustable. Test result: on tpch100, from 2.2 sec to 0.44 sec no impact on tpch other queries no performance impact on tpcds queries	2023-06-19 23:40:44 +08:00
DeadlineFen	2a294801f1	Revert "[Test](regression) CCR syncer thrift interface regression test (#20935 )" (#20990 ) This reverts commit dd482b74c849b022862e7cfb1f1d0b933a84e3d2.	2023-06-19 21:38:03 +08:00
AKIRA	010861b7ec	[enhancement](Nereids) Don't write to table_statistics when create sync analyze job anymore #20956 1. Don't write to table_statistics when create sync analyze job anymore since it's meaningless 2. Capture exceptions when creating each system analyze job to avoid the failure of creation of all automatic collection jobs due to a single job creation failure. 3. Mark auto triggered period job's job type as system	2023-06-19 20:00:41 +08:00
morrySnow	08ac55291f	[opt](Nereids) change log level to debug to avoid log explode (#20954 )	2023-06-19 18:50:06 +08:00
Yongqiang YANG	a3342cb088	[improvement](load) avoid producing small segment (#20852 ) avoid producing small segment	2023-06-19 18:34:44 +08:00
Mingyu Chen	415f1053a4	[minor](progress) do not update progress if job id is not set (#20949 )	2023-06-19 18:13:43 +08:00
Yongqiang YANG	87e3a79387	[enhancement](pk) add bvar latency recorder for pk (#20942 )	2023-06-19 15:29:42 +08:00
Xiangyu Wang	63b9684696	[Feature](broker-load) Support priority for Broker Load job. (#20628 ) Support priority for Broker Load job.	2023-06-19 14:16:48 +08:00
Yongqiang YANG	dd5ecea36a	[fix](compress) snappy does not work right (#20934 )	2023-06-19 14:11:10 +08:00
yongjinhou	26cca5e00a	[Enhancement](tvf) Add frontends table-valued-function (#20857 )	2023-06-19 13:57:40 +08:00
Luzhijing	60ef9e9cfe	[website](homepage) update pictures in homepage (#20968 )	2023-06-19 13:31:41 +08:00
Mingyu Chen	170cc46b12	[fix](hive) check hive transactional table's file format (#20888 ) Sometimes we meet a hive table with parameter: "transactional" = "true", but format is parquet, which is not supported. So we need to check the input format for transactional table.	2023-06-19 12:33:00 +08:00
Jerry Hu	08fff8923f	[improvement](serde) Optimizing the performance of mysql result writter (#20928 ) When converting query results into MySQL format, it involves transforming columnar data storage into row-based storage. This process raises the question of choosing between sequential reading and sequential writing. In reality, sequential writing is the better choice for performance optimization. Test with 9M rows contains more than 20 columns, this patch can reduce the conversion time from 20s to 11s.	2023-06-19 12:29:01 +08:00
TengJianPing	fb9fcf460a	[fix](leftjoin) fix bug of left and full join with other conjuncts (#20946 ) Fix bug of left and full outer join with other conjuncts. When equal matched row count of a probe row exceed batch_size, some times the _join_node->_is_any_probe_match_row_output flag is not set correcty, which result in outputing extra rows for the probe row.	2023-06-19 12:27:06 +08:00
starocean999	4c6f1de42b	[feature](nereids) enable nereids planner by default (#20761 ) 1. enable nereids planner by default 2. add fuzzy test on enable_nereids_planner 3. fix some ut that result different under legacy planner and nereids	2023-06-19 12:24:47 +08:00
Luzhijing	124db9addc	[docs](releasenote)add 1.2.5 release note (#20965 )	2023-06-19 12:12:49 +08:00
gnehil	e3454494b4	[improvement](extension) add new features to Mysql to doris tools (#14243 )	2023-06-19 11:08:48 +08:00
Pxl	85c5d7c6a9	[Chore](materialized-view) add ssb_flat mv test case (#20869 ) add ssb_flat mv test case	2023-06-19 10:51:50 +08:00
airborne12	f44d2cf296	[Feature](inverted index) add inverted index size to tablet meta (#20916 ) 1. get inverted index size before segment writer's column writer clear, then add size to total data size and total index size 2. also do this in vertical compaction	2023-06-19 10:26:10 +08:00
lihangyu	e4c6dd5daa	[chore](file) remove useless file (#20893 )	2023-06-19 10:24:39 +08:00
Pxl	99810f1ea5	[Bug](pipeline) fix hang on union_source_operator when child sink_operator all finished (#20938 )	2023-06-19 09:46:38 +08:00
Kang	0bed52d86b	[bugfix](inverted index) fix not writing inverted index file if upgrade from old version #20945 The metadata indexes in MaterializedIndexMeta is introduced in 2.0-beta version and it's relied by writing inverted index. When upgrade doris from old version to 2.0-beta, the metadata indexes in MaterializedIndexMeta is empty and no inverted index file will be written. This PR add compatibility logic to use metadata indexes in table if indexes in MaterializedIndexMeta is empty and indexes in MaterializedIndexMeta indexId is equal to table baseIndexId. This PR also fix metadata indexes missing for schema change.	2023-06-19 09:11:36 +08:00
Zhiyu Hu	1efd345963	[Enhancement](table) adding information_schema.parameters table (#20259 ) this is a virtual table for compatibility information_schema parameters table	2023-06-19 09:05:46 +08:00

1 2 3 4 5 ...

11367 Commits