doris

Author	SHA1	Message	Date
Gabriel	b1eef30b49	[pipelineX](dependency) Wake up task by dependencies (#26879 ) --------- Co-authored-by: Mryange <2319153948@qq.com>	2023-11-18 03:20:24 +08:00
Jack Drogon	38e4779fde	[fix](ccr) Mark getBinlog,getBinlogLag,getMeta,getBackendMeta as from master (#27211 ) Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-11-18 00:25:11 +08:00
Xiaocc	0aec436ef8	[chore](be) format reader parameter settings (#22964 )	2023-11-18 00:11:46 +08:00
Guangdong Liu	4b6330cb93	[regression test](http stream) add case for strict_mode (#27098 )	2023-11-18 00:10:02 +08:00
yujun	2841c5cfd2	[improvement](transaction) txn prune memory after visible (#27128 )	2023-11-18 00:09:27 +08:00
Luwei	fc47eb51f7	[regression test](schema change) add some schema change regression cases (#27112 )	2023-11-18 00:08:53 +08:00
Lei Zhang	c20487888f	[refact](fe) Refact GlobalTransactionMgr (#27070 ) * Only export `GlobalTransactionMgr` method, avoid using `DatabaseTransactionMgr`	2023-11-18 00:08:20 +08:00
HHoflittlefish777	fc18dc6a05	[fix](regression-test) enhance stream load test user password (#27170 )	2023-11-18 00:01:39 +08:00
walter	9319d173dd	[refactor](planner) filter empty partitions in a unified location (#27190 )	2023-11-17 23:58:21 +08:00
zy-kkk	5fb27eb652	[fix](compile) fix BE compile failure on Mac (#27206 )	2023-11-17 23:52:51 +08:00
zy-kkk	b477839bce	[enhancement](jdbc catalog) Add lowercase column name mapping to Jdbc data source & optimize database and table mapping (#27124 ) This PR adds the processing of lowercase Column names in Oracle Jdbc Catalog. In the previous behavior, we changed all Oracle columns to uppercase queries by default, but could not handle the lowercase case. This PR can solve this situation and improve All Jdbc Catalog works	2023-11-17 23:51:47 +08:00
huanghaibin	5d548935e0	[improvement](insert) support schema change and decommission for group commit (#26359 )	2023-11-17 21:41:38 +08:00
xy720	e3e249c584	[Bug](SchemeChange) Loading tasks during alter job cause modify column failed (#26975 ) When a table is doing schema-change, it adds _doris_shadow prefix in name of modified columns in shadow index. The writes during schema-change will generate rowset schema with _doris_shadow prefix in BE. If the alter task arrives at be after the write request, it will use the rowset schema with max version which has the _doris_shadow prefix. And an error will be thrown as below: a shadow column is encountered __doris_shadow_p_retailprice [INTERNAL_ERROR]failed due to operate on shadow column This commit will disable shadow prefix in rowset meta schema.	2023-11-17 21:24:53 +08:00
Kang	0a1a6cf02f	[fix](topn) add defensive code in topn opt to avoid crash due to column not in tablet schema	2023-11-17 21:14:10 +08:00
Mingyu Chen	c459408580	[fix](jni) avoid BE crash and NPE when close paimon reader (#27129 ) 1. Do not use FATAL log when jni encounter error, to avoid crash. 2. Fix NPE when closing PaimonReader, the reader may not be assigned if PaimonReader open failed.	2023-11-17 20:01:08 +08:00
yujun	635a339bce	[fix](fe ut) fix unstable ut DecommissionTest (#27082 )	2023-11-17 19:30:24 +08:00
Lei Zhang	ee12297cd9	[fix](test) Fix fe ut `BDBJEJournalTest` not stable (#27192 )	2023-11-17 19:24:12 +08:00
deardeng	a8720e645f	[fix](fe ut) Fix borrow oject throw npe (#27072 ) occasional failure of fe ut, borrowObject throw npe ``` get agent task request. type: CREATE, signature: 10008, fe addr: null java.lang.NullPointerException at java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:936) at org.apache.commons.pool2.impl.GenericKeyedObjectPool.register(GenericKeyedObjectPool.java:1079) at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:347) get agent task request. type: CREATE, signature: 10012, fe addr: TNetworkAddress(hostname:127.0.0.1, port:56072) at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:277) at org.apache.doris.common.GenericPool.borrowObject(GenericPool.java:99) at org.apache.doris.utframe.MockedBackendFactory$DefaultBeThriftServiceImpl$1.run(MockedBackendFactory.java:219) at java.lang.Thread.run(Thread.java:750) ```	2023-11-17 19:16:29 +08:00
Ashin Gau	52995c528e	[fix](iceberg) iceberg use customer method to encode special characters of field name (#27108 ) Fix two bugs: 1. Missing column is case sensitive, change the column name to lower case in FE for hive/iceberg/hudi 2. Iceberg use custom method to encode special characters in column name. Decode the column name to match the right column in parquet reader.	2023-11-17 18:38:55 +08:00
DuRipeng	f8b61d3d8e	[Enhance](fe) select BE local broker to scan Hive table when 'broker.name' in hms catalog is specified (#27122 ) Since #24830 introduce `broker.name` in hms catalog, data scan will run on specified brokers. And [doris operator](https://github.com/selectdb/doris-operator) support BE and broker deployed in same pod, BE access local broker is the fastest approach to access data. In previous logic, every inputSplit will select one BE to execute, then randomly select one broker for actual data access, BE and related broker are always located on separate K8S pod. This pr optimizes the broker select strategy to prioritize BE-local broker when `broker.name` is specified in hms catalog.	2023-11-17 18:29:55 +08:00
amory	0ece18d6cd	[FIX](regresstest) fix test_map_nested_array csv file for id(#27105 )	2023-11-17 04:20:02 -06:00
morrySnow	fa7e1b7fc7	[fix](Nereids) result type of add precision is 1 more than expected (#27136 )	2023-11-17 04:13:09 -06:00
xy	fdec286e82	[optimize](cooldown)Shorten the _meta_lock lock interval (#27118 ) Change the two passes of _rs_version_map to one, reducing cpu overhead and shortening the lock interval of _meta_lock Co-authored-by: xingying01@corp.netease.com <xingying01@corp.netease.com>	2023-11-17 16:59:36 +08:00
xy	ab322eaa2b	[improvement](detailMessage) add AvailCapacity prompt in detailMessage (#26328 ) Co-authored-by: xingying01 <xingying01@corp.netease.com>	2023-11-17 16:54:31 +08:00
airborne12	593e3662b0	[Fix](match) fix match null for no index (#26983 ) This pull request addresses an issue observed with inverted index tables or tables without indices when querying null values using the MATCH function. Previously, executing a query like `SELECT * FROM table WHERE column MATCH null;` would yield incorrect results. The update introduces enhanced handling of nullable columns within the MATCH function, ensuring accurate query results when null values are involved.	2023-11-17 15:57:50 +08:00
minghong	9b040b3fbd	[fix](nereids) partition prune fails in case of NOT expression (#27047 ) * handle not and add regression test	2023-11-17 15:50:09 +08:00
Jibing-Li	ec92ba4af1	[fix](statistics)Fix alter column stats bug (#27093 ) Encode the min and max value with base64 encoder while inject the column stats.	2023-11-17 15:40:47 +08:00
Kaijie Chen	4d2fb1fffb	[fix](load) add lock in active_memtable_mem_consumption (#27101 )	2023-11-17 15:03:15 +08:00
camby	e1b180d53d	[improve](streamload) Explicitly judge the return value of close #27134	2023-11-17 14:17:09 +08:00
AKIRA	285c617a5f	[minor](stats) Add start/end time for analyze job, precise to seconds of TableStats update time #27123	2023-11-17 13:59:53 +08:00
Guangdong Liu	b359fff097	[regression test](http_stream) Case for Invalid file format (#27133 )	2023-11-17 13:46:17 +08:00
Qi Chen	a0661ed9d2	[Fix](multi-catalog) Fix complex type crash when using dict filter facility in the parquet-reader. (#27151 ) - Fix complex type crash when using the dict filter facility in the parquet-reader by turning off the dict filter facility in this case. - Add orc complex types regression test.	2023-11-17 13:43:58 +08:00
Guangdong Liu	c7d961cb11	[regression test](stream load) add case for strict_mode=true and max_filter_ratio=0.5 (#27125 )	2023-11-17 13:39:01 +08:00
Guangdong Liu	ee08958526	[regression test](http_stream) case for timezone (#27149 ) It does not work now, anyway we need a case.	2023-11-17 13:36:41 +08:00
Kang	4fff9a5937	[Improvement](inverted index) delay inverted index col read to reduce IO (#26080 ) (#26337 )	2023-11-17 13:12:12 +08:00
starocean999	06f0c10c8b	[fix](nereids) count in correlated subquery shoud not output null value (#27064 ) consider sql: SELECT * FROM t1 WHERE t1.a <= (SELECT COUNT(t2.a) FROM t2 WHERE (t1.b = t2.b)); when unnest correlated subquery, we create a left join node. Assume outer query is left table and subquery is right one. If there is no match, the row from right table is filled with nulls. But COUNT function is always not nullable. So wrap COUNT with Nvl to ensure it's result is 0 instead of null to get the correct result	2023-11-16 22:31:42 -06:00
TengJianPing	4ac460af28	[decimal](tests) add test case for least/greatest for decimalv3 type (#26930 )	2023-11-17 12:09:59 +08:00
Mryange	91af86bc78	[fix](function) fix error when use negative number in explode_numbers #27020	2023-11-17 12:02:14 +08:00
Pxl	1188d88a10	[Chore](status) catch some error status on storage (#27132 ) catch some error status on storage	2023-11-17 12:00:39 +08:00
Lei Zhang	43ffcc5012	[fix](fe) Fix `enable_nereids_planner` forward not take effect (#26782 ) * The java reflection method `getFields()` only return public fields, but enable_nereids_planner is private	2023-11-17 11:13:07 +08:00
Chester	334260dff7	[feature](function) support ip function ipv4stringtonum(ordefault, ornull), inet_aton (#25510 )	2023-11-17 10:27:07 +08:00
lihangyu	a4d78682ff	[Optimize](point query) clear names to reduce mem consumption and cpu cost related to block column name (#26931 )	2023-11-17 10:18:21 +08:00
Mryange	0c264c8a14	[fix](pipelineX) fix scheduling bug in union operator (#27131 )	2023-11-17 10:02:54 +08:00
lsy3993	a510b5be81	[regression](delete) add regression test for every type delete (#26954 )	2023-11-16 08:03:31 -06:00
Pxl	fd6a2cba5e	[Chore](clang-tidy)enable readability-function-size.LineThreshold and readability-functi… set readability-function-size.LineThreshold to 80 and enable readability-function-cognitive-complexity	2023-11-16 20:37:12 +08:00
wangbo	492a22dced	select coordinator node from user's tag when exec streaming load (#27106 )	2023-11-16 19:55:50 +08:00
catpineapple	0ac3984d4b	[doc](fix) en docs for k8s operator (#27049 )	2023-11-16 18:40:56 +08:00
Kaijie Chen	afffcfd14c	[fix](load) skip cancel already cancelled channels (#27111 )	2023-11-16 18:38:40 +08:00
Kaijie Chen	e29d8cb110	[feature](move-memtable) support pipelineX in sink v2 (#27067 )	2023-11-16 15:00:55 +08:00
HowardQin	54989175fb	[case] Load json data with enable_simdjson_reader=false (#26601 )	2023-11-16 14:40:59 +08:00

1 2 3 4 5 ...

14903 Commits