doris

Author	SHA1	Message	Date
Mingyu Chen	26e930eed1	[Fix](multi-catalog) Make BE selection policy works fine when enable prefer_compute_node_for_external_table (#19346 )	2023-05-12 15:32:50 +08:00
Mingyu Chen	860ce97622	[feature](torc) support insert only transactional hive table on FE side (#19419 ) * [feature](torc) support insert only transactional hive table on FE side * 3 * commit * 1	2023-05-12 15:32:26 +08:00
zzzzzzzs	feef5afa0b	[typo](doc) Fixed typos in SHOW-ROUTINE-LOAD.md (#19573 )	2023-05-12 14:37:28 +08:00
Zhang Wenxin	a1da57c63e	[opt](Nereids)(WIP) optimize agg and window normalization step 2 #19305 1. refactor aggregate normalization to avoid data amplification before aggregate 2. remove useless aggreagte processing in ExtractAndNormalizeWindowExpression 3. only push distinct aggregate function children TODO: 1. push down redundant expression in aggregate functions 2. refactor normalize repeat rule 3. move expression normalization and optimization after plan normalization to avoid unexpected expression optimization.	2023-05-12 14:00:13 +08:00
Mingyu Chen	0477a9f5de	[fix](dateformat) Fix hour date format (#19569 ) Introduced from #19265. The hour format should support both "5" and "05".	2023-05-12 13:38:41 +08:00
luozenglin	56a6431b55	[fix](pipeline) fix query returns empty result instead of an error occasionally after being cancelled (#19561 )	2023-05-12 12:40:41 +08:00
Gabriel	56bc8a762d	[decimalv3](literal) use decimalv3 literal if enable_decimal_conversion is true (#19559 )	2023-05-12 12:01:54 +08:00
Mingyu Chen	9bf6ecca48	[minor](log) change debug log to info to observe the storage medium change #19529 When user set default_storage_medium to true, the storage medium of all partitions should be SSD, and cooldown time should be 9999-12-31 23:59:59. So that it won't change to HDD. But looks like sometimes it still change to HDD. So I change the debug log to info to observer it.	2023-05-12 11:02:55 +08:00
yiguolei	8ef9212ddc	[enhancement](exceptionsafe) force check exec node method's return value (#19538 )	2023-05-12 10:21:00 +08:00
AlexYue	157ec5757a	[fix](s3FileWriter) don't use bthread countdown event to sync #19534 Unfortunately BthreadCountDownEvent will not serve as one sync primitive for this scenario where are all pthread workers. BthreadCountDownEvent::time_wait is used for bthread so it will result in some confusing sync problem like heap buffer use after free.	2023-05-12 09:19:57 +08:00
Jerry Hu	bd6a36091e	[chore](cmake) fix DORIS_JAVA_HOME from JAVA_HOME (#19521 )	2023-05-12 09:12:38 +08:00
Yongqiang YANG	1296a920c2	[chore](collaborator) add several collaborators to manage issue (#19550 )	2023-05-12 09:09:52 +08:00
Mingyu Chen	868bae47f6	[improvement](docker) update compilation Dockerfile (#19563 )	2023-05-12 09:06:45 +08:00
starocean999	e9392780a9	[fix](nereids)fix some nereids planner bugs (#19509 ) 1.some encrypt and decrypt functions have wrong blockEncryptionMode 2.topN node should compare tuples from intermediate_row_desc with first_sort_slot.tuple_id 3.must keep the limit if it's an uncorrelated in-subquery with limit on sort, like select a from t1 where a in ( select b from t2 order by xx limit yy )	2023-05-12 09:06:16 +08:00
Chuang Li	a041f8eabe	[fix](fe) Fx SimpleDateFormatter thread unsafe issue by replacing to DateTimeFormatter. (#19265 ) DateTimeFormatter replace SimpleDateFormat in fe module because SimpleDateFormat is not thread-safe.	2023-05-11 22:50:24 +08:00
jakevin	d58498841a	[fix](Nereids) Should copy JoinReorderContext for PushdownProject (#19508 ) 1. should copy JoinReorderContext 2. verify bushy tree join reorder	2023-05-11 21:05:12 +08:00
yongkang.zhong	9568de303a	[Chore](build) update clang-format version check (#19542 ) update clang-format version check	2023-05-11 19:38:58 +08:00
Zhang Wenxin	35c4de9fea	[fix](Nereids) convert decimalv2 type to decimalv3 type by mistake (#19491 )	2023-05-11 19:11:51 +08:00
minghong	c5a53e0caa	[tpch](nereids) estimate cost with unknown column stats #19046 make nereids generate more reasonable plans with table row count, but without column stats. TODO: q5 and q7 is not good, because of column correlation ps_suppkey and ps_partkey	2023-05-11 19:03:11 +08:00
xy720	39ec8aa64c	[refactor](complex-type) refactor array/map/struct literal to not invoke execute() function in prepare state (#19068 )	2023-05-11 18:44:37 +08:00
LiBinfeng	99cef84acf	[Feature](Nereids) Add nereids minidump (#18747 )	2023-05-11 18:36:30 +08:00
AKIRA	45c89c1d3c	[Fix](stats) Stats persistence failed when a column is all null values (#19412 )	2023-05-11 17:44:44 +08:00
lsy3993	e8f1ce4eaa	[typo](docs) fix doc bug of 'collect_list' and 'deploy' (#19531 )	2023-05-11 16:53:29 +08:00
yongkang.zhong	3cc8bbb93f	[chore](Java UDF)remove the error code and add the copy jar (#19503 ) * [chore](Java UDF)remove the error code and add the copy jar * [chore](Java UDF)remove the error code and add the copy jar	2023-05-11 16:17:29 +08:00
Xiangyu Wang	589dd8a9b3	[Fix](multi-catalog) Fix query hms tbl with compressed data files. (#19387 ) If submit a query contains hms tbls which data files are compressed (bz2,lzo,lz4 ...), a error will occurs like this: ```[INTERNAL_ERROR]Only support csv data in utf8 codec``` . This is because `org.apache.doris.planner.external.HiveScanNode` set `fileFormatType` as `TFileFormatType.FORMAT_CSV_PLAIN` whether the real compress algo of data files are. This pr try to fix this problem.	2023-05-11 14:53:58 +08:00
yangshijie	ed8a4b4120	[feature-wip](duplicate_no_keys) skip sort function if the table is duplicate without keys (#19483 )	2023-05-11 14:44:16 +08:00
wangyongfeng	3edde1fa4a	[fix](docs) update build-extension.yml (#19528 ) Co-authored-by: wangyf0555 <wangyongfeng@flywheels.com>	2023-05-11 14:38:04 +08:00
Qi Chen	0b25376cf8	[feature](torc) support insert only transactional hive table on be side (#19518 )	2023-05-11 14:15:09 +08:00
AKIRA	6d2070c59d	[enhancement](stats) Make stats cache item size configurable (#19205 )	2023-05-11 13:59:37 +08:00
jakevin	dc497e11bb	[fix](Nereids) avoid to push top Project of JoinCluster in PushdownProjectThroughJoin (#19441 ) We shouldn't push top Project of JoinCluster in PushdownProjectThroughJoin like ``` * Project (id + 1) if this project is top project of Join Cluster * \| * Join * / \ * Join Join * / .... * Join ```	2023-05-11 13:58:54 +08:00
herry2038	834bf2eab7	[feature](array) Add array_last lambda function (#18388 ) Add array_last lambda function	2023-05-11 13:15:54 +08:00
zhannngchen	5167dc1251	[feature](merge-on-write) enable merge on write by default (#19017 )	2023-05-11 11:10:48 +08:00
yiguolei	1d421a26d9	[bugfix](memory) merge block may allocate failed (#19507 )	2023-05-11 10:42:47 +08:00
zhangstar333	1d1b2f98c3	[refactor](function) let agg functions exception safety (#19109 )	2023-05-11 10:17:11 +08:00
abmdocrt	71f7e9e185	[test](cast func) add test for cast float text to int when nereids is on #19517	2023-05-11 08:24:54 +08:00
Ashin Gau	d7ad299154	[fix](NestedType) throw error when reading complex nested type in orc&parquet (#19489 ) Doris block does not support complex nested type now, but orc and parquet reader has generated complex nested column, which makes the output of mysql client wrong and users confused.	2023-05-11 07:51:02 +08:00
Ashin Gau	3ba3b6c66f	[opt](FileCache) use modification time to determine whether the file is changed (#18906 ) Get the last modification time from file status, and use the combination of path and modification time to generate cache identifier. When a file is changed, the modification time will be changed, so the former cache path will be invalid.	2023-05-11 07:50:39 +08:00
Qi Chen	4418eb36a3	[Fix](multi-catalog) Fix some hive partition issues. (#19513 ) Fix some hive partition issues. 1. Fix be will crash when using hive partitions field of `date`, `timestamp`, `decimal` type. 2. Fix hdfs uri decode error when using `timestamp` partition filed which will cause some url-encoding for special chars, such as `%3A` will encode `:`.	2023-05-11 07:49:46 +08:00
Tiewei Fang	95833426e8	[BugFix](table-value-function) Fix backends() tvf (#19452 ) Change the `Alive/SystemDecommissioned/ClusterDecommissioned` field type of the `backends()`tvf to bool	2023-05-11 07:49:27 +08:00
Jibing-Li	2d1f597413	[Fix](statistics)Fix hive table statistic bug (#19365 ) Fix hive table statistic bug. Collect table/partition level statistics.	2023-05-11 07:48:58 +08:00
DeadlineFen	22f95fca97	set ccache configs in env.h (#19502 )	2023-05-11 06:54:54 +08:00
Yulei-Yang	41d4ed8367	[Improvement](multicatalog) support show_partitions for hms catalog (#19242 ) * [Improvement](multicatalog) support show_partitions for hms catalog * update according review advice	2023-05-11 01:17:23 +08:00
lsy3993	840dbdc7c0	[typo](docs) add comment of partition and key/value column (#19448 ) * change docker compose to 'docker-compose' * modify sql of mysql * fix docker start and stop cmd * new commit * add comment of partition and key/value column * Update cn doc format --------- Co-authored-by: Luzhijing <82810928+luzhijing@users.noreply.github.com>	2023-05-11 01:14:17 +08:00
Jibing-Li	68505a1192	[Test](multi catalog)Add test case for Iceberg External Table. #19488	2023-05-11 01:13:40 +08:00
Jerry Hu	47edc5a06e	[fix](functions) Support nullable column for multi_string functions (#19498 )	2023-05-11 01:13:13 +08:00
zclllyybb	28e088aee1	[optimization](be) optimization for ColumnConst when writing mysql result (#19122 ) * opt for result * fix	2023-05-11 01:04:18 +08:00
Lei Zhang	8845c2cf44	[fix](bdbje) remove `System.exit(-1)` in BDBEnvironment.close() (#19335 ) * https://github.com/apache/doris/issues/18766	2023-05-11 01:01:38 +08:00
Xiangyu Wang	0f6c69de53	[Fix](multi-catalog) Fix sync hms event failed when start FE soon. (#19344 ) * [Fix](multi-catalog) Fix sync hms event failed when start FE soon after. * [Fix](multi-catalog) Fix sync hms event failed when start FE soon after. --------- Co-authored-by: wangxiangyu@360shuke.com <wangxiangyu@360shuke.com>	2023-05-11 01:00:55 +08:00
zhangdong	b129c9901b	[improvement](FQDN)Change the implementation of fqdn (#19123 ) Main changes: 1. If fqdn is enabled in the configuration file, when fe starts, localAddr will obtain fqdn instead of IP, priority_ Networks will fail 2. The IP and host names of Backend and Front are combined into one field, host. When fqdn is enabled, it represents the host name, and when not enabled, it represents the IP address 3. The communication between clusters directly uses fqdn, and various Connection pool add authentication mechanisms to prevent the IP address of the domain name from changing and the connection between nodes from making errors 4. No longer requires polling to verify if the IP has changed, delete fqdnManager 5. Change the method of verifying the legitimacy of nodes between FEs from obtaining client IP to displaying the identity of the transmitting node itself in the HTTP request header or the message body of the throttle 6. When processing the heartbeat, if BE finds that the host stored by itself is inconsistent with the host stored by the master, after verifying the legitimacy of the host, it will change its own host instead of directly reporting an error 7. Simplify the generation logic of fe name Scope of influence: 1. Establishing communication connections between clusters 2. Determine whether it is the same node through attributes such as IP 3. Print Log 4. Information display 5. Address Splicing 6. k8s deployment 7. Upgrade compatibility Test plan: 1. Change the IP address of the node, while keeping the fqdn unchanged, change the IP addresses of fe and be, and verify whether the cluster can read and write data normally 2. Use the master code to generate metadata, and use the previous metadata on the current pr to verify whether it is compatible with the old version (upgrading is no longer supported if fqdn has been enabled before) 3. Deploy fe and be clusters using k8s to verify whether the cluster can read and write data normally 4. According to https://doris.apache.org/zh-CN/docs/dev/admin-manual/cluster-management/fqdn?_highlight=fqdn#%E6%97%A7%E9%9B%86%E7%BE%A4%E5%90%AF%E7%94%A8fqdn Upgrading old clusters 5. Use streamload to specify the fqdn of fe and be to import data separately 6. Use different users to start transactions and write data using insert statements	2023-05-11 00:44:48 +08:00
yongkang.zhong	3a22af836e	[fix](jdbc catalog) fix error to clickhouse uint64 type Conversion (#19463 ) * [fix](jdbc catalog) fix error to clickhouse uint64 type Conversion * add test case	2023-05-10 21:53:30 +08:00

1 2 3 4 5 ...

10435 Commits