doris

Author	SHA1	Message	Date
morrySnow	3eeeff09fd	[enhancement](nereids) convert string literal to commontype in in-expr and cass-when-expr (#17200 )	2023-03-02 22:05:35 +08:00
jakevin	93d2d461b4	[feature](Nereids): pushdown complex project through left semi/anti Join. (#17186 )	2023-03-02 21:41:08 +08:00
morrySnow	a1399043fe	[fix](Nereids) fold constant on BE could not process alias (#17259 ) 1. could not use static INSTANCE for FoldConstantOnBE rule, because it is stateful 2. if expression root is Alias, should use its child to do const collection	2023-03-02 19:16:23 +08:00
starocean999	27352afdf6	[fix](fe)support multi distinct group_concat (#17237 ) * [fix](fe)support multi distinct group_concat * update based on comments	2023-03-02 17:53:13 +08:00
谢健	33349e1457	[fix](Nereids) fold 'version()' function (#17172 ) For compatibility with legacy planner, we fold version() with GlobalVariable.version in Nereids	2023-03-02 17:35:41 +08:00
Jerry Hu	823d968452	[fix](expr) avoid crashing caused by big depth of expression tree (#17314 )	2023-03-02 16:55:53 +08:00
Mingyu Chen	39f59f554a	[improvement](dry-run)(tvf) support csv schema in tvf and add "dry_run_query" variable (#16983 ) This CL mainly changes: Support specifying csv schema manually in s3/hdfs table valued function s3 ( 'URI' = 'https://bucket1/inventory.dat', 'ACCESS_KEY'= 'ak', 'SECRET_KEY' = 'sk', 'FORMAT' = 'csv', 'column_separator' = '\|', 'csv_schema' = 'k1:int;k2:int;k3:int;k4:decimal(38,10)', 'use_path_style'='true' ) Add new session variable dry_run_query If set to true, the real query result will not be returned, instead, it will only return the number of returned rows. mysql> select * from bigtable; +--------------+ \| ReturnedRows \| +--------------+ \| 10000000 \| +--------------+ This can avoid large result set transmission time and focus on real execution time of query engine. For debug and analysis purpose.	2023-03-02 16:51:27 +08:00
yiguolei	17f4990bd3	[enhancement](functioncontext) function context should use shared ptr and simply function context (#17311 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-03-02 16:23:54 +08:00
DongLiang-0	698e9cd243	[fix](demo)fix cdc failed to synchronize datetime type in mysql, and added JsonDebeziumSchemaSerializer (#16971 ) * [fix](demo)fix cdc failed to synchronize datetime type in mysql, and added JsonDebeziumSchemaSerializer * add licenses for DateToStringConverter	2023-03-02 14:14:58 +08:00
xueweizhang	9f088f6e90	[feature](json) add json_valid function (#17247 ) add json_valid function Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-02 14:08:52 +08:00
xueweizhang	9155d8b9d1	[fix](delete) fix 'is null' or 'is not null' delete predicate will get wrong result (#17190 ) fix 'is null' or 'is not null' delete predicate will get wrong result Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-02 14:05:44 +08:00
YueW	707f814fc2	[fix](inverted index) fix still execute match query after drop inverted index (#17293 ) background： At the moment, match query must with inverted index, problem description: After drop inverted index which is the only index in table, there still can use match query for this index column. fix it: The index should be updated on BE regardless of whether the indexes_desc from FE is empty.	2023-03-02 11:12:54 +08:00
Mingyu Chen	30df268c1f	[fix](hdfs)(catalog) fix BE crash when hdfs-site.xml not exist in be/conf and fix compute node logic (#17244 ) We set LIBHDFS3_CONF env in start_be.sh, so libhdfs3 will try to read this hdfs-site.xml, if file does not exist, it will throw error. But Doris does not handle this error, cause BE crash. This CL mainly changes: Modify start_be.sh to only set LIBHDFS3_CONF if hdfs-site.xml exist. Refactor the HDFSCommonBuilder so that it can return error correctly. Add BE IP info in status, so that we can get ip from error msg like: ERROR 1105 (HY000): errCode = 2, detailMessage = [INTERNAL_ERROR]failed to init reader for file 000.snappy.orc, err: [INTERNAL_ERROR][172.21.0.101]failed to init HDFSCommonBuilder, please check check be/conf/hdfs-site.xml The logic of prefer compute node is wrong, which causing the external table query can only assign up to 3 backends. This CL refactor this logic and also change some FE config: prefer_compute_node_for_external_table If set to true, query on external table will prefer to assign to compute node. And the max number of compute node is controlled by min_backend_num_for_external_table. If set to false, query on external table will assign to any node. min_backend_num_for_external_table Only take effect when prefer_compute_node_for_external_table is true. If the compute node number is less than this value, query on external table will try to get some mix node to assign, to let the total number of node reach this value. If the compute node number is larger than this value, query on external table will assign to compute node only.	2023-03-02 11:09:55 +08:00
jakevin	4682b4564c	[enhance](Nereids): delete output in olapscan toString() (#17288 )	2023-03-02 10:53:24 +08:00
morrySnow	a5ae3072e5	[fix](planner) ignore aux expr when do push agg op (#17239 )	2023-03-02 10:44:40 +08:00
yixiutt	de5112bd90	[bugfix](merger) traverse rs_meta in lock (#17271 ) tablet_schema(version) will traverse rowset_meta and it should call in meta_lock.	2023-03-02 09:47:44 +08:00
Xinyi Zou	b7677beab7	[enhancement](memtracker) Add special counter for memtracker and fix thread create and destroy track #17301 Add a special counter for memtracker, faster, but relaxed ordering and not accurate in real time Track thread create and destroy memory, which was previously removed due to performance loss and added back	2023-03-02 08:55:00 +08:00
Gabriel	d7ee542dd4	[refactor](function) refine function geo #17289 remove unused constant args	2023-03-02 08:42:16 +08:00
yinzhijian	201cf9c8df	Revert "[enhancement](k8s) Support fqdn mode for fe in k8s enviroment (#16315 )" (#17278 ) This reverts commit 48afd77e37d63e2989cd85ab12b39a273fcd284e. There is meta problem	2023-03-02 00:44:54 +08:00
Pxl	527eb5b059	[Enchancement](function) nullable inline refactor of min_max_by/bitmap && add register_functio… (#17228 ) 1. nullable inline refactor of min_max_by/bitmap/group_concat/histogram/topn 2. add register_function_both method 3. add datetimev2 type creator of min_max_by 4. remove uint16/32/64 in FOR_INTEGER_TYPES	2023-03-02 00:00:01 +08:00
xueweizhang	bb88f2ec7d	[fix](multi-catalog) fix not find dbname from internal catalog (#17119 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com> fix not find dbname from internal catalog	2023-03-01 23:59:12 +08:00
silaskenneth	5523859fd5	[typo](readme)Fix typo in README	2023-03-01 23:43:27 +08:00
gitccl	b0c5250bf9	[Enhancement](tvf) support trim_double_quotes and skip_lines for S3 and HDFS table valued function (#17224 ) support trim_double_quotes and skip_lines for S3 and HDFS table valued function	2023-03-01 23:41:31 +08:00
HappenLee	1244eed1cd	[Opt](exec) opt the dispose nullable column logic (#17192 )	2023-03-01 23:25:40 +08:00
Gabriel	633f2d52a4	[minor](log) add some logs (#17287 )	2023-03-01 22:41:50 +08:00
Jibing-Li	543539cf18	[Feature](multi catalog)(nereids)Support ES external table for new planner. (#17290 ) Support ES external table query using Nereids planner.	2023-03-01 22:32:41 +08:00
Gabriel	6de02f1f46	[minor](jvm) add more error logs for JNI (#17270 )	2023-03-01 22:09:57 +08:00
morrySnow	722755efe9	[fix](planner) change back legacy planner type coercion (#17070 ) revert legacy planner change in #16844	2023-03-01 20:55:56 +08:00
xueweizhang	34c5e84e9f	[fix](insert) fix txn error reason clearly (#16997 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-03-01 20:28:41 +08:00
Tiewei Fang	f1db0d9501	[Enhencement](File Reader) delete old file_reader (#17261 ) * delete old file_reader * fix 1	2023-03-01 20:24:03 +08:00
starocean999	6b70faa638	[fix](planner) should call Expr's unwrapSlotRef instead of getSrcSlotRef o prevent null pointer (#17265 )	2023-03-01 20:07:36 +08:00
YueW	b839353c2d	[fix](inverted index) fix BE coredump because of not ignore case ensitivity for column name when create index (#17276 )	2023-03-01 19:32:39 +08:00
Xinyi Zou	3871e989ac	[fix](memory) Avoid repeating meaningless memory gc #17258	2023-03-01 19:23:33 +08:00
Xinyi Zou	a1e3b908d7	[fix](memory) split mem usage thread and gc thread to different threads (#17213 ) Ensure that the memory status is refreshed in time Avoid frequent GC	2023-03-01 19:19:05 +08:00
Mingyu Chen	d44c4b1300	[improvement][fix](catalog) check required properties when creating catalog and fix jdbc catalog issue (#17209 ) Check required properties when creating catalog. To avoid some strange error when missing required properties This PR add checks for: hms catalog: check the validation of dfs.ha properties jdbc catalog: check jdbc_url, driver_url, driver_class is set. Fix NPE when init MasterCatalogExecutor The MasterCatalogExecutor may be called by FrontendServiceImpl from BE, which does not have ConnectionContext. Add more jdbc url param to resolve Chinese issue add useUnicode=true&characterEncoding=utf-8 by default in jdbc catalog when connecting to MySQL Update FAQ doc of catalog	2023-03-01 17:08:36 +08:00
xueweizhang	a5bd71c03a	(udf) create native function return error (#17021 ) after 1.2.0, doris does not support native udf, return error when create native function	2023-03-01 16:56:54 +08:00
jakevin	e203678e18	[fix](Nereids): fix master problem (#17275 )	2023-03-01 16:34:23 +08:00
Dongyang Li	b8d8cf1ac9	[regression](test) script for teamcity to check if pr need run build (#16937 ) * [regression](test) script for teamcity to check if pr need run build * Update check-pr-if-need-run-build.sh fix * Update check-pr-if-need-run-build.sh fix --------- Co-authored-by: stephen <hello_stephen@@qq.com>	2023-03-01 15:59:31 +08:00
xy720	48ef61780d	[refactor](struct-type) refactor and clean unused code for struct type (#17257 ) remove unused code for struct type	2023-03-01 15:49:31 +08:00
xy720	0732eb54bc	[feature](struct-type) support csv format stream load for struct type (#17143 ) Refactor from_string method in data_type_struct.cpp to support csv format stream load for struct type.	2023-03-01 15:48:48 +08:00
Pxl	62440f3140	[Bug](Materialized-View) forbiden mv rewrite on create view and remove duplicate method getIsM… (#17194 ) 1. forbiden mv rewrite on create view to avoid select fail 2. remove duplicate method getIsMaterialized	2023-03-01 13:46:56 +08:00
yadao	ff8902370c	[improvement](doc) Supplementary Bulk Deletion Notes (#17113 ) * 补充批量删除注意事项 * 按照批量删除文档前文的介绍, 用户可能会开启`show_hidden_columns`的session variable来查看表是否支持批量删除. * 后续按示例进行DELETE/MERGE的导入作业后, 如果在同一个session中执行`select count() from xxx`语句时, 可能会发现结果与预期不一致可能无法快速联想到是因为之前开启的session variable导致被删除的语句也被查出来了. * supplement batch deletion notes for English doc	2023-03-01 13:35:20 +08:00
Gabriel	b8ebcdff78	[Bug](bloomfilter) Fix wrong result using bloomfilter with date type (#17225 )	2023-03-01 12:29:20 +08:00
Gabriel	979cf42d7a	[Bug](decimalv3) Use correct decimal scale for function round (#17232 ) Co-authored-by: maochongxin <maochongxin@gmail.com>	2023-03-01 12:28:41 +08:00
jakevin	cbdf1af2d5	[feature](Nereids): pushdown Alias through Join. (#17150 )	2023-03-01 11:33:37 +08:00
zhengyu	62ec74f4e7	segcompaction featuring verticalcompaction (#16731 ) This patchset applies the following changes: using vertical compaction machanism to do segcompaction basic (WIP) refraction to separate segcompaction logic from BetaRowsetWriter add segcompaction specific ut and regression tests	2023-03-01 10:55:40 +08:00
yinzhijian	48afd77e37	[enhancement](k8s) Support fqdn mode for fe in k8s enviroment (#16315 )	2023-03-01 10:54:39 +08:00
zhangstar333	774f66c6bc	[Enhencement](test) enhencement regression test of java udf (#17251 ) when run regression test of java udf many times. if failed in some reason, the next time will meet error like: function already exist Issue Number: close #xxx	2023-03-01 09:34:40 +08:00
Yongqiang YANG	e687f3badd	Revert "[feature-wip](BE http)Support BE http service using brpc (#16123 )" (#17219 ) This reverts commit 049ecccc578802496e5421db19e21e7eb256699d. Merge back after streamload is handled.	2023-03-01 09:18:25 +08:00
Ashin Gau	2f471de675	[fix](FileCache) load file cache before start up daemon threads (#17199 ) Daemon threads in doris_main.cpp will upload tablet metrics periodically, which will use StorageEngine::instance(). However loading file cache is a process in main thread, when it takes a lot of time to load file cache, StorageEngine::instance() will be a null pointer in daemon threads.	2023-03-01 08:35:57 +08:00

1 2 3 4 5 ...

9050 Commits