doris

Author	SHA1	Message	Date
Mingyu Chen	6e855dd198	[feature](sql-dialect) support convert sql use sql convertor service (#27581 ) Add a new FE Config `sql_convertor_service`. If this config is set, and the session variable `sql_dialect` is set, Doris will try to use a standalone sql converter service to convert user input sql to specified sql dialect. eg: ``` mysql> set sql_dialect="presto"; Query OK, 0 rows affected (0.02 sec) Database changed mysql> select * from db1.tbl1 where "k1" = 1; # will be converted to select * from db1.tbl1 where `k1` = 1; +------+------+ \| k1 \| k2 \| +------+------+ \| 1 \| 2 \| +------+------+ 1 row in set (0.08 sec) ``` The sql converter service should be a http service. The request and response body can be found in `SQLDialectUtils.java`	2023-12-18 10:32:52 +08:00
yujun	92a4a9770c	[improvement](hint) query fail print tablet detail info (#28476 )	2023-12-16 12:54:25 +08:00
Lei Zhang	e6b135c76a	[improvement](fe) Add reason log when `Env` is not ready (#28286 )	2023-12-15 12:22:06 +08:00
Calvin Kirs	c4242ab69e	[Chore](Job)Add the configuration of the maximum number of persistence tasks for the job (#28411 )	2023-12-15 11:14:06 +08:00
meiyi	ee24667b9f	[fix](group commit) Fix some group commit problems (#28319 )	2023-12-14 14:38:56 +08:00
Mingyu Chen	3f202477ec	[minor](import) modify some imports (#28206 )	2023-12-12 11:39:54 +08:00
Sun Chenyang	573b594df3	[improvement](Variant Type) Support displaying subcolumns expanded for the variant column (#27764 )	2023-12-08 20:34:58 +08:00
Xiangyu Wang	ec08850c08	[Config](multi-catalog) Enable query hive views as default. (#27906 ) Remove EXPERIMENTAL tag for enable_query_hive_views and set enable_query_hive_views to true as default. This feature has been used on our cluster which has more then a hundred thousands of tables for several months, i think it is fine to enable it as default.	2023-12-06 20:46:09 +08:00
amory	393c491820	[FIX](map/struct)fix map/struct literal from fe (#28026 )	2023-12-06 13:56:56 +08:00
Calvin Kirs	cbf1f8620a	[Feature](job)support cancel task and fix log invalid (#27703 ) - Running task can be show and fix cancel fail - When the insert task scheduling cycle is reached, if there are still tasks running, the scheduling of this task will be canceled at this time. - refactor job status changes SQL - Fix timer job window error - Support cancel task	2023-12-06 10:44:09 +08:00
xueweizhang	80f528bf26	[enhancement](backup-restore) add config for upload/download task num per be (#27772 ) set upload/download task num per be, and improve the overall speed of upload/download, enhance the performance of backup and recovery. --------- Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-12-04 11:19:45 +08:00
yujun	ba893a4e60	[log](table) add table lock failed log (#27659 )	2023-12-03 23:34:21 +08:00
wangbo	66cfcc67cb	[Fix](exectuor)Fix Follower Fe query queue may not work when exec alter #27831	2023-12-02 20:19:50 +08:00
Mingyu Chen	9daa7dc6b5	[refactor](http) disable snapshot and get_log_file api (#27724 ) Disable 2 http api by default: 1. BE's `/api/snapshot` 2. FE's `/get_log_file`	2023-11-29 16:11:51 +08:00
Mingyu Chen	7ac97c1650	[fix](bdbje) add free disk config (#27578 )	2023-11-27 21:29:02 +08:00
Lei Zhang	bb68900bed	[fix](bdbje) Fix bdbje logging level not work (#27597 ) * `EnvironmentConfig.FILE_LOGGING_LEVEL` only set FileHandlerLevel, we should set logger level firstly, otherwise it will not take effect.	2023-11-27 21:24:34 +08:00
seawinde	1b4cd24b36	[opt](Nereids) support where, group by, having, order by clause without from clause in query statement (#27006 ) Support where, group by, having, order by clause without from clause in query statement. For example as following: SELECT 1 AS a, COUNT(), SUM(2), AVG(1), RANK() OVER() AS w_rank WHERE 1 = 1 GROUP BY a, w_rank HAVING COUNT() IN (1, 2) AND w_rank = 1 ORDER BY a; this will return result: \| a \|count(*)\|sum(2)\|avg(1)\|w_rank\| +----+--------+------+------+------+ \| 1 \| 1\| 2\| 1.0\| 1\| For another example as following: select 1 c1, 2 union (select "hell0", "") order by c1 the second column datatype will be varchar(65533), 65533 is the default varchar length. this will return result: \|c1 \| 2 \| +------+---+ \|1 \| 2 \| \|hell0 \| \|	2023-11-27 12:05:14 +08:00
DuRipeng	9aafcf2e22	[Enhance](fe) Support setting initial root password when FE firstly launch (#27438 )	2023-11-27 11:03:27 +08:00
xueweizhang	7134da63ac	[improvement](nereids) add config for nereids trace dir (#27495 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2023-11-25 10:37:23 +08:00
Jibing-Li	6b1428dba1	Fix auto analyze doesn't filter unsupported type bug. (#27559 ) Fix auto analyze doesn't filter unsupported type bug. Catch throwable in auto analyze thread for each database, otherwise the thread will quit when one database failed to create jobs and all other databases will not get analyzed. change FE config item full_auto_analyze_simultaneously_running_task_num to auto_analyze_simultaneously_running_task_num	2023-11-25 10:22:52 +08:00
Mingyu Chen	6142a539f4	[deprecated](external) remove deprecated hudi and iceberg external table (#27456 ) The creation of hudi and iceberg table is disallowed since v1.2. All these features are covered by hudi/iceberg catalog. We should remove the code in v2.1 The PR mainly changes: 1. remove the code of hudi/iceberg external table. 2. remove code of iceberg database. 3. disallowed hive external table's creation. 4. disabled odbc,mysql,broker external table by default, and add FE config `disable_odbc_mysql_broker_table` to control it	2023-11-24 23:18:07 +08:00
zhangdong	dfe3a2dd01	[feature](mtmv)(3)Implementing multi table materialized views (#26146 ) Introduction to Main Classes： - MTMVService：MTMV services for other modules to call - MTMVHookService：All operations that affect the MTMV - MTMVJobManager：All operations that affect the MTMV job - MTMVCacheManager：All operations that affect the MTMV Cache - MTMVTask&MTMVJob：Inherit from job framework	2023-11-24 12:34:38 +08:00
Kaijie Chen	8e3b4e99d9	[improve](move-memtable) add switch for stream load in fe.conf (#27440 )	2023-11-23 15:11:17 +08:00
huanghaibin	5d548935e0	[improvement](insert) support schema change and decommission for group commit (#26359 )	2023-11-17 21:41:38 +08:00
Xinyi Zou	dbac12bae8	[fix](memory)Modify the default conf values of mem_limit and cache_last_version_interval_second (#26945 ) mem_limit from 80% to 90% cache_last_version_interval_second from 900 to 30	2023-11-15 14:02:58 +08:00
Calvin Kirs	13bc6b702b	[refactor](Job)Refactor JOB (#26845 ) ## Motivation: In the past, our JOB only supported Timer JOB, which could only provide scheduling for fixed-time tasks. Meanwhile, the JOB was solely responsible for execution, and during execution, there might be inconsistencies in states, where the task was executed successfully, but the JOB's recorded task status was not updated. This inconsistency in task states recorded by the JOB could not guarantee the correctness of the JOB's status. With the gradual integration of various businesses into the JOB, such as the export job and mtmv job, we found that scaling became difficult, and the JOB became particularly bulky. Hence, we have decided to refactor JOB. ## Refactoring Goals: - Provide a unified external registration interface so that all JOBs can be registered through this interface and be scheduled by the JobScheduler. - The JobScheduler can schedule instant JOBs, timer JOBs, and manual JOBs. - JOB should provide a unified external extension class. All JOBs can be extended through this extension class, which can provide special functionalities like JOB status restoration, Task execution, etc. - Extended JOBs should manage task states on their own to avoid inconsistent state maintenance issues. - Different JOBs should use their own thread pools for processing to prevent inter-JOB interference. ### Design: - The JOBManager provides a unified registration interface through which all JOBs can register and then be scheduled by the JobScheduler. - The TimerJob periodically fetches JOBs that need to be scheduled within a time window and hands them over to the Time Wheel for triggering. To prevent excessive tasks in the Time Wheel, it distributes the tasks to the dispatch thread pool, which then assigns them to corresponding thread pools for execution. - ManualJob or Instant Job directly assigns tasks to the corresponding thread pool for execution. - The JOB provides a unified extension class that all JOBs can utilize for extension, providing special functionalities like JOB status restoration, Task execution, etc. - To implement a new JOB, one only needs to implement AbstractJob.class and AbstractTask.class. <img width="926" alt="image" src="https://github.com/apache/doris/assets/16631152/3032e05d-133e-425b-b31e-4bb492f06ddc"> ## NOTICE This will cause the master's metadata to be incompatible	2023-11-14 18:18:59 +08:00
LiBinfeng	0a9d71ebd2	[Fix](Planner) fix varchar does not show real length (#25171 ) Problem: when we create table with datatype varchar(), we regard it to be max length by default. But when we desc, it does not show real length but show varchar() Reason: when we upgrade version from 2.0.1 to 2.0.2, we support new feature of creating varchar(), and it shows the same way with ddl schema. So user would confuse of the length of varchar Solved: change the showing of varchar() to varchar(65533), which in compatible with hive	2023-11-14 10:49:21 +08:00
yujun	ebc15fc6cc	[fix](transaction) Fix concurrent schema change and txn cause dead lock (#26428 ) Concurrent schema change and txn may cause dead lock. An example: Txn T commit but not publish; Run schema change or rollup on T's related partition, add alter replica R; sc/rollup add a sched txn watermark M; Restart fe; After fe restart, T's loadedTblIndexes will clear because it's not save to disk; T will publish version to all tablet, including sc/rollup's new alter replica R; Since R not contains txn data, so the T will fail. It will then always waitting for R's data; sc/rollup wait for txn before M to finish, only after that it will let R copy history data; Since T's not finished, so sc/rollup will always wait, so R will nerver copy history data; Txn T and sc/rollup will wait each other forever, cause dead lock; Fix: because sc/rollup will ensure double write after the sched watermark M, so for finish transaction, when checking a alter replica: if txn id is bigger than M, check it just like a normal replica; otherwise skip check this replica, the BE will modify history data later.	2023-11-13 21:39:28 +08:00
meiyi	c0fda8c5c2	[improve](group commit) Add a swicth to wait internal group commit lo… (#26734 ) * [improve](group commit) Add a swicth to make internal group commit load finish * modify group commit tvf plan	2023-11-13 10:35:35 +08:00
yujun	0749d632c4	[feature](diagnose) diagnose for cluster balance (#26085 )	2023-11-10 15:31:58 +08:00
zhiqiang	a5565f68b2	[Refactor](opentelemetry) Remove opentelemetry (#26605 )	2023-11-09 18:05:34 +08:00
HappenLee	45c2fa62a4	[pipeline](exec) disable shared scan in default and disable shared scan in limit with where scan (#25952 )	2023-11-08 17:51:12 +08:00
TengJianPing	a3666aa87e	[feature](decimal) support decimal256 when creating table (#26308 )	2023-11-08 15:21:01 +08:00
zclllyybb	47ba4aaf30	[Enhancement](load) add timer and partitions number limit (#26549 ) add timer and partitions number limit	2023-11-08 11:22:40 +08:00
AKIRA	290070074a	[refactor](stats) refactor collection logic and opt some config (#26163 ) 1. not collect partition stats anymore 2. merge insert of stats 3. delete period collector since it is useless 4. remove enable_auto_sample 5. move some config related to stats to global session variable Before this PR, when analyze a table, the insert count equals column count times 2 After this PR, insert count of analyze table would reduce to column count / insert_merge_item_count. According to my test, when analyzing tpch lineitem, the insert sql count is 1	2023-11-08 11:03:44 +08:00
zclllyybb	1a83a39aec	Revert "[fix](auto-partition) Fix auto partition concurrent conflict (#26166 )" (#26448 ) This reverts commit f22611769944e78c28f1b0a1eeb7b7414a16e8db.	2023-11-06 16:39:19 +08:00
zclllyybb	f226117699	[fix](auto-partition) Fix auto partition concurrent conflict (#26166 )	2023-11-06 10:34:26 +08:00
amory	6ef2e62600	[FIX](struct) fix struct be nested when read will make core (#26270 )	2023-11-03 09:55:39 -05:00
starocean999	9243de1898	[fix](planner) add IP type to compatibilityMatrix (#26095 )	2023-11-03 01:19:22 -05:00
amory	93a934e775	[Improve](map) support map cast with map literal and implicate nested scala cast (#26126 )	2023-11-02 09:56:42 -05:00
xy720	a5ef90dacc	[enhancement](recover) support skipping missing version in select by session variable (#25654 )	2023-11-02 20:01:51 +08:00
luozenglin	89cf828f03	[enhancement](udf) add the switch enable_java_udf to control the creation and use of java_udf. (#26213 )	2023-11-02 19:33:17 +08:00
Gabriel	dd8bcc831c	[keyword](decimalv2) Add DecimalV2 keyword (#26283 )	2023-11-02 16:27:12 +08:00
JingDas	1770224322	[fix](planner) Fix decimal precision and scale wrong when create table like (#25802 ) Use field datatype such as decimal(10, 0) to create table like. Because the scale is 0, the precision and scale will lost when create table like done. this will fix the bug. Before fix, create table with following SQL: CREATE TABLE IF NOT EXISTS db_test.table_test ( `name` varchar COMMENT "1m size", `id` SMALLINT COMMENT "[-32768, 32767]", `timestamp0` decimal null comment "c0", `timestamp1` decimal(38, 0) null comment "c1" ) DISTRIBUTED BY HASH(`id`) BUCKETS 1 PROPERTIES ('replication_num' = '1'); and Then run CREATE TABLE db_test.table_test_like LIKE db_test.table_test SHOW CREATE TABLE db_test.table_test_like; the field `timestamp1` will be decimal(9, 0), it's wrong. this will fix it.	2023-11-01 03:11:43 -05:00
yujun	ac7678926f	[improvement](tablet clone) add config for decommission tablet wait time (#26000 )	2023-10-31 20:36:26 +08:00
morrySnow	111b8e2b4f	[minor](planner) remove len on integral type toSql (#25951 )	2023-10-31 02:37:08 -05:00
Gabriel	6eb2cb2d48	[date](parser) Support DateV1 keyword (#25414 )	2023-10-30 18:39:22 +08:00
Guangming Lu	c32cd5b25a	FE ssl certificates are of various formats #26039 #26044	2023-10-28 22:37:38 +08:00
luozenglin	2464a22ac5	[fix](meta) fix follower sync image timeout after checkpoint. (#26003 ) The image file of our cluster reaches 2.3G. After the checkpoint, Followers synchronize the image timeout, resulting in the continuous increase of the bdb directory. related pr: #25768	2023-10-27 16:58:42 +08:00
yangshijie	c1d64a7128	[Feature](datatype) Add IPv4/v6 data type for doris (#24965 )	2023-10-26 17:33:28 +08:00

1 2 3 4 5 ...

343 Commits