Commit Graph

8289 Commits

Author SHA1 Message Date
cd6c61347d [Feature](tvf)(avro-jni) avro-jni add projection push down (#26885) 2023-11-27 10:33:27 +08:00
baadc14e60 [Enhancement](function) support unix_timestamp with float (#26827)
---------

Co-authored-by: YangWithU <plzw8@outlook.com>
2023-11-27 09:58:53 +08:00
3791de3cfa [feature](mtmv)(6)implement cancel method (#27541)
1.implement cancel task method
2.fix `show create table ` not display `comment`
2023-11-27 09:49:46 +08:00
04033dce01 [improve](routine_load) add db and table name in create routine load job log (#27500) 2023-11-25 23:41:34 +08:00
b08865982a [improvement](disk balance) Impr disk rebalancer sched when be load r… (#26412) 2023-11-25 23:35:16 +08:00
ff1a06abcf [test](regression) add routine load sequence and error test (#27519) 2023-11-25 23:30:20 +08:00
7a1b4267d5 [improve](transaction) lightweight transaction lock logic (#27564) 2023-11-25 23:24:36 +08:00
d4f2db74f9 [fix](nereids) fix stats error when using dateTime type filter (#27571)
Currently doris doesn't support datetime type filter stats estimation, but only for date type.
It will cause the filter using datetime type column with the same date and different time computing out a inaccurate selectivity and estimate a wrong row count, such as :

where o.book_time >= '2020-03-01 00:00:00.0' and o.book_time <= '2020-03-01 23:59:59.0';

This pr adds the datetime type(only support hh:mm:ss scale) filter estimation and improve the row count estimation for the above case.
2023-11-25 11:04:51 +08:00
7134da63ac [improvement](nereids) add config for nereids trace dir (#27495)
Signed-off-by: nextdreamblue <zxw520blue1@163.com>
2023-11-25 10:37:23 +08:00
cc395f5428 [Fix](hive-transactional-table) Fix NPE when query empty hive transactional table. (#27563) 2023-11-25 10:29:39 +08:00
6b1428dba1 Fix auto analyze doesn't filter unsupported type bug. (#27559)
Fix auto analyze doesn't filter unsupported type bug.
Catch throwable in auto analyze thread for each database, otherwise the thread will quit when one database failed to create jobs and all other databases will not get analyzed.
change FE config item full_auto_analyze_simultaneously_running_task_num to auto_analyze_simultaneously_running_task_num
2023-11-25 10:22:52 +08:00
6142a539f4 [deprecated](external) remove deprecated hudi and iceberg external table (#27456)
The creation of hudi and iceberg table is disallowed since v1.2.
All these features are covered by hudi/iceberg catalog.
We should remove the code in v2.1

The PR mainly changes:
1. remove the code of hudi/iceberg external table.
2. remove code of iceberg database.
3. disallowed hive external table's creation.
4. disabled odbc,mysql,broker external table by default, and add FE config `disable_odbc_mysql_broker_table` to control it
2023-11-24 23:18:07 +08:00
59efb1b238 [fix](planner)join node should output required slot from parent node #27526 2023-11-24 21:50:32 +08:00
eb250bca5c [fix](set) fix error message when set a session only variable (#27554) 2023-11-24 21:48:16 +08:00
d98d2d491f [chore](tablet scheduler) update sched ctx err (#27514) 2023-11-24 18:09:49 +08:00
d769fb69dd NEED (#27531) 2023-11-24 18:04:36 +08:00
553e4a8903 [feature-wip](merge-on-write) MOW table support different primary keys and sort keys (#24788) 2023-11-24 16:37:30 +08:00
Pxl
33b540641e [Bug](materialized-view) add limitation for duplicate expr on materialized view (#27523)
add limitation for duplicate expr on materialized view
2023-11-24 15:47:14 +08:00
674dd98fb2 [fix](Nereids): CustomRewriteJob don't handle condition rule return null (#27516) 2023-11-24 14:38:37 +08:00
540132f656 [fix](profilev2) fix merge profile min is zero #27524 2023-11-24 14:11:11 +08:00
dbff9d7a89 [chore](fuzzy) topn_opt_limit_threshold (#27496) 2023-11-24 14:08:27 +08:00
dbbab63623 [fix](nereids)keep cast operator if cast a varchar to another longer varchar in LogicalSetOperator (#27393) 2023-11-24 14:07:04 +08:00
dfe3a2dd01 [feature](mtmv)(3)Implementing multi table materialized views (#26146)
Introduction to Main Classes:
- MTMVService:MTMV services for other modules to call
- MTMVHookService:All operations that affect the MTMV
  - MTMVJobManager:All operations that affect the MTMV job
  - MTMVCacheManager:All operations that affect the MTMV Cache
- MTMVTask&MTMVJob:Inherit from job framework
2023-11-24 12:34:38 +08:00
c24a33c857 [enhancement](audit) hide password and other sensitive information in log and audit log (#27115)
Signed-off-by: nextdreamblue <zxw520blue1@163.com>
2023-11-24 10:27:30 +08:00
17ca75f834 [chore](Nereids): add eager aggregate into rules (#27505)
Add `Eager Aggregate` rules into Rewrite rules.
2023-11-24 10:06:04 +08:00
8e74470db9 [fix](statistics)Fix auto analyze remove finished job bug (#27486)
Finished job must be removed from the job list, otherwise the next batch of jobs will not be scheduled.
2023-11-23 23:22:02 +08:00
eb878ad0d2 [fix](Export) add feut for Cancel Export (#27178) 2023-11-23 23:18:30 +08:00
540bce4d1b [typo](log) Let env lock msg more distinct (#27493) 2023-11-23 23:03:06 +08:00
d73b945535 [chore](Nereids): rename pushdown to push_down (#27473) 2023-11-23 21:04:40 +08:00
d04a2de3cc [fix](hms) fix compatibility issue of hive metastore client (#27327)
For hive version lower than 2.3.7, there is no enum ClientCapability.INSERT_ONLY_TABLES.
So if we send this enum to the server side, the server side will get a null,
and this will cause some undefined behavior, eg, failed to get tables infos from hms.
2023-11-23 19:42:46 +08:00
2ea33518b0 [Opt](load) use batching to optimize auto partition (#26915)
use batching to optimize auto partition
2023-11-23 19:12:28 +08:00
511eedb4ff [fix](nereids)select base index if mv's data type is different from base table (#27387)
normally, mv column's data type should be same as base table. This pr plays as a fail-safe, if mv column's data type is different from base table accidentally, fall back to select base table to make the query works.
2023-11-23 18:41:59 +08:00
d9f6e51884 [fix](planner)output slot should be materialized as intermediate slot in agg node (#27282) 2023-11-23 18:41:08 +08:00
1555b11035 [fix](nereids)remove literal partition by and order by expression in window function (#26899) 2023-11-23 18:40:51 +08:00
2ec3395087 [fix](planner)the data type should be the same between input slot and sort slot (#27137) 2023-11-23 18:40:02 +08:00
772f181e94 [fix](stats) Fix thread leaks when doing checkpoint (#27334) 2023-11-23 03:18:19 -06:00
4b22fc14d5 [Feature](update) Support update on current_timestamp (#25884) 2023-11-23 16:23:31 +08:00
5d9c555dcf [minor](stats) Fix potential npe when loading stats (#27200)
Besides, lower down the log level of loading stats from warning to debug, since it doesn't such matter for workflow.
2023-11-23 01:37:58 -06:00
8e3b4e99d9 [improve](move-memtable) add switch for stream load in fe.conf (#27440) 2023-11-23 15:11:17 +08:00
c884e46e6c [regression test](routine test) add case for desired_concurrent_number (#27372) 2023-11-23 15:11:01 +08:00
97932d0381 [fix](export) the label of export should be unique with database scope (#27401)
### How to reproduce
1. create a database db1 and a table tbl1;
2. insert some data and export with label L1;
3. drop the db1 and tbl1, and recreate them with same name.
4. insert some data and export with same label L1;

Expect: export success
Actual: error: Label L1 have already been used.

This PR fix it.
2023-11-23 14:30:57 +08:00
93cfdffb75 [regression test](routine test) add case for exec_mem_limit (#27308) 2023-11-23 14:25:54 +08:00
dbbed113cf [feature](mtmv)(4)MTMV extends Olap (#26645) 2023-11-23 14:10:36 +08:00
33de92cc61 [improve](nereids) support agg function of count(const value) pushdown #26677
support sql: select count(1)-count(not null) from table, the agg of count could push down.
2023-11-23 11:26:06 +08:00
5b8aaf96d2 [fix](planner)scan node should project all required expr from parent node (#26886) 2023-11-23 09:44:21 +08:00
044a295541 [performance](Planner): optimize getStringValue() in DateLiteral (#27363)
- reduce cost of `getStringValue()` 
- original code don't consider `microsecond` part in `getStringValue()`
2023-11-22 22:42:44 +08:00
19c36dcc86 [Fix](statistics)Fix auto job start time incorrect bug (#27402)
Before, the auto analyze job start time was the job creation time, not the start to execute time, which is inaccurate. This pr is to change the start time to the first task start to execute time.
2023-11-22 21:38:08 +08:00
0302a9d026 [fix](fe) slots in having clause should be set to need materialized (#27412) 2023-11-22 19:47:09 +08:00
c7e3d74ffc [minor](stats) Report error with more friendly meesage when timeout (#27197) 2023-11-22 04:50:09 -06:00
cfb6af295f [fix](stats) Fix creating too many tasks on new env #27364
If there exists huge datasets with many database and may tables and many columns, Auto collector might be submit too many jobs which would occupy too much of FE memory.

In this PR, limit job each round could submit up to 5
2023-11-22 16:53:31 +08:00