Commit Graph

18263 Commits

Author SHA1 Message Date
3a752b758a [fix](Nereids) colcoate node attr lost after merge fragment (#30818) 2024-02-05 21:58:08 +08:00
255ca143f8 [fix](chinese) fix the issue where the be crashes due to the missing chinese dict (#30712) 2024-02-05 21:57:29 +08:00
a5d9004974 [fix](Nereids) physical property deriver on some node is not right (#30819) 2024-02-05 21:57:29 +08:00
fc762f426b [enhance](mtmv) mtmv disable hive auto refresh (#30775)
- If the `related table` is `hive`, do not refresh automatically
- If the `related table` is `hive`, the partition col is allowed to be `null`. Otherwise, it must be `not null`
- add more `ut`
2024-02-05 21:56:57 +08:00
8ff8d94697 [fix](ip) change IPv6 to little-endian byte order storage (like IPv4) (#30730) 2024-02-05 21:56:57 +08:00
cd939fcca2 [Enhancement](group commit) Optimize group commit block sink wal disk space log #30811 2024-02-05 21:56:57 +08:00
d1bb63ed67 [fix](arrow-flight) Modify FE Arrow version to 15.0.0 #30824 2024-02-05 21:56:57 +08:00
48aaaa8005 [Enhancement](fuction) change function REPEAT nullable mode (#30743) 2024-02-04 22:21:36 +08:00
aed858a442 [improve](log) print query_id when fold constant on BE (#30802) 2024-02-04 22:21:36 +08:00
88ff9c06cf [test](mtmv)fix table name duplicate (#30808) 2024-02-04 22:21:36 +08:00
27f65f4463 [Feature](executor)Stream load support workload group (#30763)
* Stream load support workload group

* skip mysql load
2024-02-04 22:21:36 +08:00
25f6a733fe [fix](stats) keep threads in pool alive to maintain reasonable parallelism (#30451) 2024-02-04 22:21:16 +08:00
d32292b292 [regression-test][conf] add master_sync_policy = WRITE_NO_SYNC replica_sync_policy = WRITE_NO_SYNC (#30494)
There is no power off scene in regression-test, so add these two configure has no side-effect.
2024-02-04 22:21:16 +08:00
ccbcf879b5 [test](mtmv) Add materialized view availability regression test (#30769)
Add materialized view availability regression test

when mv refresh_time is in the grace_period(unit is second), materialized view will be use to
query rewrite regardless of the base table is update or not
when mv refresh_time is out of the grace_period(unit is second), will check the base table is update or not
if update the materialized view will not be used to query rewrite
2024-02-04 22:21:16 +08:00
9e76592297 Support analyze materialized view. (#30540) 2024-02-04 22:21:16 +08:00
e891a095e7 check segment num when build rowset meta (#30803) 2024-02-04 18:15:12 +08:00
91a669f5fd [chore](mac compile) remove using regex to avoid mac compile failed frequently #30783 2024-02-04 14:28:38 +08:00
Pxl
1d39e16eda [Bug](compaction) pass arena to function->add_batch_range (#30709) 2024-02-04 14:28:38 +08:00
121d52dd37 [test](mtmv) Add mtmv basic one and two dimensional test cases (#30651) 2024-02-04 14:28:38 +08:00
383850ef12 [Opt](multi-catalog) Opt split assignment to resolve uneven distribution. (#30390)
[Opt] (multi-catalog) Opt split assignment to resolve uneven distribution. Currently only for `FileQueryScanNode`.

Referring to the implementation of Trino, 
- Local node soft affinity optimization. Prefer local replication node.
- Remote split will use the consistent hash algorithm is used when the file cache is turned on, and because of the possible unevenness of the consistent hash, the split is re-adjusted so that the maximum and minimum split numbers of hosts differ by at most `max_split_num_variance` split.
- Remote split will use the round-robin algorithm is used when the file cache is turned off.
2024-02-04 14:28:38 +08:00
b275cb0f44 [feature](mtmv) mtmv support workload group (#29595)
MTMV supports controlling the resource usage of refresh tasks by setting the name of workload group
about workload group : https://doris.apache.org/zh-CN/docs/dev/admin-manual/workload-group
2024-02-04 14:28:38 +08:00
e10defeaba [enhancement](plubin)support json format and other options in logstash doris output plugin (#27318) 2024-02-04 14:28:38 +08:00
6442663735 [Function](exec) upport atan2 math function (#30672)
Co-authored-by: Rohit Satardekar <rohitrs1983@gmail.com>
2024-02-04 14:28:38 +08:00
36b2712709 [chore](Nereids) turn on nereids dml when update to 2.1 (#30776) 2024-02-04 14:28:38 +08:00
c9ab243153 [feat-wip](join) support mark join for right semi join(without mark join conjunct) (#30767) 2024-02-04 14:28:38 +08:00
3cc409b14f [bug](function) fix date_sub function failed when arg type is datev2 (#30443)
* [bug](function) fix date_sub function failed when arg type is datev2

* update
2024-02-04 14:28:38 +08:00
d749fc3d27 [improvement](binlog) Change BinlogConfig default TTL_SECONDS to 86400 (1day) (#30771)
* Change BinlogConfig default TTL_SECONDS to 86400 (1day)

Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>

* Fix binlog.ttl_seconds in regression test

Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>

---------

Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2024-02-04 14:28:38 +08:00
5aed3abb8a [Fix](Nereids) Fix rewrite by materialized view fail when join input has agg (#30734)
materialized view definition is as following, and the query sql is the same
when outer group by use the col1 in the inner group, which can be rewritten by materialized view

select
t1.o_orderdate,
t1.o_orderkey,
t1.col1
from
(
select
o_orderkey,
o_custkey,
o_orderstatus,
o_orderdate,
sum(o_shippriority) as col1
from
orders
group by
o_orderkey,
o_custkey,
o_orderstatus,
o_orderdate
) as t1
left join lineitem on lineitem.l_orderkey = t1.o_orderkey
group by
t1.o_orderdate,
t1.o_orderkey,
t1.col1
2024-02-03 20:27:04 +08:00
119615dc50 [Fix](parquet-reader) Fix decimal test case out files. (#30716) 2024-02-03 20:26:52 +08:00
6289f7e605 [Fix](multi-catalog) Fix truncate_char_or_varchar_column crash. (#30731) 2024-02-03 20:26:04 +08:00
d33f3be436 [Doc](auto-inc) Add doc for auto increment start value (#30741) 2024-02-03 20:26:04 +08:00
4f0414d13e [fix](Nereids) date >= simplify to > by mistake (#30765) 2024-02-03 20:26:04 +08:00
f018137500 [doc](jdbc catalog) add connection pool doc (#30641) 2024-02-03 20:26:04 +08:00
d99bb51d36 [fix](legacy-planner) fixed loss of BetweenPredicate rewrite on reanalyze in legacy planner (29798) (#30328) 2024-02-03 20:26:04 +08:00
a3a73162e5 [Fix](Job)Fix One-Time type JOB parameter verification error (#30779) 2024-02-03 20:26:04 +08:00
0c7c8240ee [Fix](ser) serialize bitmap/hll some not fixed length object maybe over INT_MAX (#30773) 2024-02-03 20:26:04 +08:00
8a0ea4b651 [enhancement](Nereids): datetime support microsecond overflow (#30744) 2024-02-03 20:26:04 +08:00
151735748b [test](Nereids): add push_down_distinct_through_join regression test (#30760) 2024-02-03 20:26:03 +08:00
24433710c8 [fix](regression) Make catalog recycle bin regression stable (#30736) 2024-02-03 20:26:03 +08:00
4f8730d092 [improvement](jdbc catalog) Optimize connection pool parameter settings (#30588)
This PR makes the following changes to the connection pool of JDBC Catalog
1. Set the maximum connection survival time, the default is 30 minutes

-   Moreover, one-half of the maximum survival time is the recyclable time,
-   One-tenth is the check interval for recycling connections

2. Keepalive only takes effect on the connection pool on BE, and will be activated based on one-fifth of the maximum survival time.
3. The maximum number of existing connections is changed from 100 to 10
4. Add the connection cache recycling thread on BE, and add a parameter to control the recycling time, the default is 28800 (8 hours)
5. Add CatalogID to the key of the connection pool cache to achieve better isolation, requires refresh catalog to take effect
6. Upgrade druid connection pool to version 1.2.20
7. Added JdbcResource's setting of default parameters when upgrading the FE version to avoid errors due to unset parameters.
2024-02-03 20:26:03 +08:00
ac681e8e8c [ehmancement](binlog) Add show proc '/binlog' impl (#30770)
Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2024-02-03 20:26:03 +08:00
e413dbec91 [fix](nereids)need substitute agg function using agg node's output if it's in order by key (#30704) 2024-02-03 20:25:25 +08:00
Pxl
5687ca977d [Bug](java-udf) fix core dump when javaudf input 0 row block (#30720)
fix core dump when javaudf input 0 row block
2024-02-03 20:25:25 +08:00
bc8c1875ac [enhencement](config) change default write buffer size (#30749) 2024-02-03 20:25:25 +08:00
79a83e9e1a [fix](move-memtable) use signed integer when calculating remain ms (#30750) 2024-02-03 20:25:25 +08:00
0e2d2f6169 [Bug](pipeline) fix pipeline task execute without wait second start rpc (#30659) 2024-02-03 20:24:58 +08:00
286f397017 [chore](ci) revert "try to set 'performance (Doris Performance)' to required #30752" (#30755)
* Update .asf.yaml
2024-02-03 20:24:39 +08:00
fd000dbdb3 [chore](ci) try to set 'performance (Doris Performance)' to required (#30752) 2024-02-03 20:24:39 +08:00
Pxl
0f47f7f389 [Feature](runtime filter) normalize ignore runtime filter (#30152)
normalize ignore runtime filter
2024-02-03 20:24:39 +08:00
e5bdc369e2 [runtimefilter](nereids)push down RF into cte producer (#30568)
* push down Rf into CTE
2024-02-03 20:24:39 +08:00