Commit Graph

5614 Commits

Author SHA1 Message Date
00bb099fdd [hotfix](jdbc catalog) fix load table and column names npe (#28865)
This fix is for the npe situation that occurs when FE of non-Master nodes initializes Jdbc Catalog metadata.
2023-12-26 14:11:14 +08:00
509cfea99a [feature](Load)(step2)support nereids load job schedule (#26356)
We will  Integrate new load job manager into  new job scheduling framework so that the insert into task can be scheduled after the broker load sql  is converted to insert into TVF(table value function) sql.

issue: https://github.com/apache/doris/issues/24221

Now support:
1. load data by tvf insert into sql, but just for simple load(columns need to be defined in the table)
2. show load stmt
- job id, label name, job state, time info
- simple progress
3. cancel load from db
4. support that enable new load through Config.enable_nereids_load
5. can replay job after restarting doris

TODO:
- support partition insert job
- support show statistics from BE
- support multiple task and collect task statistic
- support transactional task
- need add ut case
2023-12-26 12:29:05 +08:00
b8fd55b0cf [feature](Nereids) support table valued function http_stream (#29004) 2023-12-26 11:47:04 +08:00
065eb9a72b [feature](nereids)support partition property in nereids (#28982) 2023-12-26 11:19:04 +08:00
7434de9ed8 [improvement](nereids) Get partition related table disable nullable field and complete agg matched pattern mv rules. (#28973)
* [improvement] (nereids) Get partition related table disable nullable field and modify regression test, complete agg mv rules.

* make filed not null to create partition mv
2023-12-26 00:29:42 +08:00
2aea47c0a9 [fix](mtmv)add log for resolve pending task (#28999)
* add lock for resolve pending task

* add lock for resolve pending task
2023-12-26 00:29:28 +08:00
17f3ca7349 [fix](planner)should save original select list item before analyze (#28187)
* [fix](planner)should save original select list item before analyze

* fix test case

* fix failed case
2023-12-25 23:06:45 +08:00
c2c5df9341 [opt](assert_num_rows) support filter in AssertNumRows operator and fix some explain (#28935)
* NEED

* Update pipeline x

* fix pipelinex compile
2023-12-25 22:47:23 +08:00
2dc7d82aff [Chore](Job)print log before task execute (#28962) 2023-12-25 16:55:59 +08:00
17917a0adb [feature](load) enable memtable on sink node by default (#28963) 2023-12-25 16:27:16 +08:00
29d3d5e433 [Test](Job)Add test case (#28481) 2023-12-25 15:12:14 +08:00
1d984e0ebb return residual expr of join (#28760) 2023-12-25 12:53:14 +08:00
e9e1e2894b [performance](variant) support topn 2phase read for variant column (#28318)
[performance](variant) support topn 2phase read for variant column
2023-12-25 11:50:41 +08:00
48f8f8f2ad [fix](nereids)group by expr may be bound twice in bind agg slot (#28771) 2023-12-25 11:23:44 +08:00
c53611dcb3 [feature](nereids)support decimalv2 (#28726) 2023-12-25 11:23:27 +08:00
99755920e2 [fix](nereids)subquery unnest need handle subquery in Not expr correnctly (#28713) 2023-12-25 11:23:00 +08:00
b49671b31e [feature](api) add profile text api (#28697) 2023-12-25 10:17:09 +08:00
d8cb4da73d [chore](prompt) Optimize tablet and replica prompt by pointing out what the numbers mean (#28925) 2023-12-24 22:29:24 +08:00
2f7deb3dea [enhancement](bulk-load) cancel loading tasks directly without retrying when timeout exceeded (#28666) 2023-12-24 21:57:31 +08:00
1e44a4b145 [opt](nereids) convert or to inpredicate and optimize inpredicate partition prune (#28316) 2023-12-24 21:42:03 +08:00
145683ccdb [improvement](group commit) make get column function more reliable when replaying wal (#28900) 2023-12-24 21:17:39 +08:00
b8de5cfc36 [fix] (nereids) Catch exception when mv fail and fix the npe (#28932) 2023-12-24 20:41:24 +08:00
6ea6ff5dee [fix](nereids) Fix query mv rewrite fail when mv cache build quickly (#28876) 2023-12-24 09:00:49 +08:00
5505fa3755 [Fix](statistics) Fix partition name NPE and sample for all table during auto analyze (#28916)
Fix partition name NPE and sample for all table during auto analyze.
Sample for all tables because getData may have latency, which may cause full analyze a huge table and use too much resource. Sample for all tables to avoid this. Will improve the strategy later.
2023-12-24 01:49:49 +08:00
13a3550d8e [nereids] fix join fd computing bug (#28849) 2023-12-24 01:41:43 +08:00
37777dcdb5 [fix](mtmv) Related partition exclude null generate column when increment build materialized view (#28855)
Infer partition column by materialized view partition column, exclude null generate column in join when increment build materialized view
2023-12-24 01:37:55 +08:00
1d8822b2b7 [fix](paimon)fix like predicate (#28803)
fix like predict
2023-12-23 22:25:55 +08:00
66b14f4db1 [fix](mtmv)fix can not create mtmv all use default value (#28922) 2023-12-23 21:27:01 +08:00
0a1d9f4cbc [feature](mtmv)add more test case1 (#28910) 2023-12-23 14:39:44 +08:00
623257d02b [feature](mtmv)MTMV pause and resume (#28887)
- PAUSE MATERIALIZED VIEW JOB ON mv1
- RESUME MATERIALIZED VIEW JOB ON mv1
- fix when drop db,not drop job
- add lock for one materialized view can only run one task at a time
2023-12-23 14:30:54 +08:00
3d2b4ae244 [fix](mtmv) fix failed to specify the number of buckets when bucket auto (#28854)
Issue Number: close #xxx

- fix failed to specify the number of buckets when bucket auto
- delete unused SessionVariable
- if mtmv used external table ,check `isMaterializedViewRewriteEnableContainForeignTable`
2023-12-23 09:26:16 +08:00
ae64afd112 [fix](pipelineX) fix cannot runtime obtain profile on pipelineX #28795 2023-12-23 08:24:04 +08:00
e0bb7fa935 [exec](load) change default parallel num from 1 to 8 in no pipeline exec engine (#28864) 2023-12-23 00:05:44 +08:00
966766f3b7 [enhancement](broker-load) fix-move-memtable-session-var-for-s3 (#28894) 2023-12-22 23:25:06 +08:00
3eec62e1d1 [fix](multi-catalog)filter impala generated path (#28786)
file impala generated dir _imapala_insert_staging
2023-12-22 21:58:50 +08:00
fa0ad56817 [exec](compress) use FragmentTransmissionCompressionCodec control the exchange compress behavior (#28818) 2023-12-22 19:50:57 +08:00
b2b209e938 [feature](Nereids) Pull up join from union all (#28682) 2023-12-22 19:35:20 +08:00
3ed82bcee2 [Feature](inverted index) add lowercase option for inverted index analyzer (#28704) 2023-12-22 18:22:44 +08:00
cb61a0727f [bug](sharedscan) Fix shared scan bug (#28841) 2023-12-22 15:13:14 +08:00
7f310cec9c [opt](sessionVar)show changed sessoin var first #28840
“show variables” command list changed vars before not changed vars,
2023-12-22 14:45:33 +08:00
007f498f3b (enhance)(InternalQuery) Support to collect profile for intenal query (#28762) 2023-12-22 14:03:48 +08:00
fb52e111c5 [Enhancement](job) No need to query some backends which are not alive. (#28608)
No need to execute some jobs if backend is not alive
2023-12-22 13:14:30 +08:00
dc786114a2 [Fix](multi-catalog) skip hms events if hms table is not supported. (#28644)
Co-authored-by: wangxiangyu <wangxiangyu@360shuke.com>
2023-12-22 13:07:58 +08:00
e4c7d4d03a [chore](error msg) print type info when colocate with ddl failed due to type mismatch (#28773) 2023-12-22 12:48:20 +08:00
012e66729a [improvement](executor) Add tvf and regression test for Workload Scheduler (#28733)
1 Add select workload schedule policy tvf
2 Add reg test
2023-12-22 12:09:51 +08:00
13ccfa06a7 [Feature](Variant) Implement variant new sub column access method (#28484)
* [Feature](Variant) Implement variant new sub column access method

The query SELECT v["a"]["b"] from simple_var WHERE cast(v["a"]["b"] as int) = 1 encompasses three primary testing scenarios:

```
1. A basic test involving the variant data type.
2. A scenario dealing with GitHub event data in the context of a variant.
3. A case related to the TPC-H benchmark using a variant.
```
2023-12-22 11:59:37 +08:00
f6b6180462 [Fix](Variant) fix variant predicate rewrite OrToIn with wrong plan (#28695)
using the name without paths info will lead to wrong In plan, e.g.
```
where cast(v:a as text) = 'hello' or cast(v:b as text) = 'world'
```
will be rewrite to:
```
where cast(v as text) in ('hello', 'world')
``
This is wrong, because they are different slots
2023-12-22 11:51:36 +08:00
0af6bd6390 [fix](group-commit) check if wal need recovery is abnormal (#28769) 2023-12-22 11:06:11 +08:00
c1457f97a4 [improvement](external catalog)Change log level to debug for getRowCountFromExternalSource. (#28801) 2023-12-21 22:09:50 +08:00
65e3da255f [enhancement](audit-log) add status code and error msg to audit log for proxy stmt (#27607) 2023-12-21 21:16:36 +08:00