Commit Graph

18429 Commits

Author SHA1 Message Date
3cb0deae9c [opt](ranger) modify and enhance the feature of ranger access controller (#34392) (#34426)
bp #34392
2024-05-06 17:08:47 +08:00
ab5ee81811 [fix](memory) Fix page cache memory tracker consumption in prune (#34320) 2024-05-06 12:53:11 +08:00
7ae5de316b [feature](Nereids) support set and use statement syntax only (#33979) (#34409)
pick from master #33979
commit id 65fb7d43b7e838c48502d4e8a69e2541dc73aa88

This PR:
1. add a new Command type: UnsupportedCommand to handle the statement only support parse but could not execute.
2. support syntax about set and use
3. add keyword VAULT to follow legacy planner

TODO
1. support all statment syntax in Nereids
2024-05-06 11:36:01 +08:00
d4c2f70673 [fix](regression_test) failed in distinct_streaming_agg (#34404) 2024-05-06 10:11:39 +08:00
85ae773996 [fix](spill) incorrect revocable mem size of hash join (#34379) 2024-05-06 06:53:12 +08:00
7248420cfd [chore](session_variable) Add 'data_queue_max_blocks' to prevent the DataQueue from occupying too much memory. (#34017) (#34395) 2024-05-05 21:20:33 +08:00
c3096cabe2 [Fix](executor)normal group not auth #34377 2024-05-02 15:17:19 +08:00
Pxl
0d106fe4c2 [Bug](runtime-filter) release rf count dependency when query canceled (#34367)
* release rf count dependency when query canceled

* update

* update
2024-05-02 09:56:17 +08:00
8da260ee0d [fix](hdfs)read 'fs.defaultFS' from core-site.xml for hdfs load which has no default fs (#34217) (#34372)
bp #34217
Co-authored-by: slothever <18522955+wsjz@users.noreply.github.com>
2024-05-01 00:31:49 +08:00
581e168ee1 [Fix](executor)Fix normal workload group alter may failed #34356 2024-04-30 22:17:40 +08:00
5fc1f11cf1 [improvement](hive)add the queryid to the temporary file path (#34278) (#34368)
bp #34278

Co-authored-by: wuwenchi <wuwenchihdu@hotmail.com>
2024-04-30 22:00:05 +08:00
8abd136ba2 [Improvement](executor)Refactor Workload group memory GC (#33797)
* just gc group's overcommit query when minor gc

* add process usage
2024-04-30 19:34:31 +08:00
50855f01c7 [fix](nereids) when runtimefilter target is null, skip the rf #34358 2024-04-30 18:48:50 +08:00
35f8563a75 [feature](iceberg) support iceberg equality delete (#34223) (#34327)
bp #34223

Co-authored-by: Ashin Gau <AshinGau@users.noreply.github.com>
2024-04-30 11:51:29 +08:00
7d77fd0286 [fix](profile) Fix reporting the profile while building the pipeline profile. (#34215) (#34326) 2024-04-30 11:38:03 +08:00
843c89f109 [fix] fix nullptr when clear cache due to move (#34323) 2024-04-30 09:52:41 +08:00
6f873c5907 [improvement](join) Avoid merging blocks more than once on the build side (#34291) 2024-04-30 08:37:53 +08:00
53c06ad9d2 [fix](spill) handel canceled status in spill (#34268) 2024-04-30 08:35:52 +08:00
75470ede1a [fix](test) Fix some testcases #34203 2024-04-30 08:35:03 +08:00
b15fc2a906 [Cherry-pick](branch-2.1) Pick #34043 and #34112 (#34318)
* [Enhancement](full compaction) Add run status support for full compaction (#34043)

* The usage is `curl http://{ip}:{host}/api/compaction/run_status?tablet_id={tablet_id}`
e.g. `curl http://127.0.0.1:8040/api/compaction/run_status?tablet_id=10084`

If full compaction is running, the output will be
```
{
"status" : "Success",
"run_status" : true,
"msg" : "compaction task for this tablet is running",
"tablet_id" : 10084,
"compact_type" : "full"
}
```
else the ouput will be
```
{
"status" : "Success",
"run_status" : false,
"msg" : "compaction task for this tablet is not running",
"tablet_id" : 10084,
"compact_type" : "full"
}
```

* 2

* 2

* [Fix](partial update) Fix rowset not found error when doing partial update (#34112)

Cause: In the logic of partial column updates, the existing data columns are read first, and then the data is supplemented and written back. During the reading process, initialization involves initially fetching rowset IDs, and the actual rowset object is fetched only when needed later. However, between fetching the rowset IDs and the rowset object, compaction may occur, turning the old rowset into a stale rowset. If too much time passes, the stale rowset might be directly deleted. Thus, when the rowset object is needed for an update, it cannot be found. Although the update operation with partial column logic should be able to read all keys and should not encounter new keys, if the rowset disappears, the Backend (BE) will consider these keys as missing. Consequently, it will check whether other columns have default values or are nullable. If this check fails, the aforementioned error is thrown.

Solution: To avoid such issues during partial column updates, the initialization step should involve fetching both the rowset IDs and the shared pointer to the rowset object simultaneously. This ensures that the rowset can always be found during data retrieval.
2024-04-30 07:26:23 +08:00
d1df0b8878 [fix](mtmv)Solving the problem of calling each other in toString() loops (#34277) (#34317)
bp #34277
2024-04-29 21:46:29 +08:00
f76bf66903 [chore](third-party) Fix compilation errors on arm platform (#34261) 2024-04-29 21:26:56 +08:00
996222c30e [fix](test) let test_ntile_function happy with Nereids (#34294) 2024-04-29 20:59:48 +08:00
a173513e27 [fix](pipelinex) exchange sink not set ready when source limit #34241 2024-04-29 20:58:50 +08:00
3495ed58e0 [Enhancement](jdbc catalog) Change Jdbc connection pool to hikari (#34045) (#34310) 2024-04-29 20:22:48 +08:00
7cb00a8e54 [Feature](hive-writer) Implements s3 file committer. (#34307)
Backport #33937.
2024-04-29 19:56:49 +08:00
1bfe0f0393 [feature](iceberg)support read iceberg complex type,iceberg.orc format and position delete. (#33935) (#34256)
master #33935
2024-04-29 14:40:12 +08:00
9b7e007ef6 [Bug](union) fix union operator set eos is not incorrect (#34250)
* [test](case) fix unstable case without order by distinct row

* [Bug](union) fix union operator set eos is not incorrect
2024-04-29 13:38:03 +08:00
20bd0c2987 [FIX](cases )fix ipv6 value for regress case 2024-04-29 13:37:29 +08:00
222289697d [improve](regression) Support qt_target_sql (#34236) (#34270) 2024-04-29 11:50:35 +08:00
5277a55791 (pick 34003) release fd for shutdown tablets (#34224) 2024-04-29 10:51:19 +08:00
946d28646a [fix](outfile)Fixed orcOutputStream.close() throwing an exception during destruction causing the program to hang. (#34254)
bp #34243
2024-04-28 19:54:34 +08:00
417431fd83 [Enhancement](hdfs-file-system) Change fs_handler ptr to shared_ptr and remove ref count operations. (#34049)
Backport #33959.
2024-04-28 19:45:30 +08:00
99af54f779 [Fix](orc-reader) Fix the issue when string col has mixed plain and dict encoding in different stripes. (#34146) (#34248)
backport #34146
2024-04-28 19:43:57 +08:00
11039ade7b [opt](paimon) support mapping Paimon column type "Row" to Doris type "Struct" (#34239)
backport: #33786
2024-04-28 19:38:50 +08:00
1fda68f738 [feature](planner) Support select constant from dual syntax sugar (#34200) (#34232)
In MySQL, it's common to use a simplified syntax like `SELECT constant FROM dual`
which is equivalent to just `SELECT constant`.
This syntax is often used by BI tools when utilizing MySQL connectors to verify connection validity.
To enhance compatibility and ensure seamless integration with such tools,
we have now implemented this feature in Doris.

### Key Changes:
- Doris now interprets `SELECT constant FROM dual` as `SELECT constant`, aligning with MySQL's behavior.
- This update ensures that BI tools can use standard MySQL connectors without modifications or errors when connecting to Doris.
2024-04-28 15:56:16 +08:00
341f5cd7a3 [fix](branch-2.1) Fix streamload profile not set (#34221) 2024-04-28 14:36:58 +08:00
45556686ea [fix](test) fix some external test cases (#34209)
Fix some test cases and enable `test_information_schema_external` suite
2024-04-27 23:25:33 +08:00
a6bf35efdf 2.1.3-rc03 2024-04-27 20:54:06 +08:00
7ab425ee4b [improve](move-memtable) reduce default load stream per node to 2 for stream load (#34065) (#34205)
Co-authored-by: Kaijie Chen <ckj@apache.org>
2024-04-27 18:20:57 +08:00
cd1c9edd71 [fix](pipeline-load) fix no error url when data quality error and total rows is negative (#34072) (#34204)
Co-authored-by: HHoflittlefish777 <77738092+HHoflittlefish777@users.noreply.github.com>
2024-04-27 18:19:08 +08:00
36e80af327 [fix](schema change) fix the defineName field is not the same when copying column (#34201)
* [fix](schema change) fix the defineName field is not the same when copying column

* fix
2024-04-27 11:59:07 +08:00
cf700a62b6 [test](case) fix unstable case without order by distinct row (#34167) 2024-04-27 11:20:36 +08:00
30a68c1240 [fix](spill) use different algorithm to avoid partition data skew (#34162) 2024-04-27 11:20:36 +08:00
4b9772062b [refactor](statistic) fetch statistic data with catalog and database id (#33928) (#34202)
bp #33928
2024-04-27 09:38:41 +08:00
c998e2f714 [Enhancement](planner) Support string input for sql_select_limit (#34177) 2024-04-27 02:29:47 +08:00
414fbd353e [fix](ES catalog)Make col != '' behavior consistent with SQL (#34151)
In SQL syntax, `col != ''` equals `col.length() > 0`.
It means that this column must exist in ES doc fields and its content is not empty.
In this PR, we make a special translation for this binary predicate to keep the behavior of both consistent.

---------

Co-authored-by: Luennng <luennng@gmail.com>
2024-04-27 02:29:33 +08:00
3ba42a7823 [improvement](mtmv) Optimize the nested materialized view performance (#34163)
Record increase refersh version more accurately.
The refreshVersion in the memo will increase when mv rewrite successfully.
In query rewrite, if refresh version is different from the current struct info map in group,
will refresh the group struct info or not.
2024-04-27 02:29:33 +08:00
970d0c80df [Improvement](agg) Improve count distinct distribute keys (#33167) 2024-04-27 02:29:33 +08:00
c125148deb [opt](Nereids) bucket shuffle downgrade expansion (#34088)
Expand bucket shuffle downgrade condition, which originally requiring a single partition after pruning, basic table and bucket number < para number. Currently, we expect this option can be used for disabling bucket shuffle more efficiently, without above restrictions.

Co-authored-by: zhongjian.xzj <zhongjian.xzj@zhongjianxzjdeMacBook-Pro.local>
2024-04-27 02:29:33 +08:00