Commit Graph

2923 Commits

Author SHA1 Message Date
5e9e199ca0 [fix](doc) Add the usage example of bos to the documentation of s3 tvf (#28899) 2023-12-23 20:47:50 +08:00
43776465d9 [fix](segcompaction) disable segcompaction by default (#28906)
Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>
2023-12-23 07:43:41 +08:00
3ed82bcee2 [Feature](inverted index) add lowercase option for inverted index analyzer (#28704) 2023-12-22 18:22:44 +08:00
7710c85904 [fix](stream load)add test case and doc for arrow type of stream load (#28098)
add test case and doc for arrow type of stream load
2023-12-22 13:18:44 +08:00
453e3c18f4 [refactor](buffer) remove download buffer since it is no longer useful (#28832)
remove download buffer since it is no longer useful
2023-12-22 11:53:31 +08:00
e2941aa9c8 [improve](config) set mutable and masterOnly in FE config stream_load_default_memtable_on_sink_node (#28835) 2023-12-22 10:58:43 +08:00
172f68480b [Enhancement](load) Limit the number of incorrect data drops and add documents (#27727)
In the load process, if there are problems with the original data, we will store the error data in an error_log file on the disk for subsequent debugging. However, if there are many error data, it will occupy a lot of disk space. Now we want to limit the number of error data that is saved to the disk.

Be familiar with the usage of doris' import function and internal implementation process
Add a new be configuration item load_error_log_limit_bytes = default value 200MB
Use the newly added threshold to limit the amount of data that RuntimeState::append_error_msg_to_file writes to disk
Write regression cases for testing and verification

Co-authored-by: xy720 <22125576+xy720@users.noreply.github.com>
2023-12-22 10:43:18 +08:00
0b9b1be1f1 [fix](function) Fix from_second functions overflow and wrong result (#28685) 2023-12-22 10:22:49 +08:00
4ee661202e [improve](transaction) extend abort transaction time (#28662) 2023-12-21 14:01:05 +08:00
2b2d3d0eb1 [fix](meta_scanner) fix meta_scanner process ColumnNullable (#28711) 2023-12-20 17:41:38 +08:00
08066a55f1 [Enhancement](auto-partition) change the behaviour when insert overwrite an auto partition table #28683
If we specific target partition(s) when inserting overwrite an auto partition table,
before:
could create new partition
now:
behalf just like non-auto partition table
2023-12-20 17:19:21 +08:00
9c9249e911 [feature](inverted index) add ignore_above property to prevent long s… (#28585)
When string is too long, clucene will throw an error. 
And the string is too long to analyze. So we ignore the string in index process when the string is longer than 256 bytes by default.
We add an poperty `ignore_above` for user to customize.
2023-12-19 18:54:36 +08:00
15e31d74e3 [feature](expr) Support kill query by query_id (#28530)
Issue Number: open #28517
2023-12-19 18:18:40 +08:00
d24a1645d9 [improvement](http) add show_table_data http api (#28380)
In some cases, users need to get the data size of single replica of a table, and evaluate certain actions based on this, such as estimating the precise backup size.

Signed-off-by: nextdreamblue <zxw520blue1@163.com>
2023-12-19 11:44:33 +08:00
6503aaf7db [feature](planner) allow HLL and QUANTILE_STATE types on duplicate and unique table (#28546) 2023-12-19 09:54:24 +08:00
03e989b342 [Doc] Update flink-doris-connector.md (#27329) 2023-12-17 10:39:21 +08:00
6f3fb81965 [fix](doc) spell errors fixes multi-tenant.md (#28436) 2023-12-15 12:21:46 +08:00
8ca7bd8f98 [enhancement](bitmap)support bitmap type for non-key column in duplicate table (#28392) 2023-12-14 23:59:12 +08:00
4300fdce43 [feature](merge-on-write) enable merge-on-write by default again (#28105)
fix #27188, #28096
2023-12-14 19:35:52 +08:00
e1f7b6cff4 [typo](doc) update spark connector doc for version 1.3.0 (#28239)
Co-authored-by: gnehil <gnehil489@github>
2023-12-14 16:48:10 +08:00
79f03d6fde [docs](releasenote)2.0.3 release note (#28373) 2023-12-14 13:07:24 +08:00
524bce3c07 [doc](auditlog) update description of field sql_digest (#28340) 2023-12-13 19:27:48 +08:00
805673dd53 [typo](doc)Add spark load faq (#28015) 2023-12-13 09:45:15 +08:00
cdb452ff3d [typo](docs)update jdk download version to 8u391 #28216
Co-authored-by: Yijia Su <suyijia@selectdb.com>
2023-12-12 21:20:54 +08:00
c1f666c497 [doc] fix typo (#28245) 2023-12-11 18:09:54 +08:00
53802fe0da [doc] document desc param is incorrect #26063 (#26064) 2023-12-11 10:33:07 +08:00
287bd87a4f [typo](docs)add some faq for flink-connector-doris (#26309)
* add flink-connector-doris faq

* add faq
2023-12-09 02:19:49 +08:00
bd8130154a [fix](doc) spell errors fixes hardware-info-action (#28154) 2023-12-09 01:47:19 +08:00
b6e72d57c5 [Improvement](hms catalog) support show_create_database for hms catalog (#28145)
* [Improvement](hms catalog) support show_create_database for hms catalog

* update
2023-12-09 01:34:21 +08:00
baf85547ae [feature](jdbc) support call function to pass sql directly to jdbc catalog #26492
Support a new stmt in Nereids:
`CALL EXECUTE_STMT("jdbc", "stmt")`

So that we can pass the origin stmt directly to the datasource of a jdbc catalog.

show case:
```
mysql> select * from mysql_catalog.db1.tbl1;
+------+------+
| k1   | k2   |
+------+------+
|  111 | 222  |
+------+------+
1 row in set (0.63 sec)

mysql> call execute("mysql_catalog", "insert into db1.tbl1 values(1,'abc')");
Query OK, 0 rows affected (0.01 sec)

mysql> select * from mysql_catalog.db1.tbl1;
+------+------+
| k1   | k2   |
+------+------+
|  111 | 222  |
|    1 | abc  |
+------+------+
2 rows in set (0.03 sec)

mysql> call execute_stmt("mysql_catalog", "delete from db1.tbl1 where k1=111");
Query OK, 0 rows affected (0.01 sec)

mysql> select * from mysql_catalog.db1.tbl1;
+------+------+
| k1   | k2   |
+------+------+
|    1 | abc  |
+------+------+
1 row in set (0.03 sec)
```
2023-12-08 23:06:05 +08:00
573b594df3 [improvement](Variant Type) Support displaying subcolumns expanded for the variant column (#27764) 2023-12-08 20:34:58 +08:00
0931eb536c Revert "[Improvement](auditlog) add column catalog for audit log and audit log table (#26403)" (#28177)
This reverts commit daea751a986823bf5858704663d58f49fd5dfb39.
2023-12-08 18:46:59 +08:00
cd108688c1 [Chore](docs)Fix job error docs (#28127) 2023-12-08 10:24:21 +08:00
25b90eb782 [Feature](function) support random int from specific range (#28076)
mysql> select rand(-20, -10);
+------------------+
| random(-20, -10) |
+------------------+
|              -13 |
+------------------+
1 row in set (0.10 sec)
2023-12-08 10:15:25 +08:00
81a0f8c041 [Feature](function) support generating const values from tvf numbers (#28051)
If specified, got a column of constant. otherwise an incremental series like it always be.

mysql> select * from numbers("number" = "5", "const_value" = "-123");
+--------+
| number |
+--------+
|   -123 |
|   -123 |
|   -123 |
|   -123 |
|   -123 |
+--------+
5 rows in set (0.11 sec)
2023-12-07 22:26:43 +08:00
a2d66911cd [chore](docs) Fix partition cache design principles #28110 2023-12-07 22:23:46 +08:00
b1c5519aa8 [doc](statistics)Update external catalog statistics doc (#28123) 2023-12-07 21:33:05 +08:00
8c79b86f5b Revert "[feature](merge-on-write) enable merge-on-write by default (#27188)" (#28096)
This reverts commit 00c8bab84de8154052f9d323800b436cd0ad36e5.
2023-12-07 11:31:36 +08:00
3a7a8bb107 [opt](resource-tag) root and admin user can use any resource tag by default (#28088)
In #25331, I change the behavior of user's default resource tag, that is, if a user does not set resource tag,
it can only use default resource tag.
This PR change this logic. The normal user can only use default resource tag if resource tag is not set,
but root and admin user can use any resource tag if resource tag is not set.
2023-12-07 11:22:30 +08:00
00bcf0d762 [doc](paimon) add more example for paimon catalog (#28087) 2023-12-07 10:21:02 +08:00
00c8bab84d [feature](merge-on-write) enable merge-on-write by default (#27188) 2023-12-06 21:06:58 +08:00
0ff5a1cc25 [fix](doc) spell error and aligned with code (#27609) 2023-12-06 20:58:39 +08:00
fa5096f510 [feature](analyze_cmd) add show-tablets-belong stmt for analyzing a batch of tablet-ids (#27994) 2023-12-06 15:59:00 +08:00
cbf1f8620a [Feature](job)support cancel task and fix log invalid (#27703)
- Running task can be show and fix cancel fail
- When the insert task scheduling cycle is reached, if there are still tasks running, the scheduling of this task will be canceled at this time.
- refactor job status changes SQL
- Fix timer job window error
- Support cancel task
2023-12-06 10:44:09 +08:00
e427ac9338 [typo](doc) update default value of compaction_promotion_min_size_mbytes (#25839)
Co-authored-by: gnehil <adamlee489@gamil.com>
2023-12-04 17:58:46 +08:00
27e1b2ac2d [docs](fix) fix document error (#27612) 2023-12-04 14:30:41 +08:00
80f528bf26 [enhancement](backup-restore) add config for upload/download task num per be (#27772)
set upload/download task num per be, and improve the overall speed of upload/download, enhance the performance of backup and recovery.

---------

Signed-off-by: nextdreamblue <zxw520blue1@163.com>
2023-12-04 11:19:45 +08:00
e9827f443f [typo](docs)add some tips for mc and add dlf.catalog.id info to dlf doc (#27754)
add doc for max compute and dlf
2023-12-04 10:03:21 +08:00
2e1ce758f1 [feature](function) support ip function ipv6numtostring(alias inet6_ntoa) (#27342) 2023-12-02 11:48:19 +08:00
fcfd0aa8e0 [fix](doc) spell error (#27079)
fixed Spelling errors in metadata-operation and cold-hot-separation
2023-12-01 21:30:50 +08:00