Commit Graph

2495 Commits

Author SHA1 Message Date
7be349a10b [opt](inverted index) add session variable enable_inverted_index_query to control whether query with inverted index (#22255) 2023-07-28 12:43:26 +08:00
313d4fd257 [doc](fix) Improve the document example of alter table replace (#21958)
Improve the document example of alter table replace
The old doc example was wrong
2023-07-28 10:21:47 +08:00
e87174dd6b [feature](planner) modify multi partition prefix value (#22098)
modify multi partition prefix value: 'p_'
2023-07-28 10:21:32 +08:00
a87d34b19b [Fix](multi catalog statistics)Improve external table statistics collection (#22224)
Improve external table statistics collection, including log, observability and fix some bugs.
1. Add Running state for statistics job.
2. Add progress for show analyze job. (n/m tasks finished, n/m task failed and so on)
3. Add analyze time cost for show analyze task.
4. Make task failure message more clear.
5. Synchronize the job status updating code in updateTaskStatus.
6. Fix NPE in HMSAnalyzeTask. (Avoid refreshing statistics cache if the collection sql failed)
7. Return error message for with sync collection while timeout. 
8. Log level improvement
9. Fix misuse of logCreateAnalysisJob for tasks.
2023-07-27 20:01:14 +08:00
391131c7fd [typo](docs)correct describe for grant (#20910) 2023-07-27 20:00:01 +08:00
e29c703ce4 [typo](docs) fix some typo (#21031) 2023-07-27 16:55:48 +08:00
94d2ba5ba0 [Docs](community) Add Join Community Files of EN & CN Verison (#22252) 2023-07-27 16:47:10 +08:00
b597eef436 [fix](doc)description of check_rpc_channel api in http-actions-be (#21568) 2023-07-27 16:09:20 +08:00
31c856351a [enhancement](default_config) change default value of rpc related (#22149)
configs

Bdbje elect timeout is 30 seconds, so we enlarge thrift_rpc_timeout_ms
and txn_commit_rpc_timeout_ms to 60s.

BTW: enlarge bdbje_lock_timeout_second from 1 to 5.
2023-07-27 11:12:26 +08:00
2c7d7937e5 [doc](fe) Add split-brain influence for metadata_failure_recovery (#21801) 2023-07-26 18:21:29 +08:00
f621d8ea65 [doc](stats) full auto analyze docs #21918
Add description for some fe config about analyze
2023-07-26 17:30:10 +08:00
ea2a7a8e56 [Docs](docs) Rename Release Note Title and File name of CN & EN Version (#22157) 2023-07-26 09:20:06 +08:00
e8f4323e0f [Fix](jdbcCatalog) fix typo of some variable #22214 2023-07-26 08:34:45 +08:00
cf677b327b [fix](jdbc catalog) Fixed mappings with type errors for bool and tinyint(1) (#22089)
First of all, mysql does not have a boolean type, its boolean type is actually tinyint(1), in the previous logic, We force tinyint(1) to be a boolean by passing tinyInt1isBit=true, which causes an error if tinyint(1) is not a 0 or 1, Therefore, we need to match tinyint(1) according to tinyint instead of boolean, and this change will not affect the correctness of where k = 1 or where k = true queries
2023-07-25 22:45:22 +08:00
527547b4ed [catalog](faq) add jdbc catalog faq (#22129) 2023-07-25 15:59:16 +08:00
1e8ae7ad16 [doc](flink-connector)improve flink connector doc (#22143) 2023-07-25 15:58:35 +08:00
28b714c371 [feature](executor) using fe version to set instance_num (#22047) 2023-07-25 14:37:42 +08:00
b41fcbb783 [feature](agg) add the aggregation function 'mag_agg' (#22043)
New aggregation function: map_agg.

This function requires two arguments: a key and a value, which are used to build a map.

select map_agg(column1, column2) from t group by column3;
2023-07-25 11:21:03 +08:00
ba2eb4d788 [typo](docs) add jdbc catalog error handling methods (#22160) 2023-07-25 10:45:29 +08:00
fc67929e34 [improvement](catalog) optimize ldap and support more character in user and table name (#21968)
- common name support `-` ,reason: MySQL's db name support `-`
- table name support `-`
- username support `.`,reason:LDAP's username support `.`
- ldap doc
- ldap support rbac
2023-07-24 22:04:37 +08:00
0205f540ac [enhancement](config) Enlarge broker scanner bytes conf to 500G, 5G is still not enough (#22126) 2023-07-24 19:49:39 +08:00
8fa41e4973 [fix](docs) update version since of stream load enable_profile (#21786)
Stream load enable_profile is not supported in 1.2.4.1, update the docs.

Currently it's supported in 2.0.0-rc01.

#21784 has backported this feature into branch 1.2-lts, and it will be included in the 1.2.7 release.
2023-07-24 09:22:12 +08:00
22aa54e335 [enhancement](config) enlarge max_bytes_per_broker_scanner to 5G #22099 2023-07-23 12:00:32 +08:00
eceb30f47e [doc](catalog)paimon doc (#21966)
code pr: #21910
2023-07-23 11:24:40 +08:00
3d0f952934 [FIX](complex-type)delete enable_map/struct_type switch #21957 2023-07-22 15:29:32 +08:00
50c8563f35 [fix](partial update) fix some bugs of sequence column (#21896) 2023-07-22 15:26:48 +08:00
32fce013f7 [feature](docs) add docs dbt-doris adapter (#22067) 2023-07-21 23:34:47 +08:00
e489b60ea3 [feature](load) support line delimiter for old broker load (#22030) 2023-07-21 19:31:19 +08:00
63b17bc7ba [typo](docs) fix some mistake in Doris & Spark Column Type Mapping (#19998) 2023-07-21 16:37:51 +08:00
67a3f37779 [doc](routineload)add routine load ssl example for access ali-kafka (#21877) 2023-07-21 16:03:10 +08:00
732e0d14ff [Enhancement](window-funnel)add different modes for window_funnel() function (#20563) 2023-07-21 13:57:27 +08:00
74313c7d54 [feature-wip](autoinc)(step-3) add auto increment support for unique table (#22036) 2023-07-21 13:24:41 +08:00
ab11dea98d [Enhancement](config) optimize behavior of default_storage_medium (#20739) 2023-07-20 22:00:11 +08:00
7d488688b4 [fix](multi-catalog)fix minio default region and throw minio error msg, support s3 bucket root path (#21994)
1. check minio region, set default region if user region is not provided, and throw minio error msg
2. support read root path s3://bucket1
3. fix max compute public access
2023-07-20 20:48:55 +08:00
367ad9164a [feature-wip](auto-inc)(step-2) support auto-increment column for duplicate table (#19917) 2023-07-20 18:03:39 +08:00
c31e826756 [opt](config) rename alter_inverted_index_worker_count to alter_index_worker_count, and add docs (#21985) 2023-07-20 17:50:04 +08:00
2ae9bfa3b2 [typo](docs) add oracle jdbc catalog FAQ of orai18n.jar (#22016) 2023-07-20 14:10:58 +08:00
1afe090486 [improvement](memory) modify jemalloc conf in be.conf (#21943)
modify jemalloc conf in be.conf
    disable je_purge_all_arena_dirty_pages
2023-07-20 10:34:31 +08:00
2daad2151d [enhancement](jdbc catalog) Add mysql jdbc catalog function to filter push-down identification (#21745) 2023-07-19 23:48:23 +08:00
845cf94a7a [feature](function) support time_to_sec (#21722)
mysql >select sec_to_time(time_to_sec(cast('16:32:18' as time)));
+----------------------------------------------------+
| sec_to_time(time_to_sec(CAST('16:32:18' AS TIME))) |
+----------------------------------------------------+
| 16:32:18                                           |
+----------------------------------------------------+
1 row in set (0.53 sec)

mysql [test]>select sec_to_time(59538);
+--------------------+
| sec_to_time(59538) |
+--------------------+
| 16:32:18           |
+--------------------+
1 row in set (0.03 sec)
2023-07-19 01:09:48 +08:00
1c149439d7 [docs](map)Add map and struct type support parameters (#21802) 2023-07-19 01:06:23 +08:00
c6063ed92f [Revert](lazy open) revert lazy open and add case (#21821) 2023-07-18 19:41:33 +08:00
e24867e138 [typo][docs] Modify the description of CREATE-TABLE (#21858) 2023-07-18 10:29:47 +08:00
ebc1e9e9f9 [docs](releasenote)add 1.2.6 release note (#21875) 2023-07-17 17:56:08 +08:00
1c36b77024 [typo][docs] Modify a typo in the aggr_type description for CREATE-TABLE (#21861)
Modify a typo in the CREATE-TABLE's aggr_type description to change "后倒入" to "后导入".
2023-07-17 17:02:39 +08:00
4cea785f13 [typo][docs] Delete the extra characters in the tablet-local-debug Chinese document. (#21846) 2023-07-17 17:02:16 +08:00
03b575842d [Feature](table function) support explode_json_array_json (#21795) 2023-07-17 11:40:02 +08:00
ca6e33ec0c [feature](table-value-functions)add catalogs table-value-function (#21790)
mysql> select * from catalogs() order by CatalogId;
2023-07-14 10:25:16 +08:00
4158253799 [feature](hudi) support hudi time travel in external table (#21739)
Support hudi time travel in external table:
```
select * from hudi_table for time as of '20230712221248';
```
PR(https://github.com/apache/doris/pull/15418) supports to take timestamp or version as the snapshot ID in iceberg, but hudi only has timestamp as the snapshot ID. Therefore, when querying hudi table with `for version as of`, error will be thrown like:
```
ERROR 1105 (HY000): errCode = 2, detailMessage = Hudi table only supports timestamp as snapshot ID
```
The supported formats of timestamp in hudi are: 'yyyy-MM-dd HH:mm:ss[.SSS]' or 'yyyy-MM-dd' or 'yyyyMMddHHmmss[SSS]', which is consistent with the [time-travel-query.](https://hudi.apache.org/docs/quick-start-guide#time-travel-query)

## Partitioning Strategies
Before this PR, hudi's partitions need to be synchronized to hive through [hive-sync-tool](https://hudi.apache.org/docs/syncing_metastore/#hive-sync-tool), or by setting very complex synchronization parameters in [spark conf](https://hudi.apache.org/docs/syncing_metastore/#sync-template). These processes are exceptionally complex and unnecessary, unless you want to query hudi data through hive.

In addition, partitions are changed in time travel. We cannot guarantee the correctness of time travel through partition synchronization.

So this PR directly obtain partitions by reading hudi meta information. Caching and updating table partition information through hudi instant timestamp, and reusing Doris' partition pruning.
2023-07-13 22:30:07 +08:00
23272abf48 [chore](docs)Removed documentation related to dynamic tables (#21803)
since the feature was reworked
2023-07-13 22:20:20 +08:00