Commit Graph

1757 Commits

Author SHA1 Message Date
af3416ede0 [docs] Update be-vscode-dev.md (#15800)
Fix some syntax errors, making it more comfortable for developers to read.
2023-01-11 12:30:52 +08:00
4be54cfcac [deps](hdfs) update libhdfs3 to v2.3.5 to support KMS (#15770)
Support KMS in libhdfs3: apache/doris-thirdparty#22
2023-01-10 23:21:53 +08:00
a67cea2d27 [Enhancement](metric) add current edit log metric (#15657) 2023-01-10 18:46:57 +08:00
ec0a9647f1 [typo](docs)Update sequence-column-manual.md #15727
创建unique模型的test_table数据表,并指定指定sequence列映射到表中的modify_date列。 重复
2023-01-10 14:54:57 +08:00
fd7d13d4c0 [typo](docs)Update dynamic-partition.md #15734
拼写错误
2023-01-10 10:14:44 +08:00
3990a44aba [typo](doc) add since dev lable to field function doc (#15648) 2023-01-10 09:52:37 +08:00
67a6ad648e [typo](doc) command of manually trigger compaction incorrect (#15709) 2023-01-10 09:50:47 +08:00
9e3a61989b [refactor](es) remove BE generated dsl for es query #15751
remove fe config enable_new_es_dsl and all related code.
Now the DSL for es is always generated on FE side.
2023-01-10 08:40:32 +08:00
211cc66d02 [fix](multi-catalog) fix image loading failture when create catalog with resource (#15692)
Bug fix
fix image loading failture when create catalog with resource
When creating jdbc catalog with resource, the metadata image will failed to be loaded.
Because when loading jdbc catalog image, it will try to get resource from ResourceMgr,
but ResourceMgr has not been loaded, so NPE will be thrown.

This PR fix this bug, and refactor some logic about catalog and resource.

When loading jdbc catalog image, it will not get resource from ResourceMgr.
And now user can create catalog with resource and properties, like:

create catalog jdbc_catalog with resource jdbc_resource
properites("user" = "user1");
The properties in "properties" clause will overwrite the properties in "jdbc_resource".

force adding tinyInt1isBit=false to jdbc url
The default value of tinyInt1isBit is true, and it will cause tinyint in mysql to be bit type.
force adding tinyInt1isBit=false to jdbc url so that the tinyint in mysql will be tinyint in Doris.

Avoid calculate checksum of jdbc driver jar multiple times
Refactor
Refactor the notification logic when updating properties in resource.
When updating properties in resource, it will notify the corresponding catalog to update its own properties.
This PR change this logic. After updating properties in resource, it will only uninitialize the catalog's internal
objects such "jdbc client" or "hms client". And this objects will be re-initialized lazily.

And all properties will be got from Resource at runtime, so that it will always get the latest properties

Regression test cases
Because we add tinyInt1isBit=false to jdbc url, some of cases need to be changed.
2023-01-09 09:56:26 +08:00
wxy
6829d361cb [Feature](audit) add errorCode and errorMessage in audit log (#14925)
* [feat] add errorCode and errorMessage in audit log.

* [Feature](audit) add errorCode and errorMessage in audit log

Co-authored-by: wangxiangyu@360shuke.com <wangxiangyu@360shuke.com>
2023-01-09 08:47:57 +08:00
wxy
fb1f6bdd82 [doc](export) add docs for cancel-export. (#15682)
Co-authored-by: wangxiangyu@360shuke.com <wangxiangyu@360shuke.com>
2023-01-09 08:38:45 +08:00
36590da24b [fix](regression p0) add the alias function hist to histogram and fix p0 (#15708)
add the alias function hist to histogram and fix p0
2023-01-08 11:31:23 +08:00
500c7fb702 [improvement](multi-catalog) support unsupported column type (#15660)
When creating an external catalog, Doris will automatically sync the schema of table from external catalog.
But some of column type are not supported by Doris now, such as struct, map, etc.

In previous, when meeting these unsupported column, Doris will throw an exception, and the corresponding
table can not be synced. But user may just want to query other supported columns.

In this PR, I add a new column type: UNSUPPORTED. And now it is just used for external table schema sync.
When meeting unsupported column, it will be synced as column with UNSUPPORTED type.

When query this table, there are serval situation:

select * from table: throw error Unsupported type 'UNSUPPORTED_TYPE' xxx
select k1 from table: k1 is with supported type. query OK.
select * except(k2): k2 is with unsupported type. query OK
2023-01-08 10:07:10 +08:00
ae1a77e034 add Q&A to jdbc external table (#15680) 2023-01-07 20:04:02 +08:00
054af036fe [typo](doc) fix Chinese describe (#15683) 2023-01-07 20:02:44 +08:00
76ad599fd7 [enhancement](histogram) optimise aggregate function histogram (#15317)
This pr mainly to optimize the histogram(👉🏻 https://github.com/apache/doris/pull/14910)  aggregation function. Including the following:
1. Support input parameters `sample_rate` and `max_bucket_num`
2. Add UT and regression test
3. Add documentation
4. Optimize function implementation logic
 
Parameter description:
- `sample_rate`:Optional. The proportion of sample data used to generate the histogram. The default is 0.2.
- `max_bucket_num`:Optional. Limit the number of histogram buckets. The default value is 128.

---

Example:

```
MySQL [test]> SELECT histogram(c_float) FROM histogram_test;
+-------------------------------------------------------------------------------------------------------------------------------------+
| histogram(`c_float`)                                                                                                                |
+-------------------------------------------------------------------------------------------------------------------------------------+
| {"sample_rate":0.2,"max_bucket_num":128,"bucket_num":3,"buckets":[{"lower":"0.1","upper":"0.1","count":1,"pre_sum":0,"ndv":1},...]} |
+-------------------------------------------------------------------------------------------------------------------------------------+

MySQL [test]> SELECT histogram(c_string, 0.5, 2) FROM histogram_test;
+-------------------------------------------------------------------------------------------------------------------------------------+
| histogram(`c_string`)                                                                                                               |
+-------------------------------------------------------------------------------------------------------------------------------------+
| {"sample_rate":0.5,"max_bucket_num":2,"bucket_num":2,"buckets":[{"lower":"str1","upper":"str7","count":4,"pre_sum":0,"ndv":3},...]} |
+-------------------------------------------------------------------------------------------------------------------------------------+
```

Query result description:

```
{
    "sample_rate": 0.2, 
    "max_bucket_num": 128, 
    "bucket_num": 3, 
    "buckets": [
        {
            "lower": "0.1", 
            "upper": "0.2", 
            "count": 2, 
            "pre_sum": 0, 
            "ndv": 2
        }, 
        {
            "lower": "0.8", 
            "upper": "0.9", 
            "count": 2, 
            "pre_sum": 2, 
            "ndv": 2
        }, 
        {
            "lower": "1.0", 
            "upper": "1.0", 
            "count": 2, 
            "pre_sum": 4, 
            "ndv": 1
        }
    ]
}
```

Field description:
- sample_rate:Rate of sampling
- max_bucket_num:Limit the maximum number of buckets
- bucket_num:The actual number of buckets
- buckets:All buckets
    - lower:Upper bound of the bucket
    - upper:Lower bound of the bucket
    - count:The number of elements contained in the bucket
    - pre_sum:The total number of elements in the front bucket
    - ndv:The number of different values in the bucket

> Total number of histogram elements = number of elements in the last bucket(count) + total number of elements in the previous bucket(pre_sum).
2023-01-07 00:50:32 +08:00
a6773417ef [Doc] Add sidebars for split_by_string function and delete split_by_char builtins code (#15679) 2023-01-06 21:14:26 +08:00
df2da89b89 [feature](multi-catalog) support postgresql jdbc catalog (#15570)
support postgresql jdbc catalog
2023-01-06 11:00:59 +08:00
e67ea1ddb7 [fix](doc): catalog use resource doc error (#15607) 2023-01-04 19:53:25 +08:00
36e43c2677 fix 1.2.1 release notes (#15590) 2023-01-04 13:26:54 +08:00
e5397efb67 [docs](releasenotes)release 1.2.1 (#15583)
* release 1.2.1
2023-01-04 10:12:46 +08:00
caaae28b50 [docs](export) fix export data to object storage docs (#15563) 2023-01-03 16:45:02 +08:00
8a5f1351e2 [typo](doc) add be max jvm heap size config description (#15561) 2023-01-03 16:04:18 +08:00
9ab663212b [docs](muti-catalog) update external iceberg system doc (#15556)
Tell users how to solve the problem "fail to read schema from table xx or Storage schema reading not supported"
when doris access hive metastore.
2023-01-03 13:58:16 +08:00
5062c62ee1 [chore](script) add build-for-release.sh (#15545) 2023-01-02 22:50:36 +08:00
c47bdf6606 [vectorized](jdbc) fix external table of oracle have keyworld column (#15487)
if column name is keyword of oracle, the query will report error
2022-12-31 12:48:26 +08:00
9bba2f4cde [typo](docs) array function doc fix (#15449) 2022-12-30 23:00:48 +08:00
aacd11336a [typo](docs)update java udf demo (#15521) 2022-12-30 21:12:34 +08:00
8e58d92e77 [typo](docs) fix document info missing in SHOW-TABLETS.md (#15488) 2022-12-30 18:39:21 +08:00
084eec87ee [docs](docs)update en docs (#15470)
* Update basic-summary.md
2022-12-30 18:38:26 +08:00
34d7eeb571 [doc](session variable) add doc content for adding variables called rewrite_or_to_in_predicate_threshold (#15513)
Co-authored-by: wuhangze <wuhangze@jd.com>
2022-12-30 17:11:45 +08:00
08d4dcefff [typo](doc)data partition doc including en and zh-CN #15379
Co-authored-by: Chen Jinquan 陈金泉 (690) <chenjinq@haier.com>
2022-12-30 15:38:25 +08:00
9a517d6a8f [DataType](Deciamlv3) change the avg function scale of decimalv3 (#15445) 2022-12-30 00:27:51 +08:00
73f7ccb58f [typo](docs) fix document display error in SHOW-ALTER.md and SHOW-PARTITION-ID.md and SHOW-PARTITIONS.md (#15453) 2022-12-30 00:27:22 +08:00
e2603ca883 [fix](docs) fix some docs about stream load and select. (#15372)
* [fix](docs) fix some docs about stream load and select.

* update
2022-12-29 14:50:06 +08:00
2ae28ea9dd [typo](docs)fix-doc #15438 2022-12-29 14:19:24 +08:00
4179ea31bd [typo](docs) fix typo in SHOW-ALTER.md and SHOW-LOAD-WARNINGS.md (#15431) 2022-12-29 14:19:05 +08:00
298c0a2391 [typo](docs)fix be dynamic configuration doc #15443 2022-12-29 14:18:14 +08:00
f5b4faf682 fix compile doc (#15454) 2022-12-29 14:17:53 +08:00
ffef81a6ab [feature](BE)pad missed version with empty rowset (#15030)
If all replicas of one tablet are broken, user can use this http api to pad the missed version with empty rowset.
2022-12-29 11:20:44 +08:00
a22ee89431 [Enhancement](jemalloc):support heap dump by http request at runtime (#15429) 2022-12-28 20:10:50 +08:00
75aa00d3d0 [Feature](NGram BloomFilter Index) add new ngram bloom filter index to speed up like query (#11579)
This PR implement  the new bloom filter index: NGram bloom filter index, which was proposed in  #10733.
The new index can improve the like query performance greatly, from our some test case , can  get order of magnitude  improve.
For how to use it you can check the docs in this PR, and the index based on the ```enable_function_pushdown```,
you need set it to ```true```, to make the index work for like query.
2022-12-28 18:01:50 +08:00
3aae27634a [doc](flink-connector) update flink connector faq (#15405) 2022-12-28 16:15:49 +08:00
8342691b62 [feature](remote)Add drop storage policy (#15364)
* add drop storage policy

* add drop storage policy

* add drop storage policy

* add drop storage policy
2022-12-28 16:04:30 +08:00
28bb13a026 [feature](light-schema-change) enable light schema change by default (#15344) 2022-12-28 09:29:26 +08:00
aad53d37c7 [typo](docs)fix doris docs 404 link (#15400) 2022-12-27 22:57:40 +08:00
b3f77a2e00 [feature](Show) add one show type cast command (#15137) 2022-12-27 14:19:04 +08:00
6d851b1fc9 [Doc](Flink) update flink connector doc add new version #15365
Co-authored-by: wudi <>
2022-12-27 14:15:49 +08:00
777b0b94bb [typo](docs) fix wrong date format (#15363)
fix wrong date format
2022-12-27 11:45:05 +08:00
c3d0e2931a [typo](docs) fix version tag for docs of s3 token (#15362) 2022-12-26 19:23:43 +08:00