Commit Graph

1910 Commits

Author SHA1 Message Date
c39914c0a0 [feature](partition)add default list partition (#15509)
This pr implements the list default partition referred in related #15507.
It's similar as GreenPlum's default's partition which would store all data not satisfying prior partition key's
constraints and optimizer wouldn't filter default partition which means default partition would be scanned
each time you try to select data from one table with default partition.

User could either create one table with default partition or alter add one default partition.

```sql
PARTITION LIST(key) {
PARTITION p1 values in (xx,xx),
PARTITION DEFAULT
}

ALTER TABLE XXX ADD PARTITION DEFAULT
```

We don't support automatically migrate data inside default partition which meets newly added partition key's
constraint to newly add partition when alter add new partition. User should select default partition using new 
constraints as predicate and insert them to new partition.

```sql
insert into tbl select * from tbl partition default where partition_key=xx;
```
2023-02-24 15:24:59 +08:00
7470198df6 [Docs](docs) Organize http documents (#16618)
1.  Organize http documents
2. Add http interface authentication for FE
3. Support https interface for FE
4. Provide authentication interface
5. Add http interface authentication for BE
6. Support https interface for BE
2023-02-24 15:17:01 +08:00
Pxl
03f4c7a94d [Doc](Materialized-View) update documentation about materialized view enhancement (#17025)
update documentation about materialized view enhancement
2023-02-24 10:06:35 +08:00
37b9b038c4 [typo](docs) fix Fix incorrect url address in export-manual.md. (#17072) 2023-02-24 09:42:28 +08:00
1cce5782a0 [typo](docs) collect doc md language annotation (#17090) 2023-02-24 09:41:54 +08:00
c416bfbaef [typo](docs)fix disk format (#17050)
* change docker compose to 'docker-compose'

* modify sql of mysql

* fix docker start and stop cmd

* new commit

* markdown format adjust
2023-02-23 20:32:05 +08:00
92ecd16573 (feature)[DOE]Support array for Doris on ES (#16941)
* (feature)[DOE]Support array for Doris on ES
2023-02-23 19:31:18 +08:00
526a66e9fb [Function](array-type) support array_apply (#17020)
Filter array to match specific binary condition

```
mysql> select array_apply([1000000, 1000001, 1000002], '=', 1000002);
+-------------------------------------------------------------+
| array_apply(ARRAY(1000000, 1000001, 1000002), '=', 1000002) |
+-------------------------------------------------------------+
| [1000002]                                                   |
+-------------------------------------------------------------+
```
2023-02-23 17:38:16 +08:00
51bbae27b8 [feature-wip](iceberg) add dlf and glue catalog impl for iceberg catalog (#16602)
iceberg catalog supports
DLF on Alibaba Cloud and AWS Glue Catalog
2023-02-23 14:02:41 +08:00
2e1ed384fd [typo](docs) add split_by_string function 1.2.2 label (#17057) 2023-02-23 11:17:25 +08:00
8eeb435963 [improvement](meta) Enhance Doris's fault tolerance to disk error (#16472)
Sense io error.
Retry query when io error.
Greylist: When finds one disk is completely broken, or the diff of tablet number in BE and FE meta is too large,reduce the query priority of the BE.
2023-02-23 08:40:45 +08:00
a1c0054b4c [fix](memory) fix memory GC details and join probe catch bad_alloc (#16989)
Fix Redhat 4.x OS /proc/meminfo has no MemAvailable, disable MemAvailable to control memory.
vm_rss_str and mem_available_str recorded when gc is triggered, to avoid memory changes during gc and cause inaccurate logs.
join probe catch bad_alloc, this may alloc 64G memory at a time, avoid OOM.
Modify document doris_be_all_segments_num and doris_be_all_rowsets_num names.
2023-02-23 08:33:30 +08:00
d7d82f26af [typo](docs) add date_trunc function 1.2 label (#17037) 2023-02-22 22:42:18 +08:00
8dd1a12ea6 [typo](docs)Add upgrade precautions #17027 2023-02-22 19:27:20 +08:00
e48d9c9d62 [doc](typo)update datax.md #17009 2023-02-22 19:27:03 +08:00
e65a061256 [Enhancement](datetimev2-enhance) support 'microseconds_add' function for datetimev2 (#16970)
support 'microseconds_add' function for datetimev2
2023-02-22 17:49:41 +08:00
51eb147711 fix inverted index doc typo and reorganize index related docs (#16915) 2023-02-22 15:15:10 +08:00
0e3be4eff5 [Improvement](brpc) Using a thread pool for RPC service avoiding std::mutex block brpc::bthread (#16639)
mainly include:
- brpc service adds two types of thread pools. The number of "light" and "heavy" thread pools is different
Classify the interfaces of be. Those related to data transmission are classified as heavy interfaces and others as light interfaces
- Add some monitoring to the thread pool, including the queue size and the number of active threads. Use these 
- indicators to guide the configuration of the number of threads
2023-02-22 14:15:47 +08:00
16c4e42f42 [typo](doc) 字段描述与建表sql中的不一致 (#16270)
* 字段描述与建表sql中的不一致

* 1. 英文文档将`key_desc`改为`keys_type`。

* 1. 英文文档将`partition_desc`改为`partition_info`。

---------

Co-authored-by: unicornlee@dingtalk.com <lxb@201092104>
2023-02-21 23:00:26 +08:00
085f0826f6 update (#16975)
Co-authored-by: wudi <>
2023-02-21 22:53:49 +08:00
004872c99a [fix](doc) fix invalid urls in tpch.md (#16949) 2023-02-21 15:45:31 +08:00
246dd65435 [fix](doc) fix export-manual.md (#16969) 2023-02-21 15:44:41 +08:00
13ae8cd6c6 [doc](point query) add row cache doc for hight-concurrent-point-query (#16972)
This code in VCollectIterator::build_heap is possible to cause double free if cumu_iter->init() fails and returns early, becuase some LevelIterator* exists both in VCollectIterator::_children and cumu_iter::_children.
2023-02-21 14:18:37 +08:00
491d269412 [fix](tvf) fix bug that failed to get schema of tvf when file is empty (#16928)
In previous implementation, when querying tvf, FE will get schema from BE.
And BE will try to open the first file to get its schema info, but for orc or parquet format,
if the file is empty, it will return error.
But even for an empty file, we can still get schema info from file's footer.
So we should handle the empty file to get schema info correctly.

Also modify the catalog doc to add some FAQ.
2023-02-21 14:14:32 +08:00
c0bb2e33a8 [improvement](scan) separate scanner into local and remote scanner pool (#16891)
There are 2 kinds for scanner thread pool, local and remote.
Local is for local file read, specially for olap scanner.
Remote is for other external data source, such as file scanner, jdbc scanner.

This PR mainly changes:

For olap scanner, use cold or hot rowset to decide whether to use local or remote pool.
For other scanner, user remote pool by default.
Add a new BE config doris_max_remote_scanner_thread_pool_thread_num, default is 512,
indicate the max thread number of the remote scanner thread pool

This will alleviate the problem of interaction between olap queries with load job and external queries.
2023-02-21 14:13:09 +08:00
113023fb86 (Enhancement)[load-json] support simdjson in new json reader (#16903)
be config:
enable_simdjson_reader=true

related PR #11665
2023-02-21 11:31:00 +08:00
c618e69f59 [typo](docs)supplement the document content for grouping_id.md. (#16926)
* [typo](docs)supplement the document content for grouping_id.md.

* Update grouping_id.md

* Update grouping_id.md
2023-02-21 08:27:25 +08:00
66e283ac7f [improvement](doc) change some version from dev to 1.2.2 (#16907) 2023-02-20 14:48:12 +08:00
21a9f5102f [doc](typo) Update spark-load-manual.md (#16911) 2023-02-20 13:22:21 +08:00
5291f14aff [vectorized](udf) java udf support array type (#16841) 2023-02-20 10:00:25 +08:00
1ac5b23e40 Update doris-join-optimization.md (#15818)
修改文档错误
2023-02-18 22:24:51 +08:00
4bf778c6cd [typo](docs)fix dynamic Table version label (#16895) 2023-02-18 20:39:14 +08:00
2d7d8102c7 [fix](doc) fix mal-format doc #16898
We must write sql reference with guidance:
https://doris.apache.org/zh-CN/community/how-to-contribute/contribute-doc/#%E5%A6%82%E4%BD%95%E7%BC%96%E5%86%99%E5%91%BD%E4%BB%A4%E5%B8%AE%E5%8A%A9%E6%89%8B%E5%86%8C
2023-02-18 14:30:54 +08:00
070f42c463 [Enhancement](Es): Support config like whether push down to es (#16800)
Support config like whether push down to es and refactor some code
Like transform to wildcard query and push down to es, this increases the cpu consumption of the es,
I add a switch control it.
2023-02-17 21:56:11 +08:00
d5c393f413 [docs](docs)Fix FE config max_running_txn_num_per_db default value (#16877) 2023-02-17 20:55:52 +08:00
90ae8dcf01 [typo](docs)supplement the document content (#16884)
* [typo](docs)supplement the document content

* Update grouping.md

Add space before and after English letters in CN docs and keep the English case consistent.

* Update grouping.md

Change the Chinese title to English
2023-02-17 20:55:34 +08:00
adc42600b4 [typo](docs)Modify some document label errors (#16866)
* [typo](docs)Modify some document label errors

* fix
2023-02-17 20:55:17 +08:00
ea0e090a77 collect_set function documentation added 1.2 label (#16868) 2023-02-17 19:05:44 +08:00
fe4ef23489 [fix](doc) add essential property for hive catalog on Kerberosied hms (#16781)
property `hive.metastore.kerberos.principal` is essential when the principal of hms you are connecting is not the 
default value: hive-metastore/_HOST@your_realms。
otherwise, you will get error: Failure unspecified at GSS-API level (Mechanism level: Checksum failed)
2023-02-17 09:54:29 +08:00
262a2ea10d [doc](point query) modify and refine docs (#16735) 2023-02-16 17:36:32 +08:00
2f3ce39857 fix broken graph in broker-load-manual.md (#16768) 2023-02-16 16:54:55 +08:00
a707bd4e57 [fix](doc)fix release 1.2.2 document (#16809) 2023-02-16 15:01:40 +08:00
0f7ebaf157 [doc](enhancement) fix some style (#16798) 2023-02-16 15:01:16 +08:00
bd3ea233f8 [doc](filecache) add file cache document (#16649) 2023-02-16 10:26:48 +08:00
3b26235e7e [typo](docs)Modified the description of preparing for the upgrade in the upgrade document (#16801) 2023-02-16 08:38:18 +08:00
5c29a9ef1a [community](team) add new PMC members and committers (#16796) 2023-02-15 23:07:16 +08:00
0664186efe fix doc (#16769) 2023-02-15 13:25:31 +08:00
040a715251 [docs](releasenote)1.2.2 release note (#16766) 2023-02-15 08:57:52 +08:00
5e80823c86 [improvement](dynamic-partition) add storage_medium property for dynamic partition (#16648) 2023-02-14 15:14:52 +08:00
fc1544701f [typo](docs) fix fe config description (#16686)
* 调整
2023-02-14 10:59:24 +08:00