doris

Author	SHA1	Message	Date
jiafeng.zhang	e03b74ebc1	[Doc] Add the error code document of returned by the OLAP function on the BE side (#6666 )	2021-09-24 21:40:20 +08:00
EmmyMiao87	bdc8c98008	[Outfile] Support hdfs in select outfile clause (#6644 ) Support hdfs in select outfile clause without broker. This PR implement a HDFS writer in BE which is used to write HDFS file directly without using broker. Also the hdfs outfile clause syntax check has been added in FE. The syntax: ``` select * from xx into outfile "hdfs://user/outfile_" format as csv properties ("hdfs.fs.dafultFS" = "xxx", "hdfs.hdfs_user" = "xxx"); ``` Note that all hdfs configurations need to carry a prefix `hdfs.`.	2021-09-24 10:07:11 +08:00
pierre xiong	840a7ef3a8	Fix a typo (#6688 ) Fix a typo	2021-09-23 09:44:46 +08:00
Mingyu Chen	521fb15a9b	[Bug] Fix some memory bugs (#6699 ) 1. Fix a memory leak in `collect_iterator.cpp` (Fix #6700) 2. Add a new BE config `max_segment_num_per_rowset` to limit the num of segment in new rowset.(Fix #6701) 3. Make the error msg of stream load more friendly.	2021-09-22 12:30:14 +08:00
EmmyMiao87	085942b30f	[Doc] Download hashes and signatures use "downloads.apache.org" (#6677 ) The latest release should use https://www.apache.org/dyn/closer.lua <https://www.apache.org/dyn/closer.lua> The latest hashes and signatures should use https://downloads.apache.org/ The old release should use http://archive.apache.org/dist	2021-09-16 18:09:08 +08:00
GeoffreyStark	7ee39743de	[Doc] Fix tabletScore expression in be_config.md (#6638 ) Co-authored-by: Geoffrey <gaofeng01@rd.netease.com>	2021-09-16 10:24:46 +08:00
qiye	225bdb1fda	[Bug] fix `replace` function bug (#6605 ) * fix replace function bug * fix replace docs	2021-09-14 09:59:13 +08:00
GeoffreyStark	5d3c7fbd80	add doc for storage_root_path (#6593 ) * add doc for storage_root_path * Maintain consistency in both Chinese and English documents Co-authored-by: Geoffrey <gaofeng01@rd.netease.com>	2021-09-10 09:52:58 +08:00
zhoubintao	b3f02955d3	[Doc] modify irregular documents (like/ not like/ regexp.md) (#6572 )	2021-09-09 14:11:37 +08:00
EmmyMiao87	9469b2ce1a	[Outfile] Support concurrent export of query results (#6539 ) This pr mainly supports 1. Export query result sets concurrently 2. Query result set export supports s3 protocol Among them, there are several preconditions for concurrently exporting query result sets 1. Enable concurrent export variables 2. The query itself can be exported concurrently (some queries containing sort nodes at the top level cannot be exported concurrently) 3. Export the s3 protocol used instead of the broker After exporting the result set concurrently, the file prefix is changed to outfile_{query_instance_id}_filenumber.{file_format}	2021-09-07 11:53:32 +08:00
王连松	79fd117d60	Update load-json-format.md (#6546 ) change stripe_outer_array to strip_outer_array	2021-09-02 16:08:09 +08:00
zhangstar333	7a15e583a7	[Feature]Support functions of json_array, json_object, json_quote (#6504 )	2021-09-02 09:59:02 +08:00
Pxl	4dd610c28d	[Feature] Support for storage layer benchmark (#6506 ) * add benchmark tool	2021-09-02 09:57:19 +08:00
zhoubintao	e01a845a4a	[Doc] Update stream-load-manual.md (#6524 ) Origin stream load column order transformation is unclear , a user is struggling for a long time in this part ,so i modified some expressions to make it clearer.	2021-09-01 13:28:25 +08:00
shee	a949dcd9f6	[Feature] Create table like clause support copy rollup (#6475 ) for issue #6474 ```sql create table test.table1 like test.table with rollup r1,r2 -- copy some rollup create table test.table1 like test.table with rollup all -- copy all rollup create table test.table1 like test.table -- only copy base table ```	2021-08-31 20:33:26 +08:00
Zeno Yang	7324f4b0ae	[Bug] Regularly clean up old DeleteInfos in the DeleteHandler (#6448 ) fix #6447 1. FE master regularly triggers the remove operation 2. After the master completes the removal of deleteInfo, it is synchronized to the Follower through editlog for remove 3. When the DeleteInfo creation time is longer than the current time, it will be cleaned up, which is determined by the `delete_info_keep_max_second` configuration	2021-08-30 18:52:18 +08:00
caiconghui	0393c9b3b9	[Optimize] Support send batch parallelism for olap table sink (#6397 ) * Support send batch parallelism for olap table sink Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-08-30 11:03:09 +08:00
Pxl	5eed1f897a	[Document] update docker env version to 1.3.1 (#6517 ) * update docker env version	2021-08-30 11:01:39 +08:00
Mingyu Chen	3f2fdd236f	Add scan thread token (#6443 )	2021-08-27 10:56:17 +08:00
PKU-zhoubintao	5419d74abf	[Doc]Update hit-the-rollup.md (#6430 )	2021-08-25 22:35:05 +08:00
luozenglin	92e50504e5	[Feature] Supports case-insensitive table names. (#6403 ) Implement the lower_case_table_names variable of mysql. The value meaning is as follows: 0: the table names are case-sensitive. 1: table names are stored in lowercase and comparisons are not case sensitive. 2: table names are stored as given but compared case-insensitively.	2021-08-25 22:34:45 +08:00
Hao Tan	c71f58fef9	[Doc] Add sidebar for percentile doc (#6470 )	2021-08-22 22:03:07 +08:00
Mingyu Chen	0cf2bc6644	[Doc] Refactor all grammar help documents (#6337 ) See #6336 for details	2021-08-22 22:02:51 +08:00
jiafeng.zhang	4ea2fcefbc	[Improve]The connector supports spark 3.0, flink 1.13 (#6449 ) Modify the flink/spark compilation documentation	2021-08-18 15:57:50 +08:00
Hao Tan	66a7a4b294	[Feature] Support exact percentile aggregate function (#6410 ) Support to calculate the exact percentile value array of numeric column `col` at the given percentage(s).	2021-08-18 15:56:06 +08:00
Zhengguo Yang	8738ce380b	Add long text type STRING, with a maximum length of 2GB. Usage is similar to varchar, and there is no guarantee for the performance of storing extremely long data (#6391 )	2021-08-18 09:05:40 +08:00
CenterCode	4be06a470f	fix typo: dynamic_partitoin -> dynamic_partition (#6445 )	2021-08-16 09:17:57 +08:00
EmmyMiao87	42fedc0a56	[Docs] Support json file format in routine load doc (#6439 )	2021-08-14 10:25:06 +08:00
GoGoWen	6f6d50a484	fix typo: '分许'->'分离' (#6440 )	2021-08-14 10:22:28 +08:00
Stalary	5e6f1b89da	[Feature] Support sql block rule (#6192 ) Support grammar: - SHOW SQL_BLOCK_RULE [FOR NAME] - CREATE SQL_BLOCK_RULE test_rule PROPERTIES ("user"="default", "sql"="select .* from .* join .*", "enable": "true"); - ALTER SQL_BLOCK_RULE test_rule PROPERTIES ("user"="test_user", "enable": "false"); - DROP SQL_BLOCK_RULE test_rule1,test_rule2;	2021-08-13 21:56:34 +08:00
CenterCode	240dd9b110	fix typo: '一下' -> '以下' (#6434 )	2021-08-13 12:18:42 +08:00
Pxl	8a267f1ac5	[Feature] Support for cleaning the trash actively (#6323 )	2021-08-12 10:07:51 +08:00
Mingyu Chen	708b6c529e	[RoutineLoad] Support pause or resume all routine load jobs (#6394 ) 1. PAUSE ALL ROUTINE LOAD; 2. RESUME ALL ROUTINE LOAD;	2021-08-11 16:38:06 +08:00
Mingyu Chen	7e93405df3	[Alter] Support alter table and column's comment (#6387 ) 1. alter table tbl1 modify comment "new comment"; 2. alter table tbl1 modify column k1 comment "k1", modify column v1 comment "v1";	2021-08-11 16:37:42 +08:00
Mingyu Chen	1a5b03167a	[Doc] Add document for datax and sample codes (#6389 ) Add documents for datax in extension catalog. Add documents for sampes in best-practice catalog.	2021-08-11 11:51:13 +08:00
luozenglin	0930e89452	[http][manager] Add manager related http interface. (#6396 ) Encapsulate some http interfaces for better management and maintenance of doris clusters. The http interface includes getting cluster connection information, node information, node configuration information, batch modifying node configuration, and getting query profile. For details, please refer to the document: `docs/zh-CN/administrator-guide/http-actions/fe/manager/`	2021-08-10 10:58:31 +08:00
zhangboya1	35c8b6a0bf	[DOC] Update dynamic-partition.md (#6371 ) Update dynamic-partition.md The default value of dynamic_partition_check_interval_seconds is 600 in source code.	2021-08-10 10:13:45 +08:00
huangmengbin	bf616dcb8f	[Config] Add default configuration of load_parallelism (#6290 ) - Make load_parallelism configurable. - Different clusters should be configured with different load_parallelism values. - Some user don't know how to set load_parallelism, or don't know the best load_parallelism value.	2021-08-10 10:11:46 +08:00
Pxl	236e0f1eda	[Feature] Support for querying the trash used capacity (#6247 ) Support for querying the trash used capacity. ``` SHOW TRASH [ON ...] ``` Now user can proactively scan trash directory.	2021-08-10 10:10:47 +08:00
wudi	d9fc1bf3ca	[Feature]:Flink-connector supports streamload parameters (#6243 ) Flink-connector supports streamload parameters #6199	2021-08-09 22:12:46 +08:00
zhangstar333	612684fb2e	[DOC]Add a profile counter of local exchange send bytes (#6372 ) Add a profile counter of local exchange send bytes: LocalBytesSent	2021-08-07 21:32:44 +08:00
qiye	70825ce846	[Feature] Support alias function (#6261 ) Implement #6260. Add alias function type.	2021-08-07 21:29:13 +08:00
jiafeng.zhang	39ee97e95d	[Doc] Add a description of the restriction of the materialized view on the use of the unique model (#6362 ) Add a description of the restriction of the materialized view on the use of the unique model	2021-08-05 14:35:13 +08:00
Mingyu Chen	2823e4daba	[Feature] Support SHOW DATA SKEW stmt (#6219 ) SHOW DATA SKEW FROM tbl PARTITION(p1) to view the data distribution of a specified partition ``` mysql> admin show data skew from tbl1 partition(tbl1); +-----------+-------------+-------+---------+ \| BucketIdx \| AvgDataSize \| Graph \| Percent \| +-----------+-------------+-------+---------+ \| 0 \| 0 \| \| 100.00% \| +-----------+-------------+-------+---------+ 1 row in set (0.01 sec) ``` Also modify the result of `admin show replica distribution`, add replica size distribution ``` mysql> admin show replica distribution from tbl1 partition(tbl1); +-----------+------------+-------------+----------+------------+-----------+-------------+ \| BackendId \| ReplicaNum \| ReplicaSize \| NumGraph \| NumPercent \| SizeGraph \| SizePercent \| +-----------+------------+-------------+----------+------------+-----------+-------------+ \| 10002 \| 1 \| 0 \| > \| 100.00% \| \| 100.00% \| +-----------+------------+-------------+----------+------------+-----------+-------------+ ```	2021-08-05 14:05:41 +08:00
Mingyu Chen	748604ff4f	[RoutineLoad] Support alter broker list and topic for kafka routine load (#6335 ) ``` alter routine load for cmy2 from kafka("kafka_broker_list" = "ip2:9094", "kafka_topic" = "my_topic"); ``` This is useful when the kafka broker list or topic has been changed. Also modify `show create routine load`, support showing "kafka_partitions" and "kafka_offsets".	2021-08-03 11:58:38 +08:00
luozenglin	9ca369aa58	[Feature][LDAP] Add LDAP authentication login and LDAP group authorization support. (#6333 ) * [Feature][LDAP] Add LDAP authentication login and LDAP group authorization support. * Update docs/.vuepress/sidebar/en.js Co-authored-by: Mingyu Chen <morningman.cmy@gmail.com> Co-authored-by: Mingyu Chen <morningman.cmy@gmail.com>	2021-07-30 09:24:50 +08:00
jiafeng.zhang	cdffe1ae20	[Doc] Modify the storage path configuration instructions in the installation and BE configuration documents (#6298 )	2021-07-27 13:40:15 +08:00
EmmyMiao87	b3a52a05d5	[Update] Support update syntax (#6230 ) [Update] Support update syntax The current update syntax only supports updating the filtered data of a single table. Syntax: * UPDATE table_reference * SET assignment_list * [WHERE where_condition] * * value: * {expr} * * assignment: * col_name = value * * assignment_list: * assignment [, assignment] ... Example Update unique_table set v1=1 where k1=1 New Frontend Config: enable_concurrent_update This configuration is used to control whether multi update stmt can be executed concurrently in one table. Default value is false which means A table can only have one update task being executed at the same time. If users want to update the same table concurrently, they need to modify the configuration value to true and restart the master frontend. Concurrent updates may cause write conflicts, the result is uncertain, please be careful. The main realization principle: 1. Read the rows that meet the conditions according to the conditions set by where clause. 2. Modify the result of the row according to the set clause. 3. Write the modified row back to the table. Some restrictions on the use of update syntax. 1. Only the unique table can be updated 2. Only the value column of the unique table can be updated 3. The where clause currently only supports single tables Possible risks: 1. Since the current implementation method is a row update, when the same table is updated concurrently, there may be concurrency conflicts which may cause the incorrect result. 2. Once the conditions of the where clause are unsatisfactory, it is likely to cause a full table scan and affect query performance. Please pay attention to whether the column in the where clause can match the index when using it. [Docs][Update] Add update document and sql-reference Fixed #6229	2021-07-27 13:38:15 +08:00
GoGoWen	7b44e7ff94	[Doc-fix] fix doc issue (#6315 ) in be_config.md, there is wrong description about `tablet_rowset_stale_sweep_time_sec` with "磁盘时间不足时“ as below, instead，it should be ”磁盘空间不足时“。	2021-07-26 09:41:10 +08:00
Henry2SS	bf0ba1d8ce	[Doc-fix] change min_bytes_per_broker_scanner's Chinese translation. (#6310 ) min_bytes_per_broker_scanner should be the "最小"	2021-07-26 09:40:22 +08:00

... 48 49 50 51 52 ...

3174 Commits