doris

Author	SHA1	Message	Date
DongLiang-0	2697f72d77	[Improvement][SET-PROPERTY] Support for set query_timeout property (#13444 )	2022-10-27 10:03:39 +08:00
Tiewei Fang	3e8cd0c669	[typo](doc) Add the description of json HDFS broker load (#13683 ) Add the instruction of HDFS broker load with json format file.	2022-10-27 09:36:57 +08:00
jiafeng.zhang	d2262bc8fb	[docs]fix 404 (#13695 ) [docs]fix 404	2022-10-27 08:49:36 +08:00
jiafeng.zhang	c5559877b4	[typo](docs)fix docs 404 link (#13677 )	2022-10-26 14:56:47 +08:00
Mingyu Chen	c709998faa	[improvement][refactor](mysql) remove old mysql server and add keep alive option (#13663 ) * [improvement][refactor](mysql) remove old mysql server and add keep alive option	2022-10-26 09:38:33 +08:00
ccoffline	9691db7918	[Enhancement](metrics) add more metrics (#11693 ) * Add `AutoMappedMetric` to measure dynamic object. * Add query instance and rpc metrics * Add thrift rpc metrics * Add txn metrics * Reorganize metrics init routine. Co-authored-by: 迟成 <chicheng@meituan.com>	2022-10-26 08:31:03 +08:00
huangzhaowei	17ba40f947	[feature-wip](CN Node)Support compute node (#13231 ) Introduce the node role to doris, and the table creation and tablet scheduler will control the storage only assign to the BE nodes.	2022-10-25 21:44:33 +08:00
lihangyu	235c105554	[feature-array](array-type) Add array function array_enumerate (#13612 ) Add array function array_enumerate	2022-10-25 15:12:11 +08:00
lsy3993	f802fc37ff	add date function 'last_day' (#13609 )	2022-10-25 13:46:16 +08:00
caiconghui	87864e40bf	[doc](random_sink) Add some doc content about random sink (#13577 ) 1. Add some doc content about random sink 2. Fix bug of showing missing rowsets info	2022-10-23 22:51:56 +08:00
zy-kkk	6ef891870f	improve the outfile doc (#13569 )	2022-10-22 23:21:12 +08:00
Mingyu Chen	413d2332ce	[improvement](heartbeat) Add some relaxation strategies to reduce the failure probability of regression testing (#13568 ) The regression test may failed because of heartbeat failure occasionally. So I add 2 new FE config to relax this limit 1. `disable_backend_black_list` Set to true to not put Backend to black list even if we failed to send task to it. Default is false. 2. `max_backend_heartbeat_failure_tolerance_count` Only if the failure time of heartbeat exceed this config, we can set Backend as dead. Default is 1.	2022-10-22 17:53:07 +08:00
Mingyu Chen	847b80ebfa	[test](jdbc) add jdbc and hive regression test (#13143 ) 1. Modify default behavior of `build.sh` The `BUILD_JAVA_UDF` is default ON, so that jvm is needed for compilation and runtime. 2. Add docker-compose for MySQL 5.7, PostgreSQL 14 and Hive 2 See `docker/thirdparties/docker-compose`. 3. Add some regression test cases for jdbc query on MySQL, PG and Hive Catalog The default is `false`, if set to true, you need first start docker for MySQL/PG/Hive. 4. Support `if not exists` and `if exists` for create/drop resource and create/drop encryptkey	2022-10-21 15:29:27 +08:00
zhangstar333	3ca8bfaf30	[Function](array) support array_difference function (#13440 )	2022-10-21 10:57:37 +08:00
Xin Liao	27d84eafc5	[feature](alter) support rename column for table with unique column id (#13410 )	2022-10-21 08:45:34 +08:00
HappenLee	e62d3dd8e5	[opt](function) refactor extract_url to use StringValue (#13508 ) change extract_url use stringvalue to repalce std::string to speed up	2022-10-21 08:33:39 +08:00
Mingyu Chen	32b1456b28	[feature-wip](array) remove array config and check array nested depth (#13428 ) 1. remove FE config `enable_array_type` 2. limit the nested depth of array in FE side. 3. Fix bug that when loading array from parquet, the decimal type is treated as bigint 4. Fix loading array from csv(vec-engine), handle null and "null" 5. Change the csv array loading behavior, if the array string format is invalid in csv, it will be converted to null. 6. Remove `check_array_format()`, because it's logic is wrong and meaningless 7. Add stream load csv test cases and more parquet broker load tests	2022-10-20 15:52:31 +08:00
DongLiang-0	2b328eafbb	[function](string_function) add new string function 'extract_url_parameter' (#13323 )	2022-10-20 11:11:43 +08:00
deardeng	bc08854a35	[doc](storage policy) add cold and hot separation docs (#13096 )	2022-10-20 08:56:53 +08:00
xueweizhang	697fa5f586	[Enhancement](profile) support configure the number of query profile (#13421 ) Signed-off-by: nextdreamblue <zxw520blue1@163.com>	2022-10-20 08:51:36 +08:00
Stalary	eeb2b0acdb	[doc][fix](multi-catalog) Add multi-catalog es doc (#13429 ) 1. Add multicatalog es doc 2. Modify es unsigned_long mapping to largeint. 3. getHost add pre judge logic.	2022-10-19 16:00:13 +08:00
Liqf	29b4d8dcad	[typo](docs) fix some problem #13462	2022-10-19 15:42:17 +08:00
zy-kkk	8a068c8c92	[function](string_function) add new string function 'not_null_or_empty' (#13418 )	2022-10-19 11:10:37 +08:00
luozenglin	a8fd76fe32	[Fix](docs) fix error description of `LDAP_ADMIN_PASSWORD` in the document (#13405 ) co-author:@luozenglin	2022-10-18 18:53:10 +08:00
yixiutt	6d322f85ac	[improvement](compaction) delete num based compaction policy (#13409 ) Co-authored-by: yixiutt <yixiu@selectdb.com>	2022-10-18 16:13:28 +08:00
Mingyu Chen	dbf71ed3be	[feature-wip](new-scan) Support stream load with csv in new scan framework (#13354 ) 1. Refactor the file reader creation in FileFactory, for simplicity. Previously, FileFactory had too many `create_file_reader` interfaces. Now unified into two categories: the interface used by the previous BrokerScanNode, and the interface used by the new FileScanNode. And separate the creation methods of readers that read `StreamLoadPipe` and other readers that read files. 2. Modify the StreamLoadPlanner on FE side to support using ExternalFileScanNode 3. Now for generic reader, the file reader will be created inside the reader, not passed from the outside. 4. Add some test cases for csv stream load, the behavior is same as the old broker scanner.	2022-10-17 23:33:41 +08:00
luozenglin	207f4e559e	[feature](agg) support `group_bitmap_xor` agg function. (#13287 ) support `group_bitmap_xor` agg function	2022-10-17 18:40:06 +08:00
Hong Liu	53286794c6	[typo](docs) Fixed thrift_client_timeout_ms's incorrect description of en docs. (#13391 ) Co-authored-by: smallhibiscus <8449081280@qq.com>	2022-10-17 14:54:38 +08:00
carlvinhust2012	4caa1e8041	[optimization](array-type) update the docs for import data to array column (#13345 ) 1. this pr is used to update the json load docs for import data to array column. when we use json to import data to array column, the Rapidjson will cause precision problems. so we update the json-load docs to specify how to avoid these problems. Issue Number: #7570 Co-authored-by: hucheng01 <hucheng01@baidu.com>	2022-10-17 12:43:22 +08:00
abmdocrt	045bccdbea	[Feature](Retention) support retention function (#13056 )	2022-10-17 11:00:47 +08:00
Xinyi Zou	50ae9e6b19	[enhancement](planner) support select table sample (#10170 ) ### Motivation TABLESAMPLE allows you to limit the number of rows from a table in the FROM clause. Used for data detection, quick verification of the accuracy of SQL, table statistics collection. ### Grammar ``` [TABLET tids] TABLESAMPLE n [ROWS \| PERCENT] [REPEATABLE seek] ``` Limit the number of rows read from the table in the FROM clause, select a number of Tablets pseudo-randomly from the table according to the specified number of rows or percentages, and specify the number of seeds in REPEATABLE to return the selected samples again. In addition, can also manually specify the TableID, Note that this can only be used for OLAP tables. ### Example Q1: ``` SELECT * FROM t1 TABLET(10001,10002) limit 1000; ``` explain: ``` partitions=1/1, tablets=2/12, tabletList=10001,10002 ``` Select the specified tabletID of the t1. Q2: ``` SELECT * FROM t1 TABLESAMPLE(1000000 ROWS) REPEATABLE 1 limit 1000; ``` explain: ``` partitions=1/1, tablets=3/12, tabletList=10001,10002,10003 ``` Q3: ``` SELECT * FROM t1 TABLESAMPLE(1000000 ROWS) REPEATABLE 2 limit 1000; ``` explain: ``` partitions=1/1, tablets=3/12, tabletList=10002,10003,10004 ``` Pseudo-randomly sample 1000 rows in t1. Note that several Tablets are actually selected according to the statistics of the table, and the total number of selected Tablet rows may be greater than 1000, so if you want to explicitly return 1000 rows, you need to add Limit. ### Design First, determine how many rows to sample from each partition according to the number of partitions. Then determine the number of Tablets to be selected for each partition according to the average number of rows of Tablet, If seek is not specified, the specified number of Tablets are pseudo-randomly selected from each partition. If seek is specified, it will be selected sequentially from the seek tablet of the partition. And add the manually specified Tablet id to the selected Tablet.	2022-10-14 15:05:23 +08:00
zhannngchen	ed73096f19	[improvemnt][doc] refine doc for unique key data model (#13319 )	2022-10-14 09:55:52 +08:00
luozenglin	de4315c1c5	[feature](function) support `initcap` string function (#13193 ) support `initcap` string function	2022-10-13 21:31:44 +08:00
luozenglin	cb300b0b39	[feature](agg) support `any`,`any_value` agg functions. (#13228 )	2022-10-13 18:31:19 +08:00
zhannngchen	fe1524a287	[Enhancement](load) remove load mem limit (#13111 ) #12716 removed the mem limit for single load task, in this PR I propose to remove the session variable load_mem_limit, to avoid confusing. For compatibility, load_mem_limit in thrift not removed, the value is set equal to exec_mem_limit in FE	2022-10-13 17:19:22 +08:00
xueweizhang	e08ba8d573	[feature](restore) Add new property 'reserve_dynamic_partition_enable' to restore statement (#12498 ) Add restore new property 'reserve_dynamic_partition_enable', which means you can get a table with dynamic_partition_enable property which has the same value as before the backup. before this commit, you always get a table with property 'dynamic_partition_enable=false' when restore.	2022-10-13 11:16:15 +08:00
zhannngchen	4a5095f00d	[cleanup](config) remove unused config push_write_mbytes_per_sec (#13290 )	2022-10-12 15:58:04 +08:00
Liqf	917d35a184	[typo](docs)Fix a document problem #13296	2022-10-12 10:08:48 +08:00
zhangstar333	16999ef02d	[Vectorized][Function] support date_trunc and countequal function (#13039 )	2022-10-12 10:01:09 +08:00
Liqf	022cfb6979	[typo](docs)delete duplicate document and fix some problem (#13274 )	2022-10-12 09:09:05 +08:00
Mingyu Chen	5af1439934	[feature](auth) support user password policy and alter user stmt (#13051 )	2022-10-11 16:37:35 +08:00
zhannngchen	48b182023f	[docs](broker load) add doc for property load_parallelism (#13041 )	2022-10-11 15:53:25 +08:00
Liqf	230efa29dd	[typo](docs)add orthogonal bitmap function note. #13078	2022-10-11 15:46:56 +08:00
jiafeng.zhang	9b42f7e479	[typo](docs)Modification instructions and examples for adding schema change key columns (#13280 )	2022-10-11 15:42:14 +08:00
caoliang-web	eb60976c25	[typo](docs)fix error url (#13171 ) * fix error url	2022-10-11 15:41:00 +08:00
FreeOnePlus	a716c74412	[typo](docs)Fix Docs Error Urls (#13176 ) * fix doc	2022-10-11 15:40:03 +08:00
yuanyuan8983	b1cd87d635	[typo](docs)Fix FE Configuration Jump Link 404 (#13149 ) * [typo](docs)Fix FE Configuration Jump Link 404	2022-10-11 15:39:25 +08:00
yuanyuan8983	6dad7ee5f5	[typo](docs) Fix jump link 404 in elastic-expansion.md (#13168 ) * [typo](docs) Fix jump link 404	2022-10-11 15:38:17 +08:00
yuanyuan8983	9c776c1011	[typo](docs) Fix the jump link 404 in basic usage.md (#13169 ) * [typo](docs) Fix the jump link 404	2022-10-11 15:38:00 +08:00
caoliang-web	0b9e9ac209	metadata operation fix 404 error url (#13215 ) metadata operation fix 404 error url	2022-10-11 14:11:11 +08:00

1 2 3 4 5 ...

1525 Commits