doris

Author	SHA1	Message	Date
EmmyMiao87	b28f4242c3	Add config max_concurrent_task_num_per_be (#1693 ) This config is used to control the max concurrent task num per be. The cluster max concurrent task num = max_concurrent_task_num_per_be * number of be.	2019-08-24 00:56:40 +08:00
kangkaisen	c73b3f15a4	Update tablet-repair-and-balance doc (#1692 )	2019-08-22 21:31:56 +08:00
EmmyMiao87	978b1ee1af	Add strict mode in Routine load, Stream load and Mini load (#1677 )	2019-08-20 21:56:45 +08:00
Mingyu Chen	176e185e18	Add broker doc (#1662 ) This broker document introduces the properties for different broker types.	2019-08-20 17:18:54 +08:00
Mingyu Chen	8e6814cfcd	Support setting timeout for stream load (#1670 )	2019-08-20 15:43:03 +08:00
Ethan	ccaf39c48f	Fix spelling mistake (#1676 )	2019-08-20 12:16:55 +08:00
yuanli	ba6d728f26	Enable parsing columns from file path for Broker Load (#1582 ) (#1635 ) Currently, we do not support parsing encoded/compressed columns in file path, eg: extract column k1 from file path /path/to/dir/k1=1/xxx.csv This patch is able to parse columns from file path like in Spark(Partition Discovery). This patch parse partition columns at BrokerScanNode.java and save parsing result of each file path as a property of TBrokerRangeDesc, then the broker reader of BE can read the value of specified partition column.	2019-08-19 09:39:21 +08:00
Mingyu Chen	6d73658207	Support checking error data row when doing INSERT (#1597 ) If strict mode is true, and at least one row is filtered, the insert operation will fail and a url will be given to get the error rows. ``` ERROR 1064 (HY000): all partitions have no load data. url: http://host:ip/api/_load_error_log?file=__shard_2/error_log_insert_stmt_e0a620e93dc54461-b89ec64768367d25_e0a620e93dc54461_b89ec64768367d25 ``` If all rows are good, insert will return OK with affected rows: ``` Query OK, 1 row affected (0.26 sec) ``` If strict mode is false, and at least one row is good, the insert operation will return OK with affected rows and warnings. If has error row num, a label will be returned: ``` Query OK, 1 row affected, 1 warning (0.32 sec) {'label':'7d66c457-658b-4a3e-bdcf-8beee872ef2c'} ```	2019-08-16 21:40:29 +08:00
Mingyu Chen	82d0afc1ba	FROM_UNIXTIME should only convert timestamp from 0 to 253402271999 (#1658 ) which is between 1970-01-01 00:00:00 ~ 9999-12-31 23:59:59, otherwise, return null	2019-08-16 18:29:57 +08:00
manannan2017	0e6560ceca	Fix document typo (#1657 )	2019-08-16 14:52:32 +08:00
wkhappy1	1ed25ad83d	Add kafka_default_offsets when no partiotion specify Support read kafka partition from start (#1642)	2019-08-16 13:30:26 +08:00
HangyuanLiu	a551abba58	Modify timediff documents (#1600 )	2019-08-15 12:45:53 +08:00
HangyuanLiu	199ff968dc	Fix time zone compatibility (#1631 )	2019-08-13 18:44:35 +08:00
kangpinghuang	1e2a4c3b9b	Fix tablet restore api in BE(#1623 ) (#1624 )	2019-08-13 09:34:24 +08:00
HangyuanLiu	69af50aa8c	Time zone related BE function (#1598 ) Details can be found in time-zone.md document	2019-08-12 20:57:59 +08:00
EmmyMiao87	add6266c71	Broker load supports function (#1592 ) * Broker load supports function The commit support the column function in broker load. The grammar of LoadStmt has not been changed. Example: columns terminated by ',' (tmp_c1, tmp_c2) set (c1=tmp_c1+tmp_c2) Also, the old function is compatible such as default_value, strftime etc. After this commit, there are no difference in column function between stream load and broker load except old function.	2019-08-09 13:27:31 +08:00
lichaoyong	326d765c64	Add doc of modify replication num upon partition (#1611 )	2019-08-08 16:47:32 +08:00
Mingyu Chen	fd2accbcf9	Modify some docs' format to make it work with document website (#1604 )	2019-08-08 14:47:38 +08:00
xy720	4c2a3d6da4	Merge Help document to documentation (#1586 ) Help document collation (integration of help and documentation documents)	2019-08-07 21:31:53 +08:00
Mingyu Chen	93a3577baa	Support multi partition column when creating table (#1574 ) When creating table with OLAP engine, use can specify multi parition columns. eg: PARTITION BY RANGE(`date`, `id`) ( PARTITION `p201701_1000` VALUES LESS THAN ("2017-02-01", "1000"), PARTITION `p201702_2000` VALUES LESS THAN ("2017-03-01", "2000"), PARTITION `p201703_all` VALUES LESS THAN ("2017-04-01") ) Notice that load by hadoop cluster does not support multi parition column table.	2019-08-05 16:16:43 +08:00
Mingyu Chen	cefe1794d4	Fix bug that replicas of a tablet may be located on same host (#1517 ) Doris support deploy multi BE on one host. So when allocating BE for replicas of a tablet, we should select different host. But there is a bug in tablet scheduler that same host may be selected for one tablet. This patch will fix this problem. There are some places related to this problem: 1. Create Table There is no bug in Create Table process. 2. Tablet Scheduler Fixed when selecting BE for REPLICA_MISSING and REPLICA_RELOCATING. Fixed when balance the tablet. 3. Colocate Table Balancer Fixed when selecting BE for repairing colocate backend sequence. Not fix in colocate group balance. Leave it to colocate repairing. 4. Tablet report Tablet report may add replica to catalog. But I did not check the host here, Tablet Scheduler will fix it.	2019-08-01 10:26:06 +08:00
Mingyu Chen	99836f0d7c	Modify load docs (#1558 ) Make it work with documentation website	2019-07-29 15:48:59 +08:00
EmmyMiao87	000e9cf53c	Add administrator guide of load (#1488 ) The catalogue of load docs: ---- load-manual.md ---- broker-load-manual.md ---- insert-into-manual.md ---- stream-load-manual.md This commit also changes max/min_stream_load_timeout to max/min_load_timeout. The old config named stream_load_timeout means the max timeout suited for all types of load. So the config name has been changed.	2019-07-25 21:02:32 +08:00
EmmyMiao87	473d69e8f8	Fix the mistake in docs of rollup (#1551 )	2019-07-25 20:53:41 +08:00
HangyuanLiu	4aedaea84e	Support TIME type and timediff function (#1505 )	2019-07-23 13:42:39 +08:00
Mingyu Chen	6e1ccbc542	Fix index.rst file for aggregation-function SQL reference docs (#1518 )	2019-07-19 18:16:50 +08:00
Mingyu Chen	2551248a52	Support grant GRANT_PRIV on database or table level (#1472 ) Currently, GRANT_PRIV can only be granted on global level, which means it can only be granted on .. Grant it on db.* or db.tbl are not allowed. This will not be able to meet the requirement to create a user who has privilege to grant privileges to other users on specified database or table, such as: GRANT SELECT_PRIV ON db1.* TO cmy@'%'; So I extend the range of GRANT_PRIV. User can now grant GRANT_PRIV on database or even table level, such as: GRANT GRANT_PRIV ON db1.* TO cmy@'%'; And after being granted, the user cmy@'%' can now grant GRANT_PRIV on db1.* to other users.	2019-07-16 19:25:18 +08:00
HangyuanLiu	2fd2b714c1	Add aggregate function doc (#1434 )	2019-07-11 16:45:45 +08:00
HangyuanLiu	941dec215b	Add utc_timestamp function (#1456 )	2019-07-11 11:09:08 +08:00
Candy	98bd4b4565	Add string function split_part (#1451 )	2019-07-10 09:47:33 +08:00
Youngwb	4989f7bfe3	Fix spelling mistake in docs (#1435 )	2019-07-07 11:55:51 +08:00
Mingyu Chen	8db97998ba	Collect all documents to Doris code base (#1414 )	2019-07-01 09:23:13 +08:00
Mingyu Chen	756a680143	Add a website builder of Doris documentations (#1396 ) The build script locates in docs/website. Built with Sphinx using a theme provided by Read the Docs.	2019-06-26 19:10:39 +08:00
Mingyu Chen	566e122c0d	Optimize Export feature (#1378 ) 1. Add 'timeout' properties in Export stmt. 2. Add more infos in 'show export' stmt. 3. Add more logs for debug.	2019-06-26 00:20:53 +08:00
Mingyu Chen	e807064a88	Modify colocation creation logic (#1289 )	2019-06-25 21:20:18 +08:00
HangyuanLiu	51b2c1d5b2	Add some function doc (#1377 )	2019-06-25 21:02:42 +08:00
EmmyMiao87	322de9cd8e	Add sql-function doc of cast_to_bigint (#1370 )	2019-06-24 19:40:57 +08:00
Mingyu Chen	5c2cf9f2ce	Handle the situation when there is no enough backends for tablet repair (#1299 ) If there are only 3 backends and replication num is 3. If one replica of a tablet is bad, there is no 4th backend for tablet repair. So we need to delete a bad replica first to make room for new replica.	2019-06-14 20:28:29 +08:00
EmmyMiao87	5dea4fb414	Add description of strict mode in decimal type (#1288 )	2019-06-12 16:03:57 +08:00
kangpinghuang	9d7f99a669	Add new file format design markdown (#1267 )	2019-06-11 09:34:06 +08:00
EmmyMiao87	53062122ea	Change strategy of incorrect data (#1255 ) This change adds a load property named strict_mode which is used to prohibit the incorrect data. When it is set to false, the incorrect data will be loaded by NULL just like before. When it is set to true, the incorrect data which belongs to a column without expr will be filtered. The strict_mode is supported in broker load v2 now. It will be supported in stream load later.	2019-06-10 20:39:45 +08:00
Mingyu Chen	ff0dd0d2da	Support SSL authentication with Kafka in routine load job (#1235 )	2019-06-07 16:29:01 +08:00
kevin	cb91e15f1e	Modify UDF docs (#1260 )	2019-06-06 15:47:10 +08:00
ZHAO Chun	7cdaba66dc	Add spatial func (#1213 ) Support some spatial functions, such as ST_Contains.	2019-05-31 14:23:09 +08:00
HangyuanLiu	5ca2805701	Add some date time function doc (#1206 )	2019-05-27 17:36:09 +08:00
EmmyMiao87	85b4619d54	Change insert into to streaming (#1191 ) The non-streaming hint of insert into will use the streamin plan which is same as the plan of stream insert. It will also record the load info and return the label of insert stmt. The partition is supportted in insert into stmt. The result which meet the target partitions will be loaded. The introduction of example has been changed especially non-streaming insert. Also, the param of partition_names is added in sql syntax which is used to declare the target partition_names in target table. Change META_VERSION to 50	2019-05-23 20:53:30 +08:00
HangyuanLiu	cde315c9e9	Add date-function doc (#1190 )	2019-05-23 15:29:08 +08:00
Mingyu Chen	722a9e71c7	Optimize json functions (#1177 ) 1. get_json_xxx() now support using quoto to escape dot 2. Implement json_path_prepare() function to preprocess json_path Performance of get_json_string() on 1000000 rows reduces from 2.27s to 0.27s	2019-05-21 09:13:12 +08:00
Yunfeng,Wu	76a8093c70	Add documentation for doris on es (#1151 )	2019-05-13 21:58:05 +08:00
ZHAO Chun	debb58c278	Add SHOW FUNCTION and update docs for UDF (#1140 )	2019-05-11 21:46:37 +08:00

1 2

63 Commits