doris

Author	SHA1	Message	Date
yuanfeng0905	ebb6506924	Fix doc (#2449 )	2019-12-12 20:56:25 +08:00
yangzhg	bf31bd238b	Change default storage model from aggregate to duplicate(#2318 ) (#2412 ) change default storage model from aggregate to duplicate for sql `create table t (k1 int) DISTRIBUTED BY HASH(k1) BUCKETS 10 PROPERTIES("replication_num" = "1");` before: ``` CREATE TABLE `t` ( `k1` int(11) NULL COMMENT "" ) ENGINE=OLAP AGGREGATE KEY(`k1`) COMMENT "OLAP" DISTRIBUTED BY HASH(`k1`) BUCKETS 10 PROPERTIES ( "storage_type" = "COLUMN" ); ``` after: ``` CREATE TABLE `t` ( `k1` int(11) NULL COMMENT "" ) ENGINE=OLAP DUPLICATE KEY(`k1`) COMMENT "OLAP" DISTRIBUTED BY HASH(`k1`) BUCKETS 10 PROPERTIES ( "storage_type" = "COLUMN" ); ``` #2318	2019-12-12 14:30:30 +08:00
kangpinghuang	c07f37d78c	[Segment V2] Add a control framework between FE and BE through heartbeat #2247 (#2364 ) The control framework is implemented through heartbeat message. Use uint64_t as flags to control different functions. Now add a flag to set the default rowset type to beta.	2019-12-12 12:18:32 +08:00
WingC	5951a0eaea	Add more schema change docs (#2411 ) Add explanation about converting: DATE -> DATETIME DATETIME -> DATE INT->DATE	2019-12-10 16:46:41 +08:00
Mingyu Chen	a46bf1ada3	[Authorization] Modify the authorization checking logic (#2372 ) Authorization checking logic There are some problems with the current password and permission checking logic. For example: First, we create a user by: `create user cmy@"%" identified by "12345";` And then 'cmy' can login with password '12345' from any hosts. Second, we create another user by: `create user cmy@"192.168.%" identified by "abcde";` Because "192.168.%" has a higher priority in the permission table than "%". So when "cmy" try to login in by password "12345" from host "192.168.1.1", it should match the second permission entry, and will be rejected because of invalid password. But in current implementation, Doris will continue to check password on first entry, than let it pass. So we should change it. Permission checking logic After a user login, it should has a unique identity which is got from permission table. For example, when "cmy" from host "192.168.1.1" login, it's identity should be `cmy@"192.168.%"`. And Doris should use this identity to check other permission, not by using the user's real identity, which is `cmy@"192.168.1.1"`. Black list Functionally speaking, Doris only support adding WHITE LIST, which is to allow user to login from those hosts in the white list. But is some cases, we do need a BLACK LIST function. Fortunately, by changing the logic described above, we can simulate the effect of the BLACK LIST. For example, First we add a user by: `create user cmy@'%' identified by '12345';` And now user 'cmy' can login from any hosts. and if we don't want 'cmy' to login from host A, we can add a new user by: `create user cmy@'A' identified by 'other_passwd';` Because "A" has a higher priority in the permission table than "%". If 'cmy' try to login from A using password '12345', it will be rejected.	2019-12-06 17:45:56 +08:00
HaiBo Li	9fbc1c7ee6	Support where/orderby/limit after “SHOW ALTER TABLE COLUMN“ syntax (#2380 ) Features： 1、Support WHERE/ORDER BY/LIMIT 2、Columns：TableName、CreatTime、FinishTime、State 3、Only “And” between conditions 4、TableName and State column only support "=" operator 5、CreateTime and FinishTime column support “=”,“>=”,"<=",">","<","!=" operators 6、CreateTime and FinishTime column support Date and DateTime string, eg:"2019-12-04" or "2019-12-04 17:18:00" TestCase: MySQL [haibotest]> show alter table column where State='FINISHED' and CreateTime > '2019-12-03' order by FinishTime desc limit 0,2; +-------+---------------+---------------------+---------------------+---------------+---------+---------------+---------------+---------------+----------+------+----------+---------+ \| JobId \| TableName \| CreateTime \| FinishTime \| IndexName \| IndexId \| OriginIndexId \| SchemaVersion \| TransactionId \| State \| Msg \| Progress \| Timeout \| +-------+---------------+---------------------+---------------------+---------------+---------+---------------+---------------+---------------+----------+------+----------+---------+ \| 11134 \| test_schema_2 \| 2019-12-03 19:21:42 \| 2019-12-03 19:22:11 \| test_schema_2 \| 11135 \| 11059 \| 1:192010000 \| 3 \| FINISHED \| \| N/A \| 86400 \| \| 11096 \| test_schema_3 \| 2019-12-03 19:21:31 \| 2019-12-03 19:21:51 \| test_schema_3 \| 11097 \| 11018 \| 1:2063361382 \| 2 \| FINISHED \| \| N/A \| 86400 \| +-------+---------------+---------------------+---------------------+---------------+---------+---------------+---------------+---------------+----------+------+----------+---------+ 2 rows in set (0.00 sec)	2019-12-06 16:24:44 +08:00
EmmyMiao87	d937f7b51e	Fix the error of stream load doc (#2340 )	2019-11-29 16:33:08 +08:00
caiconghui	8bf00afa25	Create table with nullable column for default (#2256 ) Change the default column null property to nullable	2019-11-29 11:11:31 +08:00
HangyuanLiu	e7b05f7eb3	Date format support java date style "yyyy-MM-dd HH:mm:ss" (#2309 )	2019-11-28 14:34:31 +08:00
jiangshouzhuang	c33789ee49	Update insert-into-manual_EN.md (#2323 ) Modify the description error for the enable_insert_strict parameter	2019-11-28 14:00:33 +08:00
jiangshouzhuang	02d7c486e1	Update insert-into-manual.md (#2322 ) Modify the description error for the enable_insert_strict parameter	2019-11-28 13:59:54 +08:00
Mingyu Chen	a2d7c42042	Add a variable to specifically limit the memory usage of the load part in the insert operation (#2305 ) This variable is mainly for INSERT operation, because INSERT operation has both query and load part. Using only the exec_mem_limit variable does not make a good distinction of memory limit between the two parts.	2019-11-28 13:03:11 +08:00
Mingyu Chen	d5aeb9a6b7	Add document for session variables. (#2284 ) Also make the variable effective in current session when setting it globally.	2019-11-24 22:47:05 +08:00
Mingyu Chen	46181c0880	Fix some bugs about load label (#2241 )	2019-11-23 00:04:45 +08:00
xy720	79ff0ad2a4	Add pipes_as_concat_mode (#2252 ) This commit will add a new sql mode named MODE_PIPES_AS_CONCAT: Description: 1、If this mode is active, '\|\|' will be handled different from the original way ('\|\|' and 'or' are seen as the same symbols in Doris) that it can be used to concat two exps and returns a new string. For example, 'a' \|\| 'b' = 'ab' and 1 \|\| 0 = '10'. 2. User can active this mode by "SET sql_mode = PIPES_AS_CONCAT", and deactive it by "SET sql_mode = '' ".	2019-11-22 15:01:53 +08:00
kangkaisen	d8cfbbedf7	Support bitmap_empty function (#2227 )	2019-11-18 20:37:00 +08:00
Mingyu Chen	84c1fa88b8	Add node dead num metrics for all types of node (#2191 ) Following metrics will show the number of nodes which are down. frontend_down_num backend_down_num broker_down_num	2019-11-13 23:25:51 +08:00
Mingyu Chen	9eaba67606	Limit the FE log file number (#2163 ) 1. upgrade log4j to 2.12.1 2. Add 2 new FE config: 'sys_log_delete_age' and default is '7d', for sys log. 'audit_log_delete_age' and default is '30d', for audit log. it means if a log's last modification time is 7/30 days ago, it will be deleted.	2019-11-11 09:12:57 +08:00
xy720	6759e83a07	Add license header for md files and fix some translation's error (#2137 )	2019-11-06 21:35:07 +08:00
ZHAO Chun	65c3b0907a	Support aggregation type of REPLACE_IF_NOT_NULL (#2127 ) Some use has the requirment that only some of columns will be update in one load operation, and others will retain as original. However, Doris can't handle this situation, because user must specify value for all columns. Then if a column aggregation method is REPLACE, use must query original value to overwrite it. This often needs some work for user to do. If this CL is applied, user can use REPLACE_IF_NOT_NULL instead of REPLACE. Then when load data to table, if user don't intent to change value of this column, user can specify NULL for this column. Doris will retain original value for this column.	2019-11-05 18:08:34 +08:00
xy720	ac5dd0c9f2	Support sql mode (#2083 ) At present, we do not support SQL MODE which is similar to MySQL. In MySQL, SQL MODE is stored in global session and session with a 64 bit address，and every bit 0 or 1 on this address represents a mode state. Besides, MySQL supports combine mode which is composed of several modes. We should support SQL MODE to deal with sql dialect problems. We can heuristically use the MySQL way to store SQL MODE in session and parse it into string when we need to return it back to client. This commit suggests a solution to support SQL MODE. But it's just a sample, and the mode types in SqlModeHelper.java are not really meaningful from now on.	2019-11-01 23:21:00 +08:00
shengyunyao	713e04624f	Modify the lower bound of percentile_approx compression param to 2048 (#2111 )	2019-11-01 13:07:39 +08:00
Mingyu Chen	45df6aae08	Fix some routine load bugs (#2093 ) Mainly fix the following issues: 1. A null pointer exception is raised when a database or table is dropped. The expected behavior is that the routine load job is stopped. 2. Memory leaks. Batch routine load task submissions are no longer performed, and modifications are submitted separately for each task. 3. Unreasonable task timeout. Routine load tasks should not be queued in the BE thread pool for execution. The task sent to the BE should be executed immediately, otherwise the task in the FE will be timeout first. Eventually leads to constant timeout for all subsequent tasks. 4. All routine load job should be scheduled once it being submitted. Not waiting the available BE slot. Otherwise, all later submitted jobs may not be scheduled forever.	2019-10-31 21:53:03 +08:00
kangkaisen	95a3b4ccfe	Add object type (#1948 ) Add a new type: Object. Currently, it's mainly for complex aggregate metrics(HLL , Bitmap). The Object type has the following constraints： 1 Object type could not as key column type 2 Object type doesn't support all indices (BloomFilter, short key, zone map, invert index) 3 Object type doesn't support filter and group by In the implementation： The Object type reuse the StringValue and StringVal, because in storage engine, the Object type is binary, it has a pointer and length.	2019-10-31 21:42:58 +08:00
yangzhg	03d384ac51	Add .rat_excludes file, and modify related documents (#2031 ) (#2105 )	2019-10-31 10:34:22 +08:00
Seaven	5287bc2231	Replace DISCLAIMER with DISCLAIMER-WIP (#2100 )	2019-10-30 19:06:21 +08:00
zhouhaibing089	8d2cc71934	Format markdown of docker section (#2098 ) [DOC] This change makes the format correct so that's easier to view.	2019-10-30 16:52:45 +08:00
EmmyMiao87	ebdcfc21df	Multi distinct + no group by + big data is stuck (#2079 ) ISSUE-2069: This kind of query could be stuck. The sender failed to send the last packet to receiver. Also, the failure does not be reportted to FE , so the query is not cancelled. The error log sames as "body_size=xxxx from xxx:xxx is too large". The reason of the socket is that the packet of the query is too big which is more then the max_body_size of brpc. This commit add a config named brpc_max_body_size whcih is used to change the max_body_size of brpc. Also, user can change the max_body_size directly on-the-fly by "http://host:brpc_port/flags".	2019-10-28 18:51:05 +08:00
kangkaisen	1859819aa7	Update doc for FE metadata recover (#2073 )	2019-10-25 22:27:41 +08:00
ZHAO Chun	06fe8579d2	Update release process documents (#2008 )	2019-10-23 16:20:46 +08:00
ZHAO Chun	109eb79f19	Add help doc for debug tool (#2019 )	2019-10-20 22:58:03 +08:00
EmmyMiao87	d2bc47d2cc	Add introduction of label_keep_max_second (#1993 ) [Docs]	2019-10-16 16:05:13 +08:00
Mingyu Chen	41e55cfca9	Modify fixed partition feature (#1989 ) 1. Not support MAVALUE in multi partition column. 2. Fix the incorrect show create table stmt.	2019-10-16 16:03:46 +08:00
xy720	63fa260d3f	Support prepare/close in UDF (#1985 ) The prepare/close step of scalar function is already supported in execution framework, We only need to do is that support it in syntax and meta in frontend. In addition, 'Hive' binary type of scalar function NOT supports prepare/close step, we need to make it supports.	2019-10-16 07:19:20 +08:00
worker24h	ec7c8a2c6f	Support adding fixed range partition eg: ALTER TABLE test_table ADD PARTITION p0125 VALUES [("20190125"), ("20190126"));	2019-10-15 09:50:30 +08:00
Mingyu Chen	62acf5d098	Limit the memory usage of Loading process (#1954 )	2019-10-15 09:26:20 +08:00
EmmyMiao87	b84ef013eb	Fix the mistake for HLL in mini load (#1981 ) [Docs] Fix mistakes for HLL column in mini load	2019-10-14 19:46:23 +08:00
Mingyu Chen	ccc236484b	Fix bug that failed to add KEY column to DUPLICATE KEY table (#1973 )	2019-10-14 16:40:34 +08:00
zhongyun2019	a323a190a2	Update monitor-alert.md (#1975 )	2019-10-14 12:22:51 +08:00
shengyunyao	4a17152f40	Add tdigest compression param for pencentile_approx function (#1939 )	2019-10-11 18:56:59 +08:00
ZHAO Chun	024348d74b	Enable auto convert when check in (#1926 ) Leverage gitattributes to enable auto convert end-of-line to LF when checking in. Convert already exist CRLF to LF by removing all files and checking out with new .gitattributes file. Except .gitattributes, all files are only modified at the end of line.	2019-10-09 22:31:27 +08:00
HangyuanLiu	ec3aa03c45	Add more routine load example (#1902 )	2019-09-27 20:42:52 +08:00
yangzhg	2ea7de8b5e	Update some docs (#1882 )	2019-09-26 14:43:55 +08:00
HangyuanLiu	40b9c3571b	Support hll_empty function (#1825 )	2019-09-25 09:28:02 +08:00
Mingyu Chen	e8da855cd2	Support setting timezone for stream load and routine load (#1831 )	2019-09-20 07:55:05 +08:00
lichaoyong	d1676c3c3d	Check file descriptor number is larger than 65536 upon start (#1819 )	2019-09-19 12:48:36 +08:00
Mingyu Chen	e70e48c01e	Add a ALTER operation to change distribution type from RANDOM to HASH (#1823 ) Random distribution is no longer supported since version 0.9. And we need a way to convert the random distribution to hash distribution. ALTER TABLE db.tbl SET ("distribution_type" = "hash");	2019-09-18 14:16:26 +08:00
Mingyu Chen	714dca8699	Support table comment and column comment for view (#1799 )	2019-09-18 09:45:28 +08:00
EmmyMiao87	054a3f48bc	Add where expr in broker load (#1812 ) The where predicate in broker load is responsible for filtering transformed data. The docs of help and operator has been changed.	2019-09-17 11:32:40 +08:00
WingC	973eff26cd	Fix tablet meta tool command argument bug (#1810 )	2019-09-16 17:40:23 +08:00

1 2 3

127 Commits