doris

Author	SHA1	Message	Date
WingC	3022955f32	Fix recover time in docs (#2588 )	2019-12-27 14:51:41 +08:00
xy720	1113f951c3	Alter view stmt (#2522 ) This commit adds a new statement named alter view, like ALTER VIEW view_name ( col_1, col_2, col_3, ) AS SELECT k1, k2, SUM(v1) FROM exampleDb.testTbl GROUP BY k1,k2	2019-12-27 14:02:56 +08:00
Mingyu Chen	1421a9be41	[Compaction] Support compact only one rowset (#2558 ) Support compaction operation to compact only one rowset. After the modification, the last rowset of the tablet will also be compacted. At the same time, we added a `segments_overlap_pb` field to the rowset meta. Used to describe whether the segment data in the rowset overlaps. This field is set by `rowset_writer`. Initially UNKNOWN for compatibility with existing data. In addition, the version hash of the rowset generated after compaction is directly set to the version hash of last rowset participating in compaction, to ensure that the tablet's version hash remains unchanged after compaction.	2019-12-27 10:08:41 +08:00
WingC	f7032b07f3	Support more schema change from VARCHAR type (#2501 )	2019-12-26 22:38:53 +08:00
Mingyu Chen	6f3c50a95c	[Document] Add example for using CTE in INSERT operation (#2572 )	2019-12-26 10:00:34 +08:00
firetree01	e7be52fa58	Update basic-usage_EN.md (#2530 )	2019-12-23 16:04:27 +08:00
Lishi	20abfc5f6f	Modify stream-load-manual_EN.md (#2528 )	2019-12-23 15:34:19 +08:00
frwrdt	008e59476d	Add curdate function doc (#2520 )	2019-12-20 21:24:56 +08:00
kangkaisen	6815979ba5	Fix invalid to_bitmap input lead to BE core (#2510 )	2019-12-19 21:28:00 +08:00
kangpinghuang	63ea05f9c7	Add convert tablet rowset type (#2294 ) to solve the issue #2246. scheme is as following: add a optional preferred_rowset_type in TabletMeta for V2 format rollup index tablet add a boolean session variable use_v2_rollup, if set true, the query will v2 storage format rollup index to process the query. test queries will be sent to online service to verify the correctness of segment-v2 by send the the same queries to fe with use_v2_rollup set or not to check whether the returned results are the same.	2019-12-18 18:49:47 +08:00
Mingyu Chen	222f8390c7	[Compaction] Fix the bug that cumulative point grows unreasonably (#2490 ) When there are to many segment in one rowset, which is larger than BE config 'max_cumulative_compaction_num_singleton_deltas', the cumulative compaction will not work and just increase the cumulative point, because there is only once rowset being selected. So when selecting rowset for cumulative compaction, we should meet 2 requirments before finishing the selection logic: 1. compaction score is larger than 'max_cumulative_compaction_num_singleton_deltas' 2. at least 2 rowsets are selected.	2019-12-18 12:59:17 +08:00
WingC	c81b1db406	Support convert VARCHAR type to DATE type (#2489 )	2019-12-18 12:58:47 +08:00
WingC	89003b774b	Support Convert Varchar to INT (#2481 )	2019-12-17 22:02:28 +08:00
Mingyu Chen	e1ba0efbc7	Optimize compaction strategy of tablet on BE (#2473 ) The current compaction selection strategy and cumulative point update logic will cause the cumulative compaction to not work, and all compaction tasks will be completed only by the base compaction. This can cause a large number of data versions to pile up. In the current cumulative point update logic, when a cumulative cannot select enough number of rowsets, it will directly increase the cumulative point. Therefore, when the data version generates the same speed as the cumulative compaction polling, it will cause the cumulative point to continuously increase without triggering the cumulative compaction. The new strategy mainly modifies the update logic of cumulative point to ensure that the above problems do not occur. At the same time, the new strategy also takes into account the problem that compaction cannot be performed if cumulative points stagnate for a long time. Cumulative points will be forced to increase through threshold settings to ensure that compaction has a chance to execute. Also add a new HTTP API to view the compaction status of specified tablet. See `compaction-action.md` for details.	2019-12-17 10:30:43 +08:00
landon-dai	55cb1cd1f1	Update date_format.md (#2476 )	2019-12-16 20:43:55 +08:00
landon-dai	b20a76163b	Update from_unixtime.md (#2475 )	2019-12-16 19:39:54 +08:00
kangkaisen	9244db40f7	Update bitmap doc (#2467 )	2019-12-16 18:56:53 +08:00
yuanfeng0905	ebb6506924	Fix doc (#2449 )	2019-12-12 20:56:25 +08:00
yangzhg	bf31bd238b	Change default storage model from aggregate to duplicate(#2318 ) (#2412 ) change default storage model from aggregate to duplicate for sql `create table t (k1 int) DISTRIBUTED BY HASH(k1) BUCKETS 10 PROPERTIES("replication_num" = "1");` before: ``` CREATE TABLE `t` ( `k1` int(11) NULL COMMENT "" ) ENGINE=OLAP AGGREGATE KEY(`k1`) COMMENT "OLAP" DISTRIBUTED BY HASH(`k1`) BUCKETS 10 PROPERTIES ( "storage_type" = "COLUMN" ); ``` after: ``` CREATE TABLE `t` ( `k1` int(11) NULL COMMENT "" ) ENGINE=OLAP DUPLICATE KEY(`k1`) COMMENT "OLAP" DISTRIBUTED BY HASH(`k1`) BUCKETS 10 PROPERTIES ( "storage_type" = "COLUMN" ); ``` #2318	2019-12-12 14:30:30 +08:00
kangpinghuang	c07f37d78c	[Segment V2] Add a control framework between FE and BE through heartbeat #2247 (#2364 ) The control framework is implemented through heartbeat message. Use uint64_t as flags to control different functions. Now add a flag to set the default rowset type to beta.	2019-12-12 12:18:32 +08:00
WingC	5951a0eaea	Add more schema change docs (#2411 ) Add explanation about converting: DATE -> DATETIME DATETIME -> DATE INT->DATE	2019-12-10 16:46:41 +08:00
Mingyu Chen	a46bf1ada3	[Authorization] Modify the authorization checking logic (#2372 ) Authorization checking logic There are some problems with the current password and permission checking logic. For example: First, we create a user by: `create user cmy@"%" identified by "12345";` And then 'cmy' can login with password '12345' from any hosts. Second, we create another user by: `create user cmy@"192.168.%" identified by "abcde";` Because "192.168.%" has a higher priority in the permission table than "%". So when "cmy" try to login in by password "12345" from host "192.168.1.1", it should match the second permission entry, and will be rejected because of invalid password. But in current implementation, Doris will continue to check password on first entry, than let it pass. So we should change it. Permission checking logic After a user login, it should has a unique identity which is got from permission table. For example, when "cmy" from host "192.168.1.1" login, it's identity should be `cmy@"192.168.%"`. And Doris should use this identity to check other permission, not by using the user's real identity, which is `cmy@"192.168.1.1"`. Black list Functionally speaking, Doris only support adding WHITE LIST, which is to allow user to login from those hosts in the white list. But is some cases, we do need a BLACK LIST function. Fortunately, by changing the logic described above, we can simulate the effect of the BLACK LIST. For example, First we add a user by: `create user cmy@'%' identified by '12345';` And now user 'cmy' can login from any hosts. and if we don't want 'cmy' to login from host A, we can add a new user by: `create user cmy@'A' identified by 'other_passwd';` Because "A" has a higher priority in the permission table than "%". If 'cmy' try to login from A using password '12345', it will be rejected.	2019-12-06 17:45:56 +08:00
HaiBo Li	9fbc1c7ee6	Support where/orderby/limit after “SHOW ALTER TABLE COLUMN“ syntax (#2380 ) Features： 1、Support WHERE/ORDER BY/LIMIT 2、Columns：TableName、CreatTime、FinishTime、State 3、Only “And” between conditions 4、TableName and State column only support "=" operator 5、CreateTime and FinishTime column support “=”,“>=”,"<=",">","<","!=" operators 6、CreateTime and FinishTime column support Date and DateTime string, eg:"2019-12-04" or "2019-12-04 17:18:00" TestCase: MySQL [haibotest]> show alter table column where State='FINISHED' and CreateTime > '2019-12-03' order by FinishTime desc limit 0,2; +-------+---------------+---------------------+---------------------+---------------+---------+---------------+---------------+---------------+----------+------+----------+---------+ \| JobId \| TableName \| CreateTime \| FinishTime \| IndexName \| IndexId \| OriginIndexId \| SchemaVersion \| TransactionId \| State \| Msg \| Progress \| Timeout \| +-------+---------------+---------------------+---------------------+---------------+---------+---------------+---------------+---------------+----------+------+----------+---------+ \| 11134 \| test_schema_2 \| 2019-12-03 19:21:42 \| 2019-12-03 19:22:11 \| test_schema_2 \| 11135 \| 11059 \| 1:192010000 \| 3 \| FINISHED \| \| N/A \| 86400 \| \| 11096 \| test_schema_3 \| 2019-12-03 19:21:31 \| 2019-12-03 19:21:51 \| test_schema_3 \| 11097 \| 11018 \| 1:2063361382 \| 2 \| FINISHED \| \| N/A \| 86400 \| +-------+---------------+---------------------+---------------------+---------------+---------+---------------+---------------+---------------+----------+------+----------+---------+ 2 rows in set (0.00 sec)	2019-12-06 16:24:44 +08:00
EmmyMiao87	d937f7b51e	Fix the error of stream load doc (#2340 )	2019-11-29 16:33:08 +08:00
caiconghui	8bf00afa25	Create table with nullable column for default (#2256 ) Change the default column null property to nullable	2019-11-29 11:11:31 +08:00
HangyuanLiu	e7b05f7eb3	Date format support java date style "yyyy-MM-dd HH:mm:ss" (#2309 )	2019-11-28 14:34:31 +08:00
jiangshouzhuang	c33789ee49	Update insert-into-manual_EN.md (#2323 ) Modify the description error for the enable_insert_strict parameter	2019-11-28 14:00:33 +08:00
jiangshouzhuang	02d7c486e1	Update insert-into-manual.md (#2322 ) Modify the description error for the enable_insert_strict parameter	2019-11-28 13:59:54 +08:00
Mingyu Chen	a2d7c42042	Add a variable to specifically limit the memory usage of the load part in the insert operation (#2305 ) This variable is mainly for INSERT operation, because INSERT operation has both query and load part. Using only the exec_mem_limit variable does not make a good distinction of memory limit between the two parts.	2019-11-28 13:03:11 +08:00
Mingyu Chen	d5aeb9a6b7	Add document for session variables. (#2284 ) Also make the variable effective in current session when setting it globally.	2019-11-24 22:47:05 +08:00
Mingyu Chen	46181c0880	Fix some bugs about load label (#2241 )	2019-11-23 00:04:45 +08:00
xy720	79ff0ad2a4	Add pipes_as_concat_mode (#2252 ) This commit will add a new sql mode named MODE_PIPES_AS_CONCAT: Description: 1、If this mode is active, '\|\|' will be handled different from the original way ('\|\|' and 'or' are seen as the same symbols in Doris) that it can be used to concat two exps and returns a new string. For example, 'a' \|\| 'b' = 'ab' and 1 \|\| 0 = '10'. 2. User can active this mode by "SET sql_mode = PIPES_AS_CONCAT", and deactive it by "SET sql_mode = '' ".	2019-11-22 15:01:53 +08:00
kangkaisen	d8cfbbedf7	Support bitmap_empty function (#2227 )	2019-11-18 20:37:00 +08:00
Mingyu Chen	84c1fa88b8	Add node dead num metrics for all types of node (#2191 ) Following metrics will show the number of nodes which are down. frontend_down_num backend_down_num broker_down_num	2019-11-13 23:25:51 +08:00
Mingyu Chen	9eaba67606	Limit the FE log file number (#2163 ) 1. upgrade log4j to 2.12.1 2. Add 2 new FE config: 'sys_log_delete_age' and default is '7d', for sys log. 'audit_log_delete_age' and default is '30d', for audit log. it means if a log's last modification time is 7/30 days ago, it will be deleted.	2019-11-11 09:12:57 +08:00
xy720	6759e83a07	Add license header for md files and fix some translation's error (#2137 )	2019-11-06 21:35:07 +08:00
ZHAO Chun	65c3b0907a	Support aggregation type of REPLACE_IF_NOT_NULL (#2127 ) Some use has the requirment that only some of columns will be update in one load operation, and others will retain as original. However, Doris can't handle this situation, because user must specify value for all columns. Then if a column aggregation method is REPLACE, use must query original value to overwrite it. This often needs some work for user to do. If this CL is applied, user can use REPLACE_IF_NOT_NULL instead of REPLACE. Then when load data to table, if user don't intent to change value of this column, user can specify NULL for this column. Doris will retain original value for this column.	2019-11-05 18:08:34 +08:00
xy720	ac5dd0c9f2	Support sql mode (#2083 ) At present, we do not support SQL MODE which is similar to MySQL. In MySQL, SQL MODE is stored in global session and session with a 64 bit address，and every bit 0 or 1 on this address represents a mode state. Besides, MySQL supports combine mode which is composed of several modes. We should support SQL MODE to deal with sql dialect problems. We can heuristically use the MySQL way to store SQL MODE in session and parse it into string when we need to return it back to client. This commit suggests a solution to support SQL MODE. But it's just a sample, and the mode types in SqlModeHelper.java are not really meaningful from now on.	2019-11-01 23:21:00 +08:00
shengyunyao	713e04624f	Modify the lower bound of percentile_approx compression param to 2048 (#2111 )	2019-11-01 13:07:39 +08:00
Mingyu Chen	45df6aae08	Fix some routine load bugs (#2093 ) Mainly fix the following issues: 1. A null pointer exception is raised when a database or table is dropped. The expected behavior is that the routine load job is stopped. 2. Memory leaks. Batch routine load task submissions are no longer performed, and modifications are submitted separately for each task. 3. Unreasonable task timeout. Routine load tasks should not be queued in the BE thread pool for execution. The task sent to the BE should be executed immediately, otherwise the task in the FE will be timeout first. Eventually leads to constant timeout for all subsequent tasks. 4. All routine load job should be scheduled once it being submitted. Not waiting the available BE slot. Otherwise, all later submitted jobs may not be scheduled forever.	2019-10-31 21:53:03 +08:00
kangkaisen	95a3b4ccfe	Add object type (#1948 ) Add a new type: Object. Currently, it's mainly for complex aggregate metrics(HLL , Bitmap). The Object type has the following constraints： 1 Object type could not as key column type 2 Object type doesn't support all indices (BloomFilter, short key, zone map, invert index) 3 Object type doesn't support filter and group by In the implementation： The Object type reuse the StringValue and StringVal, because in storage engine, the Object type is binary, it has a pointer and length.	2019-10-31 21:42:58 +08:00
yangzhg	03d384ac51	Add .rat_excludes file, and modify related documents (#2031 ) (#2105 )	2019-10-31 10:34:22 +08:00
Seaven	5287bc2231	Replace DISCLAIMER with DISCLAIMER-WIP (#2100 )	2019-10-30 19:06:21 +08:00
zhouhaibing089	8d2cc71934	Format markdown of docker section (#2098 ) [DOC] This change makes the format correct so that's easier to view.	2019-10-30 16:52:45 +08:00
EmmyMiao87	ebdcfc21df	Multi distinct + no group by + big data is stuck (#2079 ) ISSUE-2069: This kind of query could be stuck. The sender failed to send the last packet to receiver. Also, the failure does not be reportted to FE , so the query is not cancelled. The error log sames as "body_size=xxxx from xxx:xxx is too large". The reason of the socket is that the packet of the query is too big which is more then the max_body_size of brpc. This commit add a config named brpc_max_body_size whcih is used to change the max_body_size of brpc. Also, user can change the max_body_size directly on-the-fly by "http://host:brpc_port/flags".	2019-10-28 18:51:05 +08:00
kangkaisen	1859819aa7	Update doc for FE metadata recover (#2073 )	2019-10-25 22:27:41 +08:00
ZHAO Chun	06fe8579d2	Update release process documents (#2008 )	2019-10-23 16:20:46 +08:00
ZHAO Chun	109eb79f19	Add help doc for debug tool (#2019 )	2019-10-20 22:58:03 +08:00
EmmyMiao87	d2bc47d2cc	Add introduction of label_keep_max_second (#1993 ) [Docs]	2019-10-16 16:05:13 +08:00
Mingyu Chen	41e55cfca9	Modify fixed partition feature (#1989 ) 1. Not support MAVALUE in multi partition column. 2. Fix the incorrect show create table stmt.	2019-10-16 16:03:46 +08:00

1 2 3 4 5

227 Commits