doris

Author	SHA1	Message	Date
xy720	6759e83a07	Add license header for md files and fix some translation's error (#2137 )	2019-11-06 21:35:07 +08:00
xy720	ac5dd0c9f2	Support sql mode (#2083 ) At present, we do not support SQL MODE which is similar to MySQL. In MySQL, SQL MODE is stored in global session and session with a 64 bit address，and every bit 0 or 1 on this address represents a mode state. Besides, MySQL supports combine mode which is composed of several modes. We should support SQL MODE to deal with sql dialect problems. We can heuristically use the MySQL way to store SQL MODE in session and parse it into string when we need to return it back to client. This commit suggests a solution to support SQL MODE. But it's just a sample, and the mode types in SqlModeHelper.java are not really meaningful from now on.	2019-11-01 23:21:00 +08:00
Mingyu Chen	45df6aae08	Fix some routine load bugs (#2093 ) Mainly fix the following issues: 1. A null pointer exception is raised when a database or table is dropped. The expected behavior is that the routine load job is stopped. 2. Memory leaks. Batch routine load task submissions are no longer performed, and modifications are submitted separately for each task. 3. Unreasonable task timeout. Routine load tasks should not be queued in the BE thread pool for execution. The task sent to the BE should be executed immediately, otherwise the task in the FE will be timeout first. Eventually leads to constant timeout for all subsequent tasks. 4. All routine load job should be scheduled once it being submitted. Not waiting the available BE slot. Otherwise, all later submitted jobs may not be scheduled forever.	2019-10-31 21:53:03 +08:00
zhouhaibing089	8d2cc71934	Format markdown of docker section (#2098 ) [DOC] This change makes the format correct so that's easier to view.	2019-10-30 16:52:45 +08:00
EmmyMiao87	ebdcfc21df	Multi distinct + no group by + big data is stuck (#2079 ) ISSUE-2069: This kind of query could be stuck. The sender failed to send the last packet to receiver. Also, the failure does not be reportted to FE , so the query is not cancelled. The error log sames as "body_size=xxxx from xxx:xxx is too large". The reason of the socket is that the packet of the query is too big which is more then the max_body_size of brpc. This commit add a config named brpc_max_body_size whcih is used to change the max_body_size of brpc. Also, user can change the max_body_size directly on-the-fly by "http://host:brpc_port/flags".	2019-10-28 18:51:05 +08:00
kangkaisen	1859819aa7	Update doc for FE metadata recover (#2073 )	2019-10-25 22:27:41 +08:00
EmmyMiao87	d2bc47d2cc	Add introduction of label_keep_max_second (#1993 ) [Docs]	2019-10-16 16:05:13 +08:00
worker24h	ec7c8a2c6f	Support adding fixed range partition eg: ALTER TABLE test_table ADD PARTITION p0125 VALUES [("20190125"), ("20190126"));	2019-10-15 09:50:30 +08:00
Mingyu Chen	62acf5d098	Limit the memory usage of Loading process (#1954 )	2019-10-15 09:26:20 +08:00
zhongyun2019	a323a190a2	Update monitor-alert.md (#1975 )	2019-10-14 12:22:51 +08:00
EmmyMiao87	054a3f48bc	Add where expr in broker load (#1812 ) The where predicate in broker load is responsible for filtering transformed data. The docs of help and operator has been changed.	2019-09-17 11:32:40 +08:00
WingC	973eff26cd	Fix tablet meta tool command argument bug (#1810 )	2019-09-16 17:40:23 +08:00
Mingyu Chen	9aa2045987	Refactor alter job (#1695 )	2019-09-12 16:31:29 +08:00
Mingyu Chen	044489b92f	Optimize some kinds of load jobs (#1762 ) 1. Support specifying label to Insert Into stmt. INSERT INTO tbl1 WITH LABEL label1 ...; 2. Return job' state corresponding to the existing label in result of stream load. ... "Status": "Label Already Exists", "ExistingJobStatus": "FINISHED" ... 3. Return the recent 2000 transactions in SHOW PROC '/transactions'	2019-09-09 22:11:12 +08:00
Mingyu Chen	7e981b2b14	Limit the disk usage to avoid running out of disk capacity (#1702 ) Set high watermark and flood stage of disk used capacity. And forbid some operations if disk usage is too high.	2019-08-27 22:18:17 +08:00
Mingyu Chen	a1b92768dd	Add a loaded rows in SHOW LOAD result (#1686 ) Loaded rows will be updated periodically by query report. So that user can see that a load job is still running or being blocked.	2019-08-27 14:13:47 +08:00
EmmyMiao87	b28f4242c3	Add config max_concurrent_task_num_per_be (#1693 ) This config is used to control the max concurrent task num per be. The cluster max concurrent task num = max_concurrent_task_num_per_be * number of be.	2019-08-24 00:56:40 +08:00
kangkaisen	c73b3f15a4	Update tablet-repair-and-balance doc (#1692 )	2019-08-22 21:31:56 +08:00
EmmyMiao87	978b1ee1af	Add strict mode in Routine load, Stream load and Mini load (#1677 )	2019-08-20 21:56:45 +08:00
Mingyu Chen	176e185e18	Add broker doc (#1662 ) This broker document introduces the properties for different broker types.	2019-08-20 17:18:54 +08:00
Mingyu Chen	8e6814cfcd	Support setting timeout for stream load (#1670 )	2019-08-20 15:43:03 +08:00
Mingyu Chen	6d73658207	Support checking error data row when doing INSERT (#1597 ) If strict mode is true, and at least one row is filtered, the insert operation will fail and a url will be given to get the error rows. ``` ERROR 1064 (HY000): all partitions have no load data. url: http://host:ip/api/_load_error_log?file=__shard_2/error_log_insert_stmt_e0a620e93dc54461-b89ec64768367d25_e0a620e93dc54461_b89ec64768367d25 ``` If all rows are good, insert will return OK with affected rows: ``` Query OK, 1 row affected (0.26 sec) ``` If strict mode is false, and at least one row is good, the insert operation will return OK with affected rows and warnings. If has error row num, a label will be returned: ``` Query OK, 1 row affected, 1 warning (0.32 sec) {'label':'7d66c457-658b-4a3e-bdcf-8beee872ef2c'} ```	2019-08-16 21:40:29 +08:00
HangyuanLiu	199ff968dc	Fix time zone compatibility (#1631 )	2019-08-13 18:44:35 +08:00
kangpinghuang	1e2a4c3b9b	Fix tablet restore api in BE(#1623 ) (#1624 )	2019-08-13 09:34:24 +08:00
HangyuanLiu	69af50aa8c	Time zone related BE function (#1598 ) Details can be found in time-zone.md document	2019-08-12 20:57:59 +08:00
EmmyMiao87	add6266c71	Broker load supports function (#1592 ) * Broker load supports function The commit support the column function in broker load. The grammar of LoadStmt has not been changed. Example: columns terminated by ',' (tmp_c1, tmp_c2) set (c1=tmp_c1+tmp_c2) Also, the old function is compatible such as default_value, strftime etc. After this commit, there are no difference in column function between stream load and broker load except old function.	2019-08-09 13:27:31 +08:00
Mingyu Chen	fd2accbcf9	Modify some docs' format to make it work with document website (#1604 )	2019-08-08 14:47:38 +08:00
Mingyu Chen	cefe1794d4	Fix bug that replicas of a tablet may be located on same host (#1517 ) Doris support deploy multi BE on one host. So when allocating BE for replicas of a tablet, we should select different host. But there is a bug in tablet scheduler that same host may be selected for one tablet. This patch will fix this problem. There are some places related to this problem: 1. Create Table There is no bug in Create Table process. 2. Tablet Scheduler Fixed when selecting BE for REPLICA_MISSING and REPLICA_RELOCATING. Fixed when balance the tablet. 3. Colocate Table Balancer Fixed when selecting BE for repairing colocate backend sequence. Not fix in colocate group balance. Leave it to colocate repairing. 4. Tablet report Tablet report may add replica to catalog. But I did not check the host here, Tablet Scheduler will fix it.	2019-08-01 10:26:06 +08:00
Mingyu Chen	99836f0d7c	Modify load docs (#1558 ) Make it work with documentation website	2019-07-29 15:48:59 +08:00
EmmyMiao87	000e9cf53c	Add administrator guide of load (#1488 ) The catalogue of load docs: ---- load-manual.md ---- broker-load-manual.md ---- insert-into-manual.md ---- stream-load-manual.md This commit also changes max/min_stream_load_timeout to max/min_load_timeout. The old config named stream_load_timeout means the max timeout suited for all types of load. So the config name has been changed.	2019-07-25 21:02:32 +08:00
Mingyu Chen	2551248a52	Support grant GRANT_PRIV on database or table level (#1472 ) Currently, GRANT_PRIV can only be granted on global level, which means it can only be granted on .. Grant it on db.* or db.tbl are not allowed. This will not be able to meet the requirement to create a user who has privilege to grant privileges to other users on specified database or table, such as: GRANT SELECT_PRIV ON db1.* TO cmy@'%'; So I extend the range of GRANT_PRIV. User can now grant GRANT_PRIV on database or even table level, such as: GRANT GRANT_PRIV ON db1.* TO cmy@'%'; And after being granted, the user cmy@'%' can now grant GRANT_PRIV on db1.* to other users.	2019-07-16 19:25:18 +08:00
Mingyu Chen	8db97998ba	Collect all documents to Doris code base (#1414 )	2019-07-01 09:23:13 +08:00
Mingyu Chen	756a680143	Add a website builder of Doris documentations (#1396 ) The build script locates in docs/website. Built with Sphinx using a theme provided by Read the Docs.	2019-06-26 19:10:39 +08:00
Mingyu Chen	566e122c0d	Optimize Export feature (#1378 ) 1. Add 'timeout' properties in Export stmt. 2. Add more infos in 'show export' stmt. 3. Add more logs for debug.	2019-06-26 00:20:53 +08:00
Mingyu Chen	e807064a88	Modify colocation creation logic (#1289 )	2019-06-25 21:20:18 +08:00
Mingyu Chen	5c2cf9f2ce	Handle the situation when there is no enough backends for tablet repair (#1299 ) If there are only 3 backends and replication num is 3. If one replica of a tablet is bad, there is no 4th backend for tablet repair. So we need to delete a bad replica first to make room for new replica.	2019-06-14 20:28:29 +08:00
EmmyMiao87	5dea4fb414	Add description of strict mode in decimal type (#1288 )	2019-06-12 16:03:57 +08:00
EmmyMiao87	53062122ea	Change strategy of incorrect data (#1255 ) This change adds a load property named strict_mode which is used to prohibit the incorrect data. When it is set to false, the incorrect data will be loaded by NULL just like before. When it is set to true, the incorrect data which belongs to a column without expr will be filtered. The strict_mode is supported in broker load v2 now. It will be supported in stream load later.	2019-06-10 20:39:45 +08:00
Mingyu Chen	ff0dd0d2da	Support SSL authentication with Kafka in routine load job (#1235 )	2019-06-07 16:29:01 +08:00
Mingyu Chen	4039985729	Fix some bugs about decommission (#1138 ) 1. Print the last few tablets of decommission backend in fe.log for debug. 2. OlapTableSink should get replica on alive Backends, not only available Backends. 3. When decommission multi Backends, we should drop the redundant replicas before creating a new one. 4. Replicas on decommissioning Backends should be not added to catalog again. 5. Decommissioning Backends should not be chosen as destination of tablet repairing.	2019-05-10 17:41:48 +08:00
mengqinghuan	e5a5201626	Update routine-load-manual.md (#1133 ) edit some descriptions about “max_error_number”	2019-05-10 14:38:28 +08:00
Mingyu Chen	ba78adae94	Fix bugs when using function in both stream load request and routine load job (#1091 )	2019-05-05 20:51:30 +08:00
Mingyu Chen	5e36a769a0	Change the way to calculate task num (#1049 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	9cd090c96a	Modify routine load doc (#1016 ) Add config specification	2019-04-28 10:33:50 +08:00
EmmyMiao87	a79bd0c771	Add doc of auto creator of kafka topic (#985 ) * Add annotation of show routine load	2019-04-28 10:33:50 +08:00
Mingyu Chen	1b5643c6fb	Fix some bugs (#979 ) 1. Add Config.max_routine_load_concurrent_task_num instead of the old one 2. Fix a bug that SHOW ALTER TABLE COLUMN may throw Nullpointer exception 3. Fix some misspelling of docs	2019-04-28 10:33:50 +08:00
Mingyu Chen	56bec6f22a	Add routine load manual (#967 )	2019-04-28 10:33:50 +08:00

47 Commits