doris

Author	SHA1	Message	Date
lide	e135e3d41e	Add an example of help load (#584 )	2019-01-25 11:18:10 +08:00
Mingyu Chen	079141e14a	Add disk usage percent in SHOW BACKEND stmt (#571 )	2019-01-23 14:08:33 +08:00
Mingyu Chen	54e98f6964	Auto fix missing version replica (#560 )	2019-01-21 08:56:43 +08:00
Mingyu Chen	d15bc83de0	Fix some bugs of alter table operation (#550 ) 1. Fix bug that failed to query restored table after schema change. 2. Fix bug that failed to add rollup to restored table. 3. Optimize the info of SHOW ALTER TABLE stmt. 4. Optimize the info of some PROCs. 5. Optimize the tablet checker to avoid adding too much task to scheduler.	2019-01-17 15:17:51 +08:00
Mingyu Chen	798a66e6a0	Implement new tablet repair and balance framework (#336 ) More detail, see issue #540	2019-01-16 13:29:17 +08:00
ZHAO Chun	9bfd8d818a	Add md5 property for UDF create statement (#500 )	2019-01-06 19:45:04 +08:00
Mingyu Chen	a51ce03595	Enhance the usability of Load operation (#490 ) 1. Add broker load error hub A broker load error hub will collect error messages in load process and saves them as a file to the specified remote storage via broker. In case that in broker/min/streaming load process, user may not be able to access the error log file in Backend directly. We also add a new header option: 'enable_hub' in streaming load request, and default is false. Because if we enable the broker load error hub, it will significantly slow down the processing speed of streaming load, due to the visit of remote storage via broker. So use can disable the error load hub using this header option, to avoid slowing down the load speed. 2. Show load error logs by using SHOW LOAD WARNINGS stmt We also provide a more easy way to get load error logs. We implement 'SHOW LOAD WARNINGS ON 'url'' stmt to show load error logs directly. The 'url' in stmt is provided in 'SHOW LOAD' stmt. eg: show load warnings on "http://192.168.1.1:8040/api/_load_error_log?file=__shard_2/error_log_xxx"; 3. Support now() function in broker load User can mapping a column to now() in broker load stmt, which means this column will be filled with time when the ETL started. 4. Support more types of wildcard in broker load Currently, we only support wildcard '' to match the file names. wildcard like '/path/to/20190[1-4]' is not support.	2019-01-03 19:07:27 +08:00
ZHAO Chun	7380483394	Support UDF (#468 ) Now, user can create UDF with CREATE FUNCTION statement. Doris only support UDF in this version, it will support UDAF/UDTF later.	2018-12-29 09:13:04 +08:00
Mingyu Chen	5b1e3d3f40	Optimize backup & restore process (#460 ) 1. Print broker address for debug. 2. Do not letting backup job cancelled if it already in state UPLOAD_INFO. 3. Cancel task on Backends when job is cancelled. 4. Show detail progress of backup and restore job. 5. Make 'show snapshot' result more readable. 6. Change upload and download thread num of backup and restore in Backend to 1.	2018-12-24 16:49:16 +08:00
Mingyu Chen	dc4cbab11e	Report error when loading decimal value with scientific notation (#428 ) Currently we do not support scientific notation of decimal value.	2018-12-17 21:04:18 +08:00
Mingyu Chen	45e42bd003	Redesign the access to meta version (#436 ) Because the meta version is only be used in catalog saving and loading. So currently this version is a field of Catalog class. And we can get this version only by calling Catalog.getCurrentCatalogJournalVersion(). But in restore process, we need to read the meta data which is saved with a specified meta version. So we need a flexible way to read a meta data using a specified meta version, not only the version from Catalog. So we create a new class called MetaContext. Currently it only has one field, 'journalVersion', to save the current journal version. And it is a thread local variable, so that we can create a MetaContext anywhere we want, and setting the 'journalVersion' which we want to use for reading meta. Currently, there are 4 threads which is related to meta data saving and loading. The Frontend starting thread, which will call Catalog.initialize() to load the image. the Frontend state listener thread, which will listen the state changing, and call transferToMaster() or transferToNonMaster(). Edit log replayed thread, which is created when calling transferToNonMaster(). It will replay edit log Checkpoint thread, which is created when calling transferToMaster(). It will do the checkpoint periodically. Notice that we get the 'current meta version' only when 'READING' the meta (not WRITING). So we only need to take care of all 'READING' threads. We create MetaContext thread local variable for these 4 threads, and thread 2,3,4's meta context inherit from thread 1's meta context. Because thread 1 will load the origin image file and get the very first meta version. And we leave the Catalog.getCurrentCatalogJournalVersion()'s name unchanged, just change its content, because we don't want change a lot codes this time. On the other hand, we add the current meta version in backup job info file when doing backup job. So that when restoring from a backup snapshot, we can know which meta version we should use for read the meta. And also , we add a new property "meta_version" for Restore stmt, so that we can specify the meta version used for reading backup meta. It is for those old backup snapshots which do not has meta version saving in backup job info file.	2018-12-17 10:05:16 +08:00
kangkaisen	088a914e11	Support Colocate Join (#245 ) (#246 ) * Support colocate join Colocate join means two table are distributed by the columns being joined, then we can join them locally on each backend. Colocate join no data movement and has more concurrency.	2018-12-06 18:59:17 +08:00
Mingyu Chen	c556ed13f6	Support TRUNCATE TABLE stmt (#377 ) * Support TRUNCATE TABLE stmt User can use TRUNCATE TABLE stmt to empties a table or partitions completely. Unlike DELETE, it will drop the tablets directly, and without any performance impact. * Fix bugs that new partition should use new ID * Use equals() to compare Integer * Fix compile bug * Fix bug on single range parititon * Check table's state again after creating partition	2018-12-01 21:18:27 +08:00
kangpinghuang	d7ee57e881	Optimize quota unit (#309 ) Originally, we can only set quota in bytes unit. This commit add quota unit:K/KB/M/MB/G/GB/T/TB/P/PB for convenience.	2018-11-15 14:03:52 +08:00
lide	6f206ae9c6	Fix some license (#290 )	2018-11-09 14:30:09 +08:00
morningman	faeb472909	Tidy up the docs and gensrc directory (#263 ) 1. Remove all design docs. They will be pushed again after modification. 2. Add streaming load and privilege help docs. 3. Rename palo.py to doris.py in gensrc/script/.	2018-11-01 10:37:30 +08:00
morningman	65fe7f65c1	Fixed: privilege logic error: 1. No one can set root password expect for root user itself 2. NODE_PRIV cannot be granted. 3. ADMIN_PRIV and GRANT_PRIV can only be granted or revoked on . 4. No one can modifly privs of default role 'operator' and 'admin'. 5. No user can be granted to role 'operator'. Fixed: the running load limit should not be applied to replay logic. It will cause replay or loading image fail. Changed: optimize the problem of too many directories under mini load directory. Fixed: missing password and auth check when handling mini load request in Frontend. Fixed: DomainResolver should start after Frontends transfer to a certain ROLE, not in Catalog construction methods. Fixed: a stupid bug that no one can set password for root user... fix it: only root user can set password for root. Fixed: read null data twice When reading data with a null value, in some cases, the same data will be read twice by the storage engine, resulting in a wrong result.The reason for this problem is that when splitting, and the start key is the minimum value, the data with null is read. Fixed: add a flag to prevent DomainResovler thread start twice. Fixed: fixed a mem leak of using ByteBuf when parsing auth info of http request. Fixed: add a new config 'disable_hadoop_load', default is false, set to true to disable hadoop load. Changed: add detail error msg of submitting hadoop load job in show load result. Fixed: Backend process should be crashed if failed to saving header. Added: exposure backend info to user when encounter error on Backend. for debugging it more convenient. Fixed: Should remove fd from map when inputstream or outputstream is closed in Broker process. Fixed: Change all files' LF to unix format. Internal commit id: merge from dfcd0aca18eed9ff99d188eb3d01c60d419be1b8	2018-10-01 19:58:41 +08:00
lide	bea10e4f06	1. hide password and other sensitive information in log and audit log 2. add 2 new proc '/current_queries' and '/current_backend_instances' to monitor the current running queries. 3. add a manual compaction api on Backend to trigger cumulative or base compaction manually. 4. add Frontend config 'max_bytes_per_broker_scanner' to limit to bytes per one broker scanner. This is to limit the memory cost of a single broker load job 5. add Frontend config 'max_unfinished_load_job' to limit load job number: if number of running load jobs exceed the limit, no more load job is allowed to be submmitted. 6. a log of bug fixed	2018-09-19 20:04:01 +08:00
morningman	cc74efb3c5	merge to ddb65b69f9c788e359e191889cb31f15279c41ec (#224 ) 1. Apache HDFS broker support HDFS HA and Hadoop kerberos authentication. 2. New Backup and Restore function. Use Fs Broker to backup your data to HDFS or restore them from HDFS. 3. Table-Level Privileges. Grant fine-grained privileges on table-level to specified user. 4. A lot of bugs fixed. 5. Performance improvement.	2018-08-24 17:12:26 +08:00
morningman	a7ffd0bf81	merge to 9625ef157dd44c58802d63cb7547f037b75fd710 add monitor_arch.png	2018-07-17 09:36:30 +08:00
morningman	b07b3dc34d	merge to 9625ef157dd44c58802d63cb7547f037b75fd710 add png files in monitor and alert document	2018-07-17 09:28:24 +08:00
morningman	19997510a6	merge to 9625ef157dd44c58802d63cb7547f037b75fd710 (#208 ) 1. Implement Backend http server using libevent instead of mongoose. 2. Remove Old Hypertable rpc framework, use brpc instead. 3. Change rpc from FE to BE to brpc. 4. Fs broker support HDFS HA. 5. add more metrics to monitor. 6. Lots of bug fixed.	2018-07-17 09:20:30 +08:00
morningman	2419384e8a	push 3.3.19 to github (#193 ) * push 3.3.19 to github * merge to 20ed420122a8283200aa37b0a6179b6a571d2837	2018-05-15 20:38:22 +08:00
morningman	5de798fdd6	Merge code to github (#187 ) * merge to 95787f8be1fd0ff215708fb0f49997b632876586 * Bugs fixed	2018-03-23 14:04:55 +08:00
morningman	765f7f53d4	Update manipulation_stmt.md add mini load timeout help	2017-12-15 17:56:17 +08:00
morningman	56138b9d35	move docs to wiki (#150 ) move these 4 docs to wiki docs/admin_guide/install.md docs/user_guide/FAQ.md docs/user_guide/sql_reference.md docs/user_guide/tutorial.md	2017-12-06 13:58:22 +08:00
morningman	51beba6796	Add images for deploy doc	2017-12-03 16:19:29 +08:00
morningman	45571becdd	Update install.md add how to install lsb_release cmd if missing.	2017-12-03 16:03:56 +08:00
morningman	6d438cb257	Update install.md update pre-compilation binaries (11-30)	2017-11-30 17:36:53 +08:00
morningman	34379105f8	Update tutorial.md fix mismatch example	2017-11-28 09:05:45 +08:00
morningman	a033451367	modify: (#137 ) 1. remove libunwind deps 2. modify install in build.sh 3. add SHOW USER stmt to show user privilege	2017-11-13 19:49:40 -06:00
morningman	05b86e82b2	fix bugs: (#136 ) 1. rename database does not modify the coresponding names in cluster. 2. Result of SHOW BROKERS does not match that of in SHOW PROC '/brokers'. 3. Bugs when set property for other user using admin user.	2017-11-11 20:26:29 -06:00
chenhao7253886	e619631201	add des cluster when perform 'allter system add backend' (#134 ) add des cluster when perform 'allter system add backend'	2017-11-03 04:37:16 -05:00
morningman	cc64875e6b	Fix database quota check bug. Modify load help doc (#130 )	2017-10-25 20:40:11 -05:00
yubingpeng	fe38773c3e	update sql_reference.md (#129 ) * fix escape character in like predicate * multi tenant doc * modify error prompt when insert * update sql_reference.md	2017-10-20 01:47:53 -05:00
李超勇	57c9da619b	fix error in tutorial.md (#121 )	2017-09-29 15:51:34 +08:00
chenhao7253886	d77d8a4748	update doc about cluster (#119 )	2017-09-28 12:11:09 +08:00
morningman	1c0462568c	Update install.md update pre-compile binaries of centos and ubuntu to 20170926	2017-09-27 11:28:48 +08:00
morningman	b2188a9c33	Update install.md update pre-compile binary of centos and ubuntu to 0922.	2017-09-22 09:45:35 +08:00
morningman	3227d371e2	add metadata design doc (#103 ) * add metadata design doc * change required cmake version to 2.8.10	2017-09-18 18:29:41 +08:00
Zuyu ZHANG	cfb8fc991a	Added the fix to mysql build error. (#101 )	2017-09-18 09:55:08 +08:00
morningman	cba91bac3f	sort out the fs_brokers dir (#98 ) * sort out the fs_broker dir * modify install.md	2017-09-15 11:22:02 +08:00
morningman	b48dbecd2e	Update install.md update pre-compile binaries of ubuntu and centos	2017-09-14 17:20:11 +08:00
yubingpeng	e82a06ba2b	multi tenant doc (#83 ) multi tenant doc	2017-09-08 18:21:41 +08:00
morningman	0475394879	add new stmt ALTER SYSTEM ADD FREE BACKEND to add a backend not belongs to any cluster	2017-09-04 15:43:19 +08:00
李超勇	2a31e01ff1	add thirdparty.tar.gz suffix (#56 )	2017-08-30 09:10:39 +08:00
李超勇	7c278c9f92	fix #49 add .tar.gz suffix (#53 )	2017-08-29 11:25:13 +08:00
morningman	fb45b606e3	Update install.md fix some misspellings	2017-08-28 09:48:50 +08:00
李超勇	5342c6dbc8	add start_broker to install.md (#46 )	2017-08-25 16:51:03 +08:00
morningman	6a87b0e464	Update install.md Fix link of pre-compilation binaries	2017-08-23 11:06:13 +08:00

1 2

57 Commits