doris

Author	SHA1	Message	Date
EmmyMiao87	845f931098	[fix](select outfile) Remove optional properties check of hdfs storage (#7272 )	2021-12-03 13:42:56 +08:00
Xinyi Zou	fc9e502b51	[improvement](brpc)(config) Support transfer RowBatch in Controller Attachment (#7164 ) Transfer RowBatch in Protobuf Request to Controller Attachment, when the maximum length of the RowBatch in the Protobuf Request is exceeded. This can avoid reaching the upper limit of the Protobuf Request length (2G), and it is expected that performance can be improved.	2021-12-02 11:41:38 +08:00
xinghuayu007	dd36ccc3bf	[feature](storage-format) Z-Order Implement (#7149 ) Support sort data by Z-Order: ``` CREATE TABLE table2 ( siteid int(11) NULL DEFAULT "10" COMMENT "", citycode int(11) NULL COMMENT "", username varchar(32) NULL DEFAULT "" COMMENT "", pv bigint(20) NULL DEFAULT "0" COMMENT "" ) ENGINE=OLAP DUPLICATE KEY(siteid, citycode) COMMENT "OLAP" DISTRIBUTED BY HASH(siteid) BUCKETS 1 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "data_sort.sort_type" = "ZORDER", "data_sort.col_num" = "2", "in_memory" = "false", "storage_format" = "V2" ); ```	2021-12-02 11:39:51 +08:00
caiconghui	fbab8afe24	[feature] Support disable query and load for backend to make Doris more robust and set default value to 1 for max_query_retry_time (#7155 ) ALTER SYSTEM MODIFY BACKEND "host1:9050" SET ("disable_query" = "true"); ALTER SYSTEM MODIFY BACKEND "host1:9050" SET ("disable_load" = "true");	2021-11-30 22:08:32 +08:00
董伟召	904a32c758	[docs] fix 0.14 release date in download page (#7253 ) The release date of 0.14 in download page is wrong	2021-11-30 15:00:36 +08:00
Mingyu Chen	9b3c834396	[docs](release) Update download page to add release 0.15 (#7244 ) Also modify some steps in release processing document	2021-11-29 16:06:32 +08:00
Shuo Wang	178fda593d	[docs] Refine documents for commit message tags. (#7215 )	2021-11-26 10:14:39 +08:00
Mingyu Chen	3fd8148100	[doc] Add build-dev image 1.4.2 to compilation document (#7174 ) Add build-dev image 1.4.2 to compilation document	2021-11-24 10:42:52 +08:00
Mingyu Chen	5a8591aaf0	[doc] add FAQ document (#7173 ) From Apache Doris wechat count, authorized.	2021-11-24 10:42:33 +08:00
Mingyu Chen	e74bfea8e4	[chore](clang-format)(license-eye) Add Clang Format/Skywalking eyes github action (#7132 ) 1. The clang format action will be triggered when a PR is submitted. 2. Skywalking eyes actions will be triggered when a PR is submitted and after merging to master branch.	2021-11-24 10:41:02 +08:00
xu20160924	3b988204fc	[doc] Modify the wrong comment of the ScanTime (#7109 ) Modify the wrong comment of the ScanTime.	2021-11-24 10:40:00 +08:00
tianhui5	d3c020b3cb	[feat-opt](fe-config) Add tablets number limit to void wrong usage (#7025 ) 1. Add new FE config `default_db_replica_quota_size` 2. Check replica quota after create table/partition	2021-11-24 10:37:54 +08:00
luzhijing	4b45b806da	[doc] Created commit-format-specification.md (#7190 ) We found that many commit messages submitted at present have ambiguous information. Clear commit messages can help developers submit pull requests more readable, committers merge easily and Release Manager easy to release. Therefore, we have sorted out a version of the commit format specification. We hope that subsequent contributors can sort out the commit messages according to the specification when submitting Pull Request.	2021-11-24 10:30:54 +08:00
renzhimin7	ce7fa5d6d9	[typo] Update multi-tenant.md (#7162 ) A double quote is missing	2021-11-22 14:47:00 +08:00
Mingyu Chen	a81f4da4e4	[feat](minidump) Add minidump support (#7124 ) Now minidump file will be created when BE crashes. And user can manually trigger a minidump by sending SIGUSR1 to BE process. More details can be found in minidump.md documents	2021-11-20 21:41:26 +08:00
tianhui5	143d3769b1	[feat](config) add FE config to limit the replica num per tablet (#7087 )	2021-11-20 21:40:23 +08:00
lihuigang	e9282205f1	[feat-opt](spark-load) support bitmap binary data from hive in spark load (#6883 ) Support to load the binary data of bitmap value from Hive into Doris. fix #6461	2021-11-20 21:38:38 +08:00
Mingyu Chen	be89f0f77e	[feat-opt](routine-load) Support show offset lag in show routine load stmt (#7114 ) Add a new field `Lag` in result of `show routine load` stmt. `Lag: {"0":10, "1":0}` means kafka partition 0 has 10 msg behind and partition 1 is update-to-date.	2021-11-18 14:31:16 +08:00
luzhijing	9487899047	Update members.md (#7115 ) Update the Committer List	2021-11-17 14:38:48 +08:00
luzhijing	01c5ef2f05	[Doc]Update memmber.md (#7133 ) Add some Committers at the list , they are voted as Committers in the last half a year. and the Chinese Documents has been updated at #7115	2021-11-17 14:38:15 +08:00
Mingyu Chen	dcad6ff5e5	[License] Add License header for missing files (#7130 ) 1. Add License header for missing files 2. Modify the spark pom.xml to correct the location of `thrift`	2021-11-16 18:37:54 +08:00
jiafeng.zhang	5710cf8feb	[Blog] Example of binlog load usage (#7080 ) Example of binlog load usage	2021-11-16 12:12:44 +08:00
qiye	5b01f7bba2	[Feature] Support query hive table (#6569 ) Users can directly query the data in the hive table in Doris, and can use join to perform complex queries without laboriously importing data from hive. Main changes list below: FE: Extend HiveScanNode from BrokerScanNode HiveMetaStoreClientHelper communicate with HIVE and HDFS. BE: Treate HiveScanNode as BrokerScanNode, treate HiveTable as BrokerTable. broker_scanner.cpp: suppot read column from HDFS path. orc_scanner.cpp: support read hdfs file. POM: Add hive.version=2.3.7, hive-metastore and hive-exec Add hadoop.version=2.8.0, hadoop-hdfs Upgrade commons-lang to fix incompatiblity of Java 9 and later. Thrift: Add THiveTable Add read_by_column_def in TBrokerRangeDesc	2021-11-16 11:59:07 +08:00
jiafeng.zhang	85fd05a8ae	website bugfix (#7103 ) website bugfix	2021-11-13 18:10:22 +08:00
jiafeng.zhang	d4c0156e0f	[Doc] REPLACE_IF_NOT_NULL document modification (#7100 ) REPLACE_IF_NOT_NULL document modification	2021-11-13 17:11:20 +08:00
Universe	7db90cb6ac	[Build] Openssl development package (#7088 ) Ubuntu: libssl-dev RedHat/CentOS: openssl-devel	2021-11-13 17:11:05 +08:00
EmmyMiao87	11cca0b15d	[JoinReorder] Add session variable to close join order (#7076 ) The new session variable 'close_join_reorder' is used to turn off all automatic join reorder algorithms. If close_join_reorder is true, the Doris will execute query by the order in the original query.	2021-11-13 17:10:44 +08:00
曹建华	93ccef4ec7	[Feature] Add degradate strategy for local_replica_selection. (#7064 ) When local_replica_selection is turned on, support select a non-local BE to service the query when the local be is unavailable	2021-11-13 17:09:25 +08:00
Mingyu Chen	3d8166504a	[Alter] Support alter table engine type from MySQL to ODBC (#6993 ) Support alter table engine type from MySQL to ODBC: ``` ALTER TABLE tbl MODIFY ENGINE TO odbc PROPERTIES("driver" = "odbc"); ```	2021-11-12 15:12:41 +08:00
jiafeng.zhang	f93dae98e4	[Doc] Reorganize documents (#7093 ) 1. Migrate some of the best practice articles to the Blog 2. Changed the names of performance tests and best practices to performance tests and examples	2021-11-12 12:05:10 +08:00
Mingyu Chen	58804d3570	[Colocate] Fix bug that colocate group can not be redistributed after dropping a backend (#7020 ) Mainly changes: 1. Fix [Bug] Colocate group can not redistributed after dropping a backend #7019 2. Add detail msg about why a colocate group is unstable. 3. Add more suggestion when upgrading Doris cluster.	2021-11-11 15:41:49 +08:00
wangyongfeng	c47beb4d3a	[Website][Docs]Add author field to blog (#7086 ) * Add author field to blog Co-authored-by: 943155336 <wangyongfeng> Co-authored-by: jiafeng.zhang <zhangjf1@gmail.com>	2021-11-11 10:23:44 +08:00
wangyongfeng	4c6cbdf463	[Bug] Fix version nav button loaded multiple times in docs website header (#7062 ) * Fix version nav button loaded multiple times Co-authored-by: 943155336 <wangyongfeng> Co-authored-by: jiafeng.zhang <zhangjf1@gmail.com>	2021-11-09 18:23:44 +08:00
wangyongfeng	906c305a19	[Bug] Fix docs website home page last news icon loading failure (#7057 ) * Fix last news icon loading failure Co-authored-by: 943155336 <wangyongfeng> Co-authored-by: jiafeng.zhang <zhangjf1@gmail.com>	2021-11-09 17:34:42 +08:00
EmmyMiao87	5d946ccd5e	[Docs] Add hdfs outfile example (#7052 )	2021-11-09 10:02:28 +08:00
wangyongfeng	34637589c5	[Website][Doc] Add the sharing blog function to the document site (#7047 ) Add the sharing blog function to the document site, including the blog list and detail page. At the same time, a guide on how to share blogs has been added to the developer guide.	2021-11-09 10:01:23 +08:00
jiafeng.zhang	31f3eb4a3c	[Doc] Use Flink CDC to realize real-time MySQL data into Apache Doris (#6933 ) * Best Practices ，Use Flink CDC to realize real-time MySQL data into Apache Doris	2021-11-06 16:18:19 +08:00
Xinyi Zou	e69249c082	sub_bitmap (#6977 ) Starting from the offset position, intercept the specified limit bitmap elements and return a bitmap subset. Types of chang	2021-11-06 13:31:03 +08:00
Zhengguo Yang	5ca271299a	[refactor] set `forward_to_master` true by default (#7017 ) * ot set forward_to_master true by default * Update docs/zh-CN/administrator-guide/variables.md	2021-11-06 13:27:26 +08:00
Zhengguo Yang	760fc02bfe	Added bprc stub cache check and reset api, used to test whether the bprc stub cache is available, and reset the bprc stub cache (#6916 ) Added bprc stub cache check and reset api, used to test whether the bprc stub cache is available, and reset the bprc stub cache add a config used for auto check and reset bprc stub	2021-11-05 09:45:37 +08:00
Mingyu Chen	29838f07da	[HTTP][API] Add backends info API for spark/flink connector (#6984 ) Doris should provide a http api to return backends list for connectors to submit stream load, and without privilege checking, which can let common user to use it	2021-11-05 09:43:06 +08:00
pengxiangyu	599ecb1f30	[Function] Add bitmap function bitmap_subset_limit (#6980 ) Add bitmap function bitmap_subset_limit. This function will return subset in specified index.	2021-11-04 12:14:47 +08:00
xy720	aeec9c45e6	[Function] Add bitmap-xor-count function for doris (#6982 ) Add bitmap-xor-count function for doris relate to #6875	2021-11-02 16:37:00 +08:00
wei zhao	f39a5bc1d0	[Feature] Spark connector supports to specify fields to write (#6973 ) 1. By default , Spark connector must write all fields value to `Doris` table . In this feature , user can specify part of fields to write , even specify the order of the fields to write. eg: I have a table named `student` which has three columns (name,gender,age) , creating table sql as following: ```sql create table student (name varchar(255), gender varchar(10), age int) duplicate key (name) distributed by hash(name) buckets 2; ``` Now , I just want to write values to two columns : name , gender. The code as following: ```scala val df = spark.createDataFrame(Seq( ("m", "zhangsan"), ("f", "lisi"), ("m", "wangwu") )) df.write .format("doris") .option("doris.fenodes", dorisFeNodes) .option("doris.table.identifier", dorisTable) .option("user", dorisUser) .option("password", dorisPwd) //specify your fields or the order .option("doris.write.field", "gender,name") .save() ```	2021-11-02 16:35:29 +08:00
zhangstar333	1ff3d708ca	[Function] add functions of bitmap_and/or_count (#6912 ) issue #6875 add bitmap_and_count/ bitmap_or_count	2021-11-01 14:00:07 +08:00
luozenglin	c7a3116f98	[Function] add bitmap function of bitmap_has_all (#6918 ) The 'bitmap_has_all' function returns true if the first bitmap contains all the elements of the second bitmap.	2021-11-01 12:50:47 +08:00
wei zhao	210625b358	[Doc] Update fe-idea developer guide for latest version (#6963 )	2021-11-01 11:42:13 +08:00
qiye	65ded82778	[Function] add BE bitmap function bitmap_subset_in_range (#6917 ) Add bitmap function bitmap_subset_in_range. This function will return subset in specified range (not include the range_end).	2021-11-01 11:05:19 +08:00
Mingyu Chen	db1c281be5	[Enhance][Load] Reduce the number of segments when loading a large volume data in one batch (#6947 ) ## Case In the load process, each tablet will have a memtable to save the incoming data, and if the data in a memtable is larger than 100MB, it will be flushed to disk as a `segment` file. And then a new memtable will be created to save the following data/ Assume that this is a table with N buckets(tablets). So the max size of all memtables will be `N * 100MB`. If N is large, it will cost too much memory. So for memory limit purpose, when the size of all memtables reach a threshold(2GB as default), Doris will try to flush all current memtables to disk(even if their size are not reach 100MB). So you will see that the memtable will be flushed when it's size reach `2GB/N`, which maybe much smaller than 100MB, resulting in too many small segment files. ## Solution When decide to flush memtable to reduce memory consumption, NOT to flush all memtable, but to flush part of them. For example, there are 50 tablets(with 50 memtables). The memory limit is 1GB, so when each memtable reach 20MB, the total size reach 1GB, and flush will occur. If I only flush 25 of 50 memtables, then next time when the total size reach 1GB, there will be 25 memtables with size 10MB, and other 25 memtables with size 30MB. So I can flush those memtables with size 30MB, which is larger than 20MB. The main idea is to introduce some jitter during flush to ensure the small unevenness of each memtable, so as to ensure that flush will only be triggered when the memtable is large enough. In my test, loading a table with 48 buckets, mem limit 2G, in previous version, the average memtable size is 44MB, after modification, the average size is 82MB	2021-11-01 10:51:50 +08:00
jiafeng.zhang	80f61c823b	Docker 1.4.1 Compile Environment, First Compile Description (#6943 )	2021-11-01 10:49:45 +08:00

1 2 3 4 5 ...

817 Commits