doris

Author	SHA1	Message	Date
EmmyMiao87	df43752257	[Docs] Fix error KEY url (#6955 )	2021-10-29 12:07:44 +08:00
Zhengguo Yang	4170aabf83	[Optimize] optimize some session variable and profile (#6920 ) 1. optimize error message when using batch delete 2. rename session variable is_report_success to enable_profile 3. add table name to OlapScanner profile	2021-10-27 18:03:12 +08:00
Mingyu Chen	00fe9deaeb	[Benchmark] Add star schema benchmark tools (#6925 ) This CL mainly changes: 1. Add star schema benchmark tools in `tools/ssb-tools`, for user to easy load and test with SSB data set. 2. Disable the segment cache for some read scenario such as compaction and alter operation.(Fix #6924 ) 3. Fix a bug that `max_segment_num_per_rowset` won't work(Fix #6926) 4. Enable `enable_batch_delete_by_default` by default.	2021-10-27 09:55:36 +08:00
luzhijing	9d4e6d8362	[Spark-Doris-Connector] fixed some spark-doris-connector doc typo	2021-10-26 18:23:53 +08:00
Mingyu Chen	ed7a873a44	[Memory Usage] Implement segment lru cache to save memory of BE (#6829 )	2021-10-25 10:07:15 +08:00
xiaokangguo	ebb4c282b1	[Flink]Simplify the use of flink connector (#6892 ) 1. Simplify the use of flink connector like other stream sink by GenericDorisSinkFunction. 2. Add the use cases of flink connector. ## Use case ``` env.fromElements("{\"longitude\": \"116.405419\", \"city\": \"北京\", \"latitude\": \"39.916927\"}") .addSink( DorisSink.sink( DorisOptions.builder() .setFenodes("FE_IP:8030") .setTableIdentifier("db.table") .setUsername("root") .setPassword("").build() )); ```	2021-10-23 18:10:47 +08:00
qiye	090d99b690	[Docs] fix urls and format in routine load docs (#6896 ) fix urls and format in routine load docs	2021-10-23 16:52:33 +08:00
xy720	7b50409ada	[Bug][Binlog] Fix the number of versions may exceed the limit during data synchronization (#6889 ) Bug detail: #6887 To solve this problem, the commit of transaction must meet any of the following conditions to avoid commit too freqently: 1. The current accumulated event quantity is greater than the `min_sync_commit_size`. 2. The current accumulated data size is greater than the `min_bytes_sync_commit`. In addition, when the accumulated data size exceeds `max_bytes_sync_commit`, the transaction needs to be committed immediately. Before: ![a5e0a2ba01ec4935144253fe0a364af7](https://user-images.githubusercontent.com/22125576/137933545-77018e89-fa2e-4d45-ae5d-84638cc0506a.png) After: ![4577ec53afa47452c847bd01fa7db56c](https://user-images.githubusercontent.com/22125576/137933592-146bef90-1346-47e4-996e-4f30a25d73bc.png)	2021-10-23 16:47:32 +08:00
zh0122	3267455eca	Replace replica_allocation to replication_allocation (#6870 ) Fix #6869	2021-10-20 15:32:35 +08:00
Mingyu Chen	51e210869a	[ARM64] Fix some problem when compiling on ARM64 platform (#6836 ) (#6872 ) With thirdparties 1.4.0 to 1.4.1 1. Add patch for aws-c-cal-0.4.5 2. Add some solutions for `undefined reference libpsl` 3. Move libgsasl to fix link problme of libcurl. 4. Downgrade openssl to 1.0.2k to fix problem of low version glibc	2021-10-19 13:26:02 +08:00
xy720	bd25d1a828	[Doc] Add documents for MySQL Binlog Load (#6859 ) * add zh-CN docs * add en docs and image * fix * fix	2021-10-19 10:25:42 +08:00
MHBoy	e96882f6c5	Update materialized_view.md (#6867 )	2021-10-19 10:24:38 +08:00
wunan1210	fbd75c88d0	[Docs] Fix exporter document error (#6864 ) * fix exporter document error * update en doc	2021-10-19 10:24:08 +08:00
zhoubintao	bb2b29c64f	[Doc] Add type BOOLEAN when enter 'help create table' in mysql client (#6852 ) some user do not know Doris support type boolean, they use TINYINT, so i add type BOOLEAN when enter 'help create table' in mysql client. currently, type BOOLEAN size is 1 byte, but the value of boolean column only in {0,1} , which waste some memory, and i want change it's implement to 1 bit in the future.	2021-10-17 22:54:12 +08:00
Mingyu Chen	59017cebe6	[ARM64] Fix some problem when compiling on ARM64 platform (#6836 ) 1. Refactor the create method of hdfs reader & writer. libhdfs3 does not support arm64. So we should not support hdfs reader & writer on arm64. 2. And micro for LowerUpperImpl	2021-10-16 21:56:49 +08:00
Zhengguo Yang	607eef8d4d	[Doc] Update compile docs add 0.15 build support. (#6850 )	2021-10-15 18:37:24 +08:00
Mingyu Chen	fcd15edbf9	[Export] Support export job with label (#6835 ) ``` EXPORT TABLE xxx ... PROPERTIES ( "label" = "mylabel", ... ); ``` And than user can use label to get the info by SHOW EXPORT stmt: ``` show export from db where label="mylabel"; ``` For compatibility, if not specified, a random label will be used. And for history jobs, the label will be "export_job_id"; Not like LOAD stmt, here we specify label in `properties` because this will not cause grammatical conflicts, and there is no need to modify the meta version of the metadata.	2021-10-15 10:18:11 +08:00
zhoubintao	ad949c2f65	Optimize Hex and add related Doc (#6697 ) I tested hex in a 1000w times for loop with random numbers， old hex avg time cost is 4.92 s，optimize hex avg time cost is 0.46 s which faster nearly 10x.	2021-10-13 11:36:14 +08:00
EmmyMiao87	6cbefa9f10	[Docs] Update materialized view document (#6710 ) * [Docs] Update materialized view document	2021-10-13 11:35:23 +08:00
Gabriel	30bf6c0d1d	[DOC] minor update (#6820 )	2021-10-13 09:14:56 +08:00
jiafeng.zhang	f439e5e533	[Doc] Documentation error (#6797 ) Documentation error	2021-10-10 23:08:16 +08:00
jiafeng.zhang	bd19491b5b	[Doc] Modify the description of dynamic partition hot partition (#6764 ) Modify the description of dynamic partition hot partition	2021-10-10 23:06:14 +08:00
qiye	675aef7d75	[AliasFunction] Add support for cast in alias function (#6754 ) support #6753	2021-10-10 23:05:44 +08:00
jiafeng.zhang	4232f787ad	[Doc] datax doriswriter use case (#6612 ) datax doriswriter use case	2021-10-10 23:03:12 +08:00
wei zhao	237a8ae948	[Feature] support spark connector sink data using sql (#6796 ) Co-authored-by: wei.zhao <wei.zhao@aispeech.com>	2021-10-09 15:47:36 +08:00
Mingyu Chen	7a20d6d4c2	[Doc] Modify document of resource tag (#6778 ) Fix typo	2021-10-03 11:37:45 +08:00
shee	e7707c8180	[FOLLOWUP] create table like clause support copy rollup (#6580 ) * Remove `ALL` key word to make grammar more clear. Co-authored-by: qzsee <shizhiqiang03@meituan.com>	2021-09-30 18:26:21 +08:00
Mingyu Chen	ad3c9390a2	[Bug] Fix bdbje getDatabaseNames() bug and scan node close bug (#6769 ) 1. This bug is introduced from #6582 2. Optimize the error log of Address used used error msg. 3. Add some document about compilation. 1. Add a custom thirdparty download url. 2. Add a custom com.alibaba maven jar package for DataX. 4. Fix bug that BE crash when closing scan node, introduced from #6622.	2021-09-29 11:11:28 +08:00
chovy	8d471007a6	[Feature] support spark connector sink stream data to doris (#6761 ) * [Feature] support spark connector sink stream data to doris * [Doc] Add spark-connector batch/stream writing instructions * add license and remove meaningless blanks code Co-authored-by: wei.zhao <wei.zhao@aispeech.com>	2021-09-28 17:46:19 +08:00
wudi	df5ba6b5a2	[Fix] Flink connector support json import and use httpclient to streamlaod (#6740 ) * [Bug]:fix when data null , throw NullPointerException * [Bug]:Distinguish between null and empty string * [Feature]:flink-connector supports streamload parameters * [Fix]:code style * [Fix]: support json format import and use httpclient to streamload * [Fix]:remove System out * [Fix]:upgrade httpclient version * [Doc]: add json format import doc Co-authored-by: wudi <wud3@shuhaisc.com>	2021-09-28 17:37:03 +08:00
Henry2SS	cdf9f9e980	[Dynamic Partition] reserve specific history periods by dynamic partition. (#6554 ) Add RESERVED_HISTORY_STARTS and RESERVED_HISTORY_ENDS. Fixes #6514	2021-09-28 11:39:35 +08:00
Mingyu Chen	982b76c3c0	[Bug] Fix resource tag bug, add documents and some other bug fix (#6708 ) 1. Fix bug of UNKNOWN Operation Type 91 2. Support using resource_tag property of user to limit the usage of BE 3. Add new FE config `disable_tablet_scheduler` to disable tablet scheduler. 4. Add documents for resource tag. 5. Modify the default value of FE config `default_db_data_quota_bytes` to 1PB. 6. Add a new BE config `disable_compaction_trace_log` to disable the trace log of compaction time cost. 7. Modify the default value of BE config `remote_storage_read_buffer_mb` to 16MB 8. Fix `show backends` results error 9. Add new BE config `external_table_connect_timeout_sec` to set the timeout when connecting to odbc and mysql table. 10. Modify issue template to enable blank issue, for release note or other specific usage. 11. Fix a bug in alpha_row_set split_range() function.	2021-09-28 10:37:42 +08:00
Ming King	a121124fb2	[Doc] Update doris-on-es.md (#6734 ) Typo	2021-09-25 12:28:03 +08:00
jiafeng.zhang	f3d4c475b1	[DOC] Add connection reset exception solution (#6733 ) Add solution for connection reset exception when doing stream load.	2021-09-25 12:27:35 +08:00
Gabriel	ec777aa122	[DOCS] improve docs (#6718 )	2021-09-25 12:26:41 +08:00
weizuo93	e5a4172b27	[Bug][Docs]Fix outfile docs for parquet (#6709 ) Update outfile documents for parquet.	2021-09-25 12:24:52 +08:00
Mingyu Chen	36d6788bc3	[Optimize] Use compact mode to send query plan thrift data structure. (#6702 ) In some cases, the query plan thrift structure of a query may be very large (for example, when there are many columns in SQL), resulting in a large number of "send fragment timeout" errors. This PR adds an FE config to control whether to transmit the query plan in a compressed format. Using compressed format transmission can reduce the size by ~50%. But it may reduce the concurrency by ~10%. Therefore, in the high concurrency small query scenario, you can choose to turn off compaction.	2021-09-25 12:13:29 +08:00
zhoubintao	56031cbbe1	[Doc] Change CN/EN sql-functions single quote in markdown (#6698 )	2021-09-24 21:42:52 +08:00
zhuixun	f73af475ce	[HTTP API] Add aggregation type information in table schema api (#6686 ) ``` { "msg": "success", "code": 0, "data": { "properties": [{ "type": "INT", "name": "k1", "comment": "", "aggregation_type":"" }, { "type": "INT", "name": "k2", "comment": "", "aggregation_type":"MAX" }], "status": 200 }, "count": 0 } ```	2021-09-24 21:42:24 +08:00
jiafeng.zhang	e03b74ebc1	[Doc] Add the error code document of returned by the OLAP function on the BE side (#6666 )	2021-09-24 21:40:20 +08:00
EmmyMiao87	bdc8c98008	[Outfile] Support hdfs in select outfile clause (#6644 ) Support hdfs in select outfile clause without broker. This PR implement a HDFS writer in BE which is used to write HDFS file directly without using broker. Also the hdfs outfile clause syntax check has been added in FE. The syntax: ``` select * from xx into outfile "hdfs://user/outfile_" format as csv properties ("hdfs.fs.dafultFS" = "xxx", "hdfs.hdfs_user" = "xxx"); ``` Note that all hdfs configurations need to carry a prefix `hdfs.`.	2021-09-24 10:07:11 +08:00
pierre xiong	840a7ef3a8	Fix a typo (#6688 ) Fix a typo	2021-09-23 09:44:46 +08:00
Mingyu Chen	521fb15a9b	[Bug] Fix some memory bugs (#6699 ) 1. Fix a memory leak in `collect_iterator.cpp` (Fix #6700) 2. Add a new BE config `max_segment_num_per_rowset` to limit the num of segment in new rowset.(Fix #6701) 3. Make the error msg of stream load more friendly.	2021-09-22 12:30:14 +08:00
EmmyMiao87	085942b30f	[Doc] Download hashes and signatures use "downloads.apache.org" (#6677 ) The latest release should use https://www.apache.org/dyn/closer.lua <https://www.apache.org/dyn/closer.lua> The latest hashes and signatures should use https://downloads.apache.org/ The old release should use http://archive.apache.org/dist	2021-09-16 18:09:08 +08:00
GeoffreyStark	7ee39743de	[Doc] Fix tabletScore expression in be_config.md (#6638 ) Co-authored-by: Geoffrey <gaofeng01@rd.netease.com>	2021-09-16 10:24:46 +08:00
qiye	225bdb1fda	[Bug] fix `replace` function bug (#6605 ) * fix replace function bug * fix replace docs	2021-09-14 09:59:13 +08:00
GeoffreyStark	5d3c7fbd80	add doc for storage_root_path (#6593 ) * add doc for storage_root_path * Maintain consistency in both Chinese and English documents Co-authored-by: Geoffrey <gaofeng01@rd.netease.com>	2021-09-10 09:52:58 +08:00
zhoubintao	b3f02955d3	[Doc] modify irregular documents (like/ not like/ regexp.md) (#6572 )	2021-09-09 14:11:37 +08:00
EmmyMiao87	9469b2ce1a	[Outfile] Support concurrent export of query results (#6539 ) This pr mainly supports 1. Export query result sets concurrently 2. Query result set export supports s3 protocol Among them, there are several preconditions for concurrently exporting query result sets 1. Enable concurrent export variables 2. The query itself can be exported concurrently (some queries containing sort nodes at the top level cannot be exported concurrently) 3. Export the s3 protocol used instead of the broker After exporting the result set concurrently, the file prefix is changed to outfile_{query_instance_id}_filenumber.{file_format}	2021-09-07 11:53:32 +08:00
王连松	79fd117d60	Update load-json-format.md (#6546 ) change stripe_outer_array to strip_outer_array	2021-09-02 16:08:09 +08:00

1 2 3 4 5 ...

763 Commits