doris

Author	SHA1	Message	Date
HappenLee	1a423350f8	[Interface](exec) Add interface for multi cast data sink (#19372 )	2023-05-10 10:29:33 +08:00
Xinyi Zou	cf8ceb8586	[fix](scan) fix scanner mem tracker (#19354 )	2023-05-10 09:56:41 +08:00
amory	b2371c1246	[Refact](Literal)refact literal get field and value (#19351 )	2023-05-10 09:01:17 +08:00
luozenglin	03538381a3	[enhancement](memory) MemCounter supports lock-free thread safety (#19256 ) make try_add() and update_peak() thread-safe.	2023-05-10 02:24:07 +08:00
Ashin Gau	68eb420cab	[fix](MySQL) the way Doris handles boolean type is consistent with MySQL (#19416 )	2023-05-10 00:58:09 +08:00
Adonis Ling	f8eb08252c	[chore](workflows) Disable PCH in GitHub workflows by default (#19447 ) PCH slows the BE UT workflows down. Disable it by default in workflows.	2023-05-10 00:05:32 +08:00
Qi Chen	096aa25ca6	[improvement](orc-reader) Implements ORC lazy materialization (#18615 ) - Implements ORC lazy materialization, integrate with the implementation of https://github.com/apache/doris-thirdparty/pull/56 and https://github.com/apache/doris-thirdparty/pull/62. - Refactor code: Move `execute_conjuncts()` and `execute_conjuncts_and_filter_block()` in `parquet_group_reader `to `VExprContext`, used by parquet reader and orc reader. - Add session variables `enable_parquet_lazy_materialization` and `enable_orc_lazy_materialization` to control whether enable lazy materialization. - Modify `build.sh` to update apache-orc submodule or download package every time.	2023-05-09 23:33:33 +08:00
Pxl	dfad7b6b38	[Feature](generic-aggregation) some prowork of generic aggregation (#19343 ) some prowork of generic aggregation	2023-05-09 21:42:21 +08:00
yongkang.zhong	7c7db9ce93	[typo](docs) Add an open page cache hint to the benchmark (#19449 )	2023-05-09 21:28:39 +08:00
yongkang.zhong	1bc405c06f	[fix](catalog) fix doris jdbc catalog largeint select error (#19407 ) when I use mysql-jdbc 5.1.47 create a doris jdbc catalog, the largeint cannot select When mysql-jdbc reads largeint, it will convert the format to string because it is too long mysql> select `largeint` from type3; ERROR 1105 (HY000): errCode = 2, detailMessage = (127.0.0.1)[INTERNAL_ERROR]Fail to convert jdbc type of java.lang.String to doris type LARGEINT on column: largeint. You need to check this column type between external table and doris table.	2023-05-09 17:34:48 +08:00
lihangyu	b07053f47d	[chore](simdjson reader) default enable simdjson for json reader (#19375 )	2023-05-09 16:53:21 +08:00
chenlinzhong	aeb3450151	[feature](graph)Support querying data from the Nebula graph database (#19209 ) Support querying data from the Nebula graph database This feature comes from the needs of commercial customers who have used Doris and Nebula, hoping to connect these two databases changes mainly include: * add New Graph Database JDBC Type * Adapt the type and map the graph to the Doris type	2023-05-09 15:30:11 +08:00
yiguolei	1424fb96ca	[bugfix](regression-test) disable string column length too large test and disable auto statistics collector and disable window function test (#19428 )	2023-05-09 14:57:02 +08:00
airborne12	2504b243f0	[Fix](build) fix clucene build type (#19376 ) RelWithDebInfo default uses O2 as compile flags which hurt performance for clucene	2023-05-09 14:29:04 +08:00
Ashin Gau	e3d4723849	[fix](JDBC) set jdbc parameters to compatible with both MySQL and Doris when reading boolean type (#19399 ) Fix errors when read boolean type from external doris cluster by jdbc catalog: ``` ERROR 1105 (HY000): errCode = 2, detailMessage = (172.16.10.11)[INTERNAL_ERROR]Fail to convert jdbc type of java.lang.Integer to doris type BOOL on column: deleted. You need to check this column type between external table and doris table. ``` MySQL Types and Return Values for GetColumnTypeName and GetColumnClassName are presented in https://dev.mysql.com/doc/connector-j/8.0/en/connector-j-reference-type-conversions.html. However when tinyInt1isBit=false, GetColumnClassName of MySQL returns java.lang.Boolean, while that of Doris returns java.lang.Integer. In order to be compatible with both MySQL and Doris, Jdbc params should set tinyInt1isBit=true&transformedBitIsBoolean=true	2023-05-09 13:53:17 +08:00
AlexYue	729cd319f1	[enhance](regression) add timeout for cold&heat case (#19360 )	2023-05-09 13:08:40 +08:00
AlexYue	d8dd0536e0	[enhance](S3FileWriter) sync when s3 file writer early quits #19393	2023-05-09 11:02:58 +08:00
ZashJie	4302ceaee8	[Improvement](data types) enhance show data types stmt (#18831 )	2023-05-09 09:42:44 +08:00
hechao	f23e6dd8c1	[typo](doc)fix invalid url (#19398 )	2023-05-08 22:59:50 +08:00
yagagagaga	b11e937778	[typo](docs) fix the wrong description about cte with (#19403 )	2023-05-08 22:59:34 +08:00
DeadlineFen	e08de52ee7	[chore](compile) using PCH for compilation acceleration under clang (#19303 )	2023-05-08 19:51:06 +08:00
lvshaokang	af04c3acab	[fix](sequence-column) Fix sequence_col column used default expr insert failed (#18933 )	2023-05-08 17:18:25 +08:00
yongkang.zhong	bc1bf420d1	[typo](docs) fix err to dynamic schema table doc (#19380 )	2023-05-08 16:35:57 +08:00
catpineapple	4a65f8cad9	[doc](fix) compaction config param (#19368 )	2023-05-08 16:34:47 +08:00
Kang	6ebcbcd0dd	[bugfix](testcase) fix test_query_sys_tables decimal type precision (#19230 )	2023-05-08 15:57:58 +08:00
ZashJie	e5a6b002a3	fixed (#19329 )	2023-05-08 14:24:40 +08:00
airborne12	f199860dea	[Improvement](inverted index) Enhance compaction performance through direct inverted index merging (#19207 )	2023-05-08 14:07:32 +08:00
yongkang.zhong	c7a04fa05a	[improvement](JDBC Catalog)Added Presto connection to Presto/Trino (#19307 )	2023-05-08 14:05:56 +08:00
yongkang.zhong	1e5000c9b2	[typo](docs) fix err to mac local dev doc (#19367 )	2023-05-08 14:05:33 +08:00
yongkang.zhong	7f0d6eb644	[log](fe)add log partitionInfo is null, fe not start service (#19143 )	2023-05-08 14:04:16 +08:00
Tiewei Fang	e78149cb65	[Enhencement](Export) add property for outfile/export and add test (#18997 ) This pr does three things: 1. add `delete_existing_files` property for outfile/export. If `delete_existing_files = true`, export/outfile will delete all files under file_path first. 2. add p2 test for export 3. modify docs	2023-05-08 14:02:20 +08:00
Adonis Ling	8c4f3d4126	[chore](macOS) Fix JAVA_OPTS in start_be.sh (#19267 ) We should set -XX:-MaxFDLimit on macOS if we enable java support for BE otherwise BE may fail to start up.	2023-05-08 14:01:10 +08:00
Ashin Gau	05c5c5949c	[refactor](FileCache) set FE session variable enable_file_cache=false as default (#19327 ) Users should set `enable_file_cache=true` in FE session variables and BE configuration to enable file cache.	2023-05-08 13:53:51 +08:00
zclllyybb	bb462202dc	[Exec] log the fuzzy config of be (#19349 )	2023-05-08 11:01:54 +08:00
Adonis Ling	673cbe3317	[chore](build) Porting to GCC-13 (#19293 ) Support using GCC-13 to build the codebase.	2023-05-08 10:42:06 +08:00
Mingyu Chen	fb5b3029a7	[fix](meta) fix image file checksum error (#19363 )	2023-05-08 10:00:09 +08:00
yiguolei	1f6898a091	[refactor](remove unused file) remove progresss updater (#19332 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-05-08 09:27:52 +08:00
yongkang.zhong	32273a7a9b	[improvement](backend)Optimized error messages for insufficient replication (#19211 ) optimized the error message for creating insufficient table replications	2023-05-07 20:45:21 +08:00
Mingyu Chen	abc73ac1eb	[refactor](cluster)(step-1) remove cluster related stmt (#19355 ) * [refactor](cluster)(step-1) remove cluster stmt	2023-05-07 18:44:42 +08:00
Qi Chen	b50e2a8c08	[Fix](parquet-reader) Fix dict cols not be converted back to string type in some cases. (#19348 ) Fix dict cols not be converted back to string type in some cases, which includes introduced by #19039. For dict cols, we will convert dict cols to int32 type firstly, then convert back to string type after read block. The block will be reuse it, so it is necessary to convert it back.	2023-05-07 10:05:23 +08:00
airborne12	ed368d7f6c	[chore](build) Ignore clucene checks (#19353 )	2023-05-07 09:38:44 +08:00
Dongyang Li	6c21df6324	[tools](tpch) run mode like clickbench (#19339 )	2023-05-06 23:33:26 +08:00
yongkang.zhong	9203e0392f	[typo](docs) add mac local dev docs (#19342 ) * [typo](docs) add mac local dev docs	2023-05-06 22:58:40 +08:00
奕冷	5bf1396efe	[enhancement](load) merge single-replica related services as non-standalone (#18421 )	2023-05-06 22:54:56 +08:00
Yusheng Xu	9edbfa37cd	[Enhancement](Broker Load) New progress manager for showing loading progress status (#19170 ) This work is in the early stage, current progress is not accurate because the scan range will be too large for gathering information, what's more, only file scan node and import job support new progress manager ## How it works for example, when we use the following load query: ``` LOAD LABEL test_broker_load ( DATA INFILE("XXX") INTO TABLE `XXX` ...... ) ``` Initial Progress: the query will call `BrokerLoadJob` to create job, then `coordinator` is called to calculate scan range and its location. Update Progress: BE will report runtime_state to FE and FE update progress status according to jobID and fragmentID we can use `show load` to see the progress PENDING: ``` State: PENDING Progress: 0.00% ``` LOADING: ``` State: LOADING Progress: 14.29% (1/7) ``` FINISH: ``` State: FINISHED Progress: 100.00% (7/7) ``` At current time, full output of `show load\G` looks like: ``` ************************* 1. row ************************* JobId: 25052 Label: test_broker State: LOADING Progress: 0.00% (0/7) Type: BROKER EtlInfo: NULL TaskInfo: cluster:N/A; timeout(s):250000; max_filter_ratio:0.0 ErrorMsg: NULL CreateTime: 2023-05-03 20:53:13 EtlStartTime: 2023-05-03 20:53:15 EtlFinishTime: 2023-05-03 20:53:15 LoadStartTime: 2023-05-03 20:53:15 LoadFinishTime: NULL URL: NULL JobDetails: {"Unfinished backends":{"5a9a3ecd203049bc-85e39a765c043228":[10080]},"ScannedRows":39611808,"TaskNumber":1,"LoadBytes":7398908902,"All backends":{"5a9a3ecd203049bc-85e39a765c043228":[10080]},"FileNumber":1,"FileSize":7895697364} TransactionId: 14015 ErrorTablets: {} User: root Comment: ``` ## TODO: 1. The current partition granularity of scan range is too large, resulting in an uneven loading process for progress." 2. Only broker load supports the new Progress Manager, support progress for other query	2023-05-06 22:44:40 +08:00
yongkang.zhong	2fe9ba7c2a	[fix](jdbc catalog) fix trino jdbc catalog varchar type err (#19298 )	2023-05-06 17:16:28 +08:00
Gabriel	4c6ca88088	Revert "[refactor](function) ignore DST for function `from_unixtime` (#19151 )" (#19333 ) This reverts commit 9dd6c8f87b73db238bfd38fb1d76f3796910f398.	2023-05-06 16:33:58 +08:00
15767714253	f584ad52ca	[UDF](demo) add new demo code for java udf (#19276 )	2023-05-06 16:17:54 +08:00
HappenLee	626a4c2ab0	[RegressionTest](pipeline) coredump when run regression test in pipeline engine (#19306 )	2023-05-06 14:54:17 +08:00
ElvinWei	3f6e5118e6	[enchancement](statistics) support periodic collection of statistics (#19247 ) This PR enables periodic collection of statistics and is a precursor to automatic statistics collection. It mainly includes the following contents： support periodic collection of statistics. Change the type of Date in statistics p0 to DateV2(see [Enhancement](data-type) add FE config to prohibit create date and decimalv2 type #19077) for test locally. complement cases(remove Chinese characters, optimize code, etc) , improve stability. Supports setting whether to keep records of statistics synchronization job info, convenient for use in p0 testing. The statistics job table was modified, and some auxiliary judgments were added to avoid the user perceiving the modification. This function was removed when the table schema is stable.	2023-05-06 14:53:06 +08:00

1 2 3 4 5 ...

10361 Commits