doris

Author	SHA1	Message	Date
zy-kkk	39ded1f649	[branch-2.1][improvement](jdbc catalog) Change JdbcExecutor's error reporting from UDF to JDBC (#37635 ) pick (#35692) In the initial version, JdbcExecutor directly used UdfRuntimeException, which could lead to misunderstanding of the exception. Therefore, I created a separate Exception for JdbcExecutor to help us view the exception more clearly.	2024-07-11 15:11:41 +08:00
zy-kkk	ef754487d9	[branch-2.1][improvement](jdbc catalog) Catch `AbstractMethodError` in `getColumnValue` Method and Suggest Updating to ojdbc8+ (#37634 ) pick (#37608) Catch AbstractMethodError in getColumnValue method. Provide a clear error message suggesting the use of ojdbc8 or higher versions to avoid compatibility issues.	2024-07-11 15:10:47 +08:00
zy-kkk	62c4451c97	[branch-2.1][improvement](jdbc catalog) Modify the maximum number of connections in the connection pool to 30 by default (#37023 ) pick (#36720) In many cases, we found that users would use JDBC Catalog to perform a large number of queries, which resulted in the maximum of 10 connections being insufficient, so I adjusted it to 30, which covered most needs.	2024-07-01 12:22:20 +08:00
zy-kkk	e25b0d7c37	[branch-2.1][improvement](mysql catalog) disable mysql AbandonedConnectionCleanupThread (#36970 ) pick (#36655)	2024-06-29 18:35:41 +08:00
wuwenchi	4008a04da7	[bugfix](paimon)Fix field case issues for 2.1 (#36288 ) bp: #36239	2024-06-17 18:38:00 +08:00
wuwenchi	bd6b913e00	[bugfix](paimon)paimon's field length judgment error for 2.1 (#36049 ) bp #35981	2024-06-07 21:13:08 +08:00
Ashin Gau	4f0365e0bf	[fix](s3) move s3 providers to fe-common to be accessible for jni reader (#35779 ) backport: #35690 `PropertyConverter.setS3FsAccess` has add customized s3 providers: ``` public static final List<String> AWS_CREDENTIALS_PROVIDERS = Arrays.asList( DataLakeAWSCredentialsProvider.class.getName(), TemporaryAWSCredentialsProvider.class.getName(), SimpleAWSCredentialsProvider.class.getName(), EnvironmentVariableCredentialsProvider.class.getName(), IAMInstanceCredentialsProvider.class.getName()); ``` And these providers are set as configuration value of `fs.s3a.aws.credentials.provider`, which will be used as configuration to build s3 reader in JNI readers. However, `DataLakeAWSCredentialsProvider` is in `fe-core`, that is not dependent by JNI readers, so we have to move s3 providers to `fe-common'.	2024-06-03 14:04:39 +08:00
Ashin Gau	9dc1196f1c	[update](hudi) update hudi-spark bundle to 3.4.3 (#35013 ) (#35718 ) backport: #35013	2024-05-31 20:51:28 +08:00
苏小刚	72a27a0938	[fix](paimon)fix paimon cache bug (#35309 ) Issue Number: close #35024 This bug is because the fe incorrectly sets the update time of paimon catalog, causing the be to be unable to update paimon's schema in time. ```c++ private void initTable() { PaimonTableCacheKey key = new PaimonTableCacheKey(ctlId, dbId, tblId, paimonOptionParams, dbName, tblName); TableExt tableExt = PaimonTableCache.getTable(key); if (tableExt.getCreateTime() < lastUpdateTime) { LOG.warn("invalidate cache table:{}, localTime:{}, remoteTime:{}", key, tableExt.getCreateTime(), lastUpdateTime); PaimonTableCache.invalidateTableCache(key); tableExt = PaimonTableCache.getTable(key); } this.table = tableExt.getTable(); paimonAllFieldNames = PaimonScannerUtils.fieldNames(this.table.rowType()); if (LOG.isDebugEnabled()) { LOG.debug("paimonAllFieldNames:{}", paimonAllFieldNames); } } ```	2024-05-28 18:52:51 +08:00
zy-kkk	2e20e38523	[improvement](jdbc catalog) remove useless jdbc catalog code (#34986 ) (#35418 )	2024-05-27 14:25:26 +08:00
zy-kkk	3a5fb6265a	[refactor](jdbc catalog) split trino jdbc executor (#34932 ) (#35176 ) pick #34932	2024-05-22 19:09:57 +08:00
zy-kkk	05a390e050	[refactor](jdbc catalog) split oceanbase jdbc executor (#34869 ) (#35175 ) pick #34869	2024-05-22 19:09:35 +08:00
zy-kkk	24990383ff	[refactor](jdbc catalog) split clickhouse jdbc executor (#34794 ) (#35174 ) pick master #34794	2024-05-22 19:09:05 +08:00
zy-kkk	22d4543346	[refactor](jdbc catalog) split sap_hana jdbc executor (#34772 )	2024-05-13 22:36:52 +08:00
zy-kkk	fcbdd77f78	[fix](jdbc catalog) Fix ClassLoader Scope in JdbcExecutor Initialization (#34620 )	2024-05-10 11:24:39 +08:00
zy-kkk	3495ed58e0	[Enhancement](jdbc catalog) Change Jdbc connection pool to hikari (#34045 ) (#34310 )	2024-04-29 20:22:48 +08:00
苏小刚	11039ade7b	[opt](paimon) support mapping Paimon column type "Row" to Doris type "Struct" (#34239 ) backport: #33786	2024-04-28 19:38:50 +08:00
Mingyu Chen	fbd2c9db2d	[upgrade](hive-shade)(paimon) upgrade hive shade to 2.0.0 and paimon to 0.7 (#34085 ) * Adapt paimon 0.6.0 (#33943) Version 2.0.0 of the shade package eliminates potential jar conflicts, resolves dependency component issues, and significantly reduces package size. Utilize the directly-dependent guava library instead of relying on transitively included libraries. * [chore](dependencies)Upgrade paimon to 0.7.0 (#33987) --------- Co-authored-by: Calvin Kirs <kirs@apache.org>	2024-04-25 12:01:44 +08:00
zy-kkk	f6af79c0ed	[fix](catalog) Remove unexpected cleanup when reading jdbc data (#33529 )	2024-04-17 23:42:12 +08:00
wuwenchi	e9b67bc82d	[bugfix](paimon)merge meta-inf/services for paimon FileIOLoader (#33166 ) We introduced paimon's oss and s3 packages, but did not register them in meta-info/service. As a result, when be used the s3 or oss interface, an error was reported and the class could not be found(`Could not find a file io implementation for scheme 's3' in the classpath.`). FYI: https://stackoverflow.com/questions/47310215/merging-meta-inf-services-files-with-maven-assembly-plugin https://stackoverflow.com/questions/1607220/how-can-i-merge-resource-files-in-a-maven-assembly	2024-04-07 22:13:00 +08:00
morningman	9c6180d9ba	[revert](jni) revert part of #32455 #32904	2024-03-27 20:45:44 +08:00
Mingyu Chen	c0d7a5660e	[fix](paimon) support paimon with hive2 (#32455 ) In order to support paimon with hive2, we need to modify the origin HiveMetastoreClient.java to let it compatible with both hive2 and hive3. And this modified HiveMetastoreClient should be at the front of the CLASSPATH, so that it can overwrite the HiveMetastoreClient in hadoop jar. This PR mainly changes: 1. Copy HiveMetastoreClient.java in FE to BE's preload jar. 2. Split the origin `preload-extensions-jar-with-dependencies.jar` into 2 jars 1. `preload-extensions-project.jar`, which contains the modified HiveMetastoreClient. 2. `preload-extensions-jar-with-dependencies.jar`, which contains other dependency jars. 3. Modify the `start_be.sh`, to let `preload-extensions-project.jar` be loaded first. 4. Change the way the assemble the jni scanner jar Only need to assemble the project jar, without other dependencies. Because actually we only use classed under `org.apache.doris` package. So remove other unused dependency jars can also reduce the output size of BE. 5. fix bug that the prefix of paimon properties should be `paimon.`, not `paimon` 6. Support paimon with hive2 User can set `hive.version` in paimon catalog properties to specify the hive version.	2024-03-26 15:31:07 +08:00
Ashin Gau	ec43f65235	[feature](hudi) support hudi incremental read (#32052 ) * [feature](hudi) support incremental read for hudi table * fix jdk17 java options	2024-03-26 15:31:07 +08:00
zy-kkk	a10466598b	[fix](jdbc catalog) Fix query errors without jdbc pool default value on only BE upgrade (#32618 )	2024-03-22 16:36:22 +08:00
Mingyu Chen	4c8aaa156a	[fix](jni) remove 'push_down_predicates' and fix BE crash with decimal predicate (#32253 ) (#32599 )	2024-03-21 14:07:50 +08:00
zy-kkk	e952b5ef5b	[opt](jdbc catalog) Refine the jdbc_connector close logic and actively clear the jvm occupied by jdbcexecutor (#32300 )	2024-03-21 14:07:23 +08:00
yiguolei	85b2c42f76	[Enhancement](jdbc catalog) Add a property to test the connection when creating a Jdbc catalog (#32125 ) (#32531 )	2024-03-21 14:05:59 +08:00
zy-kkk	cf6b22c621	[fix](jdbc catalog) fix type conversion error in MySQL JDBC Driver 5.x (#31880 )	2024-03-12 14:07:57 +08:00
zy-kkk	0f1cbcc86a	[refactor](jdbc catalog) split postgresql jdbc Executor (#31730 )	2024-03-06 13:08:04 +08:00
zy-kkk	2e9bd268cd	[improvement](jdbc catalog) support sqlserver timestamp type read (#31805 )	2024-03-06 13:08:04 +08:00
zy-kkk	11903d29a1	[fix](jdbc catalog) fix close abort in sqlserver (#31718 )	2024-03-06 13:07:49 +08:00
zy-kkk	a5b9127656	[refactor](jdbc catalog) split sqlserver jdbc executor (#31679 )	2024-03-06 13:04:29 +08:00
zy-kkk	07224686ef	[feature](jdbc catalog) support db2 jdbc catalog (#31627 )	2024-03-01 14:19:28 +08:00
slothever	0b5b7175d6	[fix](multi-catalog) add max compute custom odps and tunnel url (#31390 ) add max compute custom odps and tunnel url	2024-02-29 16:44:40 +08:00
Mingyu Chen	450422556a	[fix](paimon) auto deplay paimon oss/s3 jar file (#31568 ) No need to deploy paimon oss/s3 jar files manually. Include them in preload-extensions-jar-with-dependencies.jar	2024-02-29 16:44:40 +08:00
zy-kkk	3c37fb085c	[refactor](jdbc catalog) split jdbc executor for different data sources (step-1) (#31406 )	2024-02-29 12:38:03 +08:00
Mingyu Chen	883d022f84	[fix](paimon) fix hadoop.username does not take effect in paimon catalog (#31478 )	2024-02-28 13:08:41 +08:00
zy-kkk	18f9ca9242	[fix](jdbc catalog) Fix Resource Closing Logic After Connection Abort (#31219 ) * [improvement](jdbc catalog) Optimize the closing logic of Jdbc connection after abort * [improvement](jdbc catalog) Optimize the closing logic of Jdbc connection after abort * fix	2024-02-22 20:35:10 +08:00
Ashin Gau	260568db17	[update](hudi) update hudi version to 0.14.1 and compatible with flink hive catalog (#31181 ) 1. Update hudi version from 0.13.1 to .14.1 2. Compatible with the hudi table created by flink hive catalog	2024-02-22 19:51:20 +08:00
Calvin Kirs	02bded2688	[Improve](common)Optimize logging performance with LOG.isDebugEnabled() (#31091 ) * [Improve](common)Optimize logging performance with LOG.isDebugEnabled() * fix error ut	2024-02-20 09:16:14 +08:00
slothever	4a33d9820a	[fix](multi-catalog)fix getting ugi methods and unify them (#30844 ) put all ugi login methods to HadoopUGI	2024-02-20 09:12:38 +08:00
slothever	bc2e8ac8f9	[fix](kerberos) fix kerberos ugi login method (#30766 )	2024-02-06 08:35:54 +08:00
zy-kkk	4f8730d092	[improvement](jdbc catalog) Optimize connection pool parameter settings (#30588 ) This PR makes the following changes to the connection pool of JDBC Catalog 1. Set the maximum connection survival time, the default is 30 minutes - Moreover, one-half of the maximum survival time is the recyclable time, - One-tenth is the check interval for recycling connections 2. Keepalive only takes effect on the connection pool on BE, and will be activated based on one-fifth of the maximum survival time. 3. The maximum number of existing connections is changed from 100 to 10 4. Add the connection cache recycling thread on BE, and add a parameter to control the recycling time, the default is 28800 (8 hours) 5. Add CatalogID to the key of the connection pool cache to achieve better isolation, requires refresh catalog to take effect 6. Upgrade druid connection pool to version 1.2.20 7. Added JdbcResource's setting of default parameters when upgrading the FE version to avoid errors due to unset parameters.	2024-02-03 20:26:03 +08:00
Tiewei Fang	9b7c6af581	[Fix](JDK17) Fixed that BE could not be started using JDK17 (#30286 ) Issue Number: #30484 This is because hadoop-client-api relies on hadoop-common. In the case of JDK17, it will still include hadoop-common.	2024-02-01 23:14:14 +08:00
wuwenchi	11f7b36dab	[bugfix](paimon)add class loader (#30483 )	2024-01-30 15:33:40 +08:00
slothever	b1a9370004	[fix](glue)support access glue iceberg with credential list (#30473 ) merge from #30292	2024-01-28 18:23:07 +08:00
zy-kkk	bc03354be8	[improvement](jdbc catalog) Optimize the Close logic of JDBC client (#30236 ) Optimize the Close logic of the JDBC client so that the Jdbc Catalog can correctly cancel the running query when the query is cancelled.	2024-01-23 13:22:14 +08:00
zy-kkk	0ccd706a30	[Enhancement](Jdbc Catalog) Map Jdbc Catalog JSON Type to String for Improved Performance and Compatibility (#30035 ) This PR proposes mapping external catalog JSON types to String instead of JsonB in Apache Doris. This change is motivated by the realization that JDBC retrieves JSON data as a String JSON string, regardless of its storage format (Json(String) or Json(Binary)). Mapping to String streamlines data retrieval, simplifies write-backs, and ensures compatibility with all JSON(String) and JSON(Binary) functions, despite potentially misleading displays of JSON data as Strings in Doris. This approach avoids the performance overhead and complexity of converting each row of data from JsonB to String, making the process more efficient and elegant. About Upgrade To ensure query compatibility with existing Catalogs in the upgraded version,we currently still retain the capability to query external JSON types as JSONB. However, once you upgrade to the new version and either refresh the Catalog or create a new one, all external JSON types will be treated as Strings. To ensure consistent behavior,and possible future removal of support for JSON as JSONB query code, it is highly recommended that you manually refresh your Catalog as soon as possible after upgrading to the new version.	2024-01-18 12:03:07 +08:00
Mingyu Chen	12af86176a	[fix](class-loader) fix class loader conflict on BE side (#29942 ) 1. make `hadoop-common` in be java extension as `provided`. 2. must load be java extension jars before hadoop jars	2024-01-14 15:53:33 +08:00
zy-kkk	8fc9c18c85	[improvement](jdbc catalog) Put the jdbc connection pool parameters into catalog properties (#29195 )	2024-01-12 11:40:28 +08:00

1 2 3

129 Commits