doris

Author	SHA1	Message	Date
Qi Chen	99af54f779	[Fix](orc-reader) Fix the issue when string col has mixed plain and dict encoding in different stripes. (#34146 ) (#34248 ) backport #34146	2024-04-28 19:43:57 +08:00
苏小刚	11039ade7b	[opt](paimon) support mapping Paimon column type "Row" to Doris type "Struct" (#34239 ) backport: #33786	2024-04-28 19:38:50 +08:00
Mingyu Chen	45556686ea	[fix](test) fix some external test cases (#34209 ) Fix some test cases and enable `test_information_schema_external` suite	2024-04-27 23:25:33 +08:00
qiye	414fbd353e	[fix](ES catalog)Make col != '' behavior consistent with SQL (#34151 ) In SQL syntax, `col != ''` equals `col.length() > 0`. It means that this column must exist in ES doc fields and its content is not empty. In this PR, we make a special translation for this binary predicate to keep the behavior of both consistent. --------- Co-authored-by: Luennng <luennng@gmail.com>	2024-04-27 02:29:33 +08:00
苏小刚	0f0c0a266b	[opt](parquet)Skip page with offset index (#33082 ) Make skip_page() in ColumnChunkReader more efficient. No more reading page headers if there are pagelocations in chunk.	2024-04-26 15:06:16 +08:00
Qi Chen	acc2b532e7	[Test](hive-writer) Adjust test_hive_write_partitions regression test to resolve special characters issue with git on windows. (#34026 )	2024-04-26 15:05:47 +08:00
Mingyu Chen	50f9d47e96	[test](hive) run suite cases both in hive2 and hive3 (#33874 ) (#34156 ) bp #33874 Co-authored-by: 苏小刚 <suxiaogang223@icloud.com>	2024-04-26 13:48:09 +08:00
wangbo	03c3419265	[Refactor](executor)Add workload schedule policy table (#33729 )	2024-04-20 20:06:34 +08:00
Mingyu Chen	0e3ad5cd9d	[fix](parquet) fix time zone error(isAdjustedToUTC=true) in parquet reader (#33675 ) (#33924 ) bp (#33675) Co-authored-by: Ashin Gau <AshinGau@users.noreply.github.com>	2024-04-20 19:06:54 +08:00
苏小刚	1c025c0488	[docker](hive) add hive3 docker compose and modify scripts (#33115 ) add hive3 docker compose from: big-data-europe/docker-hive#56	2024-04-17 23:42:13 +08:00
zy-kkk	1be753ed75	[enhancement](mysql compatible) add user and procs_priv tables to mysql db in all catalogs (#33058 ) Issue Number: close #xxx This PR aims to enhance the compatibility of BI tools (such as Dbeaver, DataGrip) when using the mysql connector to connect to Doris, because some BI tools query some tables in the mysql database. In our tests, the user and procs_priv tables were mainly queried. This PR adds these two tables and adds actual data to the user table. However, please note that most of the fields in the user table are in Doris' own format rather than mysql format, so it can only ensure that the BI tool is querying No error is reported when accessing these tables, which does not guarantee that the data is completely displayed, and the tables under Doris's mysql database do not support data modification. Thanks to @liujiwen-up for assisting in testing	2024-04-17 23:42:12 +08:00
zy-kkk	b035c7ceb4	[fix](catalog) fix resource is not reopen when rename catalog (#33432 ) During the renaming of `JdbcCatalog`, I noticed that the `jdbcClient` was being closed, resulting in exceptions during subsequent queries. This happens because the `removeCatalog` method is invoked when changing the name, which in turn calls the `onClose` method of the catalog. Ideally, the client should not be closed when renaming the catalog. However, to avoid extra checks in the `removeCatalog` method, we can simply execute `onRefresh` in the `addCatalog` method to address this issue.	2024-04-12 15:09:25 +08:00
slothever	18fb8407ae	[feature](insert)use optional location and add hive regression test (#33153 )	2024-04-12 10:38:54 +08:00
slothever	07f296734a	[regression](insert)add hive DDL and CTAS regression case (#32924 ) Issue Number: #31442 dependent on #32824 add ddl(create and drop) test add ctas test add complex type test TODO: bucketed table test truncate test add/drop partition test	2024-04-12 10:24:23 +08:00
slothever	716c146750	[fix](insert)fix hive external return msgs and exception and pass all columns to BE (#32824 ) [fix](insert)fix hive external return msgs and exception and pass all columns to BE	2024-04-12 10:23:52 +08:00
Tiewei Fang	61e214c327	[Fix](Hive-Metastore) fix that if JDBC reads the NULL value, it will cause NPE (#32831 )	2024-04-10 11:55:17 +08:00
Qi Chen	5116724494	[Fix](hive-writer) Fix the issue of block was not copied to do filtering when hive partition writer write block to file. (#32775 ) (#33447 ) backport #32775	2024-04-10 11:42:23 +08:00
Qi Chen	4963d60a07	[Fix](multi-catalog)Fix the issue of not initializing the writer caused by refactoring and add hive writing regression test. (#32721 ) (#33446 ) backport #32721.	2024-04-10 11:42:22 +08:00
zy-kkk	fae55e0e46	[Feature](information_schema) add processlist table for information_schema db (#32511 )	2024-04-07 23:24:22 +08:00
Ashin Gau	29556f758e	[fix](parquet) fix time zone error in parquet reader (#33217 ) `isAdjustedToUTC` is exactly the opposite in parquet reader(https://github.com/apache/parquet-format/blob/master/LogicalTypes.md), resulting the time with `isAdjustedToUTC=true` has increased by eight hours(UTC8). The parquet with `isAdjustedToUTC=true` can be produced by spark-sql with the following configuration: ``` --conf spark.sql.session.timeZone=UTC --conf spark.sql.parquet.outputTimestampType=TIMESTAMP_MICROS ``` However, using the following configuration, there's no logical and convert type in parquet meta data, so the time read by doris will also increase by eight hours(UTC8). Users need to set their own UTC time zone in doris(https://doris.apache.org/docs/dev/advanced/time-zone/) ``` --conf spark.sql.session.timeZone=UTC --conf spark.sql.parquet.outputTimestampType=INT96 ```	2024-04-07 23:24:22 +08:00
feiniaofeiafei	b5a1914740	[Fix](nereids) Fix deletestmt getting catalog (#32701 )	2024-03-26 20:29:03 +08:00
yiguolei	7b94cfdba1	Revert "[Fix](tests) add regression tests for trino-connector (#32552 )" This reverts commit 3fc3a4650681cb519405730899a2f22f268b38c1.	2024-03-25 22:38:21 +08:00
Tiewei Fang	3fc3a46506	[Fix](tests) add regression tests for trino-connector (#32552 )	2024-03-25 22:31:55 +08:00
Tiewei Fang	d7a3ff1ddf	[Fix](Outfile) Fix the column type mapping in the orc/parquet file format (#32281 ) \| Doris Type \| Orc Type \| Parquet Type \| \|---------------------\|--------------------\|------------------------\| \| Date \| Long (logical: DATE) \| int32 (Logical: Date) \| \| DateTime \| TIMESTAMP (logical: TIMESTAMP) \| int96 \|	2024-03-22 08:52:16 +08:00
zy-kkk	dea6859e0d	[refactor](jdbc catalog) refactor jdbc catalog get databases logic (#32579 )	2024-03-21 14:07:50 +08:00
zy-kkk	26ed4b69b1	[opt](jdbc catalog) filter jdbc datasource internal database (#32294 )	2024-03-21 14:07:23 +08:00
Mryange	8bd101129a	[behavior change](output) change float output format (#32049 )	2024-03-21 14:07:22 +08:00
yiguolei	85b2c42f76	[Enhancement](jdbc catalog) Add a property to test the connection when creating a Jdbc catalog (#32125 ) (#32531 )	2024-03-21 14:05:59 +08:00
wangbo	258dcfca97	[Refactor](executor)Add information_schema.workload_groups (#32195 ) (#32314 )	2024-03-15 20:46:54 +08:00
wangbo	df5ec16d7c	[Refactor](exectuor)Add schema type table active_queries (#32057 ) * Add schema type table active_queries	2024-03-15 17:57:28 +08:00
zy-kkk	31ee448c87	[test](fix) Fix one missing line of output in out file (#32036 )	2024-03-12 14:17:55 +08:00
zy-kkk	cf6b22c621	[fix](jdbc catalog) fix type conversion error in MySQL JDBC Driver 5.x (#31880 )	2024-03-12 14:07:57 +08:00
wangbo	c5390d00bb	[Improvement]Add schema table backend_active_tasks (#31945 )	2024-03-09 19:55:48 +08:00
zy-kkk	5b00f4fbeb	[improvement](jdbc catalog) opt get db2 schema list & xml type mapping (#31856 ) 1. Trim Schema Names: Adapted the system to remove trailing spaces from DB2 schema names, ensuring compatibility without affecting query operations. 2. XML Mapping: Implemented a feature to directly map XML types to String.	2024-03-07 16:53:19 +08:00
zy-kkk	2e9bd268cd	[improvement](jdbc catalog) support sqlserver timestamp type read (#31805 )	2024-03-06 13:08:04 +08:00
zy-kkk	07224686ef	[feature](jdbc catalog) support db2 jdbc catalog (#31627 )	2024-03-01 14:19:28 +08:00
Tiewei Fang	4636b6195b	[Fix](JNI) fix BE core when using JNI to query the empty `map` type value (#31502 )	2024-02-29 14:03:38 +08:00
zy-kkk	3c37fb085c	[refactor](jdbc catalog) split jdbc executor for different data sources (step-1) (#31406 )	2024-02-29 12:38:03 +08:00
zy-kkk	b3ac2128dd	[Refactor](catalog) Refactor Jdbc Catalog external name case mapping rules (#28414 )	2024-02-19 17:22:03 +08:00
Guangdong Liu	8db2824c44	[bugfix](es catalog) add constant_keyword wildcard data type (#30947 )	2024-02-19 17:20:21 +08:00
zy-kkk	6e4f76de54	[improvement](jdbc catalog) Delete unnecessary schema and optimize insert logic (#30880 ) In the previous design, we were compatible with MySQL's auto-increment column and default value to bypass the null value check when writing back Jdbc External Table. However, because MySQL's default value is not completely unified with Doris, this resulted in The unsuitable default value is wrong. In response to this situation, I made the following optimizations 1. For JDBC External Table, we always allow certain columns to be missing during insertion. Even if these columns are not allowed to be empty at the source end, the error should be generated by the source end, not Doris herself. 2. When the target column is non-nullable and the insertion is done via `INSERT INTO tbl VALUES()` or `INSERT INTO tbl SELECT constants`, Doris should verify any inconsistency between them and throw an exception. This check is not applied for `INSERT INTO tbl SELECT ... FROM tbl` operations.	2024-02-19 17:20:21 +08:00
abmdocrt	2cb46eed94	[Feature](auto-inc) Add start value for auto increment column (#30512 )	2024-02-16 10:12:23 +08:00
zy-kkk	92226c986a	[fix](catalog) fix data_sub/data_add func pushdown in jdbcscan (#30807 )	2024-02-06 08:35:54 +08:00
Qi Chen	9100fba47e	[Fix](parquet-reader) Fix decimal test case out files. (#30715 )	2024-02-01 21:17:17 +08:00
Qi Chen	92cad69fc4	[Fix](parquet-reader) Fix reading fixed length byte array decimal in parquet reader. (#30535 )	2024-01-31 23:53:40 +08:00
wuwenchi	7d037c12bf	[bugfix](paimon)fix paimon testcases (#30514 ) 1. set default timezone 2. not supported `char` type to pushdown	2024-01-31 23:53:39 +08:00
amory	0f81d2d533	[FIX](complextype)fix complex type nested version type but not hide version (#30419 )	2024-01-29 19:03:47 +08:00
morrySnow	779a9a1fbb	[opt](planner) use string for varchar in ctas if original table is not olap (#30323 )	2024-01-29 19:03:47 +08:00
qiye	fac0580eae	[opt](docker)optimize ES docker compose (#30068 ) 1. add volume for es logs 2. optimize health check, waiting for es status to be green 3. fix es6 valume path error 4. optimize disk watermark to avoid es disk watermark error 5. fix es6 create index error 6. add custom elasticsearch.yml for es6 7. add log4j2.properties for es6, es7, es8	2024-01-19 15:48:56 +08:00
zy-kkk	0ccd706a30	[Enhancement](Jdbc Catalog) Map Jdbc Catalog JSON Type to String for Improved Performance and Compatibility (#30035 ) This PR proposes mapping external catalog JSON types to String instead of JsonB in Apache Doris. This change is motivated by the realization that JDBC retrieves JSON data as a String JSON string, regardless of its storage format (Json(String) or Json(Binary)). Mapping to String streamlines data retrieval, simplifies write-backs, and ensures compatibility with all JSON(String) and JSON(Binary) functions, despite potentially misleading displays of JSON data as Strings in Doris. This approach avoids the performance overhead and complexity of converting each row of data from JsonB to String, making the process more efficient and elegant. About Upgrade To ensure query compatibility with existing Catalogs in the upgraded version,we currently still retain the capability to query external JSON types as JSONB. However, once you upgrade to the new version and either refresh the Catalog or create a new one, all external JSON types will be treated as Strings. To ensure consistent behavior,and possible future removal of support for JSON as JSONB query code, it is highly recommended that you manually refresh your Catalog as soon as possible after upgrading to the new version.	2024-01-18 12:03:07 +08:00

1 2 3

148 Commits