doris

Author	SHA1	Message	Date
zhangdong	9c7854f1ff	[Enhancement](k8s) Add k8s yaml demo (#17281 )	2023-03-11 10:56:57 +08:00
huanghaibin	697cba9a85	[fix](broker-load) fix broker's Dockerfile (#17657 ) there is some spelling mistake in broker's Dockerfile and need to fix it.	2023-03-11 10:43:09 +08:00
huangzhaowei	4ba93efc98	[Enhance](DOE)Support parse default es iso datetime string (#17412 ) * support parse default es iso datetime string	2023-03-10 09:59:20 +08:00
huangzhaowei	9bcc3ae283	[Fix](DOE)Fix be core dump when parse es epoch_millis date format (#17100 )	2023-02-28 20:09:35 +08:00
Tiewei Fang	3a9aa03aab	[BugFix](oracle-catalog) Modify the doris data type mapping of oracle `NUMBER(p,s)` type (#17051 ) The data type `NUMBER(p,s)` of oracle has some different of doris decimal type in semantics. For Oracle Number(p,s) type： 1. if s<0 , it means this is an Interger. This `NUMBER(p,s)` has (p+\|s\| ) significant digit, and rounding will be performed at s position. eg: if we insert 1234567 into `NUMBER(5,-2)` type, then the oracle will store 1234500. In this case, Doris will use int type (`TINYINT/SMALLINT/INT/.../LARGEINT`). 2. if s>=0 && s<p , it just like doris Decimal(p,s) behavior. 3. if s>=0 && s>p, it means this is a decimal(like 0.xxxxx). p represents how many digits can be left to the left after the decimal point, the figure after the decimal point s will be rounded. eg: we can not insert 0.0123456 into `NUMBER(5,7)` type, because there must be two zeros on the right side of the decimal point, we can insert 0.0012345 into `NUMBER(5,7)` type. In this case, Doris will use `DECIMAL(s,s)` 4. if we don't specify p and s for `NUMBER(p,s)` like `NUMBER`, the p and s of `NUMBER` are uncertain. In this case, doris can not determine p and s, so doris can not determine data type.	2023-02-26 09:05:41 +08:00
qiye	92ecd16573	(feature)[DOE]Support array for Doris on ES (#16941 ) * (feature)[DOE]Support array for Doris on ES	2023-02-23 19:31:18 +08:00
FreeOnePlus	6012fc3605	[feature](docker)Fe docker init script add new interface option (#16846 ) add interface BUILD_TYPE, Values only one "k8s". e.g. docker run -itd \ --name=fe-02 \ --env BUILD_TYPE="k8s" -p 8032:8030 \ -p 9032:9030 \ --network=doris-network \ --ip=172.20.80.4 \ freeoneplus/doris:1.2.2-fe-x86_64 add interface group FE_MASTER_IP & FE_MASTER_PORT & FE_CURRENT_IP & FE_CURRENT_PORT docker run -itd \ --name=fe-02 \ --env FE_MASTER_IP="172.20.80.2" \ --env FE_MASTER_PORT=9010 \ --env FE_CURRENT_IP="172.20.80.4" \ --env FE_CURRENT_PORT=9010 \ -p 8032:8030 \ -p 9032:9030 \ --network=doris-network \ --ip=172.20.80.4 \ freeoneplus/doris:1.2.2-fe-x86_64 --------- Co-authored-by: Yijia Su <suyijia@selectdb.com>	2023-02-17 08:41:38 +08:00
FreeOnePlus	ce7791c362	[fix](docker)Fix Dockerfile logic (#16791 ) Error log logic fix. Remove Chinese annotations.	2023-02-16 16:14:43 +08:00
FreeOnePlus	611d9aca10	[feature](docker)Add Docker Broker Init Script (#16733 ) Add Docker Broker Init Script	2023-02-15 19:26:01 +08:00
FreeOnePlus	4a6fd7cc30	[feature](docker) Add Docker BE computer node Interface (#16630 )	2023-02-14 15:44:26 +08:00
FreeOnePlus	274016f50e	[fix](docker)Fix Docker init_be script (#16629 ) docker_process_sql function have error output.	2023-02-11 16:15:48 +08:00
Tiewei Fang	3c3110b253	[Fix](Jdbc Catalog) jdbc catalog support to connect to doris database (#16527 ) Doris can use mysql-jdbc-jar to connect doris database, but doris has some data type that mysql without. Such as DecimalV3 and Date/DatetimeV2 I add some case judgments in `Mysql Catalog` , so that Jdbc catalog can identify the data type of DORIS	2023-02-10 20:24:40 +08:00
FreeOnePlus	1cc735f20b	[feature](docker)Refactor Image build script (#16528 ) Co-authored-by: Yijia Su <suyijia@selectdb.com>	2023-02-10 18:30:54 +08:00
FreeOnePlus	05103d88b2	[feature](docker)Add Doris Docker Build Script (#16522 ) Add 3FE & 3BE Build Script	2023-02-10 17:18:26 +08:00
Tiewei Fang	557159d3ce	[feature](JdbcExternalCatalog) support insert data in JdbcExternalCatalog (#16271 )	2023-02-02 17:31:33 +08:00
Mingyu Chen	c9f66250a8	[docker](iceberg) add iceberg docker compose and modify scripts (#16175 ) Add iceberg docker compose Rename start-thirdparties-docker.sh to run-thirdparties-docker.sh and support start to stop specified components.	2023-01-29 14:31:27 +08:00
Tiewei Fang	1638936e3f	[fix](oracle catalog) oracle catalog support `TIMESTAMP` dateType of oracle (#16113 ) `TIMESTAMP` dateType of Oracle will map to `DateTime` dateType of Doris	2023-01-20 14:47:58 +08:00
Tiewei Fang	ba71516eba	[feature](jdbc catalog) support SQLServer jdbc catalog (#16093 )	2023-01-20 12:37:38 +08:00
Tiewei Fang	2580c88c1b	[feature](multi-catalog) support oracle jdbc catalog (#15862 )	2023-01-14 00:01:33 +08:00
Mingyu Chen	500c7fb702	[improvement](multi-catalog) support unsupported column type (#15660 ) When creating an external catalog, Doris will automatically sync the schema of table from external catalog. But some of column type are not supported by Doris now, such as struct, map, etc. In previous, when meeting these unsupported column, Doris will throw an exception, and the corresponding table can not be synced. But user may just want to query other supported columns. In this PR, I add a new column type: UNSUPPORTED. And now it is just used for external table schema sync. When meeting unsupported column, it will be synced as column with UNSUPPORTED type. When query this table, there are serval situation: select * from table: throw error Unsupported type 'UNSUPPORTED_TYPE' xxx select k1 from table: k1 is with supported type. query OK. select * except(k2): k2 is with unsupported type. query OK	2023-01-08 10:07:10 +08:00
Tiewei Fang	df2da89b89	[feature](multi-catalog) support postgresql jdbc catalog (#15570 ) support postgresql jdbc catalog	2023-01-06 11:00:59 +08:00
Tiewei Fang	e7a077a81f	[fix](jdbc catalog) fix bugs of jdbc catalog and table valued function (#15216 ) * fix bugs * add `desc function` test * add test * fix	2022-12-23 16:46:39 +08:00
Tiewei Fang	7627defc88	[fix](regression-test) Add test data for test_mysql_jdbc_catalog and fix mysql-5.7.yaml about UTF8 (#14749 ) Fix two things: 1. Fix that the MySQL table displays the garbled code even if the UTF8 is specified for table. 2. Fix that `test_mysql_jdbc_catalog.out` lack of returned data for table `ex_tb13`.	2022-12-02 11:58:11 +08:00
lsy3993	ae6a007c4e	[test](jdbc)add new extremum case (#14692 )	2022-12-02 11:28:11 +08:00
Tiewei Fang	9272680d00	[feature](multi-catalog) support Jdbc catalog (#14527 ) Issue Number: close #xxx I add jdbc catalog for doris multi-catalog feature. Currently, the jdbc catalog only supports MYSQL DBMS. TODO: support for postgre DB Support for other databases. Problem summary For jdbc catalog, we can create catalog like: CREATE CATALOG jdbc4 PROPERTIES ( "type"="jdbc", "jdbc.user"="root", "jdbc.password"="123456", "jdbc.jdbc_url" = "jdbc:mysql://127.0.0.1:13396/demo?yearIsDateType=false", "jdbc.driver_url" = "file:/mnt/disk2/ftw/tools/jar/mysql-connector-java-5.1.47/mysql-connector-java-5.1.47.jar", "jdbc.driver_class" = "com.mysql.jdbc.Driver" ); Note: yearIsDateType is a param of jdbc: If yearIsDateType configuration property is set to false, then the returned object type is java.sql.Short. If set to true (the default), then the returned object is of type java.sql.Date with the date set to January 1st, at midnight. To compat with mysql, we force the use of yearIsDateType=false in FE. if user sets yearIsDateType=true, doris FE will force to change yearIsDateType=false.	2022-11-30 11:28:08 +08:00
Mingyu Chen	dd7ec8f4ca	[improvement](test) add tpch1 orc for hive catalog and refactor some test dir (#14669 ) Add tpch 1g orc test case in hive docker Refactor some suites dir of catalog test cases. And "-internal" for dlf endpoint, to support access oss with aliyun vpc.	2022-11-30 10:03:58 +08:00
FreeOnePlus	03aa5572da	[feature](docker)Add Broker Docker image related files (#14621 ) Add Broker Docker image related files	2022-11-29 18:34:10 +08:00
Mingyu Chen	064b8d2aa6	[fix](multi-catalog) fix coredump when querying partitioned hive table with text format (#14604 ) BE will crash when querying partitioned hive table with text format and put partition column at first of select items. 1. FE should use file slots to set the column mapping index of csv file. 2. BE should use `get_by_name` of block to get right column in a block in csv reader.	2022-11-26 11:42:40 +08:00
FreeOnePlus	724e57bb87	[feature](docker)Add runtime docker image related files (#14436 )	2022-11-23 23:58:44 +08:00
lsy3993	6fcffd041c	[test](jdbc)add new mysql jdbc case from other source (#14495 )	2022-11-23 16:23:42 +08:00
lsy3993	1fe9bced25	[test](jdbc)add more mysql jdbc test case (#14475 )	2022-11-22 21:14:10 +08:00
lsy3993	5dfe5ef965	[test](hive catalog)add hive catalog test case (#14217 )	2022-11-19 17:26:18 +08:00
Mingyu Chen	512b787559	[fix](parquet-reader) fix stack-use-after-return error (#14411 )	2022-11-19 10:52:50 +08:00
lsy3993	02372ca2ea	[test](jdbc external table) add new jdbc mysql external table (#14323 )	2022-11-19 09:46:48 +08:00
Tiewei Fang	a1d02f36ac	[feature](table-valued-function) support `hdfs()` tvf (#14213 ) This pr does two things: 1. support `hdfs()` table valued function. 2. add regression test	2022-11-18 14:17:02 +08:00
Ashin Gau	44ee4386f7	[test](multi-catalog)Regression test for external hive orc table (#13762 ) Add regression test for external hive orc table. This PR has generated all basic types support by hive orc, and create a hive external table to touch them in docker environment. Functions to be tested: 1. Ensure that all types are parsed correctly 2. Ensure that the null map of all types are parsed correctly 3. Ensure that the `SearchArgument` of `OrcReader` works well 4. Only select partition columns	2022-11-17 20:36:02 +08:00
Mingyu Chen	7182f14645	[improvement][fix](multi-catalog) speed up list partition prune (#14268 ) In previous implementation, when doing list partition prune, we need to generation `rangeToId` every time we doing prune. But `rangeToId` is actually a static data that should be create-once-use-every-where. So for hive partition, I created the `rangeToId` and all other necessary data structures for partition prunning in partition cache, so that we can use it directly. In my test, the cost of partition prune for 10000 partitions reduce from 8s -> 0.2s. Aslo add "partition" info in explain string for hive table. ``` \| 0:VEXTERNAL_FILE_SCAN_NODE \| \| predicates: `nation` = '0024c95b' \| \| inputSplitNum=1, totalFileSize=4750, scanRanges=1 \| \| partition=1/10000 \| \| numNodes=1 \| \| limit: 10 \| ``` Bug fix: 1. Fix bug that es scan node can not filter data 2. Fix bug that query es with predicate like `where substring(test2,2) = "ext2";` will fail at planner phase. `Unexpected exception: org.apache.doris.analysis.FunctionCallExpr cannot be cast to org.apache.doris.analysis.SlotRef` TODO: 1. Some problem when quering es version 8: ` Unexpected exception: Index: 0, Size: 0`, will be fixed later.	2022-11-17 08:30:03 +08:00
Jibing-Li	30f36070b5	[test](multi-catalog)Regression test for external hive parquet table (#13611 )	2022-11-14 14:10:10 +08:00
Stalary	23a8c7eeb6	(fix)(multi-catalog)(es) Fix error result because not used fields_context (#14229 ) Fix error result because not used fields_context	2022-11-14 14:00:55 +08:00
lsy3993	082028b2a2	[test](jdbc postgresql case)add jdbc test case for postgresql (#14162 )	2022-11-12 20:43:13 +08:00
lsy3993	78fa167b0a	[test](jdbc external table) add jdbc regression test case (#14086 )	2022-11-12 20:42:57 +08:00
Tiewei Fang	c418bbd2d1	[feature-wip](new-scan) support Json reader (#13546 ) Issue Number: close #12574 This pr adds `NewJsonReader` which implements GenericReader interface to support read json format file. TODO: 1. modify `_scann_eof` later. 2. Rename `NewJsonReader` to `JsonReader` when `JsonReader` is deleted.	2022-10-26 12:52:21 +08:00
Mingyu Chen	847b80ebfa	[test](jdbc) add jdbc and hive regression test (#13143 ) 1. Modify default behavior of `build.sh` The `BUILD_JAVA_UDF` is default ON, so that jvm is needed for compilation and runtime. 2. Add docker-compose for MySQL 5.7, PostgreSQL 14 and Hive 2 See `docker/thirdparties/docker-compose`. 3. Add some regression test cases for jdbc query on MySQL, PG and Hive Catalog The default is `false`, if set to true, you need first start docker for MySQL/PG/Hive. 4. Support `if not exists` and `if exists` for create/drop resource and create/drop encryptkey	2022-10-21 15:29:27 +08:00
Mingyu Chen	353bb6fdfb	[doc] update docs (#12615 )	2022-09-15 11:07:34 +08:00
Stalary	5f255af065	[Enhancement](docker): Add elasticsearch docker file (#12377 )	2022-09-07 08:47:10 +08:00
FreeOnePlus	dfaed52d32	[docker] Update compile Dockerfile in developer (#10339 ) Co-authored-by: manyi <fop@freeoneplus.com>	2022-07-28 10:35:56 +08:00
FreeOnePlus	85d7b3089c	[docker] ADD Arm Compile Dockerfile (#10600 ) Add the Dockerfile file of the Docker compiled image under the ARM architecture	2022-07-27 19:55:42 +08:00
Mingyu Chen	67f341f44e	[TLP](step-1) Remove incubator prefix (#10230 ) Remove some `incubator-` prefix in source code. The document is not modified, will be done in next PR.	2022-06-19 19:34:52 +08:00
Zhengguo Yang	f3c44bcd75	[chore][fix](librdkafka) disable librdkafka assert and update some thirdparty (#8425 ) 1. comment librdkafka `rd_assert(thrd_is_current(rkb->rkb_thread));` to avoid core dump 2. upgrade arrow to 7.0.0 3. upgrade aws sdk to 1.9 4. upgrade orc to 1.7.2	2022-03-12 22:09:06 +08:00
Zhengguo Yang	09bfb8b9d3	[fix] (rpc-udf) Fixed the problem that the query could not be interrupted (#8248 ) if an error occurred in the rpc server during the execution of rpc-udf. Add java,cpp,python demo of rpc-udf server	2022-03-03 09:30:03 +08:00

1 2

75 Commits