doris

Author	SHA1	Message	Date
Mingyu Chen	50a59f3f86	[license] Organize third-party dependent licenses for bianry releases (#8350 )	2022-03-07 23:18:58 +08:00
Mingyu Chen	9961b2c860	[refactor] Remove mysql-connector and replace org.json with com.googlecode.json-simple (#8319 ) 1. mysql-connector-java mysql-connector-java is under GLPv2 license, which is not compatible with APLv2, and Doris does not use it. 2. org.json org.json is under JSON license, which is not compatible with APLv2. I use `json-simple` to replace it.	2022-03-05 14:41:04 +08:00
Mingyu Chen	315bfe2d0e	Revert "[chore](dependency) upgrade-grpc-version (#8218 )" (#8250 ) This reverts commit df7e848cbbc8170c7bd83d812d7cac58b5574570. Reverts apache/incubator-doris#8218 Because when using grpc 1.44.1, the corresponding `protoc-gen-grpc-java` plugin requried GLIBC_2.14, which is not found in CentOS 6. So I suggest to revert this commit this time. And considering upgrading this component after most systems have reached glibc version 2.14. And for Mac M1, you may have to change this version manually for now	2022-03-02 10:16:25 +08:00
Mingyu Chen	93c638f3a2	[fix][chore](insert)(fe) Fix analysis error of insert stmt and modify grpc-netty dependency (#8265 ) This bug is introduced from #8112. Also , I change the `grpc-netty` dependency to `grpc-netty-shaded`, to avoid dependency conflict: ``` java.lang.NoSuchMethodError: io.netty.buffer.PooledByteBufAllocator. ```	2022-03-01 11:12:10 +08:00
qiye	87b96cfcd6	[feature](iceberg) Step3: Support query iceberg external table (#8179 ) 1. Add Iceberg scan node 2. Add Iceberg/Hive table type in thrift 3. Support querying Iceberg tables of format types `parquet` and `orc`	2022-02-26 17:04:11 +08:00
wunan1210	df7e848cbb	[chore](dependency) upgrade-grpc-version (#8218 ) upgrade grpc.version, so macos with M1 chip can build Fe correctly. 1.30.0 -> 1.44.1	2022-02-24 23:17:32 +08:00
lihuigang	264f38471c	[feature](spark-load) add Hive Bitmap UDFs (#8036 ) Hive Bitmap UDF provides UDFs for generating bitmap and bitmap operations in hive tables. The bitmap in Hive is exactly the same as the Doris bitmap. The bitmap in Hive can be imported into Doris through spark bitmap load.	2022-02-17 10:45:20 +08:00
qiye	3b8d48f08b	[feature-wip](iceberg) Step1: Support create Iceberg external table (#7391 ) Close related #7389 Support create Iceberg external table in Doris. This is the first step to support Iceberg external table. ### Create Iceberg external table This pr describes two ways to create Iceberg external tables. Both ways do not require explicitly specifying column definitions, Doris automatically converts them based on Iceberg's column definitions. 1. Create an Iceberg external table directly ```sql CREATE [EXTERNAL] TABLE table_name ENGINE = ICEBERG [COMMENT "comment"] PROPERTIES ( "iceberg.database" = "iceberg_db_name", "iceberg.table" = "icberg_table_name", "iceberg.hive.metastore.uris" = "thrift://192.168.0.1:9083", "iceberg.catalog.type" = "HIVE_CATALOG" ); ``` 2. Create an Iceberg database and automatically create all the tables under that db. ```sql CREATE DATABASE db_name [COMMENT "comment"] PROPERTIES ( "iceberg.database" = "iceberg_db_name", "iceberg.hive.metastore.uris" = "thrift://192.168.0.1:9083", "iceberg.catalog.type" = "HIVE_CATALOG" ); ``` ### Show table creation 1. For individual tables you can view them with `help show create table`. ```sql mysql> show create table iceberg_db.logs_1; +--------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ \| Table \| Create Table \| +--------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ \| logs_1 \| CREATE TABLE `logs_1` ( `level` varchar(-1) NOT NULL COMMENT "null", `event_time` datetime NOT NULL COMMENT "null", `message` varchar(-1) NOT NULL COMMENT "null" ) ENGINE=ICEBERG COMMENT "ICEBERG" PROPERTIES ( "iceberg.database" = "doris", "iceberg.table" = "logs_1", "iceberg.hive.metastore.uris" = "thrift://10.10.10.10:9087", "iceberg.catalog.type" = "HIVE_CATALOG" ) \| +--------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ ``` 2. For Iceberg database, you can view it with `help show table creation`. ```sql mysql> show table creation from iceberg_db; +--------+---------+---------------------+---------------------------------------------------------+ \| Table \| Status \| Create Time \| Error Msg \| +--------+---------+---------------------+---------------------------------------------------------+ \| logs \| fail \| 2021-12-14 13:50:10 \| Cannot convert unknown type to Doris type: list<string> \| \| logs_1 \| success \| 2021-12-14 13:50:10 \| \| +--------+---------+---------------------+---------------------------------------------------------+ 2 rows in set (0.00 sec) ``` This is a new syntax. Show table creation records in Iceberg database: Syntax: ```sql SHOW TABLE CREATION [FROM db] [LIKE mask] ```	2022-01-27 10:22:47 +08:00
Zhengguo Yang	4bdeef3b64	[chore][fix][doc](fe-plugin)(mysqldump) fix build auditlog plugin error (#7804 ) 1. fix problems when build fe_plugins 2. format 3. add docs about dump data using mysql dump	2022-01-26 09:11:23 +08:00
Mingyu Chen	4ac8b3c9a9	[fix][s3] Fix bug that can not visit aliyun oss with aws s3 sdk (#7691 ) Close #7690 1. Exclude httpclient and httpcore dependencies from thrift@0.13 Explicitly use httpclient@4.5.13 and httpcore@4.4.15 https://stackoverflow.com/questions/59265959/java-lang-bootstrapmethoderror-call-site-initialization-exception-from-athena-j 2. Exclude aws-java-sdk-s3 dependency from hadoop-aws Explicitly use aws-java-sdk-s3@1.11.95 https://github.com/aws/aws-sdk-java/issues/1032	2022-01-11 15:00:31 +08:00
Zhengguo Yang	ad35067a2a	[chore][docs] add deploy spark/flink connectors to maven release repo docs (#7616 )	2022-01-06 23:23:33 +08:00
Zhengguo Yang	738d2d2e07	[refactor] update parent pom version and optimize build scripts (#7548 )	2022-01-05 10:45:11 +08:00
jiafeng.zhang	85c30fc720	[deps] Upgrade Log4j to 2.7.1 to solve the CVE-2021-44832 security vulnerability (#7536 ) Upgrade Log4j to 2.7.1 to solve the CVE-2021-44832 security vulnerability Co-authored-by: Zhengguo Yang <yangzhgg@gmail.com>	2021-12-30 10:21:37 +08:00
Zhengguo Yang	2872dbfeb8	[refactor] Standardize the writing of pom files, prepare for deployment to maven (#7477 )	2021-12-30 10:16:37 +08:00
jiafeng.zhang	7a1bb5b335	log4j upgrade to 2.17.0 (#7440 ) Solved the third security vulnerability CVE-2021-45105 that was discovered	2021-12-21 09:28:02 +08:00
jiafeng.zhang	e64da03866	[deps](log4j) Upgrade log4j 2 to 2.16.0 (#7394 ) Upgrade log4j 2 to 2.16.0, the official strongly recommends upgrading to this version	2021-12-14 15:57:16 +08:00
jiafeng.zhang	568f6611df	[deps](log4j) upgrade log4j (#7364 ) to 2.15.0	2021-12-10 23:19:11 +08:00
Zhengguo Yang	200210e708	[fix] (ut) fix fe unit test failed, this is because we fix the MAX_PHYSICAL_PACKET_LENGTH to 0xffffff	2021-12-06 11:13:01 +08:00
caiconghui	07296a301b	[chore](fe) Fix build error caused by Inaccessible pentaho-aggdesigner-algorithm jar (#7161 )	2021-11-20 21:48:26 +08:00
Zhengguo Yang	52ebb3d8f5	[feat](mysql-compatibility) Increase compatibility with mysql (#7041 ) Increase compatibility with mysql 1. Added two system tables files and partitions 2. Improved the return logic of mysql error code to make the error code more compatible with mysql 3. Added lock/unlock tables statement and show columns statement for compatibility with mysql dump 4. Compatible with mysqldump tool, now you can use mysql dump to dump data and table structure from doris now use mysqldump may print error message like ``` $ mysqldump -h127.0.0.1 -P9130 -uroot test_query_qa > a mysqldump: Error: 'errCode = 2, detailMessage = select list expression not produced by aggregation output (missing from GROUP BY clause?): `EXTRA`' when trying to dump tablespaces ``` This error message not effect the export file, you can add `--no-tablespaces` to avoid this error	2021-11-20 21:39:37 +08:00
qiye	5b01f7bba2	[Feature] Support query hive table (#6569 ) Users can directly query the data in the hive table in Doris, and can use join to perform complex queries without laboriously importing data from hive. Main changes list below: FE: Extend HiveScanNode from BrokerScanNode HiveMetaStoreClientHelper communicate with HIVE and HDFS. BE: Treate HiveScanNode as BrokerScanNode, treate HiveTable as BrokerTable. broker_scanner.cpp: suppot read column from HDFS path. orc_scanner.cpp: support read hdfs file. POM: Add hive.version=2.3.7, hive-metastore and hive-exec Add hadoop.version=2.8.0, hadoop-hdfs Upgrade commons-lang to fix incompatiblity of Java 9 and later. Thrift: Add THiveTable Add read_by_column_def in TBrokerRangeDesc	2021-11-16 11:59:07 +08:00
zh0122	974a894688	Update Spring version to fix CVE-2020-5421 (#7023 )	2021-11-06 13:29:24 +08:00
Mingyu Chen	e8cabfff27	[S3] Support path style endpoint (#6962 ) Add a use_path_style property for S3 Upgrade hadoop-common and hadoop-aws to 2.8.0 to support path style property Fix some S3 URI bugs Add some logs for tracing load process.	2021-11-01 10:48:10 +08:00
Zhengguo Yang	24d38614a0	[Dependency] Upgrade thirdparty libs (#6766 ) Upgrade the following dependecies: libevent -> 2.1.12 OpenSSL 1.0.2k -> 1.1.1l thrift 0.9.3 -> 0.13.0 protobuf 3.5.1 -> 3.14.0 gflags 2.2.0 -> 2.2.2 glog 0.3.3 -> 0.4.0 googletest 1.8.0 -> 1.10.0 snappy 1.1.7 -> 1.1.8 gperftools 2.7 -> 2.9.1 lz4 1.7.5 -> 1.9.3 curl 7.54.1 -> 7.79.0 re2 2017-05-01 -> 2021-02-02 zstd 1.3.7 -> 1.5.0 brotli 1.0.7 -> 1.0.9 flatbuffers 1.10.0 -> 2.0.0 apache-arrow 0.15.1 -> 5.0.0 CRoaring 0.2.60 -> 0.3.4 orc 1.5.8 -> 1.6.6 libdivide 4.0.0 -> 5.0 brpc 0.97 -> 1.0.0-rc02 librdkafka 1.7.0 -> 1.8.0 after this pr compile doris should use build-env:1.4.0	2021-10-15 13:03:04 +08:00
Mingyu Chen	fa290383dc	[Doc] Modify README to add some statistical indicators (#6486 ) 1. Add license/total line/release badegs. 2. Add monthly active contributor and contributor growth graph 3. fix a pom.xml bug 4. Modify some routine load log on BE side	2021-08-25 09:36:26 +08:00
Kuncle	7a8837c962	[Maven][Dependency][Bug][DOE] fix sync es metadata issue on jdk 13 (#6250 )	2021-07-18 22:16:38 +08:00
Mingyu Chen	a4b1622ceb	[HttpV2] Add more httpv2 APIs (#6210 ) 1. /api/cluster_overview to view some statistic info of the cluster 2. /api/meta/ to view the database/table schema 3. /api/import/file_review to review the file content with format CSV or PARQUET.	2021-07-18 22:14:42 +08:00
Zhengguo Yang	ed3ff470ce	[ARRAY] Support array type load and select not include access by index (#5980 ) This is part of the array type support and has not been fully completed. The following functions are implemented 1. fe array type support and implementation of array function, support array syntax analysis and planning 2. Support import array type data through insert into 3. Support select array type data 4. Only the array type is supported on the value lie of the duplicate table this pr merge some code from #4655 #4650 #4644 #4643 #4623 #2979	2021-07-13 14:02:39 +08:00
Zhengguo Yang	b121ad6b95	[Refactor] Remove jprotobuf and use grpc client to connect brpc service (#5650 )	2021-04-21 10:25:58 +08:00
zh0122	18c2553ef8	[FE][Bug] Update Spark version to fix a security issue (#5593 ) Fix CVE-2020-9480: Apache Spark RCE vulnerability in auth-enabled standalone master https://spark.apache.org/security.html#CVE-2020-9480	2021-04-06 11:02:04 +08:00
zh0122	5012fdc049	[FE][Fix]Update commons-collections to fix a security issue (#5595 ) Fix CVE-2017-15708 https://www.cvedetails.com/cve/CVE-2017-15708/	2021-04-06 11:00:19 +08:00
caiconghui	05487e38ae	[Bug] upgrade log4j version from 2.12.1 to 2.14.0 to fix performance issue in JDK11 using ZGC (#5591 ) Co-authored-by: caiconghui <caiconghui@xiaomi.com>	2021-04-06 10:59:08 +08:00
Mingyu Chen	b9d92e0fcb	[Profile] Visualize the query plan and query profile (#5475 ) Add command: 1. EXPLAIN GRAPGH SELECT ... 2. SHOW QUERY PROFILE "..." Document will be added in next PR Change-Id: Ifd9365e10b1f9ff4fdf8ae0556343783d97545f0	2021-03-21 11:18:50 +08:00
yuliangwan	bc10d44522	升级jackson版本号 (#5373 ) Co-authored-by: jiangyan <jiangyan@sfmail.sf-express.com>	2021-02-23 10:43:25 +08:00
Zhengguo Yang	6ede4c6ec1	[Feature] Support backup,restore,load,export directly connect to s3 (#5399 ) * [doris-1008] support backup and restore directly to cloud storage via aws s3 protocol * Internal][S3DirectAccess] Support backup,restore,load,export directlyconnect to s3 1. Support load and export data from/to s3 directly. 2. Add a config to auto convert broker access to s3 acces when available Change-Id: Iac96d4b3670776708bc96a119ff491db8cb4cde7 (cherry picked from commit 2f03832ca52221cc7436069b96c45c48c4bc7201) * [Internal][S3DirectAccess] File path glob compatible with broker Change-Id: Ie55e07a547aa22c6fa8d432ca926216c10384e68 (cherry picked from commit d4fb25544c0dc06d23e1ada571ec3f8edd4ba56f) * [internal] [doris-1008] fix log4j class not found Change-Id: I468176aca0d821383c74ee658d461aba9e7d5be3 (cherry picked from commit 029adaa9d6ded8503acbd6644c1519456f3db232) * add poms Co-authored-by: yangzhengguo01 <yangzhengguo01@baidu.com>	2021-02-22 16:07:56 +08:00
copperybean	d8202ca9cc	[Enhancement] move common codes from fe-core to fe-common and remove log4j1 (#5317 ) (#5318 ) The io related codes may be used by new modules, so It's better to move them to fe-common. The modification to fe-core is frequent, but there are many generated java files by thrift will slow down the compilation, so It's better to move thrift generation process to fe-common. Currently both log4j1 and log4j2 are used, which leads to logs are written to wrong files. Our modification will remove log4j1 from dependency, use slf4j + slf4j -> log4j2 instead.	2021-02-04 13:41:03 +08:00
小鹏	16f5d223e8	[Compile] Update Repository for java-cup and cup-maven-plugin (#4769 )	2020-10-22 21:38:19 +08:00
Mingyu Chen	1821d1baea	[Compile] Add pluginRepository for java-cup-plugins (#4636 ) the cup-maven-plugin of net.sourceforge.czt.dev is missing in maven central repo. It has been moved to https://repository.cloudera.com/content/groups/public/ Co-authored-by: morningman <chenmingyu@baidu.com>	2020-09-21 12:38:28 +08:00
Mingyu Chen	6e9d2074fb	[UI Part 1] Update pom.xml of Frontend (#4583 ) Add spring related dependencies	2020-09-11 17:48:42 +08:00
Mingyu Chen	0e79f6908b	[CodeRefactor] Modify FE modules (#4146 ) This CL mainly changes: 1. Add 2 new FE modules 1. fe-common save all common classes for other modules, currently only `jmockit` 2. spark-dpp The Spark DPP application for Spark Load. And I removed all dpp related classes to this module, including unit tests. 2. Change the `build.sh` Add a new param `--spark-dpp` to compile the `spark-dpp` alone. And `--fe` will compile all FE modules. the output of `spark-dpp` module is `spark-dpp-1.0.0-jar-with-dependencies.jar`, and it will be installed to `output/fe/spark-dpp/`. 3. Modify some bugs of spark load	2020-07-29 16:18:05 +08:00
Mingyu Chen	ad17afef91	[CodeRefactor] #4098 Make FE multi module (#4099 ) This PR change the FE code structure to maven multi module structure. See ISSUE: #4098 for more info, such as How to resolve conflicts.	2020-07-21 12:42:42 +08:00
Mingyu Chen	2211cb0ee0	[Metrics] Add metrics document and 2 new metrics of TCP (#3835 )	2020-06-15 09:48:09 +08:00
wyb	edfa6683fc	Add create spark load job	2020-06-03 21:27:27 +08:00
Lijia Liu	edb3ad696d	[Deps] Remove redundant com.baidu:jprotobuf (#3322 ) * exclude jprotobuf from jprotobuf-rpc-core * add commons-io used in fe.	2020-05-10 17:10:46 +08:00
Mingyu Chen	dfaad33b8c	[Thirdparty] Upgrade Google Guava lib to 29.0-jre (#3404 ) Fix #3403 The new version of Guava has move the `toStringHelper` from `Object` to `MoreObject`. This CL has passed our test environment, and looks running well.	2020-04-29 10:33:11 +08:00
wangbo	68a801ffbe	Support Java version 64 bits Integers for BITMAP type (#3090 ) Fork from roaringbitmap's Roaring64NavigableMap, overwrite serialize/deserialize method to keep compatibility with be's bitmap storage format	2020-03-31 15:29:41 +08:00
lichaoyong	e20d905d70	Remove unused KUDU codes (#3175 ) KUDU table is no longer supported long time ago. Remove code related to it.	2020-03-24 13:54:05 +08:00
Mingyu Chen	cf219ddf18	[ConsistencyCheck] Support checking replica consistency of tablet manually (#3067 )	2020-03-10 15:25:25 +08:00
Mingyu Chen	172838175f	[Bug] Fix bug that index name in MaterializedViewMeta is not changed after schema change (#3048 ) The index name in MaterializedViewMeta is still with `__doris_shadow` prefix after schema change finished. In this CL, I just remove the index name field in MaterializedViewMeta, so that it would makes managing change of names less error-prone.	2020-03-09 10:11:16 +08:00
Mingyu Chen	35b09ecd66	[JDK] Support OpenJDK (#2804 ) Support compile and running Frontend process and Broker process with OpenJDK. OpenJDK 13 is tested.	2020-02-20 23:47:02 +08:00

1 2

72 Commits