doris

Author	SHA1	Message	Date
Stalary	f11d320213	[feature] support row policy filter (#9206 )	2022-05-11 22:11:10 +08:00
leo65535	d1b85d51a0	[code style](fe) Include test sources (#9366 ) Include test sources, we also need to check them.	2022-05-09 09:40:44 +08:00
leo65535	c5941fd166	[FE Code Style][sub] Adjust some check rules (#9345 ) Adjust `RedundantImport`,`UnusedImports`,`EmptyStatement`,`NewlineAtEndOfFile`,`UpperEll`, `AvoidStarImport`, `MissingOverride` rules.	2022-05-04 23:34:55 +08:00
qiye	bca121333e	[feature](cold-hot) support s3 resource (#8808 ) Add cold hot support in FE meta, support alter resource DDL in FE	2022-04-13 09:52:03 +08:00
Mingyu Chen	18098c5ceb	[fix](fe-ut) Fix FE unit test (#8293 ) Fix following ut: 1. GlobalTransactionMgrTest 2. BackupJobTest 3. ReplicaTest 4. SparkLoadJobTest Also remove old FE Meta version	2022-03-03 09:30:17 +08:00
yiguolei	f7c18d300c	[Improvement] Add minimum fe meta version check (#8203 ) There are many old codes in FE for old FE meta version such as `if (FeMetaVersion < VERSION_45) xxxxx`, but the latest FE meta version is 107, these code maybe never reached, but we do not remove these code because "sometimes" there are old code. Add minimum required version check to allow us remove these old codes.	2022-02-25 11:14:00 +08:00
qiye	3b8d48f08b	[feature-wip](iceberg) Step1: Support create Iceberg external table (#7391 ) Close related #7389 Support create Iceberg external table in Doris. This is the first step to support Iceberg external table. ### Create Iceberg external table This pr describes two ways to create Iceberg external tables. Both ways do not require explicitly specifying column definitions, Doris automatically converts them based on Iceberg's column definitions. 1. Create an Iceberg external table directly ```sql CREATE [EXTERNAL] TABLE table_name ENGINE = ICEBERG [COMMENT "comment"] PROPERTIES ( "iceberg.database" = "iceberg_db_name", "iceberg.table" = "icberg_table_name", "iceberg.hive.metastore.uris" = "thrift://192.168.0.1:9083", "iceberg.catalog.type" = "HIVE_CATALOG" ); ``` 2. Create an Iceberg database and automatically create all the tables under that db. ```sql CREATE DATABASE db_name [COMMENT "comment"] PROPERTIES ( "iceberg.database" = "iceberg_db_name", "iceberg.hive.metastore.uris" = "thrift://192.168.0.1:9083", "iceberg.catalog.type" = "HIVE_CATALOG" ); ``` ### Show table creation 1. For individual tables you can view them with `help show create table`. ```sql mysql> show create table iceberg_db.logs_1; +--------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ \| Table \| Create Table \| +--------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ \| logs_1 \| CREATE TABLE `logs_1` ( `level` varchar(-1) NOT NULL COMMENT "null", `event_time` datetime NOT NULL COMMENT "null", `message` varchar(-1) NOT NULL COMMENT "null" ) ENGINE=ICEBERG COMMENT "ICEBERG" PROPERTIES ( "iceberg.database" = "doris", "iceberg.table" = "logs_1", "iceberg.hive.metastore.uris" = "thrift://10.10.10.10:9087", "iceberg.catalog.type" = "HIVE_CATALOG" ) \| +--------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ ``` 2. For Iceberg database, you can view it with `help show table creation`. ```sql mysql> show table creation from iceberg_db; +--------+---------+---------------------+---------------------------------------------------------+ \| Table \| Status \| Create Time \| Error Msg \| +--------+---------+---------------------+---------------------------------------------------------+ \| logs \| fail \| 2021-12-14 13:50:10 \| Cannot convert unknown type to Doris type: list<string> \| \| logs_1 \| success \| 2021-12-14 13:50:10 \| \| +--------+---------+---------------------+---------------------------------------------------------+ 2 rows in set (0.00 sec) ``` This is a new syntax. Show table creation records in Iceberg database: Syntax: ```sql SHOW TABLE CREATION [FROM db] [LIKE mask] ```	2022-01-27 10:22:47 +08:00
Xiang Wei	ab60c5eb59	[fix](spark-load) fix Roaring64Map big-endian read/write in de/serialization (#7480 ) See #7479 This bug is triggered when the bitmap exceeds 32 bits.	2021-12-26 11:09:50 +08:00
Zhengguo Yang	926540c561	[feature] Support return bitmp/hll data in select statement (#7276 ) Support return bitmp/hll data in select statement, this can be used when set show_object_data=true;	2021-12-15 09:48:27 +08:00
xy720	b1f5979103	[New Feature][Meta][Image] Add file header and footer for image (#6207 ) #6206 At present, our image file does not have file header/footer. When we need to change the image format (such as adding different journal versions to the image), there is no way to distinguish different image formats. Therefore, we suggest adding file header and footer to the image. By the new image format, we can freely distinguish and define different image reading ways. The format of the image is as follows: ``` /** * Image Format: * \|- Image --------------------------------------\| * \| - Magic String (4 bytes) \| * \| - Header Length (4 bytes) \| * \| \|- Header -----------------------------\| \| * \| \| \|- Json Header ---------------\| \| \| * \| \| \| - version \| \| \| * \| \| \| - other key/value(undecided)\| \| \| * \| \| \|-----------------------------\| \| \| * \| \|--------------------------------------\| \| * \| \| * \| \|- Image Body -------------------------\| \| * \| \| Object a \| \| * \| \| Object b \| \| * \| \| ... \| \| * \| \|--------------------------------------\| \| * \| \| * \| \|- Footer -----------------------------\| \| * \| \| \| - Checksum (8 bytes) \| \| * \| \| \|- object index --------------\| \| \| * \| \| \| - index a \| \| \| * \| \| \| - index b \| \| \| * \| \| \| ... \| \| \| * \| \| \|-----------------------------\| \| \| * \| \| - other value(undecided) \| \| * \| \|--------------------------------------\| \| * \| - Footer Length (8 bytes) \| * \| - Magic String (4 bytes) \| * \|----------------------------------------------\| */ ``` 1. Magic Number One image format is identified by one magic string and one version field. The magic string is save in the first 4 bytes and last 4 bytes in the images. 2. Image Header: The version is save in the header with json format now. 3. Image Body: Equal to the original image. 4.Image Footer: Image footer stores the file offset(index) of many image objects. If necessary, we can read some objects in the image by the footer.	2021-07-27 13:36:53 +08:00
Stephen-Robin	40bc3fed53	[Code] basic property related classes supports create, query, read, write, etc. (#6153 ) Provides basic property related classes supports create, query, read, write, etc. Currently, Doris FE mostly uses `if` statement to check properties in SQL. There is a lot of redundancy in the code. The `PropertySet` class can be used in the analysis phase of `Statement`. The validation and correctness of the input properties are automatic verified. It can simplify the code and improve the readability of the code. Usage: 1. Create a custom class that implements `SchemaGroup` interface. 2. Define the properties to be used. If it's a required parameter, there is no need to set the default value. 3. According the the requirements, in the logic called `readFromStrMap` and other functions to check and obtain parameters. Demo: Class definition ``` public class FileFormat implements PropertySchema.SchemaGroup { public static final PropertySchema<FileFormat.Type> FILE_FORMAT_TYPE = new PropertySchema.EnumProperty<>("type", FileFormat.Type.class).setDefauleValue(FileFormat.Type.CSV); public static final PropertySchema<String> RECORD_DELIMITER = new PropertySchema.StringProperty("record_delimiter").setDefauleValue("\n"); public static final PropertySchema<String> FIELD_DELIMITER = new PropertySchema.StringProperty("field_delimiter").setDefauleValue("\|"); public static final PropertySchema<Integer> SKIP_HEADER = new PropertySchema.IntProperty("skip_header", true).setMin(0).setDefauleValue(0); private static final FileFormat INSTANCE = new FileFormat(); private ImmutableMap<String, PropertySchema> schemas = PropertySchema.createSchemas( FILE_FORMAT_TYPE, RECORD_DELIMITER, FIELD_DELIMITER, SKIP_HEADER); public ImmutableMap<String, PropertySchema> getSchemas() { return schemas; } public static FileFormat get() { return INSTANCE; } } ``` Usage ``` public class CreateXXXStmt extends DdlStmt { private PropertiesSet<FileFormat> analyzedFileFormat = PropertiesSet.empty(FileFormat.get()); private final Map<String, String> fileFormatOptions; ... public void analyze(Analyzer analyzer) throws UserException { ... if (fileFormatOptions != null) { try { analyzedFileFormat = PropertiesSet.readFromStrMap(FileFormat.get(), fileFormatOptions); } catch (IllegalArgumentException e) { ... } } // 1. Get property value String recordDelimiter = analyzedFileFormat.get(FileFormat.RECORD_DELIMITER) // 2. Check the validity of parameters PropertiesSet.verifyKey(FileFormat.get(), fileFormatOptions); ... } } ```	2021-07-08 09:55:07 +08:00
copperybean	77485521d3	[Enhancement] move FeMetaVersion.java from fe-common to fe-core #5426 (#5427 ) Currently, FeMetaVersion.java is in fe-common, users may forget to copy fe-common.jar when upgrading the service. It's really dangerous because the data may be corrupted and can not be recovered.	2021-03-04 22:25:03 +08:00
copperybean	d8202ca9cc	[Enhancement] move common codes from fe-core to fe-common and remove log4j1 (#5317 ) (#5318 ) The io related codes may be used by new modules, so It's better to move them to fe-common. The modification to fe-core is frequent, but there are many generated java files by thrift will slow down the compilation, so It's better to move thrift generation process to fe-common. Currently both log4j1 and log4j2 are used, which leads to logs are written to wrong files. Our modification will remove log4j1 from dependency, use slf4j + slf4j -> log4j2 instead.	2021-02-04 13:41:03 +08:00
Mingyu Chen	0e79f6908b	[CodeRefactor] Modify FE modules (#4146 ) This CL mainly changes: 1. Add 2 new FE modules 1. fe-common save all common classes for other modules, currently only `jmockit` 2. spark-dpp The Spark DPP application for Spark Load. And I removed all dpp related classes to this module, including unit tests. 2. Change the `build.sh` Add a new param `--spark-dpp` to compile the `spark-dpp` alone. And `--fe` will compile all FE modules. the output of `spark-dpp` module is `spark-dpp-1.0.0-jar-with-dependencies.jar`, and it will be installed to `output/fe/spark-dpp/`. 3. Modify some bugs of spark load	2020-07-29 16:18:05 +08:00

14 Commits