doris

Author	SHA1	Message	Date
Mingyu Chen	3f202477ec	[minor](import) modify some imports (#28206 )	2023-12-12 11:39:54 +08:00
morrySnow	32367c6d97	[chore](checkstyle): forbid lombok in Nereids (#27700 )	2023-11-29 14:06:20 +08:00
Mingyu Chen	efd1aa3016	[Revert](code-style) revert FE code-format #25033 and #26488 (#26505 )	2023-11-07 16:37:24 +08:00
Guangdong Liu	65304ba216	[fix](code-style) Adapt to checkstyle and spotless (#26488 )	2023-11-07 00:23:39 +08:00
Guangdong Liu	d088cba2b1	[feature](code-style)add spotless plugin (#25033 )	2023-11-06 14:01:39 +08:00
JingDas	e3d0e55794	[feature-wip] (Nereids) Support transforming trino dialect SQL to logical plan (#21855 ) Support transforming trino dialect SQL to logical plan (#21854) ## Proposed changes Issue Number: #21854 Use io.trino.sql.tree.AstVisitor as vistor, visit coorresponding trino node and transform it to doris logical plan. ## Further comments Here are some examples for function transforming as following: ascii('a') function is in doris and codepoint('a') funtion in trino, they have the same feature and have the same method signature, so we can use [TrinoFnCallTransformer](`3b37b76886/fe/fe-core/src/main/java/org/apache/doris/nereids/parser/trino/TrinoFnCallTransformer.java`) to handle them. another example for ComplexTransformer as following: date_diff('second', TIMESTAMP '2020-12-25 22:00:00', TIMESTAMP '2020-12-25 21:00:00')" fuction in trino and seconds_diff(2020-12-25 22:00:00, 2020-12-25 21:00:00)") fuction in doris. They have different method signature, we cant not handle it by TrinoFnCallTransformer simply and we should handle it by individual complex transformer [DateDiffFnCallTransformer](`3b37b76886/fe/fe-core/src/main/java/org/apache/doris/nereids/parser/trino/DateDiffFnCallTransformer.java`).	2023-10-16 05:10:55 -05:00
Mingyu Chen	6384198136	[minor](fe) optimize some log info and imports issue (#24138 )	2023-09-11 16:16:58 +08:00
Mingyu Chen	0c98355fff	[fix](catalog) fix create catalog with resource replay issue and kerberos auth issue (#20137 ) 1. Fix create catalog with resource replay bug. If user create catalog using `create catalog hive with resource xxx`, when replaying edit log, there is a bug that resource may be dropped, causing NPE and FE will fail to start. In this PR, I add a new FE config `disallow_create_catalog_with_resource`, default is true. So that `with resource` will not be allowed, and it will be deprecated later. And also fix the replay bug to avoid NPE. 2. Fix issue when creating 2 hive catalogs to connect with and without kerberos authentication. When user create 2 hive catalogs, one use simple auth, the other use kerberos auth. The query may fail with error like: `Server asks us to fall back to SIMPLE auth, but this client is configured to only allow secure connections.` So I add a default property for hive catalog: `"ipc.client.fallback-to-simple-auth-allowed" = "true"`. Which means this property will be added automatically when user creating hive catalog, to avoid such problem. 3. Fix calling `hdfsExists()` issue When calling `hdfsExists()` with non-zero return code, should check if it encounters error or is file not found. 3. Some code refactor Avoid import `org.apache.parquet.Strings`	2023-05-30 16:57:39 +08:00
Chuang Li	a041f8eabe	[fix](fe) Fx SimpleDateFormatter thread unsafe issue by replacing to DateTimeFormatter. (#19265 ) DateTimeFormatter replace SimpleDateFormat in fe module because SimpleDateFormat is not thread-safe.	2023-05-11 22:50:24 +08:00
Calvin Kirs	575c1620c2	[Improve](fe)Use commons-lang3 uniformly and refactor PatternGenerator#generateTypePattern (#18666 ) `commons-lang`(1and2) is no longer maintained since 2011, and the official recommendation is `commons-lang3`, which can be smoothly upgraded to be compatible with `commons-lang`. We use both dependencies in `fe`, which can be completely unified. `PatternGenerator#generateTypePattern` has many meaningless loops, and IntegerRange is introduced for, which is unnecessary. So I refactored it.	2023-04-17 20:15:17 +08:00
WenYao	c3fe113894	rename PaloFe to DorisFE (#18167 )	2023-03-29 00:30:16 +08:00
Mingyu Chen	39f59f554a	[improvement](dry-run)(tvf) support csv schema in tvf and add "dry_run_query" variable (#16983 ) This CL mainly changes: Support specifying csv schema manually in s3/hdfs table valued function s3 ( 'URI' = 'https://bucket1/inventory.dat', 'ACCESS_KEY'= 'ak', 'SECRET_KEY' = 'sk', 'FORMAT' = 'csv', 'column_separator' = '\|', 'csv_schema' = 'k1:int;k2:int;k3:int;k4:decimal(38,10)', 'use_path_style'='true' ) Add new session variable dry_run_query If set to true, the real query result will not be returned, instead, it will only return the number of returned rows. mysql> select * from bigtable; +--------------+ \| ReturnedRows \| +--------------+ \| 10000000 \| +--------------+ This can avoid large result set transmission time and focus on real execution time of query engine. For debug and analysis purpose.	2023-03-02 16:51:27 +08:00
huangzhaowei	4e92f63d7b	[Fix](Load) Disable for the developer to import fast json in fe (#16235 )	2023-02-01 16:32:11 +08:00
Mingyu Chen	726427b795	[refactor](fe) refactor and upgrade dependency tree of FE and support AWS glue catalog (#16046 ) 1. Spark dpp Move `DppResult` and `EtlJobConfig` to sparkdpp package in `fe-common` module. So taht `fe-core` is longer depends on `spark-dpp` module, so that the `spark-dpp.jar` will not be moved into `fe/lib`, which reduce the size of FE output. 2. Modify start_fe.sh Modify the CLASSPATH to make sure that doris-fe.jar is at front, so that when loading classes with same qualified name, it will be got from doris-fe.jar firstly. 3. Upgrade hadoop and hive version hadoop: 2.10.2 -> 3.3.3 hive: 2.3.7 -> 3.1.3 4. Override the IHiveMetastoreClient implementations from dependency `ProxyMetaStoreClient.java` for Aliyun DLF. `HiveMetaStoreClient.java` for origin Apache Hive metastore. Because I need to modified some of their method to make them compatible with different version of Hive. 5. Exclude some unused dependencies to reduce the size of FE output Now it is only 370MB (Before is 600MB) 6. Upgrade aws-java-sdk version to 1.12.31 7. Support AWS Glue Data Catalog 8. Remove HudiScanNode(no longer support)	2023-01-20 14:42:16 +08:00
Mingyu Chen	32b1456b28	[feature-wip](array) remove array config and check array nested depth (#13428 ) 1. remove FE config `enable_array_type` 2. limit the nested depth of array in FE side. 3. Fix bug that when loading array from parquet, the decimal type is treated as bigint 4. Fix loading array from csv(vec-engine), handle null and "null" 5. Change the csv array loading behavior, if the array string format is invalid in csv, it will be converted to null. 6. Remove `check_array_format()`, because it's logic is wrong and meaningless 7. Add stream load csv test cases and more parquet broker load tests	2022-10-20 15:52:31 +08:00
morrySnow	f1507f93ee	[enhancement](chore)add single empty line rule to fe check style for Nereids (#12365 )	2022-09-06 14:19:59 +08:00
morrySnow	190717dbcc	[enhancement](chore)add single space separator rule to fe check style (#12354 ) Some times, our code use more than one space as separator by mistake. This PR add a CheckStyle rule SingleSpaceSeparator to check that for Nereids.	2022-09-05 21:59:58 +08:00
morrySnow	dac0883635	[chore](checkstyle)forbidden import all kind of relocated guava (#12018 )	2022-08-24 08:47:13 +08:00
morrySnow	7c950c7cd5	[feature](Nereids) support cross join in Nereids (#11502 ) support cross join in Nereids 1. add PhysicalNestedLoopJoin 2. Translate PhysicalNestedLoopJoin to CrossJoinNode in PhysicalPlanTranslator	2022-08-08 22:14:27 +08:00
morrySnow	642499265c	[fe-package]reject illegal import (#11311 )	2022-07-29 14:22:23 +08:00
morrySnow	d17c906eb7	[chore](FE)add license header check in fe's checkstyle (#11076 ) Add license header check in fe's checkstyle	2022-07-22 18:37:32 +08:00
morrySnow	c62c2e308f	[chore]replace checkstyle action with mvn checkstyle:check (#10474 )	2022-06-30 11:20:50 +08:00
morrySnow	f06a06d623	[chore](fe)remove java doc period end check in checkstyle (#10329 ) We do not generate real java doc. All java doc comments is used to help to understand the code logic more easily. So we need loose java doc style check. Remove period character check in summary java doc check rule.	2022-06-24 08:55:53 +08:00
morrySnow	b7b78ae707	[style](fe)the last step of fe CheckStyle (#10134 ) 1. fix all checkstyle warning 2. change all checkstyle rules to error 3. remove some java doc rules a. RequireEmptyLineBeforeBlockTagGroup b. JavadocStyle c. JavadocParagraph 4. suppress some rules for old codes a. all java doc rules only affect on Nereids b. DeclarationOrder only affect on Nereids c. OverloadMethodsDeclarationOrder only affect on Nereids d. VariableDeclarationUsageDistance only affect on Nereids e. suppress OneTopLevelClass on org/apache/doris/load/loadv2/dpp/ColumnParser.java f. suppress OneTopLevelClass on org/apache/doris/load/loadv2/dpp/SparkRDDAggregator.java g. suppress LineLength on org/apache/doris/catalog/FunctionSet.java h. suppress LineLength on org/apache/doris/common/ErrorCode.java	2022-06-17 21:02:45 +08:00
Kikyou1997	67e95276fb	[fix](optimizer) Fix the default join reorder algorithm (#10174 ) Default join reorder algorithm not working for the most cases.	2022-06-17 10:59:33 +08:00
morrySnow	e701c057dc	[style](fe) wrap and whitespace rules (#9764 ) change below rules' severity to error and fix original code error: - EmptyBlock - EmptyCatchBlock - LeftCurly - RightCurly - IllegalTokenText - MultipleVariableDeclarations - OneStatementPerLine - StringLiteralEquality - UnusedLocalVariable - Indentation - OuterTypeFilename - MethodParamPad - GenericWhitespace - NoWhitespaceBefore - OperatorWrap - ParenPad - WhitespaceAfter - WhitespaceAround	2022-05-26 16:56:20 +08:00
morrySnow	235d586f11	[style](fe) code correct rules and name rules (#9670 ) * [style](fe) code correct rules and name rules * revert some change according to comments	2022-05-19 16:36:03 +08:00
morrySnow	8a0097cfb9	[style](java) format fe code with some check rules (#9460 ) Issue Number: close #9403 set below rules' severity to error and format code according check info. a. Merge conflicts unresolved b. Avoid using corresponding octal or Unicode escape c. Avoid Escaped Unicode Characters d. No Line Wrap e. Package Name f. Type Name g. Annotation Location h. Interface Type Parameter i. CatchParameterName j. Pattern Variable Name k. Record Component Name l. Record Type Parameter Name m. Method Type Parameter Name n. Redundant Import o. Custom Import Order p. Unused Imports q. Avoid Star Import r. tab character in file s. Newline At End Of File t. Trailing whitespace found	2022-05-12 20:14:38 +08:00
morrySnow	122cc3b772	[chore](fe code style)add suppressions to fe check style (#9429 ) Current fe check style check all files. But some rules should be only applied on production files. Add suppressions to suppress some rules on test files.	2022-05-12 12:16:55 +08:00
leo65535	c5941fd166	[FE Code Style][sub] Adjust some check rules (#9345 ) Adjust `RedundantImport`,`UnusedImports`,`EmptyStatement`,`NewlineAtEndOfFile`,`UpperEll`, `AvoidStarImport`, `MissingOverride` rules.	2022-05-04 23:34:55 +08:00
morrySnow	784681f106	[FE Code Style][step 0]add github action to check incremental code in pr (#9328 ) 1. add rules to checkstyle 2. add github action to check incremental code in pr	2022-05-01 17:30:29 +08:00

31 Commits