doris

Author	SHA1	Message	Date
Mingyu Chen	2810029d24	[fix](multi-catalog) fix bug that replay init catalog may happen after catalog is dropped (#15919 )	2023-01-14 09:41:37 +08:00
airborne12	be110ffaf6	[thirdparty](clucene) add clucene deps for doris inverted index (#15807 ) As part of Inverted Index DSIP steps, we'd like to contribute our inverted index implementations step by step. First of all we need to introduce clucene to doris thirdparty libs, because inverted index implementations are based on lucence API and index file format, also we add our features and performance improvements base on clucene, so we need to maintain the repo ourselves	2023-01-12 21:59:19 +08:00
Mingyu Chen	89c21af87d	[chore](fe) update fe snapshot to 1.2 and fix auditloader compile error (#15787 ) This PR #14925 change some field of AuditEvent, so we need to upgrade the fe-core's SNAPSHOT to 1.2 because auditloader depends on fe-core Already push the 1.2-SNAPSHOT to https://repository.apache.org/content/repositories/snapshots/org/apache/doris/fe-core/1.2-SNAPSHOT/	2023-01-11 08:46:48 +08:00
Adonis Ling	95f2f43c02	[fix](macOS) Failed to run BE UT due to syscall to map cache into shared region failed (#15641 ) According to the post https://developer.apple.com/forums/thread/676684, the executable whose size is bigger than 2G may fail to start. The size of the executable `doris_be_test` generated by run-be-ut.sh is 2.1G (> 2G) now and we can't run it on macOS (arm64). We can separate the debug info from the executable `doris_be_test` to reduce the size. After that, we can run `doris_be_test` successfully.	2023-01-06 01:23:37 +08:00
plat1ko	f3aea7f0f0	[Enhancement](status) Unify error code and enable customed err msg for BE internal errors (#14744 )	2022-12-11 23:33:18 +08:00
Pxl	c804024e5d	[Chore](workflow) add clang-tidy workflow (#14737 ) add clang-tidy workflow	2022-12-02 14:10:29 +08:00
Adonis Ling	0daebde223	[fix](java-udf) Disable the corresponding configuration if building BE without Java UDF support (#14303 )	2022-11-29 10:12:00 +08:00
Jeffrey	2bc43f9757	[fix](ui) clean npm cache before install (#14629 ) npm ERR! Unexpected end of JSON input while parsing near '...ih/ae0E6HfGdwwO/\r\na'	2022-11-28 12:12:51 +08:00
Adonis Ling	249b688663	[chore](github) Add a workflow to check BE UT on macOS (#14506 )	2022-11-23 08:38:28 +08:00
Xinyi Zou	21416f9947	[enhancement](memory) Support Jemalloc metrics and default allocator changed to Jemalloc (#14384 )	2022-11-18 21:02:54 +08:00
Zhengguo Yang	12652ebb0e	[UDF](java udf) using config to enable java udf instead of macro at compile time (#14062 ) * [UDF](java udf) useing config to enable java udf instead of macro at compile time	2022-11-11 09:03:52 +08:00
Mingyu Chen	1c07a01038	[feature](multi-catalog) Support data on s3-compatible oss and support aliyun DLF (#13994 ) Support Aliyun DLF Support data on s3-compatible object storage, such as aliyun oss. Refactor some interface of catalog, to make it more tidy. Fix bug that the default text format field delimiter of hive should be \x01 Add a new class PooledHiveMetaStoreClient to wrap the IMetaStoreClient.	2022-11-08 14:02:41 +08:00
Adonis Ling	2ef8f3f6f4	[enhancement](java-udf) Support loading libjvm at runtime (#13660 )	2022-10-28 08:45:12 +08:00
Adonis Ling	f51464af59	[chore](macOS) Support Java UDF (#13714 )	2022-10-28 08:40:56 +08:00
morrySnow	b13061360f	[enhancement](chore) build fe-common when build java-udf (#13647 ) * [enhancement](chore) Enhance build script compatibility * remove duplicate fix	2022-10-26 09:23:21 +08:00
Pxl	bd884d3298	[Chore](build) add a environment variable DISABLE_JAVA_UDF (#13588 )	2022-10-25 17:46:56 +08:00
Gabriel	78278f5943	[chore](be version) Check BE version by script (#13594 ) Check BE version by script	2022-10-25 16:20:38 +08:00
Adonis Ling	b042ef9765	[chore](macOS) Fix the issues with protoc and protoc-gen-grpc-java on M1 (#13571 ) There are some errors occur when building FE by JDK (arm64) on M1 because the dependencies protoc and grpc-java doesn't support M1. #13563 modified the build.sh to fix this issues by adding -Dos.arch=x86_64 to build command. However, if some one executes `mvn clean package -DskipTests=true` under the folder fe, the errors will occur again. This PR introduces a better way to fix them.	2022-10-23 14:10:46 +08:00
Adonis Ling	20ade4ae96	[chore](macOS) Disable JAVA UDF temporarily (#13563 ) Fail to start BE (ASAN) if it was built with JAVA UDF on macOS.	2022-10-22 01:05:45 +08:00
HappenLee	f0b608018b	[config](tpch) Disable jemalloc and change the hint of tpch q22 (#13555 )	2022-10-21 21:35:43 +08:00
Mingyu Chen	847b80ebfa	[test](jdbc) add jdbc and hive regression test (#13143 ) 1. Modify default behavior of `build.sh` The `BUILD_JAVA_UDF` is default ON, so that jvm is needed for compilation and runtime. 2. Add docker-compose for MySQL 5.7, PostgreSQL 14 and Hive 2 See `docker/thirdparties/docker-compose`. 3. Add some regression test cases for jdbc query on MySQL, PG and Hive Catalog The default is `false`, if set to true, you need first start docker for MySQL/PG/Hive. 4. Support `if not exists` and `if exists` for create/drop resource and create/drop encryptkey	2022-10-21 15:29:27 +08:00
Adonis Ling	125def5102	[enhancement](macOS M1) Support building from source on macOS (M1) (#13195 ) # Proposed changes This PR fixed lots of issues when building from source on macOS with Apple M1 chip. ## ATTENTION The job for supporting macOS with Apple M1 chip is too big and there are lots of unresolved issues during runtime: 1. Some errors with memory tracker occur when BE (RELEASE) starts. 2. Some UT cases fail. ... Temporarily, the following changes are made on macOS to start BE successfully. 1. Disable memory tracker. 2. Use tcmalloc instead of jemalloc. This PR kicks off the job. Guys who are interested in this job can continue to fix these runtime issues. ## Use case ```shell ./build.sh -j 8 --be --clean cd output/be/bin ulimit -n 60000 ./start_be.sh --daemon ``` ## Something else It takes around _10+_ minutes to build BE (with prebuilt third-parties) on macOS with M1 chip. We will improve the development experience on macOS greatly when we finish the adaptation job.	2022-10-18 13:10:13 +08:00
Xinyi Zou	8dc09ad05c	[enhancement](memory) Default Jemalloc as generic memory allocator #13367 gperftools/tcmalloc[https://github.com/gperftools/gperftools] is outdated, there are no new features for many years, only fix bugs. doris is currently used by default. google/tcmalloc[https://github.com/google/tcmalloc], very active recently, has many new features, and is expected to perform better than jemalloc, but there is currently no stable version. Moreover, the compilation dependencies are complex and difficult to integrate, and are incompatible with gperftools/tcmalloc, and there are few reference documents. jemalloc[https://github.com/jemalloc/jemalloc] performs better than gperftools/tcmalloc under high concurrency, and is mature and stable, looking forward to being the default memory allocator. Tested in Doris: #12496	2022-10-14 09:54:54 +08:00
yiguolei	15c7c0b754	[chore](release build) copy license and notice file to output folder and strip debug info from meta tool (#13222 ) * [chore](release build) copy license and notice file to output folder and strip debug info from meta tool Co-authored-by: yiguolei <yiguolei@gmail.com>	2022-10-10 08:31:34 +08:00
Jeffrey	1cb43b7f38	[fix](frontend) fix peerDependencies error (#12373 ) ```npm install``` problem with peer dependencies in the latest version of npm (v7+) Use ```npm install --legacy-peer-deps``` to fix it. Reference: https://blog.npmjs.org/post/626173315965468672/npm-v7-series-beta-release-and-semver-major	2022-09-23 21:54:52 +08:00
Zhengguo Yang	8fcd8ed8b3	[chore](build) add option to disable -frecord-gcc-switches (#12846 )	2022-09-22 15:38:14 +08:00
luozenglin	b619bb2000	[enhancement](ldap) optimize LDAP authentication. (#11948 ) * [enhancement](ldap) optimize LDAP authentication. 1. Support caching LDAP user information. 2. HTTP authentication supports LDAP. 3. LDAP temporary users support default user property. 4. LDAP configuration supports the `admin show config` and `admin set config` commands.	2022-08-24 17:08:14 +08:00
Adonis Ling	4fa53b4cdb	[chore](workflow) Add shellcheck to check shell scripts (#11744 )	2022-08-18 16:07:28 +08:00
caiconghui	411254c128	[Enhancement](hdfs) Support loading hdfs config from hdfs-site.xml (#11571 )	2022-08-08 14:18:28 +08:00
Adonis Ling	573ebf235e	[enhancement](build) Support customizing extra compile flags (#11444 )	2022-08-03 11:02:17 +08:00
jiafeng.zhang	ac62c9507e	[improvement](script)Audit build script (#11411 )	2022-08-02 12:06:44 +08:00
Adonis Ling	5215d95064	[enhancement](workflow) Use ccache to speed the BE UT (Clang) up (#11339 )	2022-07-29 21:19:26 +08:00
Xinyi Zou	d9095922d9	[Enhancement] [Memory] add strict memory usage compile option STRICT_MEMORY_USE (#10936 ) In the strict memory usage mode of STRICT_MEMORY_USE=ON, when the capacity of the vectorized Hash Table is greater than 2G, it starts to grow when 75% of the capacity is satisfied, the memory usage of the vectorized Join becomes 50% of the previous value. STRICT_MEMORY_USE=ON` expects BE to use less memory, and gives priority to ensuring stability when the cluster memory is limited.	2022-07-18 16:16:43 +08:00
lihangyu	b04a791895	[Enhancement] support compile with jemalloc (#10542 ) A test feature to use jemalloc as default malloc.	2022-07-11 12:15:35 +08:00
Gabriel	b11e72b76b	[chore] turn off java-udf by default when compiling in parallel (#10569 )	2022-07-03 23:24:49 +08:00
Mingyu Chen	498a80547c	[fix](fe-ut) fix fe ut and build.sh bug (#10432 )	2022-06-27 19:01:05 +08:00
Pxl	4750e94746	set default do not build benchmark-tool && and use lld/gold (#10215 )	2022-06-25 22:31:11 +08:00
Mingyu Chen	8a49c7ef04	[chore] Rename Doris binary output format	2022-06-24 15:30:05 +08:00
Mingyu Chen	67f341f44e	[TLP](step-1) Remove incubator prefix (#10230 ) Remove some `incubator-` prefix in source code. The document is not modified, will be done in next PR.	2022-06-19 19:34:52 +08:00
Zhengguo Yang	39a2785ce2	[enhancement] support simd instructions on arm cpus through sse2neon (#10068 ) * [enhancement] support simd instructions on arm cpus through sse2neon	2022-06-14 09:17:09 +08:00
Zhengguo Yang	e0cf2677a0	[dependency][enhancement] support build libhdfs in arm cpus (#10018 ) Supports native hdfs functionality on arm cpu This pr mainly upgrades libdfs3 and supports running on arm，and make libhdfs3 with kerberos as default	2022-06-10 19:40:41 +08:00
Xinyi Zou	ca05d1ee01	[fix](memory tracker) Fix lru cache, compaction tracker, add USE_MEM_TRACKER compile (#9661 ) 1. Fix Lru Cache MemTracker consumption value is negative. 2. Fix compaction Cache MemTracker has no track. 3. Add USE_MEM_TRACKER compile option. 4. Make sure the malloc/free hook is not stopped at any time.	2022-05-25 08:56:17 +08:00
gtchaos	b3a2a92bf5	[deps] libhdfs3 build enable kerberos support (#9524 ) Currently, the libhdfs3 library integrated by doris BE does not support accessing the cluster with kerberos authentication enabled, and found that kerberos-related dependencies（gsasl and krb5） were not added when build libhdfs3. so, this pr will enable kerberos support and rebuild libhdfs3 with dependencies gsasl and krb5: - gsasl version: 1.8.0 - krb5 version: 1.19	2022-05-22 20:58:19 +08:00
Adonis Ling	119ff2c02d	[enhancement] Improve debugging experience. (#9677 )	2022-05-19 16:36:37 +08:00
Dongyang Li	8c166d747c	Clean the version.sh file before build, otherwise the version information in the binary package produced by this compilation is still the commit id of the last time. (#9534 ) Co-authored-by: stephen <hello-stephen@qq.com>	2022-05-13 10:23:44 +08:00
Zhengguo Yang	290366787c	[refactor] refactor code, replace some file with stl libs (#8759 ) 1. replace ConditionVariables with std::condition_variable 2. repalace Mutex with std::mutex 3. repalce MonoTime with std::chrono	2022-04-13 09:55:29 +08:00
HappenLee	ce6b5169c2	[fix](join) Fix error bucket num get in bucket shuffle join in dynamic partition (#8891 )	2022-04-09 19:11:44 +08:00
Pxl	03c5d5d677	fix some error on build.sh && fix build fail with clang on runtime_profile (#8748 )	2022-04-05 15:52:53 +08:00
Mingyu Chen	22cf6ea17c	[chore] Modify build.sh and refactor dependency of FE submodules (#8732 ) This PR fixes the #8731 and refactor the `build.sh` script. The build.sh script is currently responsible for the compilation of the following Doris components. 1. FE - fe-common - fe-core - spark-dpp - hive-udf - java-udf - ui 2. BE - palo_be - meta_tool 3. broker In the FE module. - The 4 submodules `fe-common, fe-core, spark-dpp and ui` together form Frontend. - `spark-dpp, hive-udf and java-udf` can be compiled separately to produce jar packages for individual use. In the BE module. - `palo_be` can start the BE process separately. - `meta_tool` can be compiled separately to produce binaries. The modified build.sh script has the following changes: 1. there is no longer an option to compile `ui` separately, build together with `--fe`. 2. `fe/be/spark-dpp/hive-udf/java-udf/palo_be/meta_tool` can be compiled separately. 3. all components except `java-udf` will be compiled by default (`java-udf` is in development) Remaining issues: Several submodules of FE have messy dependencies. For example, `java-udf` depends on `fe-core`, and `fe-core` depends on `spark-dpp`, resulting in a large binary jar of `java-udf`. It needs to be reorganized afterwards.	2022-03-30 00:13:24 +08:00
Mingyu Chen	f3659c87c1	[fix][chore](repository)(fe) check reponame when creating repository and modify build.sh (#8671 ) 1. We need to check repo name when creating repository 2. modify build.sh to not install spark-dpp when spark-dpp is not compiled	2022-03-29 11:32:52 +08:00

1 2 3

124 Commits