doris

Files

Jibing-Li 9c6c2f736e [Improvement](statistics)Improve stats sample strategy (#26435 )

Improve the accuracy of sample stats collection. For non distribution columns, use 
`n*d / (n - f1 + f1*n/N)`

where `f1` is the number of distinct values that occurred exactly once in our sample of n rows (from a total of N),
and `d` is the total number of distinct values in the sample.

For distribution columns, use `ndv(n) * fraction of tablets sampled` for NDV.

For very large tablet to sample, use limit to control the total lines to scan (for non key column only, because key column is sorted and will be inaccurate using limit).

2023-11-13 15:52:21 +08:00

.idea

[chore](idea) add back .idea dir under fe (#23821 )

2023-09-04 14:01:00 +08:00

be-java-extensions

[refactor](jni) unified jni framework for jdbc catalog (#26317 )

2023-11-13 14:28:15 +08:00

check/checkstyle

[Revert](code-style) revert FE code-format #25033 and #26488 (#26505 )

2023-11-07 16:37:24 +08:00

fe-common

[improve](group commit) Add a swicth to wait internal group commit lo… (#26734 )

2023-11-13 10:35:35 +08:00

fe-core

[Improvement](statistics)Improve stats sample strategy (#26435 )

2023-11-13 15:52:21 +08:00

hive-udf

[Improvement](hive-udf)(doc) minimize hive-udf and add some docs. (#24786 )

2023-10-16 16:47:21 +08:00

spark-dpp

[feature](fe) Add coverage tool for FE UT (#26203 )

2023-11-11 19:54:04 +08:00

pom.xml

[feature](fe) Add coverage tool for FE UT (#26203 )

2023-11-11 19:54:04 +08:00

README

…

README

# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements.  See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership.  The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License.  You may obtain a copy of the License at
#
#   http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied.  See the License for the
# specific language governing permissions and limitations
# under the License.

# fe-common

This module is used to store some common classes of other modules.

# spark-dpp

This module is Spark DPP program, used for Spark Load function.
Depends: fe-common

# fe-core

This module is the main process module of FE.
Depends: fe-common, spark-dpp