Commit Graph

9976 Commits

Author SHA1 Message Date
c323bc44ff [feature](docker)add be init script option (#16909) 2023-04-18 20:03:18 +08:00
380bc16595 fix doc (#18772) 2023-04-18 19:54:28 +08:00
79c446c89f [enhancement](exception) Column filter/replicate supports exception safety (#18503) 2023-04-18 19:23:09 +08:00
4a16eff16d [fix](merge-on-write) enable_unique_key_merge_on_write property should only be used for unique table (#18734) 2023-04-18 18:40:01 +08:00
031d35d4a1 [fix](stats) Stats still in cache after user dropped it (#18720)
1. Evict the dropped stats from cache
2. Remove codes for the partition level stats collection
3. Disable analyze whole database directly
4. Fix the potential death loop in the stats cleaner
5. Sleep thread in each loop when scanning stats table to avoid excessive IO usage by this task.
2023-04-18 16:41:10 +08:00
c3f808cc06 Revert "[enhancement](Nereids) optimize bloom filter size reducing strategy (#18596)" (#18768)
This reverts commit 3eac53f75d5f3eb05e958403efeb7578ad86e438.
2023-04-18 15:37:19 +08:00
1b4be46ce5 [typo](docs) optimization Monitoring and alarming doc (#18767)
* [typo](docs) optimization Monitoring and alarming doc

* fix
2023-04-18 14:14:29 +08:00
564446e52f [Refact](type system) refact serde for type system and pb serde impl (#18627) 2023-04-18 14:13:56 +08:00
18898db09d [feature](function) Add new parameters to 'trim'. (#18580) 2023-04-18 14:13:30 +08:00
62e4140d17 [fix](olap) fix lost disable_auto_compaction info when fe restart (#18757)
Signed-off-by: nextdreamblue <zxw520blue1@163.com>
2023-04-18 14:11:40 +08:00
0b074ade02 [fix](const column) fix coredump caused by const column for some functions (#18737) 2023-04-18 13:57:55 +08:00
6b351a2818 [vectorzied](function) fix array_map function analyzed failed with order by clause (#18676)
* [vectorzied](function) fix array_map function analyzed failed with order by clause

* add test
2023-04-18 12:01:44 +08:00
3a6eae0ec5 [feature](Nereids): infer not null from Agg Count(distinct). (#18599) 2023-04-18 11:22:36 +08:00
7fe281b7a6 [docs](docs)Modify the wrong information about the concept description in the document (#18760) 2023-04-18 10:43:53 +08:00
7b0e5ad54d [enhance](buffered reader)add bvar to detect download bytes and download speed (#18736) 2023-04-18 10:14:07 +08:00
3de4d64657 [chore](hashtable) Use doris' Allocator to replace std::allocator in phmap (#18735) 2023-04-18 09:58:28 +08:00
98b8efc2c2 [fix](multi-catalog)fix old s3 properties check (#18430)
fix old s3 properties check
fix for #18005 (comment)
2023-04-18 09:58:13 +08:00
0753dc2cc1 [fix](merge-on-write) the mow keyset check is quite slow when loading large data set, remove it from release version (#18750) 2023-04-18 09:51:25 +08:00
10b252856d [feature](Nereids): pullup semiJoin through aggregate. (#18669) 2023-04-18 09:31:07 +08:00
b68857902e [Compile](BE) Fix compile failed with tcmalloc (#18748) 2023-04-18 09:26:45 +08:00
86b8e95045 [fix](Nereids): when GroupExpr already exists, we need to remove ParentExpression (#18749) 2023-04-17 23:12:26 +08:00
b59c4b4702 [fix](build) Fix missing header files (#18740) 2023-04-17 21:22:15 +08:00
c904384672 Revert "[refactor](planner) using crchash replace murmurhash in the runtime filter (#18472)" (#18730)
This reverts commit a8315b86ca5543a6cc5b3eab97e4f0953b984247.
2023-04-17 20:25:18 +08:00
575c1620c2 [Improve](fe)Use commons-lang3 uniformly and refactor PatternGenerator#generateTypePattern (#18666)
`commons-lang`(1and2) is no longer maintained since 2011, and the official recommendation is `commons-lang3`, which can be smoothly upgraded to be compatible with `commons-lang`.
We use both dependencies in `fe`, which can be completely unified.

`PatternGenerator#generateTypePattern` has many meaningless loops, and IntegerRange is introduced for,
which is unnecessary. So I refactored it.
2023-04-17 20:15:17 +08:00
b458c14d31 [improvement](merge-on-write) move segment check to delta writer (#18643) 2023-04-17 19:59:44 +08:00
16cdd9e95a [feature](segcompaction) enable segcompaction by default (#18722)
Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>
2023-04-17 18:00:49 +08:00
74d424e6d4 [Bug](DECIMAL) Fix bug for arithmatic expr DECIMALV2 / DECIMALV3 (#18723) 2023-04-17 16:43:36 +08:00
d61f52d277 [fix](Nereids): fix sum func in eager agg (#18675) 2023-04-17 15:06:28 +08:00
1e06763366 [fix](bitmap) fix bitmap_count errors to set nullable to non-nullable bitmap col (#18689) 2023-04-17 13:23:27 +08:00
5300b21db7 [Bug](DECIMALV3) report failure if a decimal value is overflow (#18336) 2023-04-17 13:18:14 +08:00
eb128753ac [Opt](pipeline) opt pipeline shared scan (#18715) 2023-04-17 13:06:39 +08:00
9e960f4c4f [chore](build) Use include-what-you-use to optimize includes (#18681)
Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.
2023-04-17 11:44:58 +08:00
a2278dbc6c [opt](nereids) optimize filter estimation for pattern "col=col" #18716
Tpc-h q10 and q5 benefit from this optimization.

For a given hash join condition, A=B, sometimes both A and B are reduced by filters. In this pr, both reductions are counted in join estimation.
2023-04-17 11:44:35 +08:00
b5b0148010 [feature](Nereids): when cost time > 5s, throw timeout Exception (#18316) 2023-04-17 11:21:54 +08:00
32c1d0acea [typo](docs) Add directory level for ST doc (#18719) 2023-04-17 11:15:37 +08:00
a8a133b497 [typo](docs)Optimize Monitoring and alarming documentation (#18718) 2023-04-17 11:01:54 +08:00
3eac53f75d [enhancement](Nereids) optimize bloom filter size reducing strategy (#18596) 2023-04-17 10:50:08 +08:00
ac0b382fed [improvement](executor) Priority Queue support vruntime (#18635)
* 1 rename some class
2 mfqs support vruntime

* fix const

* as sugguestion

* fix const
2023-04-17 10:17:28 +08:00
9c7a69ff09 Fix update submodule everytime (#18671) 2023-04-17 10:13:31 +08:00
092d81f88a [BugFix](functions) fix multi_search_all_positions #18682 2023-04-17 08:32:57 +08:00
afbbf84675 [chore](build) add apache-orc git submodule path (#18695)
1. Add apache-orc git submodule update path, not update all modules
When sh build.sh, update all modules will fails serveral times because of unstable github network.
It wastes many time.

2. Add gitignore for  be/src/apache-orc/   to avoid mistake commits.
2023-04-17 00:08:25 +08:00
ddbff2aa39 [feature](jni) map c++ block to java vector table (#18566)
PR(#17960) has introduced vector table which can map java table to c++ block.
In some cases(java udf & jdbc exector), we should map c++ block to java table. This PR implements this function.

The memory structure of java vector table and c++ block is consistent,
so the implementation doesn't copy the block, just passes the memory address.
2023-04-17 00:04:53 +08:00
0788ff115c [Fix][Docs]: from ARRAY\<T\> to ARRAY<T>(#18713) 2023-04-16 22:36:48 +08:00
b7d7525e9d [doc][multi-catalog] add timed refresh en (#18705)
For a catalog, we can set the metadata_refresh_interval_sec parameter to specify a timed refresh
2023-04-16 22:36:16 +08:00
dc6a3e088e [doc](multi-catalog)added timing refresh (#18704)
For a catalog, we can set the metadata_refresh_interval_sec parameter to specify a timed refresh
2023-04-16 22:35:59 +08:00
57982ddc46 [Fix](catalog)Fix hudi-catalog get file split error (#18644) (#18673)
`hudi-common` depends on `parque-avro`, but the dependency scope is `provide`. 
When we use `hudi-catalog`, `HoodieAvroWriteSupport` will be called. This method depends on `parque-avro`, so it will generate ClassNotFound
Describe your changes.
2023-04-16 21:56:14 +08:00
e6884a3768 [log](fe) add more detail log for master transfer (#17350) (#17485) 2023-04-16 18:35:06 +08:00
1cbbc60822 [feature](config) support "experimental" prefix for FE config (#18699)
For each release of Doris, there are some experimental features.
These feature may not stable or qualified enough, and user need to use it by setting config or session variables,
eg, set enable_mtmv = true, otherwise, these feature is disable by default.

We should explicitly tell user which features are experimental, so that user will notice that and decide whether to
use it.

Changes
In this PR, I support the experimental_ prefix for FE config and session variables.

Session Variable

Given enable_nereids_planner as an example.

The Nereids planner is an experimental feature in Doris, so there is an EXPERIMENTAL annotation for it:

@VariableMgr.VarAttr(..., expType = ExperimentalType.EXPERIMENTAL)
private boolean enableNereidsPlanner = false;
And for compatibility, user can set it by:

set enable_nereids_planner = true;
set experimental_enable_nereids_planner = true;
And for show variables, it will only show experimental_enable_nereids_planner entry.

And you can also see all experimental session variables by:

show variables like "%experimental%"
Config

Same as session variable, give enable_mtmv as an example.

@ConfField(..., expType = ExperimentalType.EXPERIMENTAL)
public static boolean enable_mtmv = false;
User can set it in fe.conf or ADMIN SET FRONTEND CONFIG stmt with both names:

enable_mtmv
experimental_enable_mtmv
And user can see all experimental FE configs by:

ADMIN SHOW FRONTEND CONFIG LIKE "%experimental%";
TODO
Support this feature for BE config

Only add experimental for:

enable_pipeline_engine
enable_nereids_planner
enable_single_replica_insert
and FE config:

enable_mtmv
enabel_ssl
enable_fqdn_mode
Should modify other config and session vars
2023-04-16 18:32:10 +08:00
652e7fe4ab [regression-test] add some suites (#18286)
test_join_result_count.groovy
2023-04-16 18:19:52 +08:00
24dd3f19cd [feature](extension) support beats output to doris (#18448) 2023-04-16 18:17:48 +08:00