Commit Graph

13721 Commits

Author SHA1 Message Date
a2858bbbfc [fix](regression-test) fix wrong http port in regression-test case compaction-with-delete #18484 2023-04-08 15:17:52 +08:00
161678380c [bug](GC)the issue of incorrect disk usage (#18397) 2023-04-08 09:32:36 +08:00
432fef7aa8 [Enhancement](iceberg table)support inPredicate for iceberg table (#18226) 2023-04-08 08:59:23 +08:00
851793aa66 [fix](test) fix p0 regression case test_compaction_with_delete failed (#18479) 2023-04-07 22:11:40 +08:00
936807b5c9 [typo](doc) Fixed typos in SHOW-PROCESSLIST.md (#18469) 2023-04-07 22:02:37 +08:00
d881d71cd1 [Bug](cast) Fix bug for cast function between datetimev2 and string (#18442)
Fix bug for cast function between datetimev2 and string
2023-04-07 22:02:15 +08:00
2bb025a474 [typo](docs) fix some error in this files (#18259)
* [typo](docs) fix some error in this files

* [typo](docs) fix some error in this files

* update files
2023-04-07 18:01:07 +08:00
5678708b7d [Fix](nereids) change setting byte size from translator to catalog generate scalar type (#18450)
Problem: when used nereids to generate scalarType, byteSize would be set. After switch the optimizer to planner, planner would reuse scalarType in some cases.
Fix: change byteSize setting from Plan translator to toCatalogDataType
2023-04-07 17:38:08 +08:00
30f2abe5d3 [FIX](Map)fix calculate map offset in olap convertor (#18295)
Fix be core when load bigger kv data in one row for map.
2023-04-07 17:04:08 +08:00
4c7979be58 [fix](regression-test) fix failure of test case (#18454) 2023-04-07 16:44:35 +08:00
e3ff2e3d21 [fix](file cache) Fix be core while use block/whole/sub file cache (#18440)
BE will core dump while use whole/sub file cache.
Call func CachedRemoteFileReader/WholeFileCache/SubFileCache::read_at_impl() did not pass IOContext when reading segment footer.
2023-04-07 16:39:59 +08:00
8e8c5b5283 [feature](Nereids): deduplicate InPredicate. (#18428) 2023-04-07 16:22:21 +08:00
f6f4dac1d0 [Improvement](DECIMAL) Improve decimal operation (#18437) 2023-04-07 15:58:28 +08:00
c5d9e8529a [enhancement](Nereids) adjust runtime filter parameters when enable pipeline engine (#18427)
when enable pipeline engine, we

1. reduce non-broadcast join bloom filter size
2. turn in or bloom filter to bloom filter if target is remote
2023-04-07 15:48:25 +08:00
63994e351f [fix](Nereids) extract complicated time string in evaluating cost model framework (#17864)
1. The time string in the profile can be "xx s xx ms". The framework should extract time with re package to support more complicated time string
2. Add stats for sortNode and AggNode in `withChildren`
2023-04-07 15:33:04 +08:00
b0a13d019f [fix](planner) enable fallback to legacy planner when execute internal query (#18353) 2023-04-07 15:15:49 +08:00
926c6056c7 [refactor](config) Delete the environment variable enable_vectorized_engine (#18166) 2023-04-07 14:23:16 +08:00
308ff9a16f [enchancement](memory) tracking lru cache memory and page memory not in cache (#18361)
Statistics lru cache memory in metrics
Statistics page memory not in cache in mem tracker
2023-04-07 14:22:44 +08:00
ccb3541fa5 [chore](regression) print exception along with error sql when run sql file (#18374) 2023-04-07 14:19:47 +08:00
d36e9bd523 [chore](scan) Disable low cardinality optimization for compaction (#18424) 2023-04-07 14:19:11 +08:00
5d876414b5 [typo](docs) add autobucket to sidebars (#18453) 2023-04-07 14:17:36 +08:00
f42db08ccc [fix](Nereids) Fixed a problem with completing ClusterName (#18366) 2023-04-07 13:35:03 +08:00
b1956b42fb [enhancement](Nereids) disable heavy operator penalty in cost model v1 (#18422) 2023-04-07 13:16:59 +08:00
a7b708263d [fix](nereids) move validate data types before EliminateUnnecessaryProject rule (#18393)
validate supported data types checks if a project node's output contains any unsupported data types like array, map, etc in nereids. So this validation should run before EliminateUnnecessaryProject rule
2023-04-07 13:10:18 +08:00
c32adba1cf [Refactor](Pipeline) Refactor pipeline code to improve coverage (#18376)
Refactor pipeline code to improve coverage
2023-04-07 13:09:44 +08:00
2783b27788 [fix](Nereids): fix LogicalProject withXXX(). (#18441) 2023-04-07 12:38:53 +08:00
Pxl
e77da1519a [Enchancement](materialized-view) adjust desc table all display fields (#18357)
adjust desc table all display fields
2023-04-07 11:14:17 +08:00
Pxl
7631a8fb39 [Bug](materialized-view) fix mv define expr persistence replay incorrect after schema change (#18418)
fix mv define expr persistence replay incorrect after schema change
2023-04-07 11:11:55 +08:00
Pxl
267b690dad [Bug](materialized-view) fix materialized-view query match not consider with order by elements (#18384)
fix materialized-view query match not consider with order by elements
2023-04-07 11:11:18 +08:00
2b662ac26b [Fix](segment iterator) fix filter block size and filter size mismatch problem (#18395)
adding result column id to _column_filter in _output_index_result_column
2023-04-07 09:43:33 +08:00
505f25c580 [fix](planner)use base index if the where clause is a constant value (#18367)
sql : select bitmap_empty() from d_table where true;
should always use base index instead of any mv, because the conjuncts is constant (true) and use none of the column from any mv
2023-04-07 09:15:00 +08:00
4e1cdb9ce7 [fix](agg_sort)fix bug of agg sort group concat with order by(#18447) 2023-04-07 08:42:36 +08:00
759f1da32e [Enhencement](Backends) add HostName filed in backends table and delete backends table in information_schema (#18156)
1.  Add `HostName` field for `show backends` statement and `backends()` tvf.
2. delete the `backends` table in `information_schema` database
2023-04-07 08:30:42 +08:00
22deeecbe1 [Improvement](multi catalog)Cache File for Hive Table, instead of cache file splits. (#18419)
Currently, the session variable for Split size will not take effect after the file splits are cached.
1. This PR is to cache file for Hive Table, instead of cache file splits. And split the file every time using the current split size.
2. Use self splitter by default.
2023-04-07 00:07:23 +08:00
a059973cff [typo](doc)Add description that external table are no longer maintained (#18425)
Co-authored-by: hechao <hechao@selectdb.com>
2023-04-06 21:01:26 +08:00
e848e456be [config] modify tablet_shard to 4 and add some log (#18416)
modify the default value of BE config tablet_map_shard_size to 4. To reduce lock contention.
Add log when failed writing disk test file, for debug
2023-04-06 17:18:16 +08:00
981ead9032 [feature](Nereids) support binary arithmetic function (#18213)
support binary arithmetic functions like:

add(op1, op2) -> op1 + op2
subtract(op1, op2) -> op1 - op2
multiply(op1, op2) -> op1 * op2
divide(op1, op2) -> op1 / op2
mod(op1, op2) -> op1 % op2
2023-04-06 16:57:04 +08:00
33ae4524ce [fix](multi-catalog) Fix properties check in S3Storage and add hive socket timeout config (#18420)
Co-authored-by: jinzhe <jinzhe@selectdb.com>
2023-04-06 16:35:24 +08:00
27576ef8dc [fix](stats) Fix analyze table failed (#18386) 2023-04-06 15:45:09 +08:00
82248ab392 [FIX](complex-type) get_default to return real nested default value (#18413)
make real default value to return with nested type in complex type
2023-04-06 15:24:32 +08:00
591f76a6a4 [fix](alter inverted index) Temporary deal with add or drop inverted index by directly schema change (#18378)
In the current implementation of the function of dynamically add and drop inverted index, there is a problem that the inverted index information of historical data is out of date after compaction on the base tablet.

In the future, I will submit PRs to solve this problem. Now, temporarily add or drop inverted index by the directly schema change logic
2023-04-06 15:07:37 +08:00
550c8aa648 [Bug](DECIMALV3) fix wrong decimal scale returned by function round (#18375) 2023-04-06 14:44:21 +08:00
a474be6d03 [Bug](ES): Es object mapping error (#18382)
Issue Number: close #18379
2023-04-06 14:11:09 +08:00
db766bb073 [fix](planner) decimalv2 castTo decimalv2 should change type directly (#18297) 2023-04-06 13:51:50 +08:00
8b61709ec8 [feature](multi-catalog) support select current_catalog(); (#18163) 2023-04-06 12:06:10 +08:00
4ec6aa1691 [fix](planner) trying register constnat slotRef to table cause NPE (#18356)
could reproduced by:

CREATE TABLE t (
name varchar(128)
) ENGINE=OLAP
UNIQUE KEY(name)
DISTRIBUTED BY HASH(name) BUCKETS 1;

insert into t values('abc');

SELECT cd
FROM
(SELECT cast(now() as string) cd FROM t) t1
JOIN
(select cast(now() as string) td from t GROUP BY now()) t2
ON t1.cd = t2.td;

ERROR 1105 (HY000): errCode = 2, detailMessage = Unexpected exception: null
2023-04-06 11:50:12 +08:00
9a916cffe4 [Optimization](String) Optimize the injection of statistics. #18401
1. Remove useless partition statistics injection.
2. Adding judgment logic to avoid exception during numerical transformation.
2023-04-06 11:42:11 +08:00
f73189860f [tpch](nereids) trustable join condition (#18272)
(affects tpch q14/7/9)
1. equation estimation confidence level
For equation, if any side is almost unique, its estimation confidence is high, we call it trustable condition.
if a join contains more than one un-trustable condition, we only use the one whose selectivity is biggest in order to avoid error propagation.

2. like expression estimation factor: 0.2
give a separate default shrink ratio for like operator, default ratio is 0.2

3. disable fat-child-penalty
set HEAVY_OPERATOR_PUNISH_FACTOR=1
this change affect tpch q15. This factor should be adaptive to the implementation of BE.
2023-04-06 11:20:47 +08:00
Pxl
76d76f672c [Chore](build) enchancement for backend build time usage (#18344) 2023-04-06 11:13:21 +08:00
4ca0c0face [fix](join) fix wrong result of right join (#18365)
When processing data in hash table for right join and full outer join, if the output data rows of one hash bucket excceeds batch size, the logic when continue processing this bucket is wrong, it should differentiate between different join types.
2023-04-06 10:55:58 +08:00