doris

Author	SHA1	Message	Date
huangzhaowei	09d98c1663	[BugFix](MTMV)Set enable_mtmv_scheduler_framework master only to avoid regression fail (#18473 ) Set enable_mtmv_scheduler_framework master only to avoid regression fail	2023-04-09 08:47:18 +08:00
Mingyu Chen	60c0bbe272	[fix](profile) fix show load query profile (#18487 ) Sometimes, `show load profile` will only show part of the insert opertion's profile. This is because we assume that for all load operation(including insert), there is only one fragment in the plan. But actually, there will be more than 1 fragment in plan. eg: `insert into tbl1 select * from tbl1 limit 1` will have 2 fragments. This PR mainly changes: 1. modify the `show load profile` Before: `show load profile "/queryid/taskid/instanceid";` After: `show load profile "/queryid/taskid/fragmentid/instanceid";` 2. Modify the display of `ReadColumns` in OlapScanNode Because for wide table, the line of `ReadColumns` may be too long for show in profile. So I wrap it and each line contains at most 10 columns names. 3. Fix tvf not working with pipeline engine, follow up #18376	2023-04-09 08:41:18 +08:00
ZhangYu0123	fb50626075	[optimize](string) optimize concat function by SIMD memcpy (#18458 ) Optimize concat function 29% up by memcpy_small_allow_read_write_overflow15. Optimize string functions list: concat, convert_to, mask, initcap, lower, upper. concat function has 29% up:	2023-04-08 17:05:34 +08:00
ZhangYu0123	58bbd46c65	[Optimization](string) optimize constant empty string compare ( column='', column!='') (#18321 ) Optimize constant empty string compare: (1) When the constant empy string '' (size is 0), we can compare offsets in SIMD directly. q10: SELECT MobilePhoneModel, COUNT(DISTINCT UserID) AS u FROM hits WHERE MobilePhoneModel <> '' GROUP BY MobilePhoneModel ORDER BY u DESC LIMIT 10; q11: SELECT MobilePhone, MobilePhoneModel, COUNT(DISTINCT UserID) AS u FROM hits WHERE MobilePhoneModel <> '' GROUP BY MobilePhone, MobilePhoneModel ORDER BY u DESC LIMIT 10; q12: SELECT SearchPhrase, COUNT() AS c FROM hits WHERE SearchPhrase <> '' GROUP BY SearchPhrase ORDER BY c DESC LIMIT 10; q13: SELECT SearchPhrase, COUNT(DISTINCT UserID) AS u FROM hits WHERE SearchPhrase <> '' GROUP BY SearchPhrase ORDER BY u DESC LIMIT 10; q14: SELECT SearchEngineID, SearchPhrase, COUNT() AS c FROM hits WHERE SearchPhrase <> '' GROUP BY SearchEngineID, SearchPhrase ORDER BY c DESC LIMIT 10; Issue Number: close #xxx	2023-04-08 16:04:10 +08:00
ZhangYu0123	0517616242	[vectorized](function) support array_repeat function to be compatible with hive syntax (#18028 ) --------- Co-authored-by: zhangyu209 <zhangyu209@meituan.com>	2023-04-08 15:50:28 +08:00
q763562998	5aa58f5ce5	[Feature](multi-catalog) add refresh for catalog if it needs (#17884 ) Currently if a catalog is modified externally in doris, doris is not dynamically aware of it. So if a catalog is created with a refresh time configuration, I added a timer for it to refresh the catalog regularly.	2023-04-08 15:49:50 +08:00
YueW	0b8bc51b72	[fix](inverted index) Fix key column match query failed (#18436 ) * [fix](inverted index) Fix key column match query failed * [chore](regression case) add regression case * [fix] fix regression case no order by	2023-04-08 15:45:08 +08:00
Lei Zhang	25fba6b8b7	[fix](bdbje) handle `RollbackException` in `BDBJEJournal.open` (#18471 ) reference: handle bdb rollbackexception #6582 [fix](bdbje) fix handle bdb RollbackException incorrectly #17483	2023-04-08 15:31:24 +08:00
奕冷	a2858bbbfc	[fix](regression-test) fix wrong http port in regression-test case compaction-with-delete #18484	2023-04-08 15:17:52 +08:00
chenlinzhong	161678380c	[bug](GC)the issue of incorrect disk usage (#18397 )	2023-04-08 09:32:36 +08:00
GoGoWen	432fef7aa8	[Enhancement](iceberg table)support inPredicate for iceberg table (#18226 )	2023-04-08 08:59:23 +08:00
yixiutt	851793aa66	[fix](test) fix p0 regression case test_compaction_with_delete failed (#18479 )	2023-04-07 22:11:40 +08:00
zzzzzzzs	936807b5c9	[typo](doc) Fixed typos in SHOW-PROCESSLIST.md (#18469 )	2023-04-07 22:02:37 +08:00
Gabriel	d881d71cd1	[Bug](cast) Fix bug for cast function between datetimev2 and string (#18442 ) Fix bug for cast function between datetimev2 and string	2023-04-07 22:02:15 +08:00
yagagagaga	2bb025a474	[typo](docs) fix some error in this files (#18259 ) * [typo](docs) fix some error in this files * [typo](docs) fix some error in this files * update files	2023-04-07 18:01:07 +08:00
LiBinfeng	5678708b7d	[Fix](nereids) change setting byte size from translator to catalog generate scalar type (#18450 ) Problem: when used nereids to generate scalarType, byteSize would be set. After switch the optimizer to planner, planner would reuse scalarType in some cases. Fix: change byteSize setting from Plan translator to toCatalogDataType	2023-04-07 17:38:08 +08:00
amory	30f2abe5d3	[FIX](Map)fix calculate map offset in olap convertor (#18295 ) Fix be core when load bigger kv data in one row for map.	2023-04-07 17:04:08 +08:00
TengJianPing	4c7979be58	[fix](regression-test) fix failure of test case (#18454 )	2023-04-07 16:44:35 +08:00
zxealous	e3ff2e3d21	[fix](file cache) Fix be core while use block/whole/sub file cache (#18440 ) BE will core dump while use whole/sub file cache. Call func CachedRemoteFileReader/WholeFileCache/SubFileCache::read_at_impl() did not pass IOContext when reading segment footer.	2023-04-07 16:39:59 +08:00
jakevin	8e8c5b5283	[feature](Nereids): deduplicate InPredicate. (#18428 )	2023-04-07 16:22:21 +08:00
Gabriel	f6f4dac1d0	[Improvement](DECIMAL) Improve decimal operation (#18437 )	2023-04-07 15:58:28 +08:00
morrySnow	c5d9e8529a	[enhancement](Nereids) adjust runtime filter parameters when enable pipeline engine (#18427 ) when enable pipeline engine, we 1. reduce non-broadcast join bloom filter size 2. turn in or bloom filter to bloom filter if target is remote	2023-04-07 15:48:25 +08:00
谢健	63994e351f	[fix](Nereids) extract complicated time string in evaluating cost model framework (#17864 ) 1. The time string in the profile can be "xx s xx ms". The framework should extract time with re package to support more complicated time string 2. Add stats for sortNode and AggNode in `withChildren`	2023-04-07 15:33:04 +08:00
morrySnow	b0a13d019f	[fix](planner) enable fallback to legacy planner when execute internal query (#18353 )	2023-04-07 15:15:49 +08:00
Mryange	926c6056c7	[refactor](config) Delete the environment variable enable_vectorized_engine (#18166 )	2023-04-07 14:23:16 +08:00
Xinyi Zou	308ff9a16f	[enchancement](memory) tracking lru cache memory and page memory not in cache (#18361 ) Statistics lru cache memory in metrics Statistics page memory not in cache in mem tracker	2023-04-07 14:22:44 +08:00
AlexYue	ccb3541fa5	[chore](regression) print exception along with error sql when run sql file (#18374 )	2023-04-07 14:19:47 +08:00
Jerry Hu	d36e9bd523	[chore](scan) Disable low cardinality optimization for compaction (#18424 )	2023-04-07 14:19:11 +08:00
yongkang.zhong	5d876414b5	[typo](docs) add autobucket to sidebars (#18453 )	2023-04-07 14:17:36 +08:00
ElvinWei	f42db08ccc	[fix](Nereids) Fixed a problem with completing ClusterName (#18366 )	2023-04-07 13:35:03 +08:00
minghong	b1956b42fb	[enhancement](Nereids) disable heavy operator penalty in cost model v1 (#18422 )	2023-04-07 13:16:59 +08:00
starocean999	a7b708263d	[fix](nereids) move validate data types before EliminateUnnecessaryProject rule (#18393 ) validate supported data types checks if a project node's output contains any unsupported data types like array, map, etc in nereids. So this validation should run before EliminateUnnecessaryProject rule	2023-04-07 13:10:18 +08:00
HappenLee	c32adba1cf	[Refactor](Pipeline) Refactor pipeline code to improve coverage (#18376 ) Refactor pipeline code to improve coverage	2023-04-07 13:09:44 +08:00
jakevin	2783b27788	[fix](Nereids): fix LogicalProject withXXX(). (#18441 )	2023-04-07 12:38:53 +08:00
Pxl	e77da1519a	[Enchancement](materialized-view) adjust desc table all display fields (#18357 ) adjust desc table all display fields	2023-04-07 11:14:17 +08:00
Pxl	7631a8fb39	[Bug](materialized-view) fix mv define expr persistence replay incorrect after schema change (#18418 ) fix mv define expr persistence replay incorrect after schema change	2023-04-07 11:11:55 +08:00
Pxl	267b690dad	[Bug](materialized-view) fix materialized-view query match not consider with order by elements (#18384 ) fix materialized-view query match not consider with order by elements	2023-04-07 11:11:18 +08:00
airborne12	2b662ac26b	[Fix](segment iterator) fix filter block size and filter size mismatch problem (#18395 ) adding result column id to _column_filter in _output_index_result_column	2023-04-07 09:43:33 +08:00
starocean999	505f25c580	[fix](planner)use base index if the where clause is a constant value (#18367 ) sql : select bitmap_empty() from d_table where true; should always use base index instead of any mv, because the conjuncts is constant (true) and use none of the column from any mv	2023-04-07 09:15:00 +08:00
TengJianPing	4e1cdb9ce7	[fix](agg_sort)fix bug of agg sort group concat with order by(#18447 )	2023-04-07 08:42:36 +08:00
Tiewei Fang	759f1da32e	[Enhencement](Backends) add `HostName` filed in backends table and delete backends table in information_schema (#18156 ) 1. Add `HostName` field for `show backends` statement and `backends()` tvf. 2. delete the `backends` table in `information_schema` database	2023-04-07 08:30:42 +08:00
Jibing-Li	22deeecbe1	[Improvement](multi catalog)Cache File for Hive Table, instead of cache file splits. (#18419 ) Currently, the session variable for Split size will not take effect after the file splits are cached. 1. This PR is to cache file for Hive Table, instead of cache file splits. And split the file every time using the current split size. 2. Use self splitter by default.	2023-04-07 00:07:23 +08:00
superche	a059973cff	[typo](doc)Add description that external table are no longer maintained (#18425 ) Co-authored-by: hechao <hechao@selectdb.com>	2023-04-06 21:01:26 +08:00
Mingyu Chen	e848e456be	[config] modify tablet_shard to 4 and add some log (#18416 ) modify the default value of BE config tablet_map_shard_size to 4. To reduce lock contention. Add log when failed writing disk test file, for debug	2023-04-06 17:18:16 +08:00
mch_ucchi	981ead9032	[feature](Nereids) support binary arithmetic function (#18213 ) support binary arithmetic functions like: add(op1, op2) -> op1 + op2 subtract(op1, op2) -> op1 - op2 multiply(op1, op2) -> op1 * op2 divide(op1, op2) -> op1 / op2 mod(op1, op2) -> op1 % op2	2023-04-06 16:57:04 +08:00
slothever	33ae4524ce	[fix](multi-catalog) Fix properties check in S3Storage and add hive socket timeout config (#18420 ) Co-authored-by: jinzhe <jinzhe@selectdb.com>	2023-04-06 16:35:24 +08:00
AKIRA	27576ef8dc	[fix](stats) Fix analyze table failed (#18386 )	2023-04-06 15:45:09 +08:00
amory	82248ab392	[FIX](complex-type) get_default to return real nested default value (#18413 ) make real default value to return with nested type in complex type	2023-04-06 15:24:32 +08:00
YueW	591f76a6a4	[fix](alter inverted index) Temporary deal with add or drop inverted index by directly schema change (#18378 ) In the current implementation of the function of dynamically add and drop inverted index, there is a problem that the inverted index information of historical data is out of date after compaction on the base tablet. In the future, I will submit PRs to solve this problem. Now, temporarily add or drop inverted index by the directly schema change logic	2023-04-06 15:07:37 +08:00
Gabriel	550c8aa648	[Bug](DECIMALV3) fix wrong decimal scale returned by function `round` (#18375 )	2023-04-06 14:44:21 +08:00

1 2 3 4 5 ...

9779 Commits