doris

Author	SHA1	Message	Date
Pxl	8e4c0d1e81	[Bug](materialized-view) fix divide double can not match mv (#23504 ) * fix divide double can not match mv * fix * fix	2023-08-28 18:01:08 +08:00
AKIRA	10792ca0f7	[fix](nereids) Mistaken stats when analyzing table incrementally and partition number less than 512 #23507 Fix bug that mistaken stats when analyzing table incrementally and partition number less than 512 Fix bug that cron expression lost during analyzing Mark system job as running after registered to AnalysisManager to avoid submit same jobs if previous one take long time	2023-08-28 17:31:36 +08:00
starocean999	1cc6474487	[fix](planner)fix bug of pushing conjunct through agg node (#23483 )	2023-08-28 17:10:13 +08:00
bobhan1	2dff89a77a	[Log](Alter) Print table's state when `checkNormalStateForAlter()` failed (#23358 )	2023-08-28 17:04:01 +08:00
Qi Chen	6ac694aede	[Configuration](multi-catalog) Modify default external cache expire time to 10 mins. (#23490 ) Configuration Modify default external cache expire time to 10 mins.	2023-08-28 16:16:43 +08:00
Jack Drogon	f70638e895	[Fix](autobucket) Fix autobucket partition size by using getAllDataSize including cooldown size (#23557 )	2023-08-28 15:24:48 +08:00
Pxl	6e82178847	[Bug](materialized-view) fix loaddb analyze failed on MaterializedIndexMeta (#23442 ) * fix loaddb analyze failed on MaterializedIndexMeta * update * update	2023-08-28 15:18:18 +08:00
xy720	4c8fc06e40	[Feature](fe) Add admin set partition version statement (#23086 ) This commit add a statement to modify partition visible version.	2023-08-28 14:31:54 +08:00
谢健	f7d2c1faf6	[feature](Nereids) support select key encryptKey (#23257 ) Add select key ``` - CREATE ENCRYPTKEY key_name AS "key_string" - select key my_key +-----------------------------+ \| encryptKeyRef('', 'my_key') \| +-----------------------------+ \| ABCD123456789 \| +-----------------------------+ ```	2023-08-28 14:07:26 +08:00
Calvin Kirs	ef2fc44e5c	[Improve](Job)Allows modify the configuration of the Job queue and the number of consumer threads (#23547 )	2023-08-28 12:01:49 +08:00
morrySnow	e84989fb6d	[feature](Nereids) support map type (#23493 )	2023-08-28 11:31:44 +08:00
morrySnow	b181a9f099	[feature](Nereids) support array type in fold constant framework (#23373 ) 1. use legacy planner way to process constant folding result from be 2. support signature with complex type for constant folding on fe	2023-08-28 10:47:43 +08:00
zy-kkk	d19dcd6bc1	[improve](jdbc catalog) support sqlserver uniqueidentifier data type (#23297 )	2023-08-28 10:30:10 +08:00
xy720	eadffedb33	[Feature](fe) Add admin set table status statement (#23139 ) For some certain bugs, jobs is stuck in FE by the table state. For example, There is a bug which causes table remains ROLLUP state after adding rollup job, then other alter jobs later will not succeed because the table state is always ROLLUP but not NORMAL. This commit adds a statement which is used to set the state of the specified table.	2023-08-28 10:22:09 +08:00
jakevin	92bdf75836	[fix](Nereids): LogicalRepeat equals lack @Override (#23408 )	2023-08-28 10:07:39 +08:00
Chenyang Sun	153e8f0f72	[imporvement](table property) support for alter table property: skip wirte index , single compaction (#23475 )	2023-08-26 23:52:09 +08:00
caiconghui	93918253ba	[fix](metric) fix issue that the counter of rejected transactions does not cover some failed situations for load (#23363 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2023-08-26 20:06:42 +08:00
wangbo	30658ebeda	[Fix](planner) Fix query queue can not limit maxConcurrency #23418 2 Fix concurrent can not limit	2023-08-26 17:31:44 +08:00
Mingyu Chen	40be6a0b05	[fix](hive) do not split compress data file and support lz4/snappy block codec (#23245 ) 1. do not split compress data file Some data file in hive is compressed with gzip, deflate, etc. These kinds of file can not be splitted. 2. Support lz4 block codec for hive scan node, use lz4 block codec instead of lz4 frame codec 4. Support snappy block codec For hadoop snappy 5. Optimize the `count()` query of csv file For query like `select count() from tbl`, only need to split the line, no need to split the column. Need to pick to branch-2.0 after this PR: #22304	2023-08-26 12:59:05 +08:00
Mingyu Chen	36b7fcf055	[tmp](hive) support hive partition 00 (#23224 ) in some case, a hive table with int partition column may has following partition value: hour=00, hour=01 we need to support this.	2023-08-26 12:58:31 +08:00
zhangdong	db8d18eb40	[Enhance](auth)row policy support role (#23022 ) ``` CREATE ROW POLICY test_row_policy_1 ON test.table1 AS {RESTRICTIVE\|PERMISSIVE} [TO user] [TO ROLE role] USING (id in (1, 2)); // add `to role` DROP [ROW] POLICY [IF EXISTS] test_row_policy;//delete `for user` and `on table` SHOW ROW POLICY [FOR user][FOR ROLE role] // add `for role` ```	2023-08-26 10:24:59 +08:00
slothever	f66f161017	[fix](multi-catalog)fix hive table with cosn location issue (#23409 ) Sometimes, the partitions of a hive table may on different storage, eg, some is on HDFS, others on object storage(cos, etc). This PR mainly changes: 1. Fix the bug of accessing files via cosn. 2. Add a new field `fs_name` in TFileRangeDesc This is because, when accessing a file, the BE will get a hdfs client from hdfs client cache, and different file in one query request may have different fs name, eg, some of are `hdfs://`, some of are `cosn://`, so we need to specify fs name for each file, otherwise, it may return error: `reason: IllegalArgumentException: Wrong FS: cosn://doris-build-1308700295/xxxx, expected: hdfs://[172.xxxx:4007](http://172.xxxxx:4007/)`	2023-08-26 00:16:00 +08:00
Kaijie Chen	2b6d876280	[feature](move-memtable)[6/7] add options to enable memtable on sink node (#23470 ) Co-authored-by: Siyang Tang <82279870+TangSiyang2001@users.noreply.github.com>	2023-08-25 22:32:22 +08:00
Calvin Kirs	da21b1cb24	[Feature](Job)Allow Job to perform all insert operations, and limit permissions to allow Admin operations (#23492 )	2023-08-25 21:58:53 +08:00
AKIRA	006c88827f	[fix](stats) Fix auto analyze (#20426 ) We only reanalyze those partition that lastVisibleTime is later than job's updatetime, so we shouldn't set this field when creat e system jobs	2023-08-25 21:30:59 +08:00
wuwenchi	e3db0fddc1	[fix](iceberg) fix iceberg count(*) short circuit read bug (#23402 )	2023-08-25 21:30:30 +08:00
xzj7019	468dfc97db	[fix](meta) set broadcast_right_table_scale_factor when upgrading from 1.2 to 2.x (#23423 ) When upgrading from 1.2 to 2.x(future version higher than 2.0), the default value of parameter broadcast_right_table_scale_factor may not be upgraded from old default value 10.0 to new default 0.0, which will cause the broadcast join behavior unexpected and may have a big performance impact. This pr will force to reset the value to new default value 0.0, to make sure the behavior correct.	2023-08-25 21:26:19 +08:00
Jibing-Li	00826185c1	[fix](tvf view)Support Table valued function view for nereids (#23317 ) Nereids doesn't support view based table value function, because tvf view doesn't contain the proper qualifier (catalog, db and table name). This pr is to support this function. Also, fix nereids table value function explain output exprs incorrect bug.	2023-08-25 21:23:16 +08:00
Jibing-Li	8be0202b94	[improvement](old planner)Prune extra slots with old planner for sql like select count(1) from view (#23393 ) The sql like Select count(1) from view would contain all the columns in old planner's execution plan, which is slow, because BE need to read all the column in data files. This pr is to improve the plan to only contain one column.	2023-08-25 21:22:03 +08:00
TengJianPing	1312c12236	Revert "[fix](testcase) fix test case failure of insert null value into not null column (#20963 )" (#23462 ) * Revert "[fix](testcase) fix test case failure of insert null value into not null column (#20963)" This reverts commit 55a6649da962fb170ddb40fea8ef26bdc552a51a. Mannual Revert "fix in strict mode, return error for insert if datatype convert fails (#20378)" This mannual reverts commit 1b94b6368f5e871c9a0fe53dd7c64409079a4c9d * fix case failure	2023-08-25 16:47:14 +08:00
minghong	6d4f06689f	[fix](Nereids) avoid Stats NaN (#23445 ) tpcds 61 plan changed: improved from 1.75 sec to 1.67 sec	2023-08-25 16:27:34 +08:00
谢健	0ccb7262a7	[feature](Nereids) add password func (#23244 ) add password function ``` select password("123"); +-------------------------------------------+ \| password('123') \| +-------------------------------------------+ \| *23AE809DDACAF96AF0FD78ED04B6A265E05AA257 \| +-------------------------------------------+ ```	2023-08-25 14:04:49 +08:00
morrySnow	ba931d9eed	[fix](Nereids) infer predicates generate wrong result (#23456 ) We use two facilities to do predicate infer: PredicatePropagation and PullUpPredicates. In the prvious implementation, we use a set to save the intermediate result of PredicatePropagation. The purpose is infer new predicate though two equal relation. However, it is the wrong way. Because it could infer wrong predicate through outer join. For example ```sql select a.c1 from a left join b on a.c2 = b.c2 and a.c1 = '1' left join c on a.c2 = c.c2 and a.c1 = '2' inner join d on a.c3=d.c3 ``` the predicates `a.c1 = '1'` and `a.c1 = '2'` should not be inferred as filter to relation `a`. This PR: 1. revert the change from PR #22145, commit 3c58e9ba 2. Remove the unreasonable restrict in PullupPredicate. 3. Use new Filter node rather than new otherCondition on join node to save infer predicates	2023-08-25 11:59:28 +08:00
Kang	8ef6b4d996	[fix](json) fix json int128 overflow (#22917 ) * support int128 in jsonb * fix jsonb int128 write * fix jsonb to json int128 * fix json functions for int128 * add nereids function jsonb_extract_largeint * add testcase for json int128 * change docs for json int128 * add nereids function jsonb_extract_largeint * clang format * fix check style * using int128_t = __int128_t for all int128 * use fmt::format_to instead of snprintf digit by digit for int128 * clang format * delete useless check * add warn log * clang format	2023-08-25 11:40:30 +08:00
morrySnow	372f83df5c	[opt](Nereids) remove between expression to simplify planner (#23421 )	2023-08-25 11:28:12 +08:00
starocean999	37b90021b7	[fix](planner)literal expr should do nothing in substituteImpl() method (#23438 ) substitute a literal expr is pointless and wrong. This pr keep literal expr unchanged during substitute process	2023-08-25 11:21:35 +08:00
Tiewei Fang	18094511e7	[fix](Outfile/Nereids) fix that `csv_with_names` and `csv_with_names_and_types` file format could not be exported on nereids (#23387 ) This problem is casued by #21197 Fixed an issue that `csv_with_names` and `csv_with_names_and_types` file format could not be exported on nereids optimizer when using `select...into outfile`.	2023-08-25 11:12:04 +08:00
morrySnow	6614c219cb	[opt](Nereids) use NUMERIC_PRECEDENCE in int div (#23403 )	2023-08-25 11:03:50 +08:00
starocean999	69e75f04ab	[fix](feut) should not enable InternalSchemaDb in fe ut (#23400 )	2023-08-25 11:03:37 +08:00
morrySnow	3786ffec51	[opt](Nereids) add some array functions (#23324 ) 1. rename TVFProperties to Properties 2. add generating function explode and explode_outer 3. fix concat_ws could not apply on array 4. check tokenize second argument format on FE 5. add test case for concat_ws, tokenize, explode, explode_outer and split_by_string	2023-08-25 11:01:50 +08:00
TengJianPing	d30bb8042e	[fuzzy](hash join) disable fuzzy enable_hash_join_early_start_probe (#23413 )	2023-08-25 10:11:20 +08:00
zclllyybb	7cfb3cc0aa	[fix](functions) fix function substitute for datetimeV1/V2 (#23344 ) * fix * function fe	2023-08-25 09:59:38 +08:00
Siyang Tang	441a9fff6d	[fix](planner) fix now function param type error (#23446 )	2023-08-25 00:12:21 +08:00
zhangdong	6a4976921d	[fix](auth)Disable column auth temporarily (#23295 ) - add config `enable_col_auth` to temporarily disable column permissions(because old/new planner has bug when select from view) - Restore the old optimizer to the previous authentication method - Support for new optimizer authentication（Legacy issue: When querying the view, the permissions of the base table will be authenticated. The view's own permissions should be authenticated and processed after the new optimizer is improved） - fix: show grants for non-existent users - fix: role:`admin` can not grant/revoke to/from user	2023-08-24 23:37:06 +08:00
Jibing-Li	966561a6ed	[improvement and fix](statistics)Load the cache for external table row count while init table (#23170 ) 1. Load the cache for external table row count while init table, this could avoid no row number stats for the very first time to run an sql. 2. Show cardinality for an external scan node when explain the sql. 3. fix bugs introduced by https://github.com/apache/doris/pull/22963	2023-08-24 23:34:16 +08:00
Tiewei Fang	f6c5c8f7b5	[Fix](Nereids) fix that `select...from tablets()` are invalidated when there exists predicates (#23365 ) Problem: `select...from tablets()` are invalidated when there exists predicates, such as: ```sql // The all data is: mysql> select * from student3; +------+------+------+ \| id \| name \| age \| +------+------+------+ \| 1 \| ftw \| 18 \| \| 3 \| yy \| 19 \| \| 4 \| xx \| 21 \| \| 2 \| cyx \| 20 \| +------+------+------+ // when we specified tablet to read: mysql> select * from student3 tablet(131131); +------+------+------+ \| id \| name \| age \| +------+------+------+ \| 1 \| ftw \| 18 \| \| 3 \| yy \| 19 \| +------+------+------+ // Howerver, when there exists predicates, the `tablet(131131)` is invalidated mysql> select * from student3 tablet(131131) where id > 1; +------+------+------+ \| id \| name \| age \| +------+------+------+ \| 4 \| xx \| 21 \| \| 3 \| yy \| 19 \| \| 2 \| cyx \| 20 \| +------+------+------+ ``` After the fix, we get promising data ```sql mysql> select * from student3 tablet(131131) where id > 1; +------+------+------+ \| id \| name \| age \| +------+------+------+ \| 3 \| yy \| 19 \| +------+------+------+ ```	2023-08-24 23:29:59 +08:00
starocean999	320eda78e6	[fix](nereids) remove useless cast in in-predicate (#23171 ) consider sql "select * from test_simplify_in_predicate_t where a in ('1992-01-31', '1992-02-01', '1992-02-02', '1992-02-03', '1992-02-04');" before: ``` \| 0:VOlapScanNode \| \| TABLE: default_cluster:bugfix.test_simplify_in_predicate_t(test_simplify_in_predicate_t), PREAGGREGATION: OFF. Reason: No aggregate on scan. \| \| PREDICATES: CAST(a[#0] AS DATETIMEV2(0)) IN ('1992-01-31 00:00:00', '1992-02-01 00:00:00', '1992-02-02 00:00:00', '1992-02-03 00:00:00', '1992-02-04 00:00:00') AND __DORIS_DELETE_SIGN__[#1] = 0 \| \| partitions=0/1, tablets=0/0, tabletList= \| \| cardinality=1, avgRowSize=0.0, numNodes=1 \| \| pushAggOp=NONE \| \| projections: a[#0] \| \| project output tuple id: 1 \| \| tuple ids: 0 ``` after: ``` \| 0:VOlapScanNode \| \| TABLE: default_cluster:bugfix.test_simplify_in_predicate_t(test_simplify_in_predicate_t), PREAGGREGATION: OFF. Reason: No aggregate on scan. \| \| PREDICATES: a[#0] IN ('1992-01-31', '1992-02-01', '1992-02-02', '1992-02-03', '1992-02-04') AND __DORIS_DELETE_SIGN__[#1] = 0 \| \| partitions=0/1, tablets=0/0, tabletList= \| \| cardinality=1, avgRowSize=0.0, numNodes=1 \| \| pushAggOp=NONE \| \| projections: a[#0] \| \| project output tuple id: 1 \| \| tuple ids: 0 ```	2023-08-24 18:14:43 +08:00
amory	6c5072ffc5	[FIX](array-func) fix array index func with decimal (#23399 ) fix array index func with decimal in old analyzer when sql with array_position or array_contains with decimal , may loss precision to which will make result wrong	2023-08-24 17:58:20 +08:00
morrySnow	044423f902	[fix](Nereids) use Stopwatch to do timeout checker (#23383 ) 1. avoid thread leak if exception thrown in planning 2. avoid memory release delays since the timer task hold CascadesContext object	2023-08-24 09:49:35 +08:00
yujun	f0bc2c2eff	[fix](tablet clone) fix partition rebalance apply move exception (#23222 )	2023-08-23 21:50:50 +08:00

1 2 3 4 5 ...

5685 Commits