doris

Author	SHA1	Message	Date
Mingyu Chen	36b7fcf055	[tmp](hive) support hive partition 00 (#23224 ) in some case, a hive table with int partition column may has following partition value: hour=00, hour=01 we need to support this.	2023-08-26 12:58:31 +08:00
Yongqiang YANG	bc020112fc	[enhancement](routineload) add debug conf and set broker.name.ttl = 0 (#23302 ) * set broker.name.ttl = 0 * add debug config for librdkafka	2023-08-26 10:56:35 +08:00
zhangdong	db8d18eb40	[Enhance](auth)row policy support role (#23022 ) ``` CREATE ROW POLICY test_row_policy_1 ON test.table1 AS {RESTRICTIVE\|PERMISSIVE} [TO user] [TO ROLE role] USING (id in (1, 2)); // add `to role` DROP [ROW] POLICY [IF EXISTS] test_row_policy;//delete `for user` and `on table` SHOW ROW POLICY [FOR user][FOR ROLE role] // add `for role` ```	2023-08-26 10:24:59 +08:00
Tiewei Fang	f32efe5758	[Fix](Outfile) Fix that it does not report error when export table to S3 with an incorrect ak/sk/bucket (#23441 ) Problem: It will return a result although we use wrong ak/sk/bucket name, such as: ```sql mysql> select * from demo.student -> into outfile "s3://xxxx/exp_" -> format as csv -> properties( -> "s3.endpoint" = "https://cos.ap-beijing.myqcloud.com", -> "s3.region" = "ap-beijing", -> "s3.access_key"= "xxx", -> "s3.secret_key" = "yyyy" -> ); +------------+-----------+----------+----------------------------------------------------------------------------------------------------+ \| FileNumber \| TotalRows \| FileSize \| URL \| +------------+-----------+----------+----------------------------------------------------------------------------------------------------+ \| 1 \| 3 \| 26 \| s3://xxxx/exp_2ae166e2981d4c08-b577290f93aa82ba_ \| +------------+-----------+----------+----------------------------------------------------------------------------------------------------+ 1 row in set (0.15 sec) ``` The reason for this is that we did not catch the error returned by `close()` phase.	2023-08-26 00:19:30 +08:00
slothever	f66f161017	[fix](multi-catalog)fix hive table with cosn location issue (#23409 ) Sometimes, the partitions of a hive table may on different storage, eg, some is on HDFS, others on object storage(cos, etc). This PR mainly changes: 1. Fix the bug of accessing files via cosn. 2. Add a new field `fs_name` in TFileRangeDesc This is because, when accessing a file, the BE will get a hdfs client from hdfs client cache, and different file in one query request may have different fs name, eg, some of are `hdfs://`, some of are `cosn://`, so we need to specify fs name for each file, otherwise, it may return error: `reason: IllegalArgumentException: Wrong FS: cosn://doris-build-1308700295/xxxx, expected: hdfs://[172.xxxx:4007](http://172.xxxxx:4007/)`	2023-08-26 00:16:00 +08:00
Qi Chen	8af1e7f27f	[Fix](orc-reader) Fix incorrect result if null partition fields in orc file. (#23369 ) Fix incorrect result if null partition fields in orc file. ### Root Cause Theoretically, the underlying file of the hive partition table should not contain partition fields. But we found that in some user scenarios, the partition field will exist in the underlying orc/parquet file and are null values. As a result, the pushed down partition field which are null values. filter incorrectly. ### Solution we handle this case by only reading non-partition fields. The parquet reader is already handled this way, this PR handles the orc reader.	2023-08-26 00:13:11 +08:00
Qi Chen	a3a951c71d	[Fix](multi-catalog) Fix load string dict issue for transactional hive tables. (#23306 ) Fix load string dict issue for transactional hive tables. The column name need to pass 'row.column_name'. apache/doris-thirdparty#112	2023-08-26 00:09:12 +08:00
Kaijie Chen	2b6d876280	[feature](move-memtable)[6/7] add options to enable memtable on sink node (#23470 ) Co-authored-by: Siyang Tang <82279870+TangSiyang2001@users.noreply.github.com>	2023-08-25 22:32:22 +08:00
Calvin Kirs	da21b1cb24	[Feature](Job)Allow Job to perform all insert operations, and limit permissions to allow Admin operations (#23492 )	2023-08-25 21:58:53 +08:00
zzzxl	6e6da733c6	[fix](invert index) fix the keyword type index length limit (#23503 )	2023-08-25 21:34:11 +08:00
AKIRA	006c88827f	[fix](stats) Fix auto analyze (#20426 ) We only reanalyze those partition that lastVisibleTime is later than job's updatetime, so we shouldn't set this field when creat e system jobs	2023-08-25 21:30:59 +08:00
wuwenchi	e3db0fddc1	[fix](iceberg) fix iceberg count(*) short circuit read bug (#23402 )	2023-08-25 21:30:30 +08:00
xzj7019	468dfc97db	[fix](meta) set broadcast_right_table_scale_factor when upgrading from 1.2 to 2.x (#23423 ) When upgrading from 1.2 to 2.x(future version higher than 2.0), the default value of parameter broadcast_right_table_scale_factor may not be upgraded from old default value 10.0 to new default 0.0, which will cause the broadcast join behavior unexpected and may have a big performance impact. This pr will force to reset the value to new default value 0.0, to make sure the behavior correct.	2023-08-25 21:26:19 +08:00
zhangdong	17e7c1ca53	[fix](fqdn)Fqdn with ipv6 (#22454 ) now,`hostname_to_ip` only can resolve `ipv4`,Therefore, a method is provided to parse ipv4 or ipv6 based on parameters。 when `_heartbeat` call `hostname_to_ip`,Resolve to ipv4 or ipv6, determined by `BackendOptions.is_bind_ipv6` Decision Additionally, a method is provided to first attempt to parse the host into ipv4, and then try ipv6 if it fails	2023-08-25 21:24:55 +08:00
Jibing-Li	00826185c1	[fix](tvf view)Support Table valued function view for nereids (#23317 ) Nereids doesn't support view based table value function, because tvf view doesn't contain the proper qualifier (catalog, db and table name). This pr is to support this function. Also, fix nereids table value function explain output exprs incorrect bug.	2023-08-25 21:23:16 +08:00
Jibing-Li	8be0202b94	[improvement](old planner)Prune extra slots with old planner for sql like select count(1) from view (#23393 ) The sql like Select count(1) from view would contain all the columns in old planner's execution plan, which is slow, because BE need to read all the column in data files. This pr is to improve the plan to only contain one column.	2023-08-25 21:22:03 +08:00
Qi Chen	29273771f7	[Fix](multi-catalog) Fix hive incorrect result by disable string dict filter if exprs contain null expr. (#23361 ) Issue Number: close #21960 Fix hive incorrect result by disable string dict filter if exprs contain null expr.	2023-08-25 21:16:43 +08:00
Jerry Hu	9d1c702b3a	[improvement](function) do not use hyperscan for non-const partterns in like function (#23495 )	2023-08-25 20:40:23 +08:00
Kaijie Chen	2b7d60eb4d	[fix](docs) fix stream load docs (#23472 )	2023-08-25 19:28:40 +08:00
zhangguoqiang	e955c32a89	update external pipeline to required (#23494 )	2023-08-25 19:18:34 +08:00
abmdocrt	e1367d509f	[Fix](Full compaction) Fix full compaction by table id regressison test #23496	2023-08-25 18:07:06 +08:00
Dongyang Li	04e520b5ec	[fix](case) enable cases (#23491 )	2023-08-25 17:59:13 +08:00
Gabriel	49a32c2ee0	[pipelineX](fix) fix two phase execution and add test cases (#23353 )	2023-08-25 17:57:35 +08:00
Jerry Hu	f80b067990	[fix](column) add unimplemented function of ColumnFixedLengthObject (#23468 )	2023-08-25 17:38:01 +08:00
TengJianPing	1312c12236	Revert "[fix](testcase) fix test case failure of insert null value into not null column (#20963 )" (#23462 ) * Revert "[fix](testcase) fix test case failure of insert null value into not null column (#20963)" This reverts commit 55a6649da962fb170ddb40fea8ef26bdc552a51a. Mannual Revert "fix in strict mode, return error for insert if datatype convert fails (#20378)" This mannual reverts commit 1b94b6368f5e871c9a0fe53dd7c64409079a4c9d * fix case failure	2023-08-25 16:47:14 +08:00
minghong	6d4f06689f	[fix](Nereids) avoid Stats NaN (#23445 ) tpcds 61 plan changed: improved from 1.75 sec to 1.67 sec	2023-08-25 16:27:34 +08:00
zhangguoqiang	2f8d1765d4	[Regression](external)disable regression test case: test_hdfs_tvf (#23394 )	2023-08-25 15:51:36 +08:00
wudi	3692c10654	[doc](flink-connector) add pg and sqlserver database sync (#23427 )	2023-08-25 15:30:59 +08:00
Dongyang Li	ab991e142c	Update check-pr-if-need-run-build.sh (#23401 )	2023-08-25 14:45:38 +08:00
shuke	13a3b8c9b5	forbid test_full_compaction (#23480 )	2023-08-25 14:19:58 +08:00
Gabriel	5c37be16fe	[pipelineX](correctness) Fix close problem for local state (#23479 )	2023-08-25 14:19:27 +08:00
Pxl	b96b8f4370	[Bug](jdbc) support get_default on complex type (#23325 ) support get_default on complex type	2023-08-25 14:08:24 +08:00
谢健	0ccb7262a7	[feature](Nereids) add password func (#23244 ) add password function ``` select password("123"); +-------------------------------------------+ \| password('123') \| +-------------------------------------------+ \| *23AE809DDACAF96AF0FD78ED04B6A265E05AA257 \| +-------------------------------------------+ ```	2023-08-25 14:04:49 +08:00
Kaijie Chen	d8e499cb55	[fix](UT) fix flaky test in LoadStreamMgrTest (#23459 )	2023-08-25 13:53:20 +08:00
Gabriel	59acf61ec5	[pipelineX](pick) pick 2 PR from pipeline engine (#23463 )	2023-08-25 13:26:05 +08:00
morrySnow	ba931d9eed	[fix](Nereids) infer predicates generate wrong result (#23456 ) We use two facilities to do predicate infer: PredicatePropagation and PullUpPredicates. In the prvious implementation, we use a set to save the intermediate result of PredicatePropagation. The purpose is infer new predicate though two equal relation. However, it is the wrong way. Because it could infer wrong predicate through outer join. For example ```sql select a.c1 from a left join b on a.c2 = b.c2 and a.c1 = '1' left join c on a.c2 = c.c2 and a.c1 = '2' inner join d on a.c3=d.c3 ``` the predicates `a.c1 = '1'` and `a.c1 = '2'` should not be inferred as filter to relation `a`. This PR: 1. revert the change from PR #22145, commit 3c58e9ba 2. Remove the unreasonable restrict in PullupPredicate. 3. Use new Filter node rather than new otherCondition on join node to save infer predicates	2023-08-25 11:59:28 +08:00
zclllyybb	84792d0886	fix compile of master (#23467 )	2023-08-25 11:47:39 +08:00
Kang	8ef6b4d996	[fix](json) fix json int128 overflow (#22917 ) * support int128 in jsonb * fix jsonb int128 write * fix jsonb to json int128 * fix json functions for int128 * add nereids function jsonb_extract_largeint * add testcase for json int128 * change docs for json int128 * add nereids function jsonb_extract_largeint * clang format * fix check style * using int128_t = __int128_t for all int128 * use fmt::format_to instead of snprintf digit by digit for int128 * clang format * delete useless check * add warn log * clang format	2023-08-25 11:40:30 +08:00
morrySnow	372f83df5c	[opt](Nereids) remove between expression to simplify planner (#23421 )	2023-08-25 11:28:12 +08:00
starocean999	37b90021b7	[fix](planner)literal expr should do nothing in substituteImpl() method (#23438 ) substitute a literal expr is pointless and wrong. This pr keep literal expr unchanged during substitute process	2023-08-25 11:21:35 +08:00
Tiewei Fang	18094511e7	[fix](Outfile/Nereids) fix that `csv_with_names` and `csv_with_names_and_types` file format could not be exported on nereids (#23387 ) This problem is casued by #21197 Fixed an issue that `csv_with_names` and `csv_with_names_and_types` file format could not be exported on nereids optimizer when using `select...into outfile`.	2023-08-25 11:12:04 +08:00
zhangdong	ffadf09eec	[fix](catalog)add custom jar (#23406 ) - allow put custom jar in `${DORIS_HOME}/lib/java_extensions/custom_extension` such as `paimon-s3-0.4.0-incubating.jar` - add some note for paimon and fqdn	2023-08-25 11:10:53 +08:00
morrySnow	6614c219cb	[opt](Nereids) use NUMERIC_PRECEDENCE in int div (#23403 )	2023-08-25 11:03:50 +08:00
starocean999	69e75f04ab	[fix](feut) should not enable InternalSchemaDb in fe ut (#23400 )	2023-08-25 11:03:37 +08:00
morrySnow	3786ffec51	[opt](Nereids) add some array functions (#23324 ) 1. rename TVFProperties to Properties 2. add generating function explode and explode_outer 3. fix concat_ws could not apply on array 4. check tokenize second argument format on FE 5. add test case for concat_ws, tokenize, explode, explode_outer and split_by_string	2023-08-25 11:01:50 +08:00
HappenLee	d331bfc513	[Performance](pipeline) support shared scan segment in mow (#23305 )	2023-08-25 10:43:02 +08:00
Pxl	d9db3f5431	[Improvement](scan) Remove redundant predicates on scan node (#23374 ) * Remove redundant predicates on scan node * update * fix	2023-08-25 10:41:37 +08:00
Mryange	a305f2ffc2	[fix](pipeline) update status when prepare failed #23419	2023-08-25 10:34:37 +08:00
Kaijie Chen	0a70cbfe99	[feature](move-memtable)[5/7] add olap table sink v2 and writers (#23458 ) Co-authored-by: laihui <1353307710@qq.com>	2023-08-25 10:20:06 +08:00
zzzxl	2847c5e5b8	[Optimize](index) Optimize implement the new internal lucene index query interface (#23389 )	2023-08-25 10:14:02 +08:00

... 3 4 5 6 7 ...

13073 Commits