doris

Author	SHA1	Message	Date
zy-kkk	a20a6d2bea	[refactor](jdbc catalog) Refactor the JdbcClient code (#20109 ) This PR does the following: 1. This PR is a substantial refactor of the JDBC client architecture. The previous monolithic JDBC client has been refactored into an abstract base class `JdbcClient`, and a set of database-specific subclasses (e.g., `JdbcMySQLClient`, `JdbcOracleClient`, etc.), and the JdbcClient required config, abstract into an object. This allows for improved modularity, easier addition of support for new databases, and cleaner, more maintainable code. This change is backward-compatible and does not affect existing functionality. 2. As a result of client refactoring, OceanBaseClient can automatically recognize the mode of operation as MySQL or Oracle, so we cancel the oceanbase_mode property in the Jdbc Catalog, but due to the cancellation of the property, When creating a single OceanBase Jdbc Table, the table type needs to be filled in as oceanbase(mysql mode) or oceanbase_oracle(oracle_mode). The above work is a change in the usage behavior, please note. 3. For the PostgreSQL Jdbc Catalog, I did two things: 1. The adaptation to MATERIALIZED VIEW and FOREIGN TABLE is added 2. Fixed reading jsonb, which had been incorrectly changed to json in a previous PR 4. fix some jdbc catalog test case 5. modify oceanbase jdbc doc And,Thanks @wolfboys for the guidance	2023-06-02 17:58:10 +08:00
zy-kkk	c2121c831a	[typo](docs) Update the `help create` command display (#20357 )	2023-06-02 17:57:23 +08:00
yongjinhou	4395fb70c4	[Enhancement](tvf) Backends tvf supports authentication (#20333 ) Add authentication for backends tvf.	2023-06-02 17:53:44 +08:00
zy-kkk	9c9f5fec0f	[chore](function) Refactor FunctionSet Initialization for Better Maintainability and Compilation Success (#20285 ) In this PR, I have refactored the initialization of the FunctionSet. Previously, all the functions were in one large method which led to the generation of Java code that was too long. This posed a problem for the compiler, as the length of the method exceeded the limit imposed by the Java compiler. To resolve this issue and improve the readability and manageability of our code, I have categorized these functions by type, and created dedicated initialization methods for each type. As such, our code is now not only more readable and understandable, but also each method is of a length that is acceptable to the compiler and can be compiled successfully. Moreover, this change makes it easier for us to add new functions as we can directly locate the right category and add new functions there. This is a significant change aimed at enhancing the maintainability and scalability of our code, while ensuring that our code can be successfully compiled.	2023-06-02 17:50:47 +08:00
Adonis Ling	fb730fb653	[chore](third-party) Bump the version of hadoop_libs (#20369 ) Bump the version of hadoop_libs to build HDFS related libraries only.	2023-06-02 17:18:36 +08:00
minghong	386a4a0b43	[fix](nereids) add fragment id on all PhysicalRelation (#20371 ) fix "cannot find fragment id for scan" exception	2023-06-02 17:13:09 +08:00
ZhangYu0123	78c37b5244	[Optimize](Function) Add fast path for col like '%%' or col like '%' or regexp '\\.' (#20143 ) Add fast path for col like '%%' or col like '%' or regexp '\\.' (1) like about 34% speed up when use count() test support col like '%%' , col like '%', col not like '%%' , col not like '%' (2) regexp about 37% speed up when use count() test support col regexp '\\.', col not regexp '\\.' Q1: select count() From hits where url like '%'; Q2: select count() From hits where url regexp '\\.*';	2023-06-02 16:26:56 +08:00
morrySnow	422fcd6377	[fix](Nereids) forbid unexpected expression on filter and fix two more bugs (#20331 ) fix below bugs: 1. not check filter's expression, aggregate function, grouping scalar function and window expression should not appear in filter 2. show not change nullable of aggregate function when it is window function in window expression 3. bitmap and other metric types should not appear in order by or partition by of window expression	2023-06-02 16:19:50 +08:00
Yongqiang YANG	b1e6c6ffe5	[enhancement](txn) print commit backends when commit fails (#20367 ) Print commit backends when a commit fails.	2023-06-02 15:10:38 +08:00
amory	06e7c14320	[Improve](json-array) Support json array with nereids bool (#20248 ) Support json array with nereids bool now : ``` set enable_nereids_planner=true; mysql> SELECT json_array(1, "abc", NULL, TRUE, '10:00:00'); +----------------------------------------------+ \| json_array(1, 'abc', NULL, TRUE, '10:00:00') \| +----------------------------------------------+ \| [1,"abc",null,false,"10:00:00"] \| +----------------------------------------------+ 1 row in set (0.02 sec) ``` nereids boolean is "true"/"false" is not '0' /'1' , so we always get false	2023-06-02 14:47:24 +08:00
Dongyang Li	098c735064	[pipeline](fix) rm github_token, no need for it (#20360 )	2023-06-02 14:11:21 +08:00
amory	d68f3f3b3d	[Feature](array-functions)improve array functions for array_last_index (#20294 ) Now we just support array_first_index for lambda input , but no array_last_index	2023-06-02 13:54:03 +08:00
Jerry Hu	8ff8705b3f	[fix](olap) deletion statement with space conditions did not take effect (#20349 ) Deletion statement like this: delete from tb where k1 = ' '; The rows whose k1's value is ' ' will not be deleted.	2023-06-02 13:52:57 +08:00
Kaijie Chen	a869056567	[performance](load) support parallel memtable flush for unique key tables (#20308 )	2023-06-02 13:49:53 +08:00
AKIRA	e32eba8fdf	[refactor](stats) Persist status of analyze task to FE meta data (#20264 ) 1. In the past, we use a BE table named `analysis_jobs` to persist the status of analyze jobs/tasks, however there are many flaws such as, if BE crashed analyze job/task would failed however the status of analyze job/task couldn't get updated. 2. Support `DROP ANALYZE JOB [job_id]` to delete analyze job 3. Support `SHOW ANALYZE TASK STATUS [job_id] ` to get the task status of specific job 4. Restrict the execute condition of auto analyze, only when the last execution of auto analyze job finished a while ago could be executed again 5. Support analyze whole DB	2023-06-02 12:33:31 +08:00
zy-kkk	62c188d9a2	[typo](docs) fix release note 2.0 zh url (#20320 )	2023-06-02 11:45:24 +08:00
Gabriel	dc43e65d06	[Bug](pipeline) Fix memory leak if query is canceled caused by memory limit (#20316 )	2023-06-02 11:42:52 +08:00
HappenLee	576288cc89	[Profile](exec) Remove unless profile in pipeline exec engine (#20337 )	2023-06-02 11:39:11 +08:00
airborne12	c6b6dcdbc7	[Docs](inverted index) update docs for inverted index parser_mode and match_phrase support (#20266 )	2023-06-02 11:38:04 +08:00
Mingyu Chen	86d77084a4	[Fix](multi-catalog) fix oss access issue with aws s3 sdk (#20287 )	2023-06-02 10:40:07 +08:00
mch_ucchi	9d8043e4c1	[Fix](Nereids) should not gather data when sink (#20330 )	2023-06-02 10:33:11 +08:00
xy720	5a3b97bbf2	[enhancement](struct-type)support comment for struct field (#20200 ) support comment for struct field	2023-06-02 10:29:56 +08:00
Bin	075635ee50	[typo](docs)Correct the getting started document (#20245 )	2023-06-02 09:58:26 +08:00
Gabriel	937f04033f	[Bug](runtime filter) fix NPE if runtime filter has no target (#20338 )	2023-06-02 09:54:37 +08:00
HappenLee	8bec2b41db	[pipeline](rpc) support closure reuse in pipeline exec engine (#20278 )	2023-06-02 09:50:21 +08:00
starocean999	a8a4da9b9e	[fix](nereids)dphyper join reorder may cache wrong project list for project node (#20209 ) * [fix](nereids)dphyper join reorder may cache wrong project list for project node	2023-06-02 09:35:28 +08:00
xueweizhang	ecdc5124be	[feature-wip](duplicate-no-keys) schame change support for duplicate no keys (#19326 )	2023-06-02 09:22:41 +08:00
wangbo	0df073699d	[fix](planner)Fix missing kw for workload #20319 1 add usage docment for Workload Group query queue; 2 Fix missing KW for workload, this may cause create workload group failed.	2023-06-02 09:04:22 +08:00
shuke	01770ba68a	[fix](regression-test) variable's scope returned by curl (#20347 )	2023-06-01 23:38:39 +08:00
yangshijie	9b936049b6	[feature-wip](duplicate_no_keys) Add some test cases of all the duplicate tables in test case tpcds_sf100_without_key_p2 and make them duplicate tables without keys (#20332 )	2023-06-01 22:29:51 +08:00
Yongqiang YANG	363e78f08f	[enhancement](publish) print detailed info for failed publish (#20309 )	2023-06-01 22:24:16 +08:00
zhangstar333	34c1cda14a	[bug](udaf) fix java-udaf test case failed with decimal (#20315 ) java-udaf have some test case with decimal will be failed in P0, because the decimal of scale is not set correctly	2023-06-01 20:14:54 +08:00
shuke	05b7c65509	[fix](regression-test) fix multi-thread problem of regression-test #20322	2023-06-01 18:57:17 +08:00
HappenLee	608d2a3eca	[Bug](exec) push down no group by agg min cause error result (#20289 ) sql """ CREATE TABLE t1_int ( num int(11) NULL, dgs_jkrq bigint(20) NULL ) ENGINE=OLAP DUPLICATE KEY(num) COMMENT 'OLAP' DISTRIBUTED BY HASH(num) BUCKETS 1 PROPERTIES ( "replication_allocation" = "tag.location.default: 1", "storage_format" = "V2", "light_schema_change" = "true", "disable_auto_compaction" = "false", "enable_single_replica_compaction" = "false" ); """ sql """insert into t1_int values(1,1),(1,2),(1,3),(1,4),(1,null);""" qt_sql """ select min(dgs_jkrq) from t1_int; """ get the error result：4 after change we get the right result：1	2023-06-01 17:29:46 +08:00
hufengkai	e416c4d95f	[fix](docs)Correct the year and month format placeholder to lower case (#20210 )	2023-06-01 16:14:00 +08:00
LiBinfeng	24fcc2011f	[Fix](Nereids) Fix function test case unstable by adding order by (#20295 ) Nereids function case do not have a order by clause, so the result will be unstable, so order by is added to ensure stability.	2023-06-01 15:18:25 +08:00
Gabriel	a8b273ae31	[P2](test) Fix P2 output (#20311 )	2023-06-01 15:11:12 +08:00
lihangyu	f0513a861d	[Improve](Scan) add a session variable to make scan run serial (#20220 ) Parallel scanning can result in some read amplification, for example, select * from xx where limit 1 actually requires only one row of data. However, due to parallel scanning of multiple tablets, read amplification occurs, leading to performance bottlenecks in high-concurrency scenarios. This PR Adding a SessionVariable to enforce serial scanning can help mitigate this issue.	2023-06-01 15:06:35 +08:00
jakevin	0ff3073fc4	[improvement](Nereids): limit Memo groupExpression size. (#20272 )	2023-06-01 13:30:19 +08:00
Mryange	519f01133a	[feature](decimal)support cast rounding half up and div precision increment in decimalv3. (#19811 )	2023-06-01 13:09:58 +08:00
LiBinfeng	04644c6dfa	[fix](regression) regression test test_bitmap_filter_nereids could not run (#20293 )	2023-06-01 12:56:32 +08:00
Jibing-Li	1b968c4ade	[fix](multi catalog)Fix nereids planner text format include extra column index bug (#20260 ) Nereids planner include all columns index in TFileScanRangeParams, this may cause the column projection incorrect for text format table. Because csv reader use the column index position to split a line. Extra column index will cause get wrong split result. This PR is to reset the column index after Projection, remove the useless column index.	2023-06-01 12:17:47 +08:00
mch_ucchi	cc41cb0e7e	[Fix](Nereids) fix some insert into select bugs (#20052 ) fix 3 bugs: 1. failed to insert into a table with mv. ```sql create table t ( id int, c1 int, c2 int, c3 int ) duplicate key(id) distributed by hash(id) buckets 4 create materialized view k12s3m as select id, sum(c1), max(c3) from t group by id; insert into t select -4, -4, -4, 'd'; ``` insert will rise exception because mv column is not handled. now we will add a target column and value as defineExpr. 2. failed to insert into a table with not all the columns. ```sql insert into t(c1, c2) select c1, c2 from t ``` and t(id ukey, c1, c2, c3), will insert too many data, we fix it by change the output partitions. 3. failed to insert into a table with complex select. the select statement has join or agg, fix the bug by the way similar to the one at 2nd bug.	2023-06-01 12:15:19 +08:00
yiguolei	6befa53caa	fix fe meta upgrade error (#20291 ) Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-06-01 12:09:08 +08:00
Gabriel	4387f47fb5	[pipeline](load) support pipeline load (#20217 )	2023-06-01 11:42:43 +08:00
zhangstar333	e748b43d3d	[bug](parse) fix can't create aggregate column with agg_state (#20235 ) fix can't create aggregate column with agg_state	2023-06-01 11:18:40 +08:00
starocean999	68e593fbf1	[fix](nereids)(planner) case when should return NullLiteral when all case result is NullLiteral (#20280 )	2023-06-01 11:11:41 +08:00
shuke	4a682a0a46	[fix][regression-test] set timeout of curl in regression test to avoid hanged when be crashed. (#20222 ) Currently in regression-test, when a be crash, because curl does not set a timeout, suite-thread will get stuck. To solve this, encapsulate the call to be into a function, set the timeout uniformly, and avoid getting stuck	2023-06-01 11:00:09 +08:00
shuke	492154ee55	[fix](regression-test) add jdbc timeout (#20228 ) In some cases ( or bugs), doris may returned query to jdbc, but jdbc can not recognized what doris sent back, so hanged. To fix this, add a timeout of 30 minutes to jdbc connection.	2023-06-01 10:50:17 +08:00
lihangyu	9e21318834	[refactor](dynamic table) Make segment_writer unaware of dynamic schema, and ensure parsing is exception-safe. (#19594 ) 1. make ColumnObject exception safe 2. introduce FlushContext and construct schema at memtable flush stage to make segment independent from dynamic schema 3. add more test cases	2023-06-01 10:25:04 +08:00

1 2 3 4 5 ...

10932 Commits