Commit Graph

18263 Commits

Author SHA1 Message Date
e9104765ea [fix](doc) modify the document error,about outfile (#22475) 2023-09-02 21:14:59 +08:00
bf1b281388 [typo][doc] delete redundant '/' in flink-doris-connector. (#23185) 2023-09-02 21:01:37 +08:00
a86aa7161f [enhancement](doc) Lack of detailed parameter documentation (#23247) 2023-09-02 20:54:23 +08:00
eedd24316d [Feature](CCR) Support MoW for CCR (#22798) 2023-09-02 20:40:06 +08:00
6b56896a01 [chore](json reader) add original data to error messge for tracing (#22803) 2023-09-02 20:15:18 +08:00
9898c08620 [enhancement](merge-on-write) Add delete bitmap correctness check in commit phase (#23316) 2023-09-02 20:03:00 +08:00
4854651901 [chore](clang-tidy) Apply uninitialized variables check of clang-tidy (#23497) 2023-09-02 19:19:13 +08:00
45414db1ba [enhancement](table-meta) flush column unique ids for tables before 1.2 automatically (#23616) 2023-09-02 14:56:48 +08:00
d2417c168b [test](ColdHotSeparation) refresh case (#23741) 2023-09-02 14:43:57 +08:00
26571a00bf [Fix](IndexColumnWriter) Add logic for IndexedColumnWriter::add when the current page is full (#23766) 2023-09-02 14:28:59 +08:00
a6dff2faf0 [Feature](config) allow update multiple be configs in one request (#23702) 2023-09-02 14:26:54 +08:00
1d7ce83801 [coverage][test] modify run-be-ut.sh for run be cov, no need merge (#23755) 2023-09-02 14:24:30 +08:00
de8fa2cff5 [Fix](thrift) Add fe master check in some thrift calls (#23757)
Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2023-09-02 14:05:31 +08:00
f1c354e0cf [improvement](colocate table) forbit change colocate table's replica allocation (#23064) 2023-09-02 13:54:25 +08:00
a542f107db [feature](move-memtable) buffer messages in load stream stub (#23721) 2023-09-02 13:42:34 +08:00
bbc893c953 [Enhancement](binlog) Add ModifyPartition, BatchModifyPartitions && ReplacePartitionOperationLog support (#23773) 2023-09-02 13:19:07 +08:00
228f0ac5bb [Feature](Multi-Catalog) support query doris bitmap column in external jdbc catalog (#23021) 2023-09-02 12:46:33 +08:00
8bad3bbd62 [fix](be) doris_be compile failed(#20932) (#20932)
Co-authored-by: yiguolei <676222867@qq.com>
2023-09-02 08:26:24 +08:00
68aa4867b0 [fix](map_agg) lost scale information for decimal type (#23776) 2023-09-02 08:03:33 +08:00
e5d1248c72 [Fix] spark load not found file #23502
Use the thrift interface to assign values to variables.
Otherwise, __isset will returns false.
2023-09-02 01:16:38 +08:00
18d470ecf7 [improvement](config) add a specific be config for segment_cache_capacity (#23701)
* add segment_cache_capacity config istead of fd limit * 2/5
* default -1 for backward compatibility
2023-09-02 01:14:14 +08:00
657e927d50 [fix](json)Fix the bug that read json file Out of bounds access (#23411) 2023-09-02 01:11:37 +08:00
b3ccc53f24 [improvement](profile)Add workload group in audit log and profile (#23761) 2023-09-02 01:09:23 +08:00
6630f92878 [Enhancement](Load) stream tvf support json (#23752)
stream tvf support json

[{"id":1, "name":"ftw", "age":18}]
[{"id":2, "name":"xxx", "age":17}]
[{"id":3, "name":"yyy", "age":19}]
example:

curl -v --location-trusted -u root: -H "sql: insert into test.t1(c1, c2) select id, name from http_stream(\"format\" = \"json\", \"strip_outer_array\" = \"true\", \"read_json_by_line\" = \"true\")" -T /root/json_file.json http://127.0.0.1:8030/api/_http_stream
2023-09-02 01:09:06 +08:00
75e2bc8a25 [function](bitmap) support bitmap_to_base64 and bitmap_from_base64 (#23759) 2023-09-02 00:58:48 +08:00
e0efda1234 [Improvement](Slice) support move constructor and operator for Slice (#23694) 2023-09-01 21:05:10 +08:00
ba863a992e [doc](alter table) add notes for modify column (#23720) 2023-09-01 19:32:44 +08:00
295ea482a1 [improvement](log) optimize template function log for performance (#23746)
change log level to debug and use format in template function log for performance.
2023-09-01 19:02:33 +08:00
0b94eee4c7 [fix](rest)query_info returns empty rows #23595 2023-09-01 18:50:49 +08:00
0d50c11d5c [Doc](AuditLoader) improvement auditLoader doc (#23758) 2023-09-01 18:48:39 +08:00
a8de805a7a [fix](Nereids) fix stats inject in or_expansion.groovy (#23748)
make stats injection run first
2023-09-01 18:31:58 +08:00
797d9de192 [fix](Nereids) When col stats is Unknow, not expression should return the stats with selectivity of 1 2023-09-01 17:36:31 +08:00
e3bbba82cf [Fix](planner) fix to_date failed in create table as select (#23613)
Problem:
when create table as select using to_date function, it would failed

Example:
create table test_to_date properties('replication_num' = '1') as select to_date('20230816') as datev2;

Reason:
after release version 2.0, datev1 is disabled, but to_date function signature does not upgrade, so it failed when checking return type of to_date

Solved:
when getfunction, forbidden to_date with return type date_v1, datetime v1 also changed to datetime v2 and decimal v2 changed to decimal v3
2023-09-01 17:28:40 +08:00
b5232ce0d7 [fix](nereids) NormalizeAggregate may push redundant expr to child project node (#23700)
NormalizeAggregate may push exprs to child project node. We need make sure there is no redundant expr in the pushed down expr list. This pr use 'Set' to make sure of that.
2023-09-01 17:16:10 +08:00
fe5feae480 [chroe](ci) add script for create_issue_comment (#23723)
Co-authored-by: stephen <hello-stephen@qq.com>
2023-09-01 15:46:25 +08:00
e3886bcf2a [fix](tablet sheduler) change sched period back to 1s (#23573)
This reverts commit 285bf978442fdff65fda5264ff40bd8291954ef2.

* change tablet sched peroid back to 1s
2023-09-01 15:29:59 +08:00
9d2fc78bd5 [fix](cooldown) Fix potential data loss when clone task's dst tablet is cooldown replica (#17644)
Co-authored-by: Yongqiang YANG <98214048+dataroaring@users.noreply.github.com>
Co-authored-by: Kang <kxiao.tiger@gmail.com>
2023-09-01 15:27:52 +08:00
91c5640cae [fix](tablet clone) fix clone backend chose wrong disk (#23729) 2023-09-01 15:12:35 +08:00
b843b79ddc [fix](tablet clone) fix tablet sched ctx toString cause null exeption (#23731) 2023-09-01 15:05:28 +08:00
Pxl
32853a529c [Bug](cte) fix multi cast data stream source not open expr (#23740)
fix multi cast data stream source not open expr
2023-09-01 14:57:12 +08:00
Pxl
0e9dd348fb [Improvment](materialized-view) add short circuit for selectBestMV #23743 2023-09-01 14:46:54 +08:00
eaf2a6a80e [fix](date) return right date value even if out of the range of date dictionary(#23664)
PR(https://github.com/apache/doris/pull/22360) and PR(https://github.com/apache/doris/pull/22384) optimized the performance of date type. However hive supports date out of 1970~2038, leading wrong date value in tpcds benchmark.
How to fix:
1. Increase dictionary range: 1900 ~ 2038
2. The date out of 1900 ~ 2038 is regenerated.
2023-09-01 14:40:20 +08:00
5b2360e836 [opt](planner) speed up computeColumnsFilter on ScanNode (#23742)
computeColumnsFilter compute filter on all table base schema's column.
However, it table is very wide, such as 5000 columns. It will take a
long time. This PR compare conjuncts size and columns size. If conjuncts
size is small than columns size, then collect slots from conjuncts to
avoid traverse all columns.
2023-09-01 14:22:17 +08:00
e88c218390 [Improve](Job)Job internal interface provides immediate scheduling (#23735)
Delete meaningless job status
System scheduling is executed in the time wheel
Optimize window calculation code
2023-09-01 12:50:08 +08:00
c31cb5fd11 [enhance] use correct default value for show config action (#19284) 2023-09-01 11:28:26 +08:00
d96bc2de1a [enhance](policy) Support to change table's storage policy if the two policy has same resource (#23665) 2023-09-01 11:25:27 +08:00
d6450a3f1c [Fix](statistics)Fix external table auto analyze bugs (#23574)
1. Fix auto analyze external table recursively load schema cache bug.
2. Move some function in StatisticsAutoAnalyzer class to TableIf. So that external table and internal table could implement the logic separately. 
3. Disable external catalog auto analyze by default, could open it by adding catalog property "enable.auto.analyze"="true"
2023-09-01 10:58:14 +08:00
9a7e8b298a [Improvement](statistics)Show column stats even when error occurred (#23703)
Before, show column stats will ignore column with error.
In this pr, when min or max value failed to deserialize, show column stats will use N/A as value of min or max, and still show the rest stats. (count, null_count, ndv and so on).
2023-09-01 10:57:37 +08:00
b93a1a83a5 [opt](Nereids) let keywords list same with legacy planner (#23632) 2023-09-01 10:24:30 +08:00
e1090d6a63 [Fix](column predicate) seperate CHAR primitive type for column predicate (#23581) 2023-09-01 09:41:53 +08:00