Commit Graph

5382 Commits

Author SHA1 Message Date
6b9698a248 [bugfix](insert into) should not send profile during report process (#24127)
Co-authored-by: yiguolei <yiguolei@gmail.com>
2023-09-09 17:12:35 +08:00
c3f3195721 [Fix](clucene) fix clucene build error in arm (#24130) 2023-09-09 15:31:40 +08:00
03757d0672 [bug](explode) fix table node not implement alloc_resource function (#24031)
fix table node not implement alloc_resource function
2023-09-09 08:25:28 +08:00
153c7982f3 [Optimize](invert index) Optimize multiple terms conjunction query (#23871) 2023-09-09 01:52:58 +08:00
0f408d1192 [improvement](executor)Add name for task scheduler #23983 2023-09-09 00:56:39 +08:00
0f0ffa3482 [Fix](Parquet Reader) fix parquet read issue (#24092) 2023-09-09 00:35:18 +08:00
0143ae8266 [fix]Add logging before _builtin_unreachable() (#24101)
Co-authored-by: 宋光璠 <songguangfan@sf.com>
2023-09-09 00:30:11 +08:00
e140938d81 [Perfomance][export] Opt the export of CSV tranformer (#24003) 2023-09-08 20:26:54 +08:00
0b24bd6a42 [Bug](pipelineX) init runtime filter profile at first (#24106) 2023-09-08 20:01:02 +08:00
2638ad0550 [fix](compaction) rowid_conversion should ignore deleted row on normal compaction (#24005) 2023-09-08 19:44:24 +08:00
f8fd8a3d17 [fix](trash) fix clean trash not working (#23936)
When executing admin clean trash, if the backend daemon clean thread is cleaning trash, then SQL command will return immediately. But for the backend daemon thread, it doesn't clean all the trashes, it clean only the expired trashes.
Also if there's lots of trashes, the daemon clean thread will busy handling trashes for a long time.
2023-09-08 18:13:22 +08:00
76ca57cf21 [bug](join) fix outer join not add tuple is null column when build rows is 0 (#23974)
fix outer join not add tuple is null column when build rows is 0
2023-09-08 17:55:03 +08:00
Pxl
69868f18d6 [Bug](join) fix nested loop join some problems (#24034) 2023-09-08 17:40:41 +08:00
1abf5e779d [pipelineX](refactor) refactor debug string (#24083) 2023-09-08 16:58:53 +08:00
82dc970916 [feature](insert) Support group commit insert (#22829) 2023-09-08 15:51:03 +08:00
2965b9b3b4 fix update delete bitmap when rowset is blank (#24075)
If the rowset (derived from a clone) does not have a segment, there is no need to update the delete bitmap.
2023-09-08 12:43:42 +08:00
cb29d1a395 fix compile error with gcc12 (#24049) 2023-09-08 10:36:30 +08:00
b73f345479 [fix](intersect) fix wrong result of intersect node (#24044)
Issue Number: close #24046
2023-09-08 10:27:37 +08:00
3927ceac95 [Bug](runtime filter) Fix runtime filter initialization (#24063)
In be.WARNING, print lots of logs like 'runtime filter params meet error'. This is misleading message
2023-09-08 10:27:20 +08:00
cdb1b341c7 [pipelineX](runtime filter) Support runtime filter (#24054) 2023-09-08 10:17:22 +08:00
68acb8597b [fix](nested_loop_join) null value should be output in semi-anti join (#23971)
create table t1
        (k1 bigint, k2 bigint)
        ENGINE=OLAP
DUPLICATE KEY(k1, k2)
COMMENT 'OLAP'
DISTRIBUTED BY HASH(k2) BUCKETS 1
PROPERTIES (
"replication_allocation" = "tag.location.default: 1",
"is_being_synced" = "false",
"storage_format" = "V2",
"light_schema_change" = "true",
"disable_auto_compaction" = "false",
"enable_single_replica_compaction" = "false"
);
create table t3
        (k1 bigint, k2 bigint)
        ENGINE=OLAP
DUPLICATE KEY(k1, k2)
COMMENT 'OLAP'
DISTRIBUTED BY HASH(k2) BUCKETS 1
PROPERTIES (
"replication_allocation" = "tag.location.default: 1",
"is_being_synced" = "false",
"storage_format" = "V2",
"light_schema_change" = "true",
"disable_auto_compaction" = "false",
"enable_single_replica_compaction" = "false"
);
Data:

insert into t1 values (1,null),(null,1),(1,2), (null,2),(1,3), (2,4), (2,5), (3,3), (3,4), (20,2), (22,3), (24,4),(null,null);
insert into t3 values (1,null),(null,1),(1,4), (1,2), (null,3), (2,4), (3,7), (3,9),(null,null),(5,1);
Query:

 select t1.* from t1 where not exists ( select k1 from t3 where t1.k2 < t3.k2 );
Result:

Empty set
Expect result:

+------+------+
| k1   | k2   |
+------+------+
| NULL | NULL |
|    1 | NULL |
+------+------+
2023-09-08 09:28:55 +08:00
f2ebe65ea4 [enhancement](exchange) not use thread pool to handle exchange block (#23970)
Co-authored-by: yiguolei <yiguolei@gmail.com>
2023-09-07 19:40:48 +08:00
20b3e5eafe [feature](Datetime) add from_microsecond / from_millisecond function (#23902) 2023-09-07 19:03:49 +08:00
fdb7a44f57 Revert "[Feature](partitions) Support auto partition" (#24024)
* Revert "[Feature](partitions) Support auto partition (#23236)"

This reverts commit 6c544dd2011d731b8c9c51384c77bcf19c017981.

* Update config.h
2023-09-07 17:08:26 +08:00
Pxl
f13d3008a2 [Bug](type) coredump on get_type_as_primitive_type #24012 2023-09-07 11:28:41 +08:00
349c090469 [fix](move-memtable) lock when send data in load stream stub (#23949) 2023-09-07 11:19:21 +08:00
9b494f4b36 [Fix](autoinc) skip to fill the auto increment column when the input column is not nullable (#23905) 2023-09-07 11:13:22 +08:00
3acf026172 [Enhancement](inverted index) add bkd index query cache to improve perf (#23952)
use query cache to accelerate bkd query performance, especially for high concurrency.
2023-09-07 10:24:27 +08:00
4144e62c5f [pipeline](log) disable some unless error status warning log (#23961) 2023-09-07 10:22:05 +08:00
xy
45dfbeb8b3 [feature](agg) add the aggregation function 'array_agg' #23474
This function requires one arguments just as ARRAY_AGG(col) and col means the column whose values you want to aggregate.
This function Aggregates the values including NULL in a column into an array and returns a value of the ARRAY data type.
2023-09-07 10:21:55 +08:00
537369f4e2 [Fix](http) Fix curl return HTTP_ERROR && Add not_found HttpClientTest, fix (#23984)
Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>
2023-09-07 10:10:51 +08:00
2f8b075b71 [improvement](bitmap) support version for ser/deser of bitmap (#23959) 2023-09-07 09:55:29 +08:00
d183e08f6d [opt](MergedIO) optimize merge small IO, prevent amplified read (#23849)
There were two vulnerabilities in the previous fix(https://github.com/apache/doris/pull/20305):
1. `next_content` may not necessarily be a truly readable range
2. The last range of the merged data may be the hollow

This PR fundamentally solves the problem of reading amplification by rechecking the calculation range. According to the algorithm, there is only one possibility of generating read amplification, with only a small content of data within the 4k(`MIN_READ_SIZE `) range. However, 4k is generally the minimum IO size and there is no need for further segmentation.
2023-09-06 22:45:31 +08:00
95ae5376f3 [Fix](BinaryPrefixPage) stop to read values when current pos reached the end of the page in BinaryPrefixPageDecoder::next_batch (#23855) 2023-09-06 16:34:38 +08:00
6c544dd201 [Feature](partitions) Support auto partition (#23236)
Co-authored-by: zhangstar333 <2561612514@qq.com>
2023-09-06 16:26:45 +08:00
Pxl
a96adc01aa [Chore](function) refactor of quantile_state (#23862)
refactor of quantile_state
2023-09-06 15:39:19 +08:00
7625d1514a [pipelineX](feature) support select operator and add TPCH test cases (#23954) 2023-09-06 15:34:31 +08:00
b2861975ec [FIX](array/map)fix array map batch append data with right next_array_item_rowid (#23779) 2023-09-06 14:47:37 +08:00
dc28878f0e [FIX](function) fix size function for array map (#23920)
Issue Number: close #xxx
now we use select size(map(1, 2)); which will make be core
and we can make size function handle array & map column both
2023-09-06 14:32:06 +08:00
29e1c59cca [enhancement](merge-on-write) print the detailed error message on calculating delete bitmap failed (#23919) 2023-09-06 14:28:17 +08:00
893e53ea3f [fix](segcompaction) disable segcompaction when calculating delete bitmap (#23927)
Calc delete bitmap may generate segments, as a result, may trigger
segcompaction. But the BetaRowsetWriter is a transient one and in
Publishing context, which is bug-prone and hard to rollback. So let us
disable the triggering in delete bitmap calc code path.

Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>
2023-09-06 12:11:13 +08:00
47e47c575b [pipelineX](join) Fix hash join (#23895) 2023-09-06 11:37:15 +08:00
hzq
80097f0752 [fix](fragment mgr) Do not cancel queries when upgrading cluster. #23924 2023-09-06 09:04:28 +08:00
09bcedb116 [feature](merge-cloud) Remove deprecated old cache (#23881)
* Remove deprecated old cache
2023-09-06 08:07:05 +08:00
d7f1a4fb9a [Fix](inverted index) need_read_data only supports DUP_KEYS (#23818)
need_read_data optimization does not work properly when table is UNQIUE_KEY MOR
2023-09-05 19:57:22 +08:00
1a8913f8f4 [fix](shared hash table) fix p0 test failure (#23907) 2023-09-05 14:48:46 +08:00
801ddc0313 [feature-wip](arrow-flight) BE not start Arrow Flight Service by default (#23901) 2023-09-05 14:48:29 +08:00
ee98cea8a1 [improve](segment-cache) Change the segment cache granularity from rowset_id to rowset_id+segment_id (#23561) 2023-09-05 14:32:07 +08:00
d9461d77ba [Fix](bitmap index) like predicate does not work in bitmap index (#23819) 2023-09-05 11:43:16 +08:00
32dd088d30 [fix](inverted)fix inverted index report be hostname error #23872
when inverted index report task status to fe,not put the correct host
delete unused _backend
2023-09-05 10:52:23 +08:00