Commit Graph

16093 Commits

Author SHA1 Message Date
28ff349381 [doc](fix)invalid character 。 in en docs (#29355)
Co-authored-by: Rohit Satardekar <rohitrs1983@gmail.com>
2024-01-03 12:59:59 +08:00
193b7518ab [enhancement](nereids)throw readable exception when meet missing column in agg's output (#29243) 2024-01-03 12:59:32 +08:00
12286f0a63 [docs][hive-transactional-tables] Add hive transactional tables documents. (#29369) 2024-01-03 12:57:20 +08:00
2a9b4a0f76 [enhancement](paimon)support predict for null and notnull (#29134) 2024-01-03 12:53:39 +08:00
79eb575d7c [Improvement](nereids)Support ODBC table for new planner. (#29129) 2024-01-03 12:51:07 +08:00
1fbbff32b2 [fix](pipelinex) coredump caused by VRuntimeFilterSlots::_is_global was not set (#29446) 2024-01-03 12:40:41 +08:00
dbf61005df change boost thirdparty url to offical archives.boost.io (#29401) 2024-01-03 12:12:43 +08:00
c0db8533af [fix](load) fix single replica load with auto partition 2024-01-03 11:53:09 +08:00
5eb38301dd [Chore](CI)Re-configure branch-2.0 shell-check as a required check (#29448)
https://github.com/apache/doris/pull/29289 done
2024-01-03 11:44:38 +08:00
xy
fab1a627fc [fix](scan) _insert_data_normal should catch exception when BlockReader::_unique_key_next_block (#29426)
Co-authored-by: xingying01 <xingying01@corp.netease.com>
2024-01-03 11:44:02 +08:00
08353f6027 [Enhance](fe) Iceberg table in HMS catalog supports broker scan (#28107)
My organization uses HMS catalog to accelerate Lake query. Sine we have custom distributed file system and hard to integrate to FE / BE, we introduce HMS Catalog broker scan support (#24830) and implement custom distributed file system adaption in broker.

We want to expand the scope of use to Iceberg table scan in HMS Catalog. This PR introduces broker-scan-related `IcebergBrokerIO`, `BrokerInputFile`, `BrokerInputStream` for Iceberg table scan
2024-01-03 11:29:12 +08:00
1e8bb75182 [improve](move-memtable) add log on idle timeout (#29438) 2024-01-03 11:26:26 +08:00
be1d9c3358 [fix](memory) Fix mem tracker web page notice #29361 2024-01-03 11:25:00 +08:00
14e7eb7624 [Opt](rf) Opt broadcast join remote runtime filter merge and wait (#29439) 2024-01-03 11:21:28 +08:00
067a9a3a22 [opt](ES catalog)Add more description for limitations of docvalue_scan (#29420)
> [ignore_above](https://www.elastic.co/guide/en/elasticsearch/reference/current/ignore-above.html)
Do not index any string longer than this value. Defaults to 2147483647 so that all values would be accepted. Please however note that default dynamic mapping rules create a sub keyword field that overrides this default by setting ignore_above: 256.

When the field value is longer than the limitation, we will get an empty result.
The solution is turn off `enable_docvalue_scan`, to get data from `_source` field.
2024-01-03 11:17:48 +08:00
b3da9bad3e [fix](move-memtable) fix load stream stub fault injection test (#29431) 2024-01-03 11:15:18 +08:00
3a8ae590ec [fix](pipelineX) Fix unexpected OOM on pipelineX (#29436) 2024-01-03 10:40:44 +08:00
83d36927b3 [Choricals] (ci) Temporarily cancel the mandatory restrictions on the 2.0 branch shellCheck (#29442) 2024-01-03 10:39:52 +08:00
2578f61451 [Chore](ci)Temporarily cancel the mandatory restrictions of shellCheck (#29440)
https://github.com/apache/doris/pull/29289
I modified the shell-check attribute, once the PR was combined, I revert this
2024-01-03 10:20:02 +08:00
Pxl
358995e4ac [Chore](sort) fix block used after it was moved(#29416)
/mnt/disk2/pengyu/codebase/apache/doris/be/src/vec/common/sort/heap_sorter.cpp:91:23: error: 'tmp_block' used after it was moved [bugprone-use-after-move,-warnings-as-errors]
2024-01-03 10:02:35 +08:00
cdf2bb24a4 [fix](Nereids) topn runtime filter only support simplest case (#29312)
only support simple case: select ... from tbl [where ...] order by ... limit ...
2024-01-03 09:55:00 +08:00
97234a6485 [Enhancement](inverted index) strictly checkout inverted index properties (#29421) 2024-01-03 09:54:31 +08:00
298d0c6904 [Chore](Notice)Update Notice year (#29415) 2024-01-03 09:50:30 +08:00
f54f79515c [Bug](fix) str_to_date "" should be null (#29402) 2024-01-03 08:25:22 +08:00
1edf5b31b6 [Regression-test](wal) Add fault injection case for wal mem back pressure (#29298) 2024-01-03 00:06:52 +08:00
797238cbb7 [fix](merge-on-write) fix schema change may result in delete bitmap incorrect (#29386) 2024-01-02 23:45:04 +08:00
69a01e0cf5 [improve](move-memtable) skip load stream stub close wait when cancel (#29427) 2024-01-02 23:35:50 +08:00
69524ccf98 [improve](group commit) Group commit support commit by data size (#29428) 2024-01-02 23:20:23 +08:00
d54cde51ba [fix](group commit) fix http stream group commit begin a txn (#29399) 2024-01-02 21:58:00 +08:00
5db496d844 [Improve](Variant) make output stable (#29389) 2024-01-02 20:29:17 +08:00
9490d5e9a2 [Debug](Variant) sanitize variant in write_column_to_mysql (#29380) 2024-01-02 20:28:59 +08:00
243b075eef [improve](move-memtable) report failed tablets with status (#29351) 2024-01-02 20:24:53 +08:00
f2fa62f524 [minor](Nereids): cleanup code (#29413) 2024-01-02 20:13:59 +08:00
af39217d14 [improve](agg)support push down min/max on unique table (#29242) 2024-01-02 19:40:23 +08:00
3eca457edd [fix](statistics)Fix set partition loaded replayer try to write edit log bug (#29395)
While observer or checkpointer replay SetPartitionLoaded log, it shouldn't try to write the log. This pr is to fix the bug.
2024-01-02 19:38:34 +08:00
04abcc1245 [cases](index compaction)Add index compaction cases (#29393)
Add some cases for index compaction:

1. index compaction with duplicate key table
2. index compaction with unique key table 
3. optimize index compaction with multi segments in one  inverted index
2024-01-02 18:29:09 +08:00
Pxl
d474d4e701 [Bug](rollup) fallback and disable tulpe when rollup select failed on old planner (#29226)
* fallback and disable tulpe when rollup select failed on old planner

* add case
2024-01-02 17:54:56 +08:00
b3f69211b7 [refactor](wal) move group commit load content length to runtime state (#29188) 2024-01-02 15:53:38 +08:00
4692a62e47 [fix](planner)nullable info of agg node's intermediate and output slots should be same (#29315) 2024-01-02 15:53:28 +08:00
706463781c [refactor](group commit) refactor group commit wal code (#29375) 2024-01-02 15:52:03 +08:00
c07e3b7b47 [feature](Nereids): eliminate left outer join by unique (#28853) 2024-01-02 15:50:49 +08:00
2ed122b787 [improvement](task exec context) add parent class HasTaskExecutionCtx to own the task ctx (#29388)
---------

Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-01-02 15:28:27 +08:00
4581618b09 [improvement](disk) pick disk randomly when usage is less than 0.7 (#29368) 2024-01-02 14:08:09 +08:00
eac96005b3 [Fix](Job)Concurrency may result in event loss (#29385) 2024-01-02 14:06:16 +08:00
f6443efe1a [doc](fix)fix kill doc error and add example (#29313) 2024-01-02 14:05:19 +08:00
bf6488e518 [fix](doc) show catalog example output fix (#29150)
Co-authored-by: Rohit Satardekar <rohitrs1983@gmail.com>
2024-01-02 14:05:05 +08:00
90b2ee90b2 [nereids] consider numNulls in filter estimation (#29184)
consider numNulls in filter estimation
2024-01-02 13:51:11 +08:00
b7487430da Revert "[improve](move-memtable) cancel load rapidly when stream close wait (#29322)" (#29371)
This reverts commit bbf58c5aa42d40e66bc6ccc9ed91a4fcb4bdfff7.
2024-01-02 11:32:14 +08:00
7cb3239599 [fix](sink) add writting restriction for OlapTableSinkV2Operator (#29340) 2024-01-02 11:16:06 +08:00
cec69f4cbd [Enhancement](wal) Add timout for wal memory back pressure (#29178) 2024-01-02 11:02:17 +08:00