3fcd64366f
[opt](Nereids) use 1 as narrowest column when do column pruning on union ( #41719 ) ( #41975 )
...
pick from master #41719
just like previous PR #41548
this PR process union node to ensure not require any column from its
children when it is required by its parent with empty slot set
2024-10-17 15:28:27 +08:00
b4875c2789
[fix](jni)fix jni use timezone_obj get timezone be core. ( #41956 ) ( #42003 )
...
bp #41956
This PR #40225 try to pass time zone info from BE to JNI, and it use
`_state->timezone_obj().name()`
to get the timezone name.
But when we do some rolling upgrade of BE, it may coredump like:
```
*** SIGSEGV address not mapped to object (@0x610) received by PID 72661 (TID 73538 OR 0x7f2e898d1640) from PID 1552; stack trace: ***
0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_branch-2.1/doris/be/src/common/signal_handler.h:421
1# os::Linux::chained_handler(int, siginfo_t*, void*) in /usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so
2# JVM_handle_linux_signal in /usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so
3# signalHandler(int, siginfo_t*, void*) in /usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so
4# 0x00007F3070D3E520 in /lib/x86_64-linux-gnu/libc.so.6
5# cctz::time_zone::name[abi:cxx11]() const in /mnt/hdd01/ci/compatibility-deploy/be/lib/doris_be
6# doris::vectorized::JniConnector::open(doris::RuntimeState*, doris::RuntimeProfile*) at /home/zcp/repo_center/doris_branch-2.1/doris/be/src/vec/exec/jni_connector.cpp:87
7# doris::vectorized::AvroJNIReader::init_fetch_table_schema_reader() at /home/zcp/repo_center/doris_branch-2.1/doris/be/src/vec/exec/format/avro/avro_jni_reader.cpp:119
8# std::_Function_handler::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
9# doris::WorkThreadPool::work_thread(int) at /home/zcp/repo_center/doris_branch-2.1/doris/be/src/util/work_thread_pool.hpp:159
10# execute_native_thread_routine at ../../../../../libstdc++-v3/src/c++11/thread.cc:84
11# start_thread at ./nptl/pthread_create.c:442
12# 0x00007F3070E22850 at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:83
172.20.50.206 last coredump sql: 2024-10-13 04:12:23,985 [query]
```
This PR use another method: `_state->timezone()`, which just return a
string, instead of reading and initializing
time zone info file, to avoid potential coredump.
2024-10-17 14:47:33 +08:00
67d057a711
[cherry-pick](branch-21) fix conv function parser string failure return wrong result ( #40530 ) ( #41964 )
...
## Proposed changes
Issue Number: close #39618
cherry-pick from master (#40530 )
2024-10-17 14:45:46 +08:00
0b41cd2472
[fix](serde)fix the bug in DataTypeNullableSerDe.deserialize_column_from_fixed_json ( #41217 ) ( #41960 )
...
bp #41217
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
2024-10-17 14:36:01 +08:00
1c5c27eceb
[Enhancement](ExternalTable)Optimize the performance of getCachedRowCount when reading ExternalTable ( #41659 ) ( #41959 )
...
bp #41659
## Proposed changes
Because ExternalTable will initialize the previously uninitialized table
when `getCachedRowCount()`, which is unnecessary. So for the
uninitialized table, we directly return -1.
This will increase the speed of our query `information_schema.tables`.
2024-10-17 14:34:23 +08:00
968e33f07e
[cherry-pick](branch-21) pick ( #39057 ) ( #41352 ) ( #41958 )
...
## Proposed changes
pick from master (#39057 ) (#41352 )
<!--Describe your changes.-->
---------
Co-authored-by: Mryange <59914473+Mryange@users.noreply.github.com >
2024-10-17 14:30:40 +08:00
1b901f6fcc
[cherry-pick](branch-2.1) add parquet tvf cases and fix some parquet bug ( #41931 )
...
## Proposed changes
pick pr:
https://github.com/apache/doris/pull/41683
https://github.com/apache/doris/pull/41506
https://github.com/apache/doris/pull/41338
https://github.com/apache/doris/pull/39326
---------
Co-authored-by: morningman <morningman@163.com >
2024-10-17 14:20:58 +08:00
eea916e466
[cherry-pick](branch-21) should check the expr of auto range partition ( #41626 ) ( #41872 )
...
## Proposed changes
cherry-pick from master (#41626 )
<!--Describe your changes.-->
2024-10-17 12:45:49 +08:00
b8214952a1
[branch-2.1] Fix is_partial_update parameter is not set in append_block_with_partial_content() ( #41865 )
...
https://github.com/apache/doris/pull/41439 forgets to set
`is_partial_update` parameter for `Tablet::lookup_row_key()` in
`append_block_with_partial_content()`
2024-10-17 12:44:41 +08:00
3ff67350d0
[opt](Nereids) support all syntax to avoid fallback in multi-statement query ( #41811 )
2024-10-17 12:42:01 +08:00
5736dc537c
[fix](mtmv) Fix duplicate column name not check when create materialized view #40658 ( #41822 )
...
## Proposed changes
pr: https://github.com/apache/doris/pull/40658
commitId: 252aeeb6
2024-10-17 12:19:15 +08:00
95c0a7a8e3
[chore](planner) change decimal literal toSql as plainString ( #41809 ) ( #41976 )
...
pick from master #41809
for example, 0 with decimal(38,4) will return 0.0000
2024-10-17 12:16:10 +08:00
5521a25392
[enhance](insertoverwrite)insert overwrite not fallback ( #41799 )
...
- Insert overwrite on NEREIDS can automatically clean up garbage
temporary partitions after restart, which is not available on old
optimizers
- When insert fails, no longer throw nereids exceptions
2024-10-17 12:14:17 +08:00
7daf423e98
[opt](profile) Move ExecutedByFrontend to execution summary profile #41761 ( #41831 )
...
cherry pick from #41761
2024-10-17 11:37:48 +08:00
19784d420c
[opt](inverted index) Improved top-N optimization by refining the sorting column check. ( #39496 ) ( #41954 )
...
https://github.com/apache/doris/pull/39496
2024-10-17 11:31:11 +08:00
cf2ec26bc2
[fix](catalog) should return error if try using a unknown database ( #40479 ) ( #41971 )
...
bp #40479
2024-10-17 11:13:56 +08:00
169a12058b
[chore](Variant) forbid variant type as hash join key ( #41673 ) ( #41974 )
...
pick from master #41673
2024-10-17 11:06:37 +08:00
f98aa1d08b
[Fix](Branch-2.1) fix fallback to legacy planner when set group commit in session variable ( #41984 )
2024-10-17 10:40:33 +08:00
0b6447faeb
[Fix](SchemaChange) refactor variant root column iterator to make row… ( #41941 )
...
pick #41700
2024-10-17 10:39:07 +08:00
d04082f685
[improvement](statistics)Use min row count of all replicas as tablet/table row count. ( #41894 ) ( #41978 )
...
backport: https://github.com/apache/doris/pull/41894
2024-10-16 21:45:37 +08:00
7d99d5fcc4
[fix](analytic) Fix data distribution after analytic operator ( #41902 ) ( #41949 )
...
Fix data distribution after analytic operator
pick #41902
2024-10-16 18:41:56 +08:00
5bd33fc88c
[pick](branch-2.1) pick #41292 #41350 #41589 #41628 #41743 #41601 #41667 #41751 ( #41927 )
...
## Proposed changes
pick #41292 #41350 #41589 #41628 #41743 #41601 #41667 #41751
<!--Describe your changes.-->
---------
Co-authored-by: Pxl <pxl290@qq.com >
2024-10-16 15:41:28 +08:00
e56216211e
[pick](branch-2.1) pick #40667 #40714 ( #41905 )
...
pick
#40667
#40714
---------
Co-authored-by: wangbo <wangbo@apache.org >
2024-10-16 14:09:03 +08:00
95d429b7de
[pick](branch-2.1) pick #41891 ( #41929 )
...
pick #41891
2024-10-16 13:59:46 +08:00
e6545a36a3
[improvement](iceberg)Parallelize splits for count(*) for 2.1 ( #41169 ) ( #41880 )
...
bp: #41169
2024-10-16 10:52:06 +08:00
e62e47700d
[fix](Nereids) fixed the limit offset error pick 39316 ( #41878 )
2024-10-16 10:38:05 +08:00
b185dfcbf6
[pick](branch-2.1) pick #41676 #41740 #41857 ( #41904 )
...
pick #41676 #41740 #41857
2024-10-15 22:41:17 +08:00
a4b7d93ded
[bugfix](iceberg)add prefix for endpoint with s3 client for 2.1 ( #41336 ) ( #41877 )
...
bp: #41336
2024-10-15 19:59:10 +08:00
b91d8e2327
[Improvement](minor) Reduce locking scope ( #41845 ) ( #41844 )
...
pick #41845
2024-10-15 18:39:53 +08:00
78b6157aa9
[fix](ip/variant) fix information meta ( #41871 )
...
fix datatype information meta for ip/variant (#41666 )
## Proposed changes
Issue Number: close #xxx
<!--Describe your changes.-->
2024-10-15 18:01:14 +08:00
5fbefa084c
[opt](hive) make supported hive table error msg clearer ( #41616 ) ( #41851 )
...
bp #41616
2024-10-15 17:36:27 +08:00
f3389973e0
[fix](heartbeat) fill default value for required field in TFrontendPingFrontendResult ( #41609 ) ( #41854 )
...
bp #41609
2024-10-15 17:33:46 +08:00
24ceb60ac7
[fix](glue) support glue on aws ( #41084 ) ( #41855 )
...
bp #41084
2024-10-15 17:33:25 +08:00
abcba778ff
[fix](cancel) Fix cancel msg on branch-2.1 ( #41798 )
...
Make sure we can tell cancel reason from:
1. user cancel
2. timeout
3. others
```text
mysql [demo]>set query_timeout=1;
--------------
set query_timeout=1
--------------
Query OK, 0 rows affected (0.00 sec)
mysql [demo]>select sleep(5);
--------------
select sleep(5)
--------------
ERROR 1105 (HY000): errCode = 2, detailMessage = Timeout
mysql [demo]>select sleep(5);
--------------
select sleep(5)
--------------
^C^C -- sending "KILL QUERY 0" to server ...
^C -- query aborted
ERROR 1105 (HY000): errCode = 2, detailMessage = cancel query by user from 127.0.0.1:64208
```
2024-10-15 17:15:05 +08:00
77fbe6397a
[fix](http) Remove file if downloading faile is failed #41778 ( #41827 )
...
cherry pick from #41778
2024-10-15 15:30:29 +08:00
736bb161b1
[ci](ga) skip scope-label ( #41612 ) ( #41848 )
...
bp #41612
2024-10-15 14:02:21 +08:00
7aec6fc42b
[fix](auto bucket) fix auto buckets calc using the first k partition #41675 ( #41820 )
...
cherry pick from #41675
2024-10-15 10:26:46 +08:00
94687a2f3c
[fix](array/map) fix resize impl in array/map ( #41595 ) ( #41699 )
...
backport: https://github.com/apache/doris/pull/41595
2024-10-15 09:50:11 +08:00
d97642e9b5
[cherry-pick](branch-21) fix tablet sink shuffle without project not match the output tuple ( #40299 )( #41293 ) ( #41327 )
...
## Proposed changes
cherry-pick from master (#40299 )(#41293 )
<!--Describe your changes.-->
2024-10-15 00:12:23 +08:00
afdc6800d8
[fix](Nerieds) column prune should retain at least one column for union all ( #41613 ) ( #41771 )
...
cherry-pick #41613 to branch-2.1
2024-10-15 00:09:02 +08:00
4888c632f4
[cherry-pick](branch2.1) support escape.delim and serialization.null.format for hive text ( #41684 )
...
## Proposed changes
pick from master:
https://github.com/apache/doris/pull/40291
2024-10-15 00:08:23 +08:00
19be696120
[Refactor](statistics)Refactor of statistics buildConnectContext. ( #41553 ) ( #41814 )
...
backport: https://github.com/apache/doris/pull/41553
2024-10-14 22:09:57 +08:00
b84e128974
[fix](statistics)Fix clear stale mv row count bug. ( #41653 ) ( #41807 )
...
backport: https://github.com/apache/doris/pull/41653
2024-10-14 21:14:05 +08:00
345cc9172f
[improvement](statistics)Drop table stats in unprotectDropTable so that dropDB could call it as well. ( #39685 ) ( #41801 )
...
backport: https://github.com/apache/doris/pull/39685
2024-10-14 19:30:49 +08:00
ff52e73a07
[Fix](inverted index) fix match null for inverted index #41746 ( #41787 )
...
cherry pick from #41746
2024-10-14 14:45:36 +08:00
f112af0fd2
[pick](branch-2.1) pick #41555 #41592 #38204 ( #41781 )
...
pick #41555 #41592 #38204
2024-10-14 14:05:08 +08:00
ec0c008317
[feature](paimon)support paimon with dlf for 2.1 ( #41247 ) ( #41694 )
...
bp: #41247
2024-10-13 20:04:01 +08:00
629f500646
[cherry-pick](branch-2.1) Pick "[Fix](delete) Fix some delete p2 case ( #37689 ) ( #41678 )" ( #41757 )
...
Pick #37689 #41678
2024-10-12 22:43:21 +08:00
87005aa5b2
[fix](delete) Fix potential delete job stuck util timeout if exception happend in FE DeleteJob execution ( #41672 ) ( #41765 )
...
pick: #41672
Fail task should also count down for the count down latch to prevent job
stuck.
2024-10-12 22:29:10 +08:00
68ae6d025e
[branch-2.1][improvement](jdbc catalog) Print more errors when Jdbc Catalog fails to obtain a connection on FE ( #41769 )
2024-10-12 21:21:54 +08:00