Commit Graph

6849 Commits

Author SHA1 Message Date
dfde10d4c8 [improvement](function) switch inet(6)_aton alias origin function (#30196) 2024-01-23 10:09:54 +08:00
e5f1d8d7ec [fix](phrase_prefix) fix match_phrase_prefix query incorrect result (#29946) 2024-01-23 10:09:54 +08:00
9dd368f8dc [config](load) set default memtable_flush_running_count_limit to 2 (#30146) 2024-01-23 10:07:51 +08:00
a9ab094614 [Bug](fix) try to fix the coredump of streambyte decode of sse (#30190) 2024-01-23 10:07:51 +08:00
ead3b4ac1d [feature](function) support ip function is_ipv4_compat, is_ipv4_mapped (#29954) 2024-01-23 10:07:51 +08:00
21db1ecff2 [fix](tablet meta) stale rowset is not loaded while BE start (#30077) 2024-01-23 10:06:44 +08:00
f66f6b2a82 [refactor](close) refactor ispendingfinish logic and close logic to do close more quickly (#30021) 2024-01-23 10:06:05 +08:00
afd440fea7 [fix](move-memtable) only do close wait on the last sink (#30098) 2024-01-19 15:48:56 +08:00
eb86872b70 [improve](profile) add init probe side timer in join node (#30079) 2024-01-19 15:48:56 +08:00
97b2a3b993 [improvement](ip function) refactor some ip functions and remove dirty codes (#30080) 2024-01-19 15:48:56 +08:00
fd6ec97731 [fix](injection) don't disturb CLOSE_LOAD message in LoadStream (#30097) 2024-01-19 15:48:56 +08:00
Pxl
a5ca8833d7 [Improvement](aggregate) optimize for small string aggregate (#29919) 2024-01-19 15:48:15 +08:00
0ccd706a30 [Enhancement](Jdbc Catalog) Map Jdbc Catalog JSON Type to String for Improved Performance and Compatibility (#30035)
This PR proposes mapping external catalog JSON types to String instead of JsonB in Apache Doris. This change is motivated by the realization that JDBC retrieves JSON data as a String JSON string, regardless of its storage format (Json(String) or Json(Binary)). Mapping to String streamlines data retrieval, simplifies write-backs, and ensures compatibility with all JSON(String) and JSON(Binary) functions, despite potentially misleading displays of JSON data as Strings in Doris. This approach avoids the performance overhead and complexity of converting each row of data from JsonB to String, making the process more efficient and elegant.

About Upgrade
To ensure query compatibility with existing Catalogs in the upgraded version,we currently still retain the capability to query external JSON types as JSONB. However, once you upgrade to the new version and either refresh the Catalog or create a new one, all external JSON types will be treated as Strings. To ensure consistent behavior,and possible future removal of support for JSON as JSONB query code, it is highly recommended that you manually refresh your Catalog as soon as possible after upgrading to the new version.
2024-01-18 12:03:07 +08:00
7e821f3d66 [Performance](Join) Support all match one logic (#30019)
Support all match one logic
2024-01-18 12:03:07 +08:00
990d5d8664 [cleanup](insert-into) clean up some insert into log (#30063) 2024-01-18 12:03:07 +08:00
Pxl
30378f9bbc [Chore](config) remove some unused config (#29983)
remove some unused config
2024-01-18 12:03:07 +08:00
e894911cda [function](char) change char function behaviour same with mysql (#30034)
select char(0) = '\0';
should return true;
2024-01-18 10:04:21 +08:00
7e3ed8a19a [fix](index_tool) fix index_tool compile error (#30049) 2024-01-18 10:04:21 +08:00
6ef9ed08aa [fix](multi-table-load) fix multi table load can not finish (#29957) 2024-01-18 10:03:35 +08:00
f0a4ec5f85 [pipelineX](fix) Fix runtime filter dependency DCHECK failed (#30038) 2024-01-18 08:56:47 +08:00
9714a3c873 [fix](regression) fault injection may cause fd to be closed twice (#30070) 2024-01-18 08:55:53 +08:00
2dcdf07dd4 [Feature](profile)Support active_queries TVF (#29999) 2024-01-16 21:25:02 +08:00
2fa511f80e [improve](multi-table-load) avoid plan and execute too many plan at once (#29951) 2024-01-16 21:14:35 +08:00
f7a340a2df [improve](move-memtable) add cancel method to load stream stub (#29994) 2024-01-16 20:23:09 +08:00
a0355a6580 [opt](scanner) optimize the number of threads of scanners, follow up #28640 (#30030)
follow up #28640, doris_scanner_thread_pool_thread_num may be to large in machines with 128 cores.
2024-01-16 20:23:09 +08:00
2916745cf2 [Chore](arm) Support auto-SIMD on arm and refactor compilation docs (#29956) 2024-01-16 18:49:04 +08:00
d11e797d4c [fix](decimal) fix wrong decimal overflow caused by uninitialized nested column of null value (#29960) 2024-01-16 18:48:20 +08:00
d3bf23d70d [chore](removelogs) remove debug query timeout logs (#30006)
---------

Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-01-16 18:48:18 +08:00
08db92a49a [fix](regression) spare .testfile to make disk checker happy when injecting fault (#30012) 2024-01-16 18:46:19 +08:00
e1bcdc35fd (fix)[group-commit] Fix some group commit case (#30008) 2024-01-16 18:46:19 +08:00
4d6de1f181 [opt](invert index) optimize the code structure of inverted index queries (#29771) 2024-01-16 18:46:19 +08:00
779ed25972 [fix](scan) crashing caused by unlocked reading of tablet (#30005) 2024-01-16 18:46:19 +08:00
ece5f8e86c [pipelineX](fix) Fix input data distribution for distinct streaming agg (#29980) 2024-01-16 18:42:09 +08:00
66513d57f9 [feature](function) support ip function named ipv6_cidr_to_range(addr, cidr) (#29812) 2024-01-16 18:42:09 +08:00
43597afe2c [bugfix](core) writer status is read and write concurrently and will core by use after free (#29982)
Co-authored-by: yiguolei <yiguolei@gmail.com>
2024-01-16 18:42:09 +08:00
c9cf9ab841 [pipelineX](improvement) Improve data distribution for streaming agg (#29969) 2024-01-16 18:40:32 +08:00
9e30a67a2a [Improve](topn opt) avoid crash when rpc returned row contains duplicated row entry (#29872)
1. Add more info to trace potential bug and avoid crash
2. use correct permutation size to do `column->permute`
2024-01-16 18:40:31 +08:00
ffc6f58e85 [pipelineX](fix) Fix incorrect partition number (#29963) 2024-01-16 18:39:37 +08:00
be893d792c [fix](jni) fix jni_reader function name get_nex_block to get_next_block (#29943) 2024-01-16 18:39:00 +08:00
05a65b9f81 [improve](join) remove join probe dependency of wait rf publish finish #29792 2024-01-16 18:39:00 +08:00
e35b26f4fc [feature](auditlog)Add runtime cpu time/peak memory metric (#29925) 2024-01-16 18:39:00 +08:00
b7b8e59392 [opt](scanner) use buffered queue to avoid acquiring locks frequently (#29938) 2024-01-16 18:37:44 +08:00
c8845c9e07 [opt](scanner) Improve the efficiency of TOPN opt (#29937) 2024-01-16 18:37:44 +08:00
4b4fd1a290 [improvement](log) add txn log (#28875) 2024-01-16 18:37:06 +08:00
8ca807578f [fix](migrate disk) fix migrate disk lost data during publish version (#29887)
Co-authored-by: Yongqiang YANG <98214048+dataroaring@users.noreply.github.com>
2024-01-16 18:37:06 +08:00
74e4486c65 [fix](partition) Add more log for single replica load when partition id eq 0 (#28707) 2024-01-16 18:35:32 +08:00
615d94bbc7 [log](insertadd log in parse insert into values data (#29903) 2024-01-16 18:35:32 +08:00
7309061db4 [pipelineX](improvement) Adjust local exchange strategy (#29915) 2024-01-16 18:35:32 +08:00
25428bd7fb [fix](kerberos) fix BE kerberos ccache renew, optimize kerbero options (#29291)
1. we need  remove BE kinit, and use jni login with keytab, because kinit cannot renew TGT for doris in many complex cases.
> This pull requet will support new instance from keytab: https://github.com/apache/doris-thirdparty/pull/173, so now we  won't need kinit cmd, just login with keytab and principal

2. add `kerberos_ccache_path` to set kerberos credentials cache path manually.

3. add `max_hdfs_file_handle_cache_time_ms` to set hdfs fs handle cache time.
2024-01-16 18:35:29 +08:00
5e697990a8 [bugfix](timeout) serving_blocks_num may cause timeout, try to fix it (#29912)
Although serving_blocks_num is an atomic variable. It's ++ and -- are not protected by transfer lock.
I am not sure the memory order of ++ and --.
I think it maybe the root cause of query timeout. So that I remove the check and test it in github pipeline.
2024-01-16 18:34:19 +08:00