doris

Author	SHA1	Message	Date
Xin Liao	0aa7108ee2	[fix](merge-on-write) incorrect result caused by key range filter with pk (#31456 )	2024-02-29 19:51:47 +08:00
zclllyybb	82add8dfc1	[Fix](timezone) Introduce a config to use Doris tzdata directly (#31561 )	2024-02-29 12:38:03 +08:00
zclllyybb	b177b26d39	[branch-2.1](tracing) Pick pipeline tracing and relative bugfix (#31367 ) * [Feature](pipeline) Trace pipeline scheduling (part I) (#31027) * [fix](compile) Fix performance compile fail #31305 * [fix](compile) Fix macOS compilation issues for PURE macro and CPU core identification (#31357) * [fix](compile) Correct PURE macro definition to fix compilation on macOS * 2 --------- Co-authored-by: zy-kkk <zhongyk10@gmail.com>	2024-02-29 08:42:35 +08:00
AlexYue	f18c853495	[enhance](S3) Init default retry strategy for aws s3 sdk (#31329 )	2024-02-28 13:08:36 +08:00
yiguolei	70304bffd2	[refactor](wg) move memory gc logic to workload group (#31334 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-02-23 23:12:09 +08:00
yangshijie	8f77e6363a	[Feature](function) Support xxhash function like murmur hash function (#31193 )	2024-02-23 19:03:28 +08:00
AlexYue	2f9bd3e3bb	(enhance)(S3) Change s3 metric from bvar adder to latency recorder (#28861 )	2024-02-19 17:22:03 +08:00
lsy3993	870a9342b7	[fix](function) fix extract_url_parameter's bug then get the last key (#30929 ) fix extract_url_parameter's bug then get the last key	2024-02-18 14:45:25 +08:00
koarz	6cf7468073	[enhancement](function) change some function nullable mode (#30991 ) change some function nullable mode	2024-02-18 14:45:25 +08:00
zhiqiang	73940f96d3	[opt](string_to_unsigned_int) performance opt (#30825 )	2024-02-05 22:23:16 +08:00
yiguolei	2c99c53812	[refactor](taskqueue) remove old task scheduler based wg (#30832 ) --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-02-05 22:00:27 +08:00
lihangyu	90d3f0d805	[Fix](json) avoid print warn log when parse failed (#30656 )	2024-02-01 19:00:50 +08:00
TengJianPing	ef8d9ad9a4	[pipelinex](profile) improve memory counter of pipelineX (#30538 )	2024-01-31 23:53:39 +08:00
slothever	b1a9370004	[fix](glue)support access glue iceberg with credential list (#30473 ) merge from #30292	2024-01-28 18:23:07 +08:00
yujun	8ca807578f	[fix](migrate disk) fix migrate disk lost data during publish version (#29887 ) Co-authored-by: Yongqiang YANG <98214048+dataroaring@users.noreply.github.com>	2024-01-16 18:37:06 +08:00
Pxl	068367063f	[Improvement](function) optimization for substr with ascii string (#29799 )	2024-01-12 11:59:52 +08:00
zhangstar333	f8d3b20911	[improve](fmt) use format_to and FMT_COMPILE to speed up write data #29682	2024-01-12 11:53:57 +08:00
plat1ko	b0cac0014d	[enhance](FS) Improve FS error code (#29432 )	2024-01-06 21:17:22 +08:00
HappenLee	92533d544f	[LOG](exec) Add timer for hostname_to_ip (#29497 )	2024-01-04 18:27:27 +08:00
Xinyi Zou	82635d4b59	[opt](memory) All LRU Cache inherit from LRUCachePolicy (#28940 ) After all LRU Cache inherits from LRUCachePolicy, this will allow prune stale entry, eviction when memory exceeds limit, and define common properties. LRUCache constructor change to private, only allow LRUCachePolicy to construct it. Impl DummyLRUCache, when LRU Cache capacity is 0, will no longer be meaningless insert and evict.	2023-12-29 16:15:56 +08:00
TengJianPing	a525d5c5a3	[refactor](decimal) change type name Decimal128 to Decimal128V2, Decimal128I to Decimal128V3 to avoid confusion (#29265 ) change type name Decimal128 to Decimal128V2, Decimal128I to Decimal128V3 to avoid confusion	2023-12-29 10:11:44 +08:00
TengJianPing	5129ab5738	[fix](decimalv2) fix decimalv2 agg errors (#29246 )	2023-12-28 21:17:16 +08:00
yangshijie	3dc3e81734	[Improvement](datatype) Update Parser for IPv4/v6 data types (#29044 ) Transforming from parsing std:: string to parsing char * to accelerate the parsing of ipv4/v6 data types.	2023-12-28 11:00:38 +08:00
zhiqiang	8c59e16f81	[opt](query cancel) optimization for query cancel #28778	2023-12-22 12:48:37 +08:00
Xinyi Zou	83e7235bab	[fix](memory) Add thread asynchronous purge jemalloc dirty pages (#28655 ) jemallctl purge all arena dirty pages may take several seconds, which will block memory GC and cause OOM. So purge asynchronously in a thread.	2023-12-22 12:05:20 +08:00
zclllyybb	453e3c18f4	[refactor](buffer) remove download buffer since it is no longer useful (#28832 ) remove download buffer since it is no longer useful	2023-12-22 11:53:31 +08:00
Gavin Chou	95073053bc	[chore] Add bvar for meta operations of BE (#28374 )	2023-12-19 15:54:19 +08:00
Liqf	0f25a4b3c6	[bug](json)Fix the problem of be down caused by json path ending with \ (#28180 )	2023-12-15 15:57:08 +08:00
Xinyi Zou	cd6d75e518	[fix](memory) `TabletSchema` and `Schema` no longer track memory, only track columns count. (#28149 ) TabletSchema and Schema no longer track memory, only track columns count. because cannot accurately track memory size. TabletMeta MemTracker changed to track TabletSchema columns count. Segment::_meta_mem_usage Unknown value overflow, causes the value of SegmentMeta MemTracker is similar to -2912341218700198079. So, temporarily put it in experimental type tracker.	2023-12-13 15:06:46 +08:00
Xinyi Zou	a719d7a222	[fix](memory) Fix LRU Cache of type `NUMBER` charge (#28175 )	2023-12-13 11:15:57 +08:00
Xinyi Zou	226a0c3b1d	[chore](memory) Warning in log when turning on THP (#28122 )	2023-12-08 17:38:38 +08:00
julic20s	d8d8f15bf3	[improvement](vectorization) Use requires instead of specialization for doris::vectorized::Decimal (#28027 ) Use requires instead of specialization for doris::vectorized::Decimal	2023-12-08 09:59:52 +08:00
Kaijie Chen	84a651d976	[improve](load) rewrite memtable memory limiter rules (#27759 )	2023-12-07 17:26:26 +08:00
wuwenchi	54d062ddee	[feature](stream load) (step one)Add arrow data type for stream load (#26709 ) By using the Arrow data format, we can reduce the streamload of data transferred and improve the data import performance	2023-12-06 23:29:46 +08:00
Mingyu Chen	990b8d4fa5	[minor](s3client) use LOG(INFO) to print s3 client debug log if enabled (#28059 )	2023-12-06 20:39:12 +08:00
Kaijie Chen	36c54da03d	fix owned slice capacity (#28002 )	2023-12-06 00:26:34 +08:00
TengJianPing	17016b9797	[improvement](decimal) use new way for decimal arithmetic precision promotion (#27787 ) * [DNM](decimal) use new way for decimal arithmetic precision promotion * [improvement](decimal) [DNM](decimal) use new way for decimal arithmetic precision promotion 1. [DNM](decimal) use new way for decimal arithmetic precision promotion 2. throw exception if it overflows for decimal arithmetics 3. throw exception if it overflows when casting among number types * fix compile error of gcc * improvement --------- Co-authored-by: morrySnow <morrysnow@126.com>	2023-12-05 12:54:40 +08:00
amory	358d73a0ae	[FIX](complextype) fix empty quote with complex type (#27942 )	2023-12-05 12:25:26 +08:00
plat1ko	1afdbfe723	[enhance](BE) Refactor TaskWorkerPool (#27555 )	2023-12-04 21:46:10 +08:00
Xinyi Zou	b096062680	[feature-wip](arrow-flight)(step6) Support regression test (#27847 ) Design Documentation Linked to #25514 Regression test add a new group: arrow_flight_sql, ./run-regression-test.sh -g arrow_flight_sql to run regression-test, can use jdbc:arrow-flight-sql to run all Suites whose group contains arrow_flight_sql. ./run-regression-test.sh -g p0,arrow_flight_sql to run regression-test, can use jdbc:arrow-flight-sql to run all Suites whose group contains arrow_flight_sql, and use jdbc:mysql to run other Suites whose group contains p0 but does not contain arrow_flight_sql. Requires attention, the formats of jdbc:arrow-flight-sql and jdbc:mysql and mysql client query results are different, for example: Datatime field type: jdbc:mysql returns 2010-01-02T05:09:06, mysql client returns 2010-01-02 05:09:06, jdbc:arrow-flight-sql also returns 2010-01-02 05:09 :06. Array and Map field types: jdbc:mysql returns ["ab", "efg", null], {"f1": 1, "f2": "a"}, jdbc:arrow-flight-sql returns ["ab ","efg",null], {"f1":1,"f2":"a"}, which is missing spaces. Float field type: jdbc:mysql and mysql client returns 6.333, jdbc:arrow-flight-sql returns 6.333000183105469, in query_p0/subquery/test_subquery.groovy. If the query result is empty, jdbc:arrow-flight-sql returns empty and jdbc:mysql returns \N. use database; and query should be divided into two SQL executions as much as possible. otherwise the results may not be as expected. For example: USE information_schema; select cast ("0.0101031417" as datetime) The result is 2000-01-01 03:14:1 (constant fold), select cast ("0.0101031417" as datetime) The result is null (no constant fold), In addition, doris jdbc:arrow-flight-sql still has unfinished parts, such as: Unsupported data type: Decimal256. INVALID_ARGUMENT: [INTERNAL_ERROR]Fail to convert block data to arrow data, error: [E3] write_column_to_arrow with type Decimal256 Unsupported null value of map key. INVALID_ARGUMENT: [INTERNAL_ERROR]Fail to convert block data to arrow data, error: [E33] Can not write null value of map key to arrow. Unsupported data type: ARRAY<MAP<TEXT,TEXT>> jdbc:arrow-flight-sql not support connecting to specify DB name, such asjdbc:arrow-flight-sql://127.0.0.1:9090/{db_name}", In order to be compatible with regression-test, use db_nameis added before all SQLs whenjdbc:arrow-flight-sql` runs regression test. select timediff("2010-01-01 01:00:00", "2010-01-02 01:00:00");, error java.lang.NumberFormatException: For input string: "-24:00:00"	2023-12-04 19:23:56 +08:00
yangshijie	4d1aa131ee	[Feature](datatype) add be ut codes for IPv4/v6 (#26534 ) Add unit test codes for IP types	2023-12-04 15:25:02 +08:00
HHoflittlefish777	97d36b4f38	[fix](csv_reader) fix trim_double_quotes behavior change (#27882 )	2023-12-03 22:57:55 +08:00
daidai	80d2c7ab41	[feature](parquet)support read parquet lzo compress. (#27706 )	2023-12-03 09:55:52 +08:00
yiguolei	be30bd1e40	[improvement](spinlock) remove some potential bad spinlock usage (#27904 ) * [improvement](spinlock) remove some potential spinlock usage --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2023-12-02 20:33:54 +08:00
Mryange	68525fc112	[feature](profile) add RuntimeFilterInfo in merge profile #27869	2023-12-01 21:42:25 +08:00
Qi Chen	60bc3be8a2	[Opt](Compression) Opt zstd block decompression by `ZSTD_decompressDCtx()`. (#27534 ) Opt zstd block decompression by `ZSTD_decompressDCtx()` to replace streaming decompression. It will improve performance but consume more memory. Test result: - env: 1 node(16 cores, 64G). - parquet column: 100 million rows of char(255) column. - result: 5.2 -> 4.6.	2023-12-01 09:10:32 +08:00
Kaijie Chen	8ca8a0655e	[fix](memtracking) require size in Allocator::free (#27795 )	2023-11-30 15:57:15 +08:00
Xinyi Zou	d96e2dfefb	[feature-wip](arrow-flight)(step5) Support JDBC and PreparedStatement and Fix Bug (#27661 )	2023-11-29 21:17:20 +08:00
daidai	ce271ff382	[fix](parquet)fix can not read parquet lz4 compress. (#27383 ) Fixed the problem of not being able to read parquet lz4 compressed format. By default, it is decompressed according to the Hadoop lz4 format. If it fails, it will fall back to the standard lz4 compression format.	2023-11-29 19:04:53 +08:00
Kaijie Chen	b152abc292	[fix](faststring) fix memtracking in faststring free (#27731 )	2023-11-29 17:05:14 +08:00

1 2 3 4 5 ...

857 Commits