doris

Author	SHA1	Message	Date
wangbo	a55e109e97	[pick][Improment]Add schema table workload_group_privileges (#38436 ) (#39708 ) pick #38436	2024-08-22 00:44:43 +08:00
Mingyu Chen	0bfcee1251	[opt](file-cache) support system table file_cache_statistics (#39552 ) 1. Add new system table: `file_cache_statistics` This table is used for viewing metrics related to file cache on BE side ``` mysql> select * from information_schema.file_cache_statistics limit 10; +-------+---------------+----------------------------+--------------------------------+--------------------+ \| BE_ID \| BE_IP \| CACHE_PATH \| METRIC_NAME \| METRIC_VALUE \| +-------+---------------+----------------------------+--------------------------------+--------------------+ \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| disposable_queue_curr_elements \| 0 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| disposable_queue_curr_size \| 0 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| disposable_queue_max_elements \| 102400 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| disposable_queue_max_size \| 21474836480 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| hits_ratio \| 0.8539634687001242 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| hits_ratio_1h \| 0 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| hits_ratio_5m \| 0 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| index_queue_curr_elements \| 0 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| index_queue_curr_size \| 0 \| \| 10003 \| 172.20.32.136 \| /mnt/output/be/file_cache/ \| index_queue_max_elements \| 102400 \| +-------+---------------+----------------------------+--------------------------------+--------------------+ ``` It will show metrics of file caches on each BE. 2. Add new metrics `hits_ratio_1h` and `hits_ratio_5m` for file cache This 2 metrics will show the hit ratio of file cache in recent 1 hour or 5 minutes. So that we can know recent hit ratio instead of global historical hit ratio.	2024-08-21 10:03:39 +08:00
qiye	43cc8d648d	[fix](ES Catalog)Check isArray before parse json to array (#39104 ) (#39273 ) ## Proposed changes bp #39104	2024-08-13 15:13:40 +08:00
wangbo	fc0222a64c	[opt](info) processlist schema table support show all fe (#38701 ) (#38953 ) pick #38701	2024-08-07 11:01:46 +08:00
Gabriel	9d23ccf1f2	[Improvement](schema scan) Use async scanner for schema scanners (#38… (#38666 ) …403)	2024-08-01 16:05:24 +08:00
Mryange	017dad8c54	[fix](type)support runtime predicate for time type (#38258 ) (#38465 ) ## Proposed changes https://github.com/apache/doris/pull/38258 Issue Number: close #xxx <!--Describe your changes.-->	2024-07-31 10:27:36 +08:00
zzzxl	e2bb86e7f8	[fix](inverted index) fixed in_list condition not indexed on pipelinex (#38178 ) ## Proposed changes https://github.com/apache/doris/pull/36565 https://github.com/apache/doris/pull/37842 https://github.com/apache/doris/pull/37921 https://github.com/apache/doris/pull/37386 <!--Describe your changes.-->	2024-07-25 14:42:34 +08:00
Xinyi Zou	10c5c336d8	[branch-2.1](arrow-flight-sql) Add config arrow_flight_result_sink_buffer_size_rows (#38223 ) pick #38221	2024-07-24 15:15:39 +08:00
qiye	e5339a4014	[feature](ES Catalog)Support control scroll level by config #37180 (#37290 ) ## Proposed changes backport #37180	2024-07-15 16:41:38 +08:00
qiye	f8cee439b6	[feature](ES Catalog) map nested/object type in ES to JSON type in Doris (#37101 ) (#37182 ) backport #37101	2024-07-05 10:48:32 +08:00
abmdocrt	02fad48870	[Fix](upgrade) Fix fields not handled correctly during upgrade and downgrade (#36691 ) master version is #36690	2024-06-22 14:23:04 +08:00
lihangyu	445d42a57d	[fix](topn-opt) remove redundant check for fetch phase (#36676 ) #36629 Issue Number: close #xxx <!--Describe your changes.-->	2024-06-21 22:28:38 +08:00
zclllyybb	bd47d5a681	[branch-2.1](auto-partition) Fix auto partition load failure in multi replica (#36586 ) this pr 1. picked #35630, which was reverted #36098 before. 2. picked #36344 from master these two pr fixed existing bug about auto partition load. --------- Co-authored-by: Kaijie Chen <ckj@apache.org>	2024-06-20 17:51:18 +08:00
Pxl	dda25cceb6	[Bug](information-schema) fix some bug of information_schema.PROCESSLIST (#36447 ) ## Proposed changes pick from #36409	2024-06-18 16:45:48 +08:00
zclllyybb	3b23eee37c	Revert "[fix](auto-partition) fix auto partition load lost data in multi sender (#35287 )" (#36098 ) Reverts apache/doris#35630 because it brought some more damaging bugs. we will fix it and merge in next version	2024-06-11 17:11:42 +08:00
wangbo	75a6f28f2e	[cherry-pick]Add query type when report (#35918 ) pick #34978	2024-06-11 10:51:59 +08:00
amory	b5a35b9cef	[FIX] Pick array inverted index bugfix (#35837 ) here with some array with inverted index bugfix: see also: https://github.com/apache/doris/pull/34766 https://github.com/apache/doris/pull/35086 https://github.com/apache/doris/pull/34683 https://github.com/apache/doris/pull/34076	2024-06-06 09:54:14 +08:00
amory	fe1a4c4136	[Feature](IP) support ipv4/ipv6 with inverted index and conjuncts for query (#35734 ) support data type ipv4/ipv6 with inverted index and then we can query like "> or < or >= or <= or in/not in " this conjuncts expr for ip with inverted index speeding up	2024-06-03 23:24:03 +08:00
Kaijie Chen	c2fc485327	[fix](auto-partition) fix auto partition load lost data in multi sender (#35287 ) (#35630 ) ## Proposed changes Change `use_cnt` mechanism for incremental (auto partition) channels and streams, it's now dynamically counted. Use `close_wait()` of regular partitions as a synchronize point to make sure all sinks are in close phase before closing any incremental (auto partition) channels and streams. Add dummy (fake) partition and tablet if there is no regular partition in the auto partition table. Backport #35287 Co-authored-by: zhaochangle <zhaochangle@selectdb.com>	2024-05-31 10:27:03 +08:00
Qi Chen	b91d2caab8	[Feature](iceberg-writer) Implements iceberg sink basic functionality for inserting into table. (#35587 ) backport #34929	2024-05-29 16:40:54 +08:00
yiguolei	f38ecd349c	[enhancement](memory) return error if allocate memory failed during add rows method (#35085 ) * return error when add rows failed * f --------- Co-authored-by: yiguolei <yiguolei@gmail.com>	2024-05-22 00:53:34 +08:00
shee	fb28d0b185	[BUG] fix scan range boundary handling is incorrect (#34832 ) fix scan range boundary handling is incorrect Co-authored-by: shizhiqiang03 <shizhiqiang03@meituan.com>	2024-05-21 13:00:50 +08:00
Tiewei Fang	c0fd98abe5	[Fix](tvf) Fix that tvf reading empty files in compressed formats. (#34926 ) 1. Fix the issue with tvf reading empty compressed files. 2. move two test cases (`test_local_tvf_compression` and `test_s3_tvf_compression`) from p2 to p0	2024-05-21 12:59:31 +08:00
abmdocrt	42425808a1	[Cherry-Pick](branch-2.1) Pick "Fix multiple replica partial update auto inc data inconsistency problem #34788 " (#35056 ) * [Fix](auto inc) Fix multiple replica partial update auto inc data inconsistency problem (#34788) * Problem: For tables with auto-increment columns, updating partial columns can cause data inconsistency among replicas. Cause: Previously, the implementation for updating partial columns in tables with auto-increment columns was done independently on each BE (Backend), leading to potential inconsistencies in the auto-increment column values generated by each BE. Solution: Before distributing blocks, determine if the update involves partial columns of a table with an auto-increment column. If so, add the auto-increment column to the last column of the block. After distributing to each BE, each BE will check if the data key for the partial column update exists. If it exists, the previous auto-increment column value is used; if not, the auto-increment column value from the last column of the block is used. This ensures that the auto-increment column values are consistent across different BEs. * 2 * [Fix](regression-test) Fix auto inc partial update unstable regression test (#34940)	2024-05-20 15:43:46 +08:00
xueweizhang	9b5028785d	[fix](prepare) fix datetimev2 return err when binary_row_format (#34662 ) fix datetimev2 return err when binary_row_format. before pr, Backend return datetimev2 alwary by to_string. fix datatimev2 return metadata loss scale.	2024-05-18 18:37:41 +08:00
zzzxl	cc00666be6	[opt](inverted index) add inlist condition handling to compound (#34134 ) 1. Previously, the compound did not support the inlist condition, which could impact performance if an inverted index was created.	2024-05-10 14:35:47 +08:00
wangbo	39fdc9ba0c	[refactor](executor)Rename workload schedule policy #34497	2024-05-08 08:35:20 +08:00
wangbo	299d069da9	Fix alter policy failed (#33910 )	2024-04-22 22:33:24 +08:00
wangbo	03c3419265	[Refactor](executor)Add workload schedule policy table (#33729 )	2024-04-20 20:06:34 +08:00
zclllyybb	25358564ca	[Fix](compile) Fix gcc compile on master (#33864 ) This is imported by #33511. wrongly used ColumnStr<T> (); which violate C++20 standard(see https://wg21.cmeerw.net/cwg/issue2237) but still supported by clang up until now(see llvm/llvm-project#58112)	2024-04-19 23:41:37 +08:00
Mryange	74590e4836	[refine](node) Remove the cse DCHECK from the constructor (#33856 ) It's possible that a failure in the fe caused the check to fail, and at that moment, it may not be possible to retrieve the corresponding query ID from be.out.	2024-04-19 23:41:37 +08:00
HappenLee	1300317723	[Exec](join) Support column string64 to avoid join failed in string size overflow the uint32 (#33511 ) (#33850 )	2024-04-18 19:43:08 +08:00
abmdocrt	06a155abb0	[branch-2.1](cherry-pick) Pick some partial-update PR from master (#33639 ) * [Fix](partial-update) Fix partial update fail when the datetime default value is 'current_time' (#32926) * Problem: When importing data that includes datetime with a default value of current time for partial column updates, the import fails. Reason: Partial column updates do not handle the logic for datetime default values. Solution: During partial column updates, when the default value is set to current time, read the current time from the runtime state and write it into the data. * [Enhancement](partial update)Add timezone case for partial update timestamp #33177 * [fix](partial update) Support partial update when the date default value is 'current_date'. This PR is a extension of PR #32926. (#33394)	2024-04-17 23:42:12 +08:00
Vallish Pai	face7c42fd	[enhancement](plsql) Support select * from routines (#32866 ) Support show of plsql procedure using select * from routines.	2024-04-17 23:42:12 +08:00
zy-kkk	1be753ed75	[enhancement](mysql compatible) add user and procs_priv tables to mysql db in all catalogs (#33058 ) Issue Number: close #xxx This PR aims to enhance the compatibility of BI tools (such as Dbeaver, DataGrip) when using the mysql connector to connect to Doris, because some BI tools query some tables in the mysql database. In our tests, the user and procs_priv tables were mainly queried. This PR adds these two tables and adds actual data to the user table. However, please note that most of the fields in the user table are in Doris' own format rather than mysql format, so it can only ensure that the BI tool is querying No error is reported when accessing these tables, which does not guarantee that the data is completely displayed, and the tables under Doris's mysql database do not support data modification. Thanks to @liujiwen-up for assisting in testing	2024-04-17 23:42:12 +08:00
zclllyybb	3d66723214	[branch-2.1](auto-partition) pick auto partition and some more prs (#33523 )	2024-04-11 17:12:17 +08:00
Mryange	f7d52b5b1c	[feature](expr) add type check when expr prepare (#33330 )	2024-04-11 09:31:50 +08:00
超威老仲	b0b5f84e40	[feature](load) support compressed JSON format data for broker load (#30809 )	2024-04-10 14:20:53 +08:00
Mryange	8e19cdd745	[featrue](expr) support common subexpression elimination be part (#32673 )	2024-04-10 11:56:21 +08:00
wangbo	97a2977f2a	[improvement](executor)Add tag property for workload group #32874	2024-04-10 11:34:29 +08:00
zclllyybb	e574b35833	[Enhancement](partition) Refine some auto partition behaviours (#32737 ) (#33412 ) fix legacy planner grammer fix nereids planner parsing fix cases forbid auto range partition with null column fix CreateTableStmt with auto partition and some partition items. 1 and 2 are about #31585 doc pr: apache/doris-website#488	2024-04-09 15:51:02 +08:00
zy-kkk	fae55e0e46	[Feature](information_schema) add processlist table for information_schema db (#32511 )	2024-04-07 23:24:22 +08:00
Mryange	ad2d20348a	[fix](pipeline) fix use error row desc when origin block clear #32803 (#32849 ) * fix * add case	2024-03-26 20:02:46 +08:00
wangbo	326a264fcd	[Improvement](executor)Add spill property for workload group #32554	2024-03-22 16:38:19 +08:00
Mryange	baf3ae1a93	[refactor](nereids)unify outputTupleDesc and projection be part (#32439 )	2024-03-22 16:35:43 +08:00
zy-kkk	fd0bc720e9	[opt](information_schema) Add DEFAULT_ENCRYPTION column to schemata table (#32501 )	2024-03-22 08:52:16 +08:00
Mingyu Chen	2e564036ef	[fix](profile) avoid update profile in deconstructor (#32131 ) In previous, the counter in `profile` may be updated when close the file reader. And the file reader may be closed when the object being deconstruted. But at that time, the `profile` object may already be deleted, causing NPE and BE will crash. This PR try to fix this issue: 1. Remove the "profile counter update" logic from all `close()` method. 2. Add a new interface `ProfileCollector` It has 2 methods: - `collect_profile_at_runtime()` It can be called at runtime, eg, in every `get_next_block()` method. So that the counter in profile can be updated at runtime. - `collect_profile_before_close()` Should be called before the object call `close()`. And it will only be called once. 3. Derived from `ProfileCollector` All classes which may update the profile counter in `close()` method should extends the `ProfileCollector`. Such as `GenericReader`, etc. And implement `collect_profile_before_close()` And `collect_profile_before_close()` will be called in `scanner->mark_to_need_to_close()`.	2024-03-21 14:07:22 +08:00
zhiqiang	0990014e94	[fix](datetime) fix datetime rounding on BE (#32075 )	2024-03-21 14:07:19 +08:00
Mingyu Chen	ef2151ae66	[Feature-WIP](multi-catalog) Add Hive sink on BE side. (#32306 ) (#32364 ) bp #32306 Co-authored-by: Qi Chen <kaka11.chen@gmail.com>	2024-03-18 11:23:01 +08:00
wangbo	83ab61ad22	Add QUEUE_START_TIME/QUEUE_END_TIME/QUERY_STATUS column for active_queries (#32259 )	2024-03-16 20:53:46 +08:00

1 2 3 4 5 ...

1049 Commits