doris

Author	SHA1	Message	Date
feifeifeimoon	d17ac99abe	[feature](coverage): refresh the coverage file before exiting the program (#28354 )	2023-12-19 10:54:57 +08:00
yujun	f9ddf8c7ef	[improvement](be report) add be report http (#28424 )	2023-12-19 10:39:19 +08:00
Xinyi Zou	a719d7a222	[fix](memory) Fix LRU Cache of type `NUMBER` charge (#28175 )	2023-12-13 11:15:57 +08:00
Mryange	877935442f	[feature](pipelineX)use markFragments instead of markInstances in pipelineX (#27829 )	2023-12-11 17:59:53 +08:00
Sun Chenyang	573b594df3	[improvement](Variant Type) Support displaying subcolumns expanded for the variant column (#27764 )	2023-12-08 20:34:58 +08:00
Yongqiang YANG	ffa4ea66d5	[enhancement](main) donot coredump when be can not start (#27928 )	2023-12-05 20:11:24 +08:00
plat1ko	1afdbfe723	[enhance](BE) Refactor TaskWorkerPool (#27555 )	2023-12-04 21:46:10 +08:00
Xinyi Zou	d96e2dfefb	[feature-wip](arrow-flight)(step5) Support JDBC and PreparedStatement and Fix Bug (#27661 )	2023-11-29 21:17:20 +08:00
zhiqiang	4633a5c49b	[chore](log) Warning log to trace send fragment #27738	2023-11-29 16:43:25 +08:00
lihangyu	7398c3daf1	[Feature-Variant](Variant Type) support variant type query and index (#27676 )	2023-11-29 10:37:28 +08:00
ShowCode	f565f60bc3	[refactor](standard)BE:Initialize pointer variables in the class to nullptr by default (#27587 )	2023-11-28 13:02:30 +08:00
meiyi	553e4a8903	[feature-wip](merge-on-write) MOW table support different primary keys and sort keys (#24788 )	2023-11-24 16:37:30 +08:00
walter	b457856bd2	[chore](be) remove bthread scanner related codes (#27417 )	2023-11-23 15:18:49 +08:00
Kaijie Chen	b79f5d77f1	[improve](move-memtable) improve logging messages (#27443 )	2023-11-23 11:46:29 +08:00
Gabriel	b1eef30b49	[pipelineX](dependency) Wake up task by dependencies (#26879 ) --------- Co-authored-by: Mryange <2319153948@qq.com>	2023-11-18 03:20:24 +08:00
huanghaibin	5d548935e0	[improvement](insert) support schema change and decommission for group commit (#26359 )	2023-11-17 21:41:38 +08:00
lihangyu	a4d78682ff	[Optimize](point query) clear names to reduce mem consumption and cpu cost related to block column name (#26931 )	2023-11-17 10:18:21 +08:00
Adonis Ling	d2eea9b3ae	[chore](macOS) Reduce the size of executables on macOS arm64 (#26894 ) Like #15641, we should reduce the size of executables on macOS arm64. Otherwise, we can not run doris_be and doris_be_test with ASAN build type on macOS arm64 now.	2023-11-14 12:21:08 +08:00
Xinyi Zou	de6ecd2035	[fix](tls) Manually track memory in Allocator instead of mem hook and ThreadContext life cycle to manual control (#26904 ) Manually track query/load/compaction/etc. memory in Allocator instead of mem hook. Can still use Mem Hook when cannot manually track memory code segments and find memory locations during debugging. This will cause memory tracking loss for Query, loss less than 10% compared to the past, but this is expected to be more controllable. Similarly, Mem Hook will no longer track unowned memory to the orphan mem tracker by default, so the total memory of all MemTrackers will be less than before. Not need to get memory size from jemalloc in Mem Hook each memory alloc and free, which would lose performance in the past. Not require caching bthread local in pthread local for memory hook, in the past this has caused core dumps inside bthread, seems to be a bug in bthread. ThreadContext life cycle to manual control In the past, ThreadContext was automatically created when it was used for the first time (this was usually in the Jemalloc Hook when the first malloc memory), and was automatically destroyed when the thread exited. Now instead of manually controlling the create and destroy of ThreadContext, it is mainly created manually when the task thread start and destroyed before the task thread end. Run 43 clickbench query tests. Use MemHook in the past:	2023-11-14 10:30:42 +08:00
yiguolei	8cf360fff7	[refactor](closure) remove ref count closure using auto release closure (#26718 ) 1. closure should be managed by a unique ptr and released by brpc , should not hold by our code. If hold by our code, we need to wait brpc finished during cancel or close. 2. closure should be exception safe, if any exception happens, should not memory leak. 3. using a specific callback interface to be implemented by Doris's code, we could write any code and doris should manage callback's lifecycle. 4. using a weak ptr between callback and closure. If callback is deconstruted before closure'Run, should not core.	2023-11-12 11:57:46 +08:00
zhannngchen	899630d0eb	[chore](key_util) remove useless null_first parameter (#26635 ) Doris always put null in the first when sorting key, the parameter null_first of encode_keys is useless.	2023-11-10 14:27:47 +08:00
plat1ko	d767804815	[feature](merge-cloud) Decouple rowset id generator and local rowsets gc implementation (#25921 )	2023-11-10 10:07:02 +08:00
zhiqiang	a5565f68b2	[Refactor](opentelemetry) Remove opentelemetry (#26605 )	2023-11-09 18:05:34 +08:00
Kaijie Chen	58bf79f79e	[fix](move-memtable) pass load stream num to backends (#26198 )	2023-11-08 16:16:33 +08:00
meiyi	9502cc758d	[fix](regression) fix group commit regression test (#26557 )	2023-11-08 11:57:07 +08:00
Xinyi Zou	1544110c1b	[feature-wip](arrow-flight)(step4) Support other DML and DDL statements, besides `Select` (#25919 ) Design Documentation Linked to #25514	2023-11-08 10:50:42 +08:00
Kaijie Chen	32b36d3c9c	[refactor](move-memtable) rename proto OpenStreamSink to OpenLoadStream (#26527 )	2023-11-07 22:41:20 +08:00
Jack Drogon	2cc68381ec	[feature](binlog) Add ingest_binlog/http_get_snapshot limit download speed && Add async ingest_binlog (#26323 )	2023-11-06 11:14:44 +08:00
meiyi	b19f275714	[improvement](insert) refactor group commit insert into (#25795 )	2023-11-03 12:02:40 +08:00
TengJianPing	7ba4f91258	[fix](log) avoid redundent log printing (#26188 ) Co-authored-by: yiguolei <676222867@qq.com>	2023-11-01 13:49:08 +08:00
Mingyu Chen	e20cab64f4	[improvement](scan) avoid too many scanners for file scan node (#25727 ) In previous, when using file scan node(eq, querying hive table), the max number of scanner for each scan node will be the `doris_scanner_thread_pool_thread_num`(default is 48). And if the query parallelism is N, the total number of scanner would be 48 * N, which is too many. In this PR, I change the logic, the max number of scanner for each scan node will be the `doris_scanner_thread_pool_thread_num / query parallelism`. So that the total number of scanners will be up to `doris_scanner_thread_pool_thread_num`. Reduce the number of scanner can significantly reduce the memory usage of query.	2023-10-29 17:41:31 +08:00
Kaijie Chen	6a85f46ff3	[refactor](move-memtable) rename open_stream_sink rpc to open_load_stream (#25883 )	2023-10-29 10:07:14 +08:00
wangbo	1ba8a9bae4	[feature-wip](executor)Fe send topic info to be (#25798 )	2023-10-26 15:52:48 +08:00
Kaijie Chen	69c3e08699	[improve](load) set load id in signal handler (#25813 )	2023-10-24 21:58:39 +08:00
zhannngchen	0c8bce4292	[fix](partial update) fix some bugs about delete sign (#25712 )	2023-10-24 14:33:33 +08:00
lihangyu	76fd566403	[Improve](topn opt) change `multiget_data` RPC worker pool from `_heavy_work_pool` to `_light_work_pool` (#25741 ) Under some workload the `multiget_data` maybe stuck and raise rpc timeout. Inorder to to minimize the latency of point queries, so it should be placed in `_light_work_pool`, since it's typically finished in hundreds of millisecond	2023-10-24 14:07:52 +08:00
zhiqiang	87b414cdae	[Fix](query execution) Fix result sink fragment can't be cancelled in non-pipeline (#25524 )	2023-10-24 11:30:29 +08:00
plat1ko	9c9fc84f39	[feature](merge-cloud) Abstract BaseTablet for CloudTablet (#24929 )	2023-10-18 20:29:04 +08:00
lihangyu	b0e0a0569a	[Fix](row store) Real default value should be used instead of default… (#25230 ) Before this PR the default value is not correct, we should use default value in Frontend schema.	2023-10-18 10:13:44 +08:00
bobhan1	1514f78b87	[refactor](partial-update) Split partial update infos from tablet schema (#25147 )	2023-10-17 14:21:40 +08:00
AlexYue	59ebbb351e	[feature](merge-cloud) Enable write into cache when uploading file to s3 using s3 file writer (#24364 )	2023-10-16 21:31:02 +08:00
meiyi	e514d52232	[fix](point-query) Support mow table with sequence column (#25308 )	2023-10-11 18:22:16 +08:00
zzzzzzzs	6fe060b79e	[fix](streamload) fix http_stream retry mechanism (#24978 ) If a failure occurs, doris may retry. Due to ctx->is_read_schema is a global variable that has not been reset in a timely manner, which may cause exceptions. --------- Co-authored-by: yiguolei <676222867@qq.com>	2023-10-08 11:16:21 +08:00
bobhan1	642e5cdb69	[Fix](Status) Make `Status` `[[nodiscard]]` and handle returned `Status` correctly (#23395 )	2023-09-29 22:38:52 +08:00
zhengyu	d23bedf170	[fix](single-replica-load) fix duplicated done run in request_slave_tablet_pull_rowset (#25013 ) BE will crash because done run twice when try_offer() failed in request_slave_tablet_pull_rowset. Signed-off-by: freemandealer <freeman.zhang1992@gmail.com>	2023-09-28 21:08:18 +08:00
Lijia Liu	864a0f9bcb	[opt](pipeline) Make pipeline fragment context send_report asynchronized (#23142 )	2023-09-28 17:55:53 +08:00
Xinyi Zou	87a30dc41d	[feature-wip](arrow-flight)(step3) Support authentication and user session (#24772 )	2023-09-27 14:53:58 +08:00
yujun	8679095e5c	[feature](debug) support debug point used in debug code (#24502 )	2023-09-25 17:56:12 +08:00
Yongqiang YANG	8eb14eec7c	[enhancement](baddisk) record bad disk in be_custom.conf to handle (#24639 )	2023-09-21 18:31:58 +08:00
Jack Drogon	ee5b307e63	[Fix](binlog) Add more log for ingest_binlog && Fix ingest_binlog not rewrite rowset_meta tablet_uid (#24617 ) * Add more log for ingest_binlog && Fix ingest_binlog not rewrite rowset_meta tablet_uid Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com> * Add lost thrift TDebugProtocol Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com> --------- Signed-off-by: Jack Drogon <jack.xsuperman@gmail.com>	2023-09-21 10:40:08 +08:00

1 2 3 4 5 ...

408 Commits