doris

Author	SHA1	Message	Date
hui lai	423483ed8f	[branch-2.1](routine-load) optimize out of range error message (#37391 ) ## Proposed changes pick #36450 before ``` ErrorReason{code=errCode = 105, msg='be 10002 abort task, task id: d846f3d3-7c9e-44a7-bee0-3eff8cd11c6f job id: 11310 with reason: [INTERNAL_ERROR]Offset out of range, 0# doris::Status doris::Status::Error<6, true>(std::basic_string_view<char, std::char_traits<char> >) at /mnt/disk1/laihui/doris/be/src/common/status.h:422 1# doris::Status doris::Status::InternalError<true>(std::basic_string_view<char, std::char_traits<char> >) at /mnt/disk1/laihui/doris/be/src/common/status.h:468 2# doris::KafkaDataConsumer::group_consume(doris::BlockingQueue<RdKafka::Message>, long) at /mnt/disk1/laihui/doris/be/src/runtime/routine_load/data_consumer.cpp:226 3# doris::KafkaDataConsumerGroup::actual_consume(std::shared_ptr<doris::DataConsumer>, doris::BlockingQueue<RdKafka::Message>, long, std::function<void (doris::Status const&)>) at /mnt/disk1/laihui/doris/be/src/runtime/routine_load/data_consumer_group.cpp:200 4# void std::__invoke_impl<void, void (doris::KafkaDataConsumerGroup::&)(std::shared_ptr<doris::DataConsumer>, doris::BlockingQueue<RdKafka::Message>, long, std::function<void (doris::Status const&)>), doris::KafkaDataConsumerGroup&, std::shared_ptr<doris::DataConsumer>&, doris::BlockingQueue<RdKafka::Message>&, long&, doris::KafkaDataConsumerGroup::start_all(std::shared_ptr<doris::StreamLoadContext>, std::shared_ptr<doris::io::KafkaConsumerPipe>)::$_0&>(std::__invoke_memfun_deref, void (doris::KafkaDataConsumerGroup::&)(std::shared_ptr<doris::DataConsumer>, doris::BlockingQueue<RdKafka::Message>, long, std::function<void (doris::Status const&)>), doris::KafkaDataConsumerGroup&, std::shared_ptr<doris::DataConsumer>&, doris::BlockingQueue<RdKafka::Message>&, long&, doris::KafkaDataConsumerGroup::start_all(std::shared_ptr<doris::StreamLoadContext>, std::shared_ptr<doris::io::KafkaConsumerPipe>)::$_0&) at /mnt/disk1/laihui/build/ldb_toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/invoke.h:74 ... ``` now ``` ErrorReason{code=errCode = 105, msg='be 10002 abort task, task id: 3ba0c0f4-d13c-4dfa-90ce-3df922fd9340 job id: 11310 with reason: [INTERNAL_ERROR]Offset out of range, consume partition 0, consume offset 100, the offset used by job does not exist in kafka, please check the offset, using the Alter ROUTINE LOAD command to modify it, and resume the job'} ``` ## Proposed changes Issue Number: close #xxx <!--Describe your changes.-->	2024-07-07 18:29:04 +08:00
HHoflittlefish777	7ba66c5890	[branch-2.1](routine-load) do not schedule task when there is no data (#34654 )	2024-05-11 11:01:18 +08:00
HHoflittlefish777	e0ec2da29b	[fix](routine-load) fix get kafka offset timeout may too long (#33502 )	2024-04-17 23:42:12 +08:00
HHoflittlefish777	6610307eb0	[opt](routine-load) end Kafka consume when meets partition EOF #32046	2024-03-12 18:50:06 +08:00
Yongqiang YANG	bc020112fc	[enhancement](routineload) add debug conf and set broker.name.ttl = 0 (#23302 ) * set broker.name.ttl = 0 * add debug config for librdkafka	2023-08-26 10:56:35 +08:00
Adonis Ling	e412dd12e8	[chore](build) Use include-what-you-use to optimize includes (PART II) (#18761 ) Currently, there are some useless includes in the codebase. We can use a tool named include-what-you-use to optimize these includes. By using a strict include-what-you-use policy, we can get lots of benefits from it.	2023-04-19 23:11:48 +08:00
yiguolei	03a4fe6f39	[enhancement](streamload) make stream load context as shared ptr and save it in global load mgr (#16996 )	2023-02-24 11:15:29 +08:00
Pxl	f50edff59d	[Chore](build) enable fallthrough check annd fix some fallthrough bug (#16748 ) * enable fallthrough check annd fix some fallthrough bug * fix * fix	2023-02-15 15:58:43 +08:00
Zhengguo Yang	98cdeed6e0	[chore](routine load) remove deprecated property of librdkafka reconnect.backoff.jitter.ms #15172	2022-12-20 10:13:56 +08:00
plat1ko	f3aea7f0f0	[Enhancement](status) Unify error code and enable customed err msg for BE internal errors (#14744 )	2022-12-11 23:33:18 +08:00
caiconghui	1a173a854e	[fix](routine-load) Fix that routine load cannot work with old kafka version (#10554 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-07-04 10:47:50 +08:00
Tiewei Fang	c9f86bc7e2	[refactor] Refactoring Status static methods to format message using fmt(#9533 )	2022-07-02 18:58:23 +08:00
chenlinzhong	c9961c9bb9	[style] clang-format all c++ code (#9305 ) - sh build-support/clang-format.sh to clang-format all c++ code	2022-04-29 16:14:22 +08:00
caiconghui	df3a8545dc	[fix](routine_load) Add retry mechanism for routine load task which encounter Broker transport failure (#9067 )	2022-04-20 14:49:58 +08:00
Zhengguo Yang	f3817829bb	[fix] fix malloc and free mismatch issue (#7702 ) The memory allocate by `malloc` should be freed by `free`	2022-01-14 09:32:33 +08:00
caiconghui	83f6eef506	[improvement](routine-load) Make routine load work with old kafka version (#7630 ) Co-authored-by: caiconghui1 <caiconghui1@jd.com>	2022-01-10 17:30:24 +08:00
Zhengguo Yang	760fc02bfe	Added bprc stub cache check and reset api, used to test whether the bprc stub cache is available, and reset the bprc stub cache (#6916 ) Added bprc stub cache check and reset api, used to test whether the bprc stub cache is available, and reset the bprc stub cache add a config used for auto check and reset bprc stub	2021-11-05 09:45:37 +08:00
Mingyu Chen	5ef3f59928	[Optimize][RoutineLoad] Avoid sending tasks if there is no data to be consumed (#6805 ) 1 Avoid sending tasks if there is no data to be consumed By fetching latest offset of partition before sending tasks.(Fix [Optimize] Avoid too many abort task in routine load job #6803 ) 2 Add a preCheckNeedSchedule phase in update() of routine load. To avoid taking write lock of job for long time when getting all kafka partitions from kafka server. 3 Upgrade librdkafka's version to 1.7.0 to fix a bug of "Local: Unknown partition" See offsetsForTimes fails with 'Local: Unknown partition' edenhill/librdkafka#3295 4 Avoid unnecessary storage migration task if there is no that storage medium on BE. Fix [Bug] Too many unnecessary storage migration tasks #6804	2021-10-13 11:39:01 +08:00
Mingyu Chen	2c208e932b	[Bug][RoutineLoad] Avoid TOO_MANY_TASKS error (#6342 ) Use `commitAsync` to commit offset to kafka, instead of using `commitSync`, which may block for a long time. Also assign a group.id to routine load if user not specified "property.group.id" property, so that all consumer of this job will use same group.id instead of a random id for each consume task.	2021-08-03 11:59:06 +08:00
crazyleeyang	8b4721c941	[Bug] Fix kafka consumer reuse bug (#6007 ) When judging whether consumer can be reused, it is necessary to judge whether the parameter content is equal.	2021-06-16 09:39:05 +08:00
Mingyu Chen	07ad038870	[Feature][RoutineLoad] Support for consuming kafka from the point of time (#5832 ) Support when creating a kafka routine load, start consumption from a specified point in time instead of a specific offset. eg: ``` FROM KAFKA ( "kafka_broker_list" = "broker1:9092,broker2:9092", "kafka_topic" = "my_topic", "property.kafka_default_offsets" = "2021-10-10 11:00:00" ); or FROM KAFKA ( "kafka_broker_list" = "broker1:9092,broker2:9092", "kafka_topic" = "my_topic", "kafka_partitions" = "0,1,2", "kafka_offsets" = "2021-10-10 11:00:00, 2021-10-10 11:00:00, 2021-10-10 12:00:00" ); ``` This PR also reconstructed the analysis method of properties when creating or altering routine load jobs, and unified the analysis process in the `RoutineLoadDataSourceProperties` class.	2021-05-22 23:37:53 +08:00
stdpain	5fed34fcfe	[optimize] provide a better defer operator (#5706 )	2021-05-12 10:37:23 +08:00
Zhengguo Yang	93a4c7efc1	[LOG] Standardize the use of VLOG in code (#5264 ) At present, the application of vlog in the code is quite confusing. It is inherited from impala VLOG_XX format, and there is also VLOG(number) format. VLOG(number) format does not have a unified specification, so this pr standardizes the use of VLOG	2021-01-21 12:09:09 +08:00
sduzh	6fedf5881b	[CodeFormat] Clang-format cpp sources (#4965 ) Clang-format all c++ source files.	2020-11-28 18:36:49 +08:00
Mingyu Chen	796f44beac	[Bug] Fix bug that routine load blocked with TOO_MANY_TASKS error (#4861 ) When receiving empty msg from kafka, the load process will quit abnormally. Fix #4860	2020-11-12 10:05:10 +08:00
Zhengguo Yang	09f97f8a05	[Refactor] Fixes some be typo part 2 (#4747 )	2020-10-20 09:28:57 +08:00
LingBin	569d0bb3af	Replace all remaining boost::split() with strings::split() (#2302 )	2019-11-26 22:22:14 +08:00
yiguolei	f852f50acb	Improve unique id performance (#1911 ) Remove the default constructor for UniqueID Add a gen_uid method in UniqueId. If need to generate a new uid, users should call this api explicitly. Reuse boost random generator not generate a new one every time.	2019-09-29 18:20:02 +08:00
HangyuanLiu	235cdb0ecd	Commit kafka offset (#1734 ) Commit kafka offset in routine load Kafka will decide whether to delete data based on whether all consumer group is commit offset or not. If there is no commit offset, the kafka server disk may be full	2019-09-10 14:27:06 +08:00
ZHAO Chun	9d03ba236b	Uniform Status (#1317 )	2019-06-14 23:38:31 +08:00
Mingyu Chen	ff0dd0d2da	Support SSL authentication with Kafka in routine load job (#1235 )	2019-06-07 16:29:01 +08:00
HangyuanLiu	9d19c6c315	Support arbitrary kafka properties (#1204 )	2019-05-28 10:03:50 +08:00
Mingyu Chen	722a9e71c7	Optimize json functions (#1177 ) 1. get_json_xxx() now support using quoto to escape dot 2. Implement json_path_prepare() function to preprocess json_path Performance of get_json_string() on 1000000 rows reduces from 2.27s to 0.27s	2019-05-21 09:13:12 +08:00
Mingyu Chen	cf1e7aa844	Add close tablet writer log (#1014 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	2b4d02b2fa	Add error load log url for routine load job (#938 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	400d8a906f	Optimize the consumer assignment of Kafka routine load job (#870 ) 1. Use a data consumer group to share a single stream load pipe with multi data consumers. This will increase the consuming speed of Kafka messages, as well as reducing the task number of routine load job. Test results： * 1 consumer, 1 partitions: consume time: 4.469s, rows: 990140, bytes: 128737139. 221557 rows/s, 28M/s * 1 consumer, 3 partitions: consume time: 12.765s, rows: 2000143, bytes: 258631271. 156689 rows/s, 20M/s blocking get time(us): 12268241, blocking put time(us): 1886431 * 3 consumers, 3 partitions: consume time(all 3): 6.095s, rows: 2000503, bytes: 258631576. 328220 rows/s, 42M/s blocking get time(us): 1041639, blocking put time(us): 10356581 The next 2 cases show that we can achieve higher speed by adding more consumers. But the bottle neck transfers from Kafka consumer to Doris ingestion, so 3 consumers in a group is enough. I also add a Backend config `max_consumer_num_per_group` to change the number of consumers in a data consumer group, and default value is 3. In my test(1 Backend, 2 tablets, 1 replicas), 1 routine load task can achieve 10M/s, which is same as raw stream load. 2. Add OFFSET_BEGINNING and OFFSET_END support for Kafka routine load	2019-04-28 10:33:50 +08:00
Mingyu Chen	9d08be3c5f	Add metrics for routine load (#795 ) * Add metrics for routine load * limit the max number of routine load task in backend to 10 * Fix bug that some partitions will no be assigned	2019-04-28 10:33:50 +08:00
Mingyu Chen	8d2de42b36	Fix some routine load bugs (#787 ) 1. Reserve the column order in load stmt. 2. Fix some replay bugs of routine load task.	2019-04-28 10:33:50 +08:00
Mingyu Chen	9fa5e1b768	Add a cleaner bg thread to clean idle data consumer (#776 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	8f781f95c7	Add persist operations for routine load job (#754 )	2019-04-28 10:33:50 +08:00
EmmyMiao87	8b52787114	Stream load with no data will abort txn (#735 ) 1. stream load executor will abort txn when no correct data in task 2. change txn label to DebugUtil.print(UUID) which is same as task id printed by be 3. change print uuid to hi-lo	2019-04-28 10:33:50 +08:00
Mingyu Chen	8474061d63	Add some logs (#711 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	567d5de2de	Add a data consumer pool to reuse the data consumer (#691 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	20b2b2c37f	Modify interface (#684 ) 1. Add batch submit interface 2. Add Kafka Event callback to catch Kafka events	2019-04-28 10:33:50 +08:00
Mingyu Chen	9618d20a72	Add unit test (#675 )	2019-04-28 10:33:50 +08:00
Mingyu Chen	0820a29b8d	Implement the routine load process of Kafka on Backend (#671 )	2019-04-28 10:33:50 +08:00

46 Commits