Replies: 1 comment 4 replies
-
Are you actually using Strimzi? |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Description
After restarting the Kafka Connect application, Mongo connectors become available (appear in the Kafka Connect UI) after a prolonged period, ranging from 5 to 15 minutes. Connectors become available simultaneously, and there are no subsequent issues with their operation.
Environment Details
-Xmx4096m -Xms2048m
Kafka Distributed Properties
Observations
Based on monitoring data, there are no network delays; CPU and Memory resources appear to be sufficient, indicating the application is not resource-bound.
During the researching on this problem the second test environment was set up. Restart on this stand occurs almost instantly and connectors become available rather fast.
Configuration settings between environments are identical. The only diff is the size of the
kafka-connect-offsets
system topic, which is automatically created by Kafka Connect. The topic has 25 partitions by default. Newly created stand has the topic’s size in MBs, but original stand has kafka-connect-offset topic’s size from 0.8Gb to 2.4Gb (depends on compaction time)Please check the screen with 30 days statistics on the kafka-connect-offset topic's size
Kafka Connect Offsets Topic Settings
Compaction Observations
Despite compaction occurring periodically, the first offset remains at 0, and the total number of messages continues to grow despite the size of the topic decreasing. There are numerous duplicates, especially with heartbeat messages.
Typical Heartbeat Message Structure
Key:
Value:
Every 10 seconds, each of the 200 connectors sends this message, resulting in 1200 messages per minute. The topic continues to grow, leading me to suspect there may be an issue with compaction, as the size is decreasing while the total message count remains unchanged. Also, IDK. Is it normal, that the first offset is still equal 0?
Resume
What strategies can we employ to reduce the restart time of the Kafka Connect application?
We suspect that the delays are due to Kafka Connect handling a high volume of messages from the topic. Increasing resources hasn’t noticeably improved the restart speed.
Has anyone else faced similar challenges or have any optimization tips?
Any assistance would be greatly appreciated
Beta Was this translation helpful? Give feedback.
All reactions