General questions about Managed Service for Apache Kafka®
-
Why is the cluster slow even though the computing resources are not used fully?
-
Can I connect to cluster hosts via SSH or get superuser permissions on hosts?
How do I ensure my cluster is fault tolerant?
Make sure it meets all fault tolerance conditions.
What do I do if I get an error like "disk size must be at least ... according to topics partitions number and replication factor, but size is ..."?
The error results from the fact that your topics' log segments take up more space than is available in broker storage. For more information about calculating the required space, see Minimum storage size.
To solve the issue, you can increase the disk size or reduce the topic segment or total cluster size.
Can I get logs of my operations with services?
Yes, you can request log records about your resources from Yandex Cloud services. For more information, see Data requests.
What is the retention period for logs?
Cluster logs are stored for 30 days.
How do I move a cluster to a different network?
You cannot move a Managed Service for Apache Kafka® cluster to another network.
Here is an alternative solution:
- Create a new cluster with the same configuration in the target network.
- Use MirrorMaker to move the topics of the source cluster to the new one.
How do I set up an alert that triggers as soon as a certain percentage of disk space has been used up?
Create an alert with the disk.used_bytes
metric in Yandex Monitoring. This metric shows the disk space usage in the Managed Service for Apache Kafka® cluster.
For disk.used_bytes
, use notification thresholds. The recommended values are as follows:
Alarm
: 90% of the disk spaceWarning
: 80% of the disk space
Thresholds are set in bytes only. For example, the recommended values for a 100 GB disk are as follows:
Alarm
:96636764160
bytes (90%)Warning
:85899345920
bytes (80%)
Why is the cluster slow even though the computing resources are not used fully?
Perhaps, the maximum storage IOPS and bandwidth values are insufficient for processing the current number of requests. In this case, throttling is triggered and the performance of the entire cluster degrades.
The maximum IOPS and bandwidth values increase by a fixed value when the storage size increases by a certain step. The step and increment values depend on the disk type:
Disk type | Step, GB | Max IOPS increase (read/write) | Max bandwidth increase (read/write), MB/s |
---|---|---|---|
network-hdd |
256 | 300/300 | 30/30 |
network-ssd |
32 | 1,000/1,000 | 15/15 |
network-ssd-nonreplicated |
93 | 28,000/5,600 | 110/82 |
To increase the maximum IOPS and bandwidth values and make throttling less likely, increase the storage size.
Can I connect to cluster hosts via SSH or get superuser permissions on hosts?
You cannot connect to hosts via SSH, nor can you get superuser permissions. This is done for the sake of security and user cluster fault tolerance because direct changes inside a host can render it completely inoperable.