Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Managed Service for Apache Kafka®
  • Getting started
    • Resource relationships
    • Topics and partitions
    • Brokers
    • KRaft protocol
    • Producers and consumers
    • User roles
    • Managing data schemas
    • Web interface for Apache Kafka®
    • APIs in Managed Service for Apache Kafka®
    • Host classes
    • High availability clusters
    • Networking in Managed Service for Apache Kafka®
    • Quotas and limits
    • Storage in Managed Service for Apache Kafka®
    • Connectors
    • Maintenance
    • Apache Kafka® settings
    • Apache Kafka® versioning policy
  • Access management
  • Pricing policy
  • Terraform reference
  • Yandex Monitoring metrics
  • Audit Trails events
  • Public materials
  • Release notes
  • FAQ

In this article:

  • Number and placement of cluster hosts
  • Single-host cluster
  • Two-host cluster
  • Cluster with three or more hosts
  • Cluster availability during maintenance
  • Other settings
  • See also
  1. Concepts
  2. High availability clusters

High availability of a Managed Service for Apache Kafka® cluster

Written by
Yandex Cloud
Updated at July 9, 2025
  • Number and placement of cluster hosts
    • Single-host cluster
    • Two-host cluster
    • Cluster with three or more hosts
  • Cluster availability during maintenance
  • Other settings
    • See also

High availability of a Managed Service for Apache Kafka® cluster depends on the number and placement of its hosts, topic settings, and other cluster parameters.

Number and placement of cluster hostsNumber and placement of cluster hosts

The Service Level Agreement (SLA) defines a high-availability cluster as the one made up of three or more broker hosts residing in different availability zones.

Single-host clusterSingle-host cluster

Single-host cluster is both the cheapest and easiest to operate. We recommend to use it for test clusters or production apps where high cluster availability is not critical.

Here is why a single-host cluster is not a high-availability solution:

  • If the broker host VM fails, your cluster will be unavailable for reading and writing until the VM is fully recovered.
  • Once its host goes to read-only mode, your cluster will be unavailable for writing until you manually increase the storage size.

Two-host clusterTwo-host cluster

For a cluster with two broker hosts, the maximum topic replication factor is two, and the SLA does not apply to such clusters.

Compared to a single-host cluster, a cluster with two hosts offers the following advantages:

  • At your application level, you can balance data reads and writes between two broker hosts so your cluster will work faster.
  • You can replicate topic partitions if the topic has a replication factor of 2. This ensures availability if one of the cluster hosts fails.

To ensure high availability of your cluster under the SLA, you can increase the number of broker hosts.

Cluster with three or more hostsCluster with three or more hosts

A cluster with three or more hosts offers reliable storage and continuous data availability if each of the three availability zones has at least one broker host. Such a cluster meets the high availability criteria and is subject to the SLA.

To qualify for high availability under the SLA, your cluster topics must have the following parameters:

  • Replication factor: 3
  • Minimum number of in-sync replicas: 2

Also, we recommend specifying the acks=all parameter in the producer configuration. In which case, writing a message to a topic will be considered successful only after Apache Kafka® gets a write confirmation from as many broker hosts as specified in the Minimum number of in-sync replicas parameter. For more information, see the Apache Kafka® documentation.

Cluster availability during maintenanceCluster availability during maintenance

The table below lists the possible cluster availability options based on the number of broker hosts.

Cluster topology Availability during maintenance
One host Completely unavailable during reboots or updates.
Two hosts Only the rebooting host is temporarily unavailable. When upgrading the Apache Kafka® version, cluster topics are unavailable if their replication factor is 1.
Three hosts Only the rebooting host is temporarily unavailable.

Consider the expected load on your cluster when selecting the maintenance window.

Other settingsOther settings

High availability of your cluster also depends on the following:

  • Storage disk type you selected.
  • Host classes.
  • Quotas and limits.

See alsoSee also

  • Recommendations on fault tolerance in Yandex Cloud

Was the article helpful?

Previous
Host classes
Next
Networking in Managed Service for Apache Kafka®
© 2025 Direct Cursus Technology L.L.C.