Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
  • Blog
  • Pricing
  • Documentation
Yandex project
© 2025 Yandex.Cloud LLC
Yandex Managed Service for Kubernetes
  • Comparison with other Yandex Cloud services
  • Getting started
    • Resource relationships
    • Release channels and updates
    • Encryption
      • Cluster autoscaler
      • Evicting pods from nodes
      • Dynamic resource allocation for a node
      • Node groups with GPUs
    • Networking in Managed Service for Kubernetes
    • Network settings and cluster policies
    • Automatic scaling
    • Audit policy
    • External cluster nodes
    • Quotas and limits
    • Managed Service for Kubernetes usage recommendations
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Release notes

In this article:

  • How the autoscaler works
  • Autoscaling features
  • Use cases
  1. Concepts
  2. Node group
  3. Cluster autoscaler

Node group autoscaling in Managed Service for Kubernetes

Written by
Yandex Cloud
Updated at May 5, 2025
  • How the autoscaler works
  • Autoscaling features
  • Use cases

The cluster autoscaler resizes the node group based on resource requests for the pods running on the nodes of that group.

How the autoscaler works

When creating a node group, you specify the minimum and maximum number of nodes in the group, and the Kubernetes cluster will check the status of pods and nodes from time to time:

  • If pods cannot be assigned due to a lack of resources (vCPU, RAM) on the existing nodes, the number of nodes in the group will gradually increase to the specified maximum number.
  • If the load on the nodes is insufficient, and all pods can be assigned with fewer nodes per group, the number of nodes per group will gradually decrease to the specified minimum number.

Autoscaling features

You can resize an autoscaling group using Kubernetes cluster-autoscaler, but there are many non-trivial features. In particular:

  • If you can distribute the load in a cluster to a smaller number of nodes, but you cannot evict pods from the nodes for some reason, the number of nodes in a group will not decrease.
  • The actual number of nodes in a group may be below the minimum size of a node group. This may occur if:
    • Autoscaling group was scaled up from its minimum size after being created.
    • Actual cluster load does not require more nodes than the cluster currently has.
  • You can place an autoscaling node group only in one availability zone.

Tip

If unexpected behavior occurs, we recommend referring to the Kubernetes cluster-autoscaler FAQ.

Note

When calculating the current limits and quotas, Managed Service for Kubernetes uses the specified maximum node group size as its actual size, regardless of the current group size.

The node group autoscaler is different from Horizontal Pod Autoscaler. The Horizontal Pod Autoscaler uses a controller to periodically request resource usage based on the metrics defined in each Kubernetes API HorizontalPodAutoscaler object. You can use both mechanisms in parallel.

Use cases

  • DNS autoscaling based on cluster size

Was the article helpful?

Previous
Service
Next
Evicting pods from nodes
Yandex project
© 2025 Yandex.Cloud LLC