Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Managed Service for YDB
  • Getting started
    • Overview
    • Terms and definitions
    • Serverless and dedicated operation modes
    • Autoscaling
    • OLTP or OLAP operating modes
    • Transfer
    • DynamoDB tables
    • Quotas and limits
  • Access management
  • Monitoring metrics
  • Audit Trails events
  • FAQ
  • Public materials
  1. Concepts
  2. Autoscaling

Autoscaling

Written by
Yandex Cloud
Updated at October 20, 2025

Note

Autoscaling is only available for the OLTP mode.

In dedicated mode in Managed Service for YDB, you can use autoscaling of computing resources.

Autoscaling adjusts the number of database nodes to accommodate load variations, allowing you to handle increased load and optimize your costs during low load periods. Yandex Managed Service for YDB does this via CPU utilization, i.e., CPU core utilization percentage across all computing pools.

To use autoscaling, you should select the target CPU utilization value averaged out across the database nodes, plus the minimum and maximum number of nodes. Yandex Managed Service for YDB will try to maintain CPU utilization at this level by adjusting the number of database nodes within these limits. Newly added nodes have identical configurations and are evenly distributed across availability zones.

Note

For reliable and stable performance, a database should have more than one node and maintain CPU utilization below 60%.

ExampleExample

Your database is deployed with six compute nodes, and you need to maintain CPU utilization at 50% under the fluctuating load. This allows the system to handle traffic spikes without allocating too many resources. To achieve this, autoscaling is configured the following way: target CPU utilization of 50%, with minimum number of nodes being three and maximum number, nine. With this configuration, the system will automatically scale up the number of compute nodes when CPU utilization exceeds 50% to handle higher loads. Conversely, when CPU utilization drops below 50%, the number of nodes will be scale down to optimize costs during periods of lower loads. At all times, the total number of nodes will remain between three and nine.

Was the article helpful?

Previous
Serverless and dedicated operation modes
Next
OLTP or OLAP operating modes
© 2025 Direct Cursus Technology L.L.C.