Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
  • Blog
  • Pricing
  • Documentation
Yandex project
© 2025 Yandex.Cloud LLC
Yandex Data Processing
  • Getting started
    • Resource relationships
    • Runtime environment
    • Yandex Data Processing component interfaces and ports
    • Jobs in Yandex Data Processing
    • Spark jobs
    • Automatic scaling
    • Decommissioning subclusters and hosts
    • Networking in Yandex Data Processing
    • Maintenance
    • Quotas and limits
    • Storage in Yandex Data Processing
    • Component properties
    • Apache Iceberg™ in Yandex Data Processing
    • Delta Lake in Yandex Data Processing
    • Logs in Yandex Data Processing
    • Initialization scripts
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials
  • FAQ
  1. Concepts
  2. Decommissioning subclusters and hosts

Decommissioning Yandex Data Processing subclusters and hosts

Written by
Yandex Cloud
Updated at February 18, 2025

Decommissioning is the process of reducing the cluster capacity (the number of hosts and their class) without stopping the cluster and interrupting its workload. Decommissioning is supported for Yandex Data Processing clusters version 1.2 and higher.

Yandex Data Processing implements decommissioning based on YARN and HDFS. Decommissioning will not interrupt running user tasks or cause data to be lost.

You can specify a decommissioning timeout for YARN clusters. In this case, the cluster will wait for the current operations to complete, but not longer than the specified time. Without a timeout, subcluster hosts will terminate immediately. The hosts being decommissioned will not perform new operations or accept new data.

The decommissioning duration depends on the timeout and the time spent modifying a cluster. The maximum timeout is 24 hours. The maximum duration of cluster operations is 1 hour.

YARN subcluster resources are decommissioned when:

  • Changing the host class.
  • Increasing the disk size.
  • Reducing the number of hosts in data processing subclusters.

HDFS subcluster resources are decommissioned if the number of hosts in data storage subclusters is reduced.

If the hosts must be rebooted for a cluster to update:

  1. The hosts to update or delete are added to YARN's excluded list.
  2. No new jobs are run on the hosts from the excluded list. As the running jobs are completed, the hosts are updated and restarted.
  3. If jobs are not completed before the decommissioning timeout, they are aborted and the host is updated and restarted.
  4. After restarting, the hosts are removed from the excluded list.

When all hosts change their status to Alive, decommissioning is considered complete.

For more information about YARN subcluster decommissioning, see the Apache Hadoop documentation.

Was the article helpful?

Previous
Automatic scaling
Next
Networking in Yandex Data Processing
Yandex project
© 2025 Yandex.Cloud LLC