Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Data Processing
  • Getting started
    • Resource relationships
    • Runtime environment
    • Yandex Data Processing component interfaces and ports
    • Yandex Data Processing jobs
    • Spark jobs
    • Autoscaling
    • Decommissioning of subclusters and hosts
    • Networking in Yandex Data Processing
    • Maintenance
    • Quotas and limits
    • Storage in Yandex Data Processing
    • Component properties
    • Apache Iceberg™ in Yandex Data Processing
    • Delta Lake in Yandex Data Processing
    • Yandex Data Processing logs
    • Initialization scripts
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials
  • FAQ
  1. Concepts
  2. Decommissioning of subclusters and hosts

Decommissioning Yandex Data Processing subclusters and hosts

Written by
Yandex Cloud
Updated at February 18, 2025

Decommissioning is the process of reducing the cluster capacity (the number of hosts and their class) without stopping the cluster and interrupting its workload. Decommissioning is supported for Yandex Data Processing clusters version 1.2 and higher.

Yandex Data Processing implements decommissioning based on YARN and HDFS. Decommissioning will not interrupt running user tasks or cause data to be lost.

You can specify a decommissioning timeout for YARN clusters. In this case, the cluster will wait for the current operations to complete, but not longer than the specified time. Without a timeout, subcluster hosts will terminate immediately. The hosts being decommissioned will not perform new operations or accept new data.

The decommissioning duration depends on the timeout and the time spent modifying a cluster. The maximum timeout is 24 hours. The maximum duration of cluster operations is 1 hour.

YARN subcluster resources are decommissioned when:

  • Changing the host class.
  • Increasing the disk size.
  • Reducing the number of hosts in data processing subclusters.

HDFS subcluster resources are decommissioned if the number of hosts in data storage subclusters is reduced.

If the hosts must be rebooted for a cluster to update:

  1. The hosts to update or delete are added to YARN's excluded list.
  2. No new jobs are run on the hosts from the excluded list. As the running jobs are completed, the hosts are updated and restarted.
  3. If jobs are not completed before the decommissioning timeout, they are aborted and the host is updated and restarted.
  4. After restarting, the hosts are removed from the excluded list.

When all hosts change their status to Alive, decommissioning is considered complete.

For more information about YARN subcluster decommissioning, see the Apache Hadoop documentation.

Was the article helpful?

Previous
Autoscaling
Next
Networking in Yandex Data Processing
© 2025 Direct Cursus Technology L.L.C.