Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML Services
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Virtual Private Cloud
  • Getting started
    • All tutorials
      • Architecture and protection of a basic web service
      • DHCP settings for interfacing with a corporate DNS server
      • Setting up networking between resources from different folders
      • Implementing fault-tolerant scenarios for NAT VMs
      • Configuring a network for Yandex Data Processing
      • Network connection switching during Yandex Data Processing cluster recreation
      • Migrating an HDFS Yandex Data Processing cluster to a different availability zone
      • Deploying an Always On availability group with an internal network load balancer
  • DDoS Protection
  • Access management
  • Pricing policy
  • Terraform reference
  • Audit Trails events
  • Release notes
  • FAQ

In this article:

  • Required paid resources
  • Create resources
  • Delete the resources you created
  1. Tutorials
  2. Configuring network resources and interaction between them
  3. Configuring a network for Yandex Data Processing

Configuring a network for Yandex Data Processing

Written by
Yandex Cloud
Updated at September 25, 2025
  • Required paid resources
  • Create resources
  • Delete the resources you created

In this tutorial, you will learn how to create a Yandex Data Processing cluster and set up subnets and a NAT gateway.

Required paid resourcesRequired paid resources

The support cost includes:

  • Yandex Data Processing cluster fee: using VM computing resources and Compute Cloud network disks, and Cloud Logging for log management (see Yandex Data Processing pricing).
  • Fee for a NAT gateway (see Virtual Private Cloud pricing).
  • Fee for an Object Storage bucket: data storage and operations (see Object Storage pricing).

Create resourcesCreate resources

Manually
Terraform
  1. Create a network named data-proc-network with the Create subnets option disabled.

  2. In data-proc-network, create a subnet with the following settings:

    • Name: data-proc-subnet-a
    • Availability zone: ru-central1-a
    • CIDR: 192.168.1.0/24
  3. Create a NAT gateway and a route table named data-proc-route-table in data-proc-network. Associate the table with data-proc-subnet-a.

  4. In data-proc-network, create a security group named data-proc-security-group with the following rules:

    • One rule for incoming and another one for outgoing service traffic:

      • Port range: 0-65535
      • Protocol: Any
      • Source/Destination name: Security group
      • Security group: Current
    • Rule for outgoing HTTPS traffic:

      • Port range: 443
      • Protocol: TCP
      • Destination name: CIDR
      • CIDR blocks: 0.0.0.0/0
    • Rule that allows access to NTP servers for time syncing:

      • Port range: 123
      • Protocol: UDP
      • Destination name: CIDR
      • CIDR blocks: 0.0.0.0/0

    Note

    You can configure additional security group rules to connect to cluster hosts.

  5. Create a service account named data-proc-sa with the following roles:

    • dataproc.agent
    • dataproc.provisioner
    • storage.uploader
    • storage.viewer
  6. Create a Yandex Object Storage bucket with restricted access.

  7. Create a Yandex Data Processing cluster in any suitable configuration with the following settings:

    • Service account: data-proc-sa.
    • Bucket ID format: List.
    • Bucket name: Select the bucket you created earlier.
    • Network: data-proc-network.
    • Security groups: data-proc-security-group.
  1. If you do not have Terraform yet, install it and configure the Yandex Cloud provider.

  2. Get the authentication credentials and specify the Yandex Cloud provider installation source (see Configure your provider, Step 1).

  3. Download the cluster configuration file to the same working directory.

    This file describes:

    • Network.
    • Subnet.
    • NAT gateway and route table.
    • Security group.
    • Service account to work with cluster resources.
    • Service account for bucket management.
    • Static access key required to grant the service account permissions for the bucket.
    • Bucket to store job dependencies and results.
    • Yandex Data Processing cluster.

    Note

    You can configure additional security group rules to connect to cluster hosts.

  4. In the configuration file, specify all the relevant parameters.

  5. Run the terraform init command in the working directory with the configuration files. This command initializes the provider specified in the configuration files and enables you to use its resources and data sources.

  6. Make sure the Terraform configuration files are correct using this command:

    terraform validate
    

    Terraform will show any errors found in your configuration files.

  7. Create the required infrastructure:

    1. Run this command to view the intended changes:

      terraform plan
      

      If you described the configuration correctly, the terminal will display a list of the resources to update and their parameters. This is a verification step that does not apply changes to your resources.

    2. If everything looks correct, apply the changes:

      1. Run this command:

        terraform apply
        
      2. Confirm updating the resources.

      3. Wait for the operation to complete.

All the resources you need will be created in the specified folder. You can check the new resources and their settings using the management console.

Delete the resources you createdDelete the resources you created

Some resources are not free of charge. To avoid paying for them, delete the resources you no longer need:

Manually
Terraform
  1. Delete the Yandex Data Processing cluster.
  2. If you reserved public static IP addresses, release and delete them.
  3. Delete the subnet.
  4. Delete the route table.
  5. Delete the NAT gateway.
  6. Delete the network.
  1. In the terminal window, go to the directory containing the infrastructure plan.

    Warning

    Make sure the directory has no Terraform manifests with the resources you want to keep. Terraform deletes all resources that were created using the manifests in the current directory.

  2. Delete resources:

    1. Run this command:

      terraform destroy
      
    2. Confirm deleting the resources and wait for the operation to complete.

    All the resources described in the Terraform manifests will be deleted.

Was the article helpful?

Previous
Implementing fault-tolerant scenarios for NAT VMs
Next
Network connection switching during Yandex Data Processing cluster recreation
© 2025 Direct Cursus Technology L.L.C.