Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
    • Yandex Cloud Partner program
  • Blog
  • Pricing
  • Documentation
© 2025 Direct Cursus Technology L.L.C.
Yandex Data Processing
  • Getting started
  • Access management
  • Pricing policy
    • Overview
      • Overview
      • add-labels
      • create
      • delete
      • get
      • list
      • list-hosts
      • list-operations
      • list-ui-links
      • remove-labels
      • start
      • stop
      • update
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials
  • FAQ
  1. CLI reference
  2. cluster
  3. create

yc dataproc cluster create

Written by
Yandex Cloud
Updated at January 31, 2025

Create a cluster.

Command UsageCommand Usage

Syntax:

yc dataproc cluster create <CLUSTER-NAME> [Flags...] [Global Flags...]

FlagsFlags

Flag Description
--name string
Name of the cluster.
--description string
Description of the cluster.
--zone string
ID of the availability zone where the cluster resides.
--service-account-id string
Service account id.
--service-account-name string
Service account name.
--version string
Version of an image for cluster provisioning.
--services List of used services in a cluster. If no service is specified, the default set will be used.
A list can be specified by listing services separated by commas as well as passing this flag multiple times.
Available services are: 'hdfs', 'yarn', 'mapreduce', 'hive', 'tez', 'zookeeper', 'hbase', 'sqoop', 'flume', 'spark', 'zeppelin'.
--property value[,value]
Properties passed to all hosts *-site.xml configurations in <service>:<property>=<value> format.
For example setting property 'dfs.replication' to 3 in /etc/hadoop/conf/hdfs-site.xml requires specifying --property "hdfs:dfs.replication=3"
This flag can be passed multiple times.
--ssh-public-keys-file string
Specify a list of SSH public keys that will be set as authorized keys to root user on cluster hosts.
--subcluster PROPERTY=VALUE[,PROPERTY=VALUE...]
Specifies options for subclusters. Can be passed multiple times.

Possible property names:
  • name: Name of a subcluster.
  • role: Role of a subcluster Values: 'masternode', 'datanode', 'computenode'
  • resource-preset: Preset of computational resources available to a host
  • disk-type: Type of the storage environment for a host.
  • disk-size: Amount of disk storage available to a host in GB.
  • subnet-name: Name of the subnet that subcluster's hosts should be created in.
  • subnet-id: ID of the subnet that subcluster's hosts should be created in.
  • hosts-count: Specifies a number of hosts in a subcluster. (Minimum number of hosts for autoscaling compute subcluster)
  • assign-public-ip: Assigns public Ip for all hosts in a subcluster.
  • max-hosts-count: Specifies a maximum number of hosts for autoscaling compute subcluster.
  • preemptible: Enables VMs preemption for autoscaling compute subcluster.
  • warmup-duration: Specifies a warmup duration for autoscaling compute subcluster.
  • stabilization-duration: Specifies a stabilization duration for autoscaling compute subcluster.
  • measurement-duration: Specifies a measurement duration for autoscaling compute subcluster.
  • cpu-utilization-target: Specifies a CPU utilization threshold. In percents (10-100). When this threshold is passed the subcluster nodes number will grow.
  • autoscaling-decommission-timeout: Specifies a decommission timeout (in seconds) for nodes during automatic downscaling.
--initialization-action PROPERTY=VALUE[,PROPERTY=VALUE...]
Specifies initialization action. Can be passed multiple times.

Possible property names:
  • uri: URI of an initialization action.
  • args: Arguments for initialization action.
  • timeout: Specifies initialization action execution timeout (in seconds).
--bucket string
Object Storage bucket to be used for Data Proc jobs that are run in the cluster.
--labels key=value[,key=value...]
A list of cluster labels as key-value pairs.
--ui-proxy Whether to enable UI Proxy feature.
--security-group-ids value[,value]
A list of security groups for the Data Proc cluster.
--host-group-ids value[,value]
A list of host groups to place VMs of cluster on.
--deletion-protection Deletion Protection inhibits deletion of the cluster.
--log-group-id string
Id of a log group to write cluster logs to.
--environment string
Cluster environment. Values: production, prestable.
--async Display information about the operation in progress, without waiting for the operation to complete.

Global FlagsGlobal Flags

Flag Description
--profile string
Set the custom configuration file.
--debug Debug logging.
--debug-grpc Debug gRPC logging. Very verbose, used for debugging connection problems.
--no-user-output Disable printing user intended output to stderr.
--retry int
Enable gRPC retries. By default, retries are enabled with maximum 5 attempts.
Pass 0 to disable retries. Pass any negative value for infinite retries.
Even infinite retries are capped with 2 minutes timeout.
--cloud-id string
Set the ID of the cloud to use.
--folder-id string
Set the ID of the folder to use.
--folder-name string
Set the name of the folder to use (will be resolved to id).
--endpoint string
Set the Cloud API endpoint (host:port).
--token string
Set the OAuth token to use.
--impersonate-service-account-id string
Set the ID of the service account to impersonate.
--no-browser Disable opening browser for authentication.
--format string
Set the output format: text (default), yaml, json, json-rest.
--jq string
Query to select values from the response using jq syntax
-h,--help Display help for the command.

Was the article helpful?

Previous
add-labels
Next
delete
© 2025 Direct Cursus Technology L.L.C.