Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
  • Blog
  • Pricing
  • Documentation
Yandex project
© 2025 Yandex.Cloud LLC
Yandex MetaData Hub
    • All guides
      • Creating a Metastore cluster
      • Configuring security groups
      • Connecting Yandex Data Processing to Metastore
      • Exporting and importing a Metastore cluster
      • Connecting to a Yandex Object Storage bucket with a bucket policy
      • Transferring logs from Metastore to Cloud Logging
      • Deleting a Metastore cluster
  • Audit Trails events
  • Pricing policy
  • Troubleshooting
  • Public materials
  • Release notes
  1. Step-by-step guides
  2. Hive Metastore
  3. Connecting Yandex Data Processing to Metastore

Connecting Yandex Data Processing to Metastore

Written by
Yandex Cloud
Updated at March 6, 2025

Note

To use a Metastore cluster, your Yandex Data Processing cluster must contain the SPARK and YARN components.

Management console
  1. Create a Metastore cluster.

  2. When creating or updating a Yandex Data Processing cluster, specify the following property:

    spark:spark.hive.metastore.uris : thrift://<Metastore_cluster_IP_address>:9083
    

    To find out the Metastore cluster IP address, select Yandex MetaData Hub in the management console and then select the Metastore page in the left-hand panel. Copy the IP address column value for the cluster.

  3. If the Metastore cluster and Yandex Data Processing cluster are hosted in different cloud networks, set up routing between these cloud networks so that the Metastore subnet is accessible from the Yandex Data Processing subnet.

    There are multiple ways to configure routing. For example, you can create an IPsec tunnel.

  4. If the cloud network uses security groups, set up the security group of the Yandex Data Processing cluster to work with Metastore. To do this, add the following rule for outgoing traffic:

    • Port range: 9083
    • Protocol: Any (Any)
    • Source: CIDR
    • CIDR blocks: 0.0.0.0/0

For an example of using Yandex Data Processing with a Metastore cluster connected, see the Shared use of Yandex Data Processing tables through Metastore tutorial.

Was the article helpful?

Previous
Configuring security groups
Next
Exporting and importing a Metastore cluster
Yandex project
© 2025 Yandex.Cloud LLC