Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI Studio
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Service page
Yandex MetaData Hub
Documentation
Yandex MetaData Hub
  • About Yandex MetaData Hub
    • Getting started
      • Creating a Metastore cluster
      • Configuring security groups
      • Connecting Yandex Data Processing to Metastore
      • Exporting and importing a Metastore cluster
      • Connecting to a Yandex Object Storage bucket with a bucket policy
      • Transferring logs from Metastore to Cloud Logging
      • Deleting a Metastore cluster
    • Service roles for access management
    • Troubleshooting
    • Audit Trails events
  • Access management
  • Pricing policy
  • Public materials
  • Release notes
  1. Hive Metastore
  2. Step-by-step guides
  3. Connecting Yandex Data Processing to Metastore

Connecting Yandex Data Processing to Metastore

Written by
Yandex Cloud
Updated at March 6, 2025

Note

To use a Metastore cluster, your Yandex Data Processing cluster must contain the SPARK and YARN components.

Management console
  1. Create a Metastore cluster.

  2. When creating or updating a Yandex Data Processing cluster, specify the following property:

    spark:spark.hive.metastore.uris : thrift://<Metastore_cluster_IP_address>:9083
    

    To find out the Metastore cluster IP address, select Yandex MetaData Hub in the management console and then select the Metastore page in the left-hand panel. Copy the IP address column value for the cluster.

  3. If the Metastore cluster and Yandex Data Processing cluster are hosted in different cloud networks, set up routing between these cloud networks so that the Metastore subnet is accessible from the Yandex Data Processing subnet.

    There are multiple ways to configure routing. For example, you can create an IPsec tunnel.

  4. If the cloud network uses security groups, set up the security group of the Yandex Data Processing cluster to work with Metastore. To do this, add the following rule for outgoing traffic:

    • Port range: 9083
    • Protocol: Any (Any)
    • Source: CIDR
    • CIDR blocks: 0.0.0.0/0

For an example of using Yandex Data Processing with a Metastore cluster connected, see the Shared use of Yandex Data Processing tables through Metastore tutorial.

Was the article helpful?

Previous
Configuring security groups
Next
Exporting and importing a Metastore cluster
© 2025 Direct Cursus Technology L.L.C.