Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Data Processing
  • Getting started
    • All guides
      • Information about existing clusters
      • Creating a cluster
      • Creating an autoscaling cluster in another folder's network
        • Connecting to a cluster
        • Configuring security groups
        • Getting an FQDN of a host
        • Connecting to a host over SSH
        • Connecting to a host via OS Login
        • Connecting to a host from graphical IDEs
      • Updating a cluster
      • Migrating a lightweight cluster to a different availability zone
      • Deleting a cluster
    • Creating and using Python virtual environments
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials
  • FAQ
  1. Step-by-step guides
  2. Yandex Data Processing clusters
  3. Connection
  4. Connecting to a host from graphical IDEs

Connecting to a Yandex Data Processing host from graphical IDEs

Written by
Yandex Cloud
Updated at September 25, 2025

You can connect to a Yandex Data Processing cluster using graphical IDEs.

Before connecting:

  1. Configure security groups.
  2. Get the host FQDN.

Connect using graphical IDEsConnect using graphical IDEs

Connections were tested in the following environment:

  • Ubuntu 20.04, DBeaver: 22.2.4
  • MacOS Monterey 12.7:
    • JetBrains DataGrip: 2023.3.4
    • DBeaver Community: 24.0.0

To use graphical IDEs, save a certificate to a local folder and specify the path to it in the connection settings.

DataGrip
DBeaver
  1. Create a data source:
    1. Select File → New → Data Source → Apache Hive.

      Note

      Select the data source depending on the Yandex Data Processing component you are connecting to:

      • Hive: Select Apache Hive.
      • Spark: Select Apache Spark.

      The list of settings does not change.

    2. Specify the connection settings on the General tab:

      • Host: FQDN of the cluster master host or its public IP address.
      • If connecting for the first time, click Download to download the connection driver.
    3. On the SSH/SSL tab:

      1. Enable the Use SSL setting and specify the SSL connection settings:
        • CA file: Downloaded SSL certificate for the connection.
        • Client key file, Client key password: File with the private key required to connect to the Yandex Data Processing cluster and its password.
      2. Optionally, to connect via a jump host VM, configure the SSH tunnel settings:
        1. Select Use SSH tunnel, create an SSH configuration, and specify these settings:
          • Host: VM IP address.
          • User name: VM username.
          • Private key file, Passphrase: Private key file required to connect to the VM and its password.
        2. Click Test Connection to test the connection to the VM from DataGrip.
        3. Click OK to save the configuration.
  2. Click Test Connection. If the connection is successful, you will get the OK connection status and information about the DBMS and driver.
  3. Click OK to save the data source.
  1. Download the SSH key to the local machine or VM to connect to a Yandex Data Processing cluster.
  2. Create a new DB connection:
    1. From the Database menu, select New connection.

    2. Select a data source from the DB list depending on the configuration of the Yandex Data Processing cluster you are connecting to:

      • If the cluster uses Hive, select Apache Hive.
      • If only Spark is enabled in the cluster and the Thrift server is enabled, select Apache Spark.

      The list of connection settings remains the same regardless of the selected data source.

    3. Click Next.

    4. On the SSH tab, enable the Use SSH tunnel setting and specify these settings:

      • Host/IP: FQDN (to connect via a jump host VM) or public IP address of the master host.
      • Username: Enter the username:
        • For version 2.0: ubuntu.
        • For version 1.4: root.
      • Authentication method: Public key.
      • Secret key: Path to the cluster’s private key file.
      • Passphrase: Private key password.
      • Optionally, to connect via a jump host VM, enable the Use jump server setting and specify the settings:
        • Host/IP: Public IP address of the VM for connection.
        • Username: Username for connecting to the VM.
        • Authentication method: Public key.
        • Secret key: Path to the VM’s private key file.
        • Passphrase: Private key password.
  3. Click Test Connection .... If the connection is successful, you will see the connection status and information about the DBMS and driver.
  4. Click Ready to save the database connection settings.

Was the article helpful?

Previous
Connecting to a host via OS Login
Next
Updating a cluster
© 2025 Direct Cursus Technology L.L.C.