Yandex Cloud
Search
Contact UsTry it for free
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
  • Marketplace
    • Featured
    • Infrastructure & Network
    • Data Platform
    • AI for business
    • Security
    • DevOps tools
    • Serverless
    • Monitoring & Resources
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
    • Price calculator
    • Pricing plans
  • Customer Stories
  • Documentation
  • Blog
© 2026 Direct Cursus Technology L.L.C.
Tutorials
    • All tutorials
    • Unassisted deployment of the Apache Kafka® web interface
    • Upgrading a Managed Service for Apache Kafka® cluster to migrate from ZooKeeper to KRaft
    • Migrating a database from a third-party Apache Kafka® cluster to Managed Service for Apache Kafka®
    • Moving data between Managed Service for Apache Kafka® clusters using Data Transfer
    • Delivering data from Managed Service for MySQL® to Managed Service for Apache Kafka® using Data Transfer
    • Delivering data from Managed Service for MySQL® to Managed Service for Apache Kafka® using Debezium
    • Delivering data from Managed Service for PostgreSQL to Managed Service for Apache Kafka® using Data Transfer
    • Delivering data from Managed Service for PostgreSQL to Managed Service for Apache Kafka® using Debezium
    • Delivering data from Managed Service for YDB to Managed Service for Apache Kafka® using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Managed Service for ClickHouse® using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Yandex MPP Analytics for PostgreSQL using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Yandex StoreDoc using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Managed Service for MySQL® using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Managed Service for OpenSearch using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Managed Service for PostgreSQL using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Managed Service for YDB using Data Transfer
    • Delivering data from Managed Service for Apache Kafka® to Data Streams using Data Transfer
    • Delivering data from Data Streams to Managed Service for YDB using Data Transfer
    • Delivering data from Data Streams to Managed Service for Apache Kafka® using Data Transfer
    • YDB change data capture and delivery to YDS
    • Configuring Kafka Connect to work with a Managed Service for Apache Kafka® cluster
    • Synchronizing Apache Kafka® topics in Object Storage with no web access
    • Monitoring message loss in an Apache Kafka® topic
    • Automating Query tasks with Managed Service for Apache Airflow™
    • Sending requests to the Yandex Cloud API via the Yandex Cloud Python SDK
    • Configuring an SMTP server to send e-mail notifications
    • Adding data to a ClickHouse® DB
    • Migrating data to Managed Service for ClickHouse® using ClickHouse®
    • Migrating data to Managed Service for ClickHouse® using Data Transfer
    • Delivering data from Managed Service for MySQL® to Managed Service for ClickHouse® using Data Transfer
    • Asynchronously replicating data from PostgreSQL to ClickHouse®
    • Exchanging data between Managed Service for ClickHouse® and Yandex Data Processing
    • Configuring Managed Service for ClickHouse® for Graphite
    • Fetching data from Managed Service for Apache Kafka® to Managed Service for ClickHouse®
    • Fetching data from Managed Service for Apache Kafka® to ksqlDB
    • Fetching data from RabbitMQ to Managed Service for ClickHouse®
    • Saving a data stream from Data Streams to Managed Service for ClickHouse®
    • Asynchronous replication of data from Yandex Metrica to ClickHouse® using Data Transfer
    • Using hybrid storage in Managed Service for ClickHouse®
    • Sharding Managed Service for ClickHouse® tables
    • Loading data from Yandex Direct to a Managed Service for ClickHouse® data mart using Cloud Functions, Object Storage, and Data Transfer
    • Loading data from Object Storage to Managed Service for ClickHouse® using Data Transfer
    • Migrating data from Managed Service for OpenSearch to Managed Service for ClickHouse® with a storage change using Data Transfer
    • Loading data from Managed Service for YDB to Managed Service for ClickHouse® using Data Transfer
    • Yandex Managed Service for ClickHouse® integration with Microsoft SQL Server via ClickHouse® JDBC Bridge
    • Migrating databases from Google BigQuery to Managed Service for ClickHouse®
    • Yandex Managed Service for ClickHouse® integration with Oracle via ClickHouse® JDBC Bridge
    • Configuring Cloud DNS to access a Managed Service for ClickHouse® cluster from other cloud networks
    • Migrating a Yandex Data Processing HDFS cluster to a different availability zone
    • Importing data from Managed Service for MySQL® to Yandex Data Processing using Sqoop
    • Importing data from Managed Service for PostgreSQL to Yandex Data Processing using Sqoop
    • Mounting Object Storage buckets to the file system of Yandex Data Processing hosts
    • Working with Apache Kafka® topics using Yandex Data Processing
    • Automating operations with Yandex Data Processing using Managed Service for Apache Airflow™
    • Shared use of Yandex Data Processing tables through Apache Hive™ Metastore
    • Transferring metadata across Yandex Data Processing clusters using Apache Hive™ Metastore
    • Importing data from Object Storage, processing, and exporting it to Managed Service for ClickHouse®
    • Migrating collections from a third-party MongoDB cluster to Yandex StoreDoc
    • Migrating data to Yandex StoreDoc
    • Migrating Yandex StoreDoc cluster from 4.4 to 6.0
    • Sharding Yandex StoreDoc collections
    • Yandex StoreDoc performance analysis and tuning
    • Managed Service for MySQL® performance analysis and tuning
    • Syncing data from a third-party MySQL® cluster to Managed Service for MySQL® using Data Transfer
    • Migrating a database from Managed Service for MySQL® to a third-party MySQL® cluster
    • Migrating a database from Managed Service for MySQL® to Object Storage using Data Transfer
    • Migrating data from Object Storage to Managed Service for MySQL® via Data Transfer
    • Delivering data from Managed Service for MySQL® to Managed Service for Apache Kafka® using Data Transfer
    • Delivering data from Managed Service for MySQL® to Managed Service for Apache Kafka® using Debezium
    • Migrating a database from Managed Service for MySQL® to Managed Service for YDB using Data Transfer
    • MySQL® change data capture and delivery to YDS
    • Migrating data from Managed Service for MySQL® to Managed Service for PostgreSQL using Data Transfer
    • Migrating data from AWS RDS for PostgreSQL to Managed Service for PostgreSQL using Data Transfer
    • Migrating data from Managed Service for MySQL® to Yandex MPP Analytics for PostgreSQL using Data Transfer
    • Configuring an index policy in Managed Service for OpenSearch
    • Migrating data from a third-party OpenSearch cluster to Managed Service for OpenSearch using Data Transfer
    • Loading data from Managed Service for OpenSearch to Object Storage using Data Transfer
    • Migrating data from Managed Service for OpenSearch to Managed Service for YDB using Data Transfer
    • Copying data from Managed Service for OpenSearch to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
    • Migrating data from Managed Service for PostgreSQL to Managed Service for OpenSearch using Data Transfer
    • Authenticating a Managed Service for OpenSearch cluster in OpenSearch Dashboards using Keycloak
    • Using the yandex-lemmer plugin in Managed Service for OpenSearch
    • Creating a PostgreSQL cluster for 1C:Enterprise
    • Searching for the Managed Service for PostgreSQL cluster performance issues
    • Managed Service for PostgreSQL performance analysis and tuning
    • Logical replication in PostgreSQL
    • Migrating a database from a third-party PostgreSQL cluster to Managed Service for PostgreSQL
    • Migrating a database from Managed Service for PostgreSQL
    • Delivering data from Managed Service for PostgreSQL to Managed Service for Apache Kafka® using Data Transfer
    • Delivering data from Managed Service for PostgreSQL to Managed Service for Apache Kafka® using Debezium
    • Delivering data from Managed Service for PostgreSQL to Managed Service for YDB using Data Transfer
    • Migrating a database from Managed Service for PostgreSQL to Object Storage
    • Migrating data from Object Storage to Managed Service for PostgreSQL via Data Transfer
    • PostgreSQL change data capture and delivery to YDS
    • Migrating data from Managed Service for PostgreSQL to Managed Service for MySQL® using Data Transfer
    • Migrating data from Managed Service for PostgreSQL to Managed Service for OpenSearch using Data Transfer
    • Fixing string sorting issues in PostgreSQL after a glibc upgrade
    • Migrating a database from Greenplum® to ClickHouse®
    • Migrating a database from Greenplum® to PostgreSQL
    • Exporting Greenplum® data to a cold storage in Object Storage
    • Loading data from Object Storage to Yandex MPP Analytics for PostgreSQL using Data Transfer
    • Copying data from Managed Service for OpenSearch to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
    • Creating an external table from an Object Storage bucket table using a configuration file
    • Getting data from external sources using named queries in Greenplum®
    • Migrating a database from a third-party Valkey™ cluster to Yandex Managed Service for Valkey™
    • Using a Yandex Managed Service for Valkey™ cluster as a PHP session storage
    • Loading data from Object Storage to Managed Service for YDB using Data Transfer
    • Loading data from Managed Service for YDB to Object Storage using Data Transfer
    • Processing Audit Trails events
    • Processing Cloud Logging logs
    • Processing Debezium CDC streams
    • Analyzing data with Jupyter
    • Processing files with usage details in Yandex Cloud Billing
    • Ingesting data into storage systems
    • Smart log processing
    • Data transfer in microservice architectures
    • Migrating data to Object Storage using Data Transfer
    • Migrating data from a third-party Greenplum® or PostgreSQL cluster to Yandex MPP Analytics for PostgreSQL using Data Transfer
    • Migrating Yandex StoreDoc clusters
    • Migrating MySQL® clusters
    • Migrating to a third-party MySQL® cluster
    • Migrating PostgreSQL clusters
    • Creating a schema registry to deliver data in Debezium CDC format from Apache Kafka®
    • Automating operations using Yandex Managed Service for Apache Airflow™
    • Working with an Object Storage table from a PySpark job
    • Integrating Yandex Managed Service for Apache Spark™ with Apache Hive™ Metastore
    • Running a PySpark job using Yandex Managed Service for Apache Airflow™
    • Using Yandex Object Storage in Yandex Managed Service for Apache Spark™

In this article:

  • Apache Kafka®
  • Apache Airflow™
  • ClickHouse®
  • Greenplum®
  • MongoDB/Yandex StoreDoc
  • MySQL®
  • OpenSearch
  • PostgreSQL
  • Valkey™
  • YDB
  • Yandex Cloud DNS
  • Yandex Data Processing
  • Basic examples of working with jobs
  • Advanced examples of working with jobs
  • Yandex Query
  • Yandex Data Streams
  • Yandex Data Transfer
  • Yandex Managed Service for Apache Spark™
  1. Building a data platform
  2. All tutorials

Building a data platform based on Yandex Cloud

Written by
Yandex Cloud
Updated at January 19, 2026
  • Apache Kafka®
  • Apache Airflow™
  • ClickHouse®
  • Greenplum®
  • MongoDB/Yandex StoreDoc
  • MySQL®
  • OpenSearch
  • PostgreSQL
  • Valkey™
  • YDB
  • Yandex Cloud DNS
  • Yandex Data Processing
    • Basic examples of working with jobs
    • Advanced examples of working with jobs
  • Yandex Query
  • Yandex Data Streams
  • Yandex Data Transfer
  • Yandex Managed Service for Apache Spark™

Apache Kafka®Apache Kafka®

  • Unassisted deployment of the Apache Kafka® web interface
  • Upgrading a Managed Service for Apache Kafka® cluster to migrate from ZooKeeper to KRaft
  • Migrating a database from a third-party Apache Kafka® cluster to Yandex Managed Service for Apache Kafka®
  • Moving data between Managed Service for Apache Kafka® clusters using Data Transfer
  • Delivering data from Yandex Managed Service for MySQL® to Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for MySQL® to Yandex Managed Service for Apache Kafka® using Debezium
  • Delivering data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for Apache Kafka® using Debezium
  • Delivering data from Yandex Managed Service for YDB to Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for ClickHouse® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex StoreDoc using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for MySQL® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for OpenSearch using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for PostgreSQL using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for YDB using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Data Streams using Yandex Data Transfer
  • Delivering data from a Data Streams queue to Managed Service for Apache Kafka® using Yandex Data Transfer
  • Configuring Kafka Connect to work with a Yandex Managed Service for Apache Kafka® cluster
  • Syncing data from Apache Kafka® topics to an Object Storage bucket without using the internet
  • Using a schema registry with Yandex Managed Service for Apache Kafka®:
    • Managing data schemas in Managed Service for Apache Kafka®
    • Using Managed Schema Registry with Yandex Managed Service for Apache Kafka®
    • Using Managed Schema Registry with Yandex Managed Service for Apache Kafka® through the REST API
    • Using Confluent Schema Registry with Yandex Managed Service for Apache Kafka®
  • Monitoring message loss in an Apache Kafka® topic

Apache Airflow™Apache Airflow™

  • Automating Yandex Query tasks with Yandex Managed Service for Apache Airflow™
  • Sending requests to the Yandex Cloud API via the Yandex Cloud Python SDK
  • Configuring an SMTP server to send e-mail notifications
  • Running a PySpark job using Yandex Managed Service for Apache Airflow™

ClickHouse®ClickHouse®

  • Adding data to ClickHouse®
  • Migrating data to Managed Service for ClickHouse® using ClickHouse®
  • Migrating data to Yandex Managed Service for ClickHouse® using Yandex Data Transfer
  • Migrating databases from MySQL® to ClickHouse® using Yandex Data Transfer
  • Asynchronously replicating data from PostgreSQL to ClickHouse®
  • Exchanging data between Yandex Managed Service for ClickHouse® and Yandex Data Processing
  • Configuring Yandex Managed Service for ClickHouse® for Graphite
  • Fetching data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for ClickHouse®
  • Delivering data to ksqlDB
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for ClickHouse® using Yandex Data Transfer
  • Fetching data from RabbitMQ to Yandex Managed Service for ClickHouse®
  • Saving a Yandex Data Streams data stream in Yandex Managed Service for ClickHouse®
  • Using hybrid storage in Yandex Managed Service for ClickHouse®
  • Sharding Yandex Managed Service for ClickHouse® tables
  • Loading data from Yandex Direct to a Yandex Managed Service for ClickHouse® data mart using Yandex Cloud Functions, Yandex Object Storage, and Yandex Data Transfer
  • Loading data from Yandex Object Storage to Yandex Managed Service for ClickHouse® using Yandex Data Transfer
  • Copying data from Managed Service for OpenSearch to Managed Service for ClickHouse® using Yandex Data Transfer
  • Loading data from Yandex Managed Service for YDB to Yandex Managed Service for ClickHouse® using Yandex Data Transfer
  • Migrating databases from Google BigQuery to Yandex Managed Service for ClickHouse®
  • Yandex Managed Service for ClickHouse® integration with external Microsoft SQL Server database via ClickHouse® JDBC Bridge
  • Yandex Managed Service for ClickHouse® integration with Oracle via ClickHouse® JDBC Bridge

Greenplum®Greenplum®

  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Migrating data from Yandex Managed Service for MySQL® to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Migrating databases from Greenplum® to ClickHouse®
  • Migrating databases from Greenplum® to PostgreSQL
  • Exporting Greenplum® data to cold storage in Yandex Object Storage
  • Loading data from Yandex Object Storage to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Copying data from Managed Service for OpenSearch to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Creating an external table from a Yandex Object Storage bucket table using a configuration file
  • Getting data from external sources using named queries

MongoDB/Yandex StoreDocMongoDB/Yandex StoreDoc

  • Migrating collections from MongoDB to Yandex StoreDoc
  • Migrating data to Yandex StoreDoc
  • Migrating a Yandex StoreDoc cluster from version 4.4 to 6.0 using Yandex Data Transfer
  • Sharding Yandex StoreDoc collections
  • Performance analysis and tuning of Yandex StoreDoc
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex StoreDoc using Yandex Data Transfer

MySQL®MySQL®

  • Migrating a database from a third-party MySQL® cluster to a Yandex Managed Service for MySQL® cluster
  • Managed Service for MySQL® performance analysis and tuning
  • Syncing data from a third-party MySQL® cluster to Yandex Managed Service for MySQL® using Yandex Data Transfer
  • Migrating databases from Yandex Managed Service for MySQL® to a third-party MySQL® cluster
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for MySQL® using Yandex Data Transfer
  • Migrating databases from MySQL® to ClickHouse® using Yandex Data Transfer
  • Migrating databases from Yandex Managed Service for MySQL® to Yandex Object Storage
  • Migrating data from Yandex Object Storage to Yandex Managed Service for MySQL® using Yandex Data Transfer
  • MySQL® change data capture and delivery to YDS
  • Migrating data from Managed Service for MySQL® to Managed Service for PostgreSQL using Data Transfer
  • Migrating data from Yandex Managed Service for MySQL® to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Importing data from Yandex Managed Service for MySQL® to Yandex Data Processing using Sqoop
  • Delivering data from Yandex Managed Service for MySQL® to Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for MySQL® to Yandex Managed Service for Apache Kafka® using Debezium
  • Migrating databases from Yandex Managed Service for MySQL® to Yandex Managed Service for YDB using Yandex Data Transfer

OpenSearchOpenSearch

  • Configuring an index policy in Yandex Managed Service for OpenSearch
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for OpenSearch using Yandex Data Transfer
  • Migrating data from a third-party OpenSearch cluster to Yandex Managed Service for OpenSearch using Yandex Data Transfer
  • Loading data from Yandex Managed Service for OpenSearch to Yandex Object Storage using Yandex Data Transfer
  • Migrating data from Yandex Managed Service for OpenSearch to Yandex Managed Service for YDB using Yandex Data Transfer
  • Copying data from Managed Service for OpenSearch to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Copying data from Managed Service for OpenSearch to Managed Service for ClickHouse® using Yandex Data Transfer
  • Migrating data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for OpenSearch using Yandex Data Transfer
  • Authenticating a Yandex Managed Service for OpenSearch cluster in OpenSearch Dashboards using Keycloak
  • Using the yandex-lemmer plugin in Yandex Managed Service for OpenSearch

PostgreSQLPostgreSQL

  • Creating a PostgreSQL cluster for 1C:Enterprise
  • Searching for Managed Service for PostgreSQL cluster performance issues
  • Managed Service for PostgreSQL performance analysis and tuning
  • Logical replication PostgreSQL
  • Migrating a database from a third-party PostgreSQL cluster to Managed Service for PostgreSQL
  • Migrating a database from Managed Service for PostgreSQL
  • Asynchronously replicating data from PostgreSQL to ClickHouse®
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for PostgreSQL using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for Apache Kafka® using Debezium
  • Importing data from Yandex Managed Service for PostgreSQL to Yandex Data Processing using Sqoop
  • Delivering data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for YDB using Yandex Data Transfer
  • Migrating databases from Managed Service for PostgreSQL to Object Storage
  • Migrating data from Yandex Object Storage to Yandex Managed Service for PostgreSQL using Yandex Data Transfer
  • Migrating data from Managed Service for PostgreSQL to Managed Service for MySQL® using Data Transfer
  • PostgreSQL change data capture and delivery to YDS
  • Migrating data from AWS RDS for PostgreSQL to Yandex Managed Service for PostgreSQL using Yandex Data Transfer
  • Migrating data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for OpenSearch using Yandex Data Transfer
  • Fixing string sorting issues in PostgreSQL after upgrading glibc

Valkey™Valkey™

  • Migrating a database from a third-party Valkey™ cluster to Yandex Managed Service for Valkey™
  • Using a Yandex Managed Service for Valkey™ cluster as a PHP session storage

YDBYDB

  • Delivering data from Yandex Managed Service for YDB to Yandex Managed Service for Apache Kafka® using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for Apache Kafka® to Yandex Managed Service for YDB using Yandex Data Transfer
  • Migrating databases from Yandex Managed Service for MySQL® to Yandex Managed Service for YDB using Yandex Data Transfer
  • Delivering data from Yandex Managed Service for PostgreSQL to Yandex Managed Service for YDB using Yandex Data Transfer
  • Loading data from Yandex Object Storage to Yandex Managed Service for YDB using Yandex Data Transfer
  • Loading data from Yandex Managed Service for YDB to Yandex Object Storage using Yandex Data Transfer
  • Loading data from Yandex Managed Service for YDB to Yandex Managed Service for ClickHouse® using Yandex Data Transfer
  • Migrating data from Yandex Managed Service for OpenSearch to Yandex Managed Service for YDB using Yandex Data Transfer
  • Delivering data from an Data Streams queue to Managed Service for YDB using Yandex Data Transfer
  • Yandex Managed Service for YDB change data capture and delivery to Yandex Data Streams

Yandex Cloud DNSYandex Cloud DNS

  • Configuring Yandex Cloud DNS to access a Yandex Managed Service for ClickHouse® cluster from other cloud networks

Yandex Data ProcessingYandex Data Processing

  • Migrating an HDFS Yandex Data Processing cluster to a different availability zone
  • Exchanging data between Yandex Managed Service for ClickHouse® and Yandex Data Processing
  • Importing data from Yandex Managed Service for MySQL® to Yandex Data Processing using Sqoop
  • Importing data from Yandex Managed Service for PostgreSQL to Yandex Data Processing using Sqoop
  • Mounting Yandex Object Storage buckets to the file system of Yandex Data Processing hosts
  • Working with Apache Kafka® topics using PySpark jobs in Yandex Data Processing
  • Automating operations with Yandex Data Processing using Yandex Managed Service for Apache Airflow™
  • Shared use of Yandex Data Processing tables through Apache Hive™ Metastore
  • Transferring metadata between Yandex Data Processing clusters using Apache Hive™ Metastore
  • Importing data from Yandex Object Storage, processing and exporting to Yandex Managed Service for ClickHouse®

Basic examples of working with jobsBasic examples of working with jobs

  • Working with Hive jobs
  • Working with MapReduce jobs
  • Working with PySpark jobs
  • Working with Spark jobs

Advanced examples of working with jobsAdvanced examples of working with jobs

  • Running Apache Hive jobs
  • Launching and managing applications for Spark and PySpark
  • Running jobs from remote hosts that are not part of the Yandex Data Processing cluster

Yandex QueryYandex Query

  • Processing Yandex Audit Trails events
  • Processing Yandex Cloud Logging logs
  • Processing Debezium CDC streams
  • Analyzing data with Jupyter
  • Processing files with usage details in Yandex Cloud Billing

Yandex Data StreamsYandex Data Streams

  • Ingesting data into storage systems
  • Smart log processing
  • Data transfer in microservice architectures
  • Migrating data to Yandex Object Storage using Yandex Data Transfer

Yandex Data TransferYandex Data Transfer

  • Migrating data from a third-party Greenplum® or PostgreSQL cluster to Yandex MPP Analytics for PostgreSQL using Yandex Data Transfer
  • Migrating MongoDB clusters
  • Migrating MySQL® clusters
  • Migrating to a third-party MySQL® cluster
  • Migrating PostgreSQL clusters
  • Creating a schema registry to deliver data in Debezium CDC format from Apache Kafka®

Yandex Managed Service for Apache Spark™Yandex Managed Service for Apache Spark™

  • Automating operations using Yandex Managed Service for Apache Airflow™
  • Working with an Object Storage table from a PySpark job using Apache Hive™ Metastore and Apache Iceberg™
  • Yandex Managed Service for Apache Spark™ integration with Apache Hive™ Metastore
  • Running a PySpark job using Yandex Managed Service for Apache Airflow™
  • Using Yandex Object Storage in Yandex Managed Service for Apache Spark™

ClickHouse® is a registered trademark of ClickHouse, Inc.

Was the article helpful?

Previous
Configuring printing from Cloud Desktop to a local printer in Linux
Next
Unassisted deployment of the Apache Kafka® web interface
© 2026 Direct Cursus Technology L.L.C.