Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Data Transfer
  • Available transfers
  • Getting started
    • All guides
    • Preparing for a transfer
      • Managing endpoints
      • Migrating endpoints to a different availability zone
        • Source
        • Target
    • Managing transfer process
    • Working with databases during transfer
    • Monitoring transfer status
  • Troubleshooting
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • Public materials

In this article:

  • Scenarios for transferring data to YTsaurus using Yandex Data Transfer
  • Configuring the data source
  • Preparing the target database
  • Configuring the YTsaurus target endpoint
  • Supported data delivery schemas and limits
  • Managed Service for YTsaurus cluster
  • Advanced settings
  1. Step-by-step guides
  2. Configuring endpoints
  3. YTsaurus
  4. Target

Transferring data to a YTsaurus target endpoint

Written by
Yandex Cloud
Updated at November 1, 2025
  • Scenarios for transferring data to YTsaurus using Yandex Data Transfer
  • Configuring the data source
  • Preparing the target database
  • Configuring the YTsaurus target endpoint
    • Supported data delivery schemas and limits
    • Managed Service for YTsaurus cluster
    • Advanced settings

Yandex Data Transfer enables you to migrate data to YTsaurus and implement various data transfer, processing, and transformation scenarios.

There are two types of target endpoints available for YTsaurus:

  • YTSaurus Dynamic: Writing data to dynamic tables.
  • YTSaurus Static: Writing data to static tables.

To implement a transfer:

  1. Explore possible data transfer scenarios.
  2. Configure one of the supported data sources.
  3. Configure the target endpoint in Yandex Data Transfer.
  4. Create a transfer and start it.
  5. Perform the required operations with the database and see how the transfer is going.

Scenarios for transferring data to YTsaurus using Yandex Data TransferScenarios for transferring data to YTsaurus using Yandex Data Transfer

You can implement scenarios for loading data from tables into Yandex Cloud managed databases for storage in the cloud, processing, and loading into data marts for visualization.

For a detailed description of possible Yandex Data Transfer scenarios, see Tutorials.

Configuring the data sourceConfiguring the data source

Configure one of the supported data sources:

  • Apache Kafka® (transfer is only possible to a YTSaurus Dynamic target)
  • ClickHouse®
  • Greenplum®
  • MongoDB
  • MySQL®
  • PostgreSQL
  • Yandex Object Storage
  • Oracle
  • Managed Service for YDB

For a complete list of supported sources and targets in Yandex Data Transfer, see Available transfers.

Preparing the target databasePreparing the target database

  1. To transfer to static tables, grant permissions to create and write to tables (write permission for the directory with tables). If data cleanup is performed before the transfer, grant the remove permission. To perform a transfer, you will also need the use permission for the account associated with the directory the tables will be in.

  2. To transfer to dynamic tables, in addition to the write, remove, and use permissions, grant the permission to mount tables.

Configuring the YTsaurus target endpointConfiguring the YTsaurus target endpoint

When creating or updating an endpoint, you can define:

  • Settings for connecting to a Yandex Managed Service for YTsaurus cluster. These are required parameters.
  • Additional settings.

Supported data delivery schemas and limitsSupported data delivery schemas and limits

Data delivery schemas Support level Constraints
Delivering data to static tables Without data transformation (sharding, rotation, or splitting tables into subtables)
Parallel copy of data to static tables Under development.
Delivering data to dynamic tables through static ones All intermediate operations on table parts are run as non-transactional and users can see them.
Disabled and Drop cleanup policy limits 1.
Parallel copy of data to dynamic tables through static ones All intermediate operations on table parts are run as non-transactional and users can see them.
Disabled and Drop cleanup policy limits 1.

1 Cleanup policy limits:

  • If the Disabled cleanup policy is used, there is no guarantee that new data in existing tables will take precedence over the old should the keys overlap.
  • With the Drop policy, the old tables are cleaned up before parts of new tables are added to them. We will fix that.

Warning

For dynamic tables, primary keys in the data are a requirement. YTsaurus dynamic tables store data in key:value format, where the key and its associated value must also be specified. If there is no non-key column value, the __dummy non-key stub column will be created. If there is no primary key, the transfer will end with an error.

Managed Service for YTsaurus clusterManaged Service for YTsaurus cluster

Management console
  • Service account ID: Select or create a service account with the managed-ytsaurus.editor role that Data Transfer will use to connect to the cluster.

  • Cluster ID: Select the cluster to connect to.

  • Security groups: Select the following:

    • Cloud network for hosting the endpoint.
    • Security groups for network traffic.

    Security group rules apply to a transfer. They allow opening up network access from the transfer VM to the cluster. Learn more in Networking in Yandex Data Transfer.

  • Path: Path to the folder to write the transferred data to.

  • Cleanup policy: Select a way to clean up data in the target database before the transfer:

    • Drop: Fully delete tables included in the transfer (default).

      Use this option so that the latest version of the table schema is always transferred to the target database from the source whenever the transfer is activated.

    • Disabled: Do not clean.

      Select this option if only replication without copying data is performed.

Advanced settingsAdvanced settings

Settings for the YTSaurus Dynamic database typeSettings for the YTSaurus Dynamic database type

Management console
  • Table settings:

    • Medium: Select the type of storage for your data:

      • HDD (primary_medium=default): Multiple HHD disks in a cluster.
      • SSD (primary_medium=ssd_blobs): Multiple SSD disks in a cluster.
      • SSD for logs (primary_medium=ssd_journals): Multiple SSD disks to store dynamic table logs.
      • RAM (primary_medium=in_memory): Dedicated space in cluster node RAM.
    • Chunk format: Select the format for storing data in the chunk:

      • Columnar (optimize_for=scan): To optimize scanning.
      • Line-by-line (optimize_for=lookup): To optimize search.
    • Atomic transactions: Enable this if you want transactions for tables to be completely atomic (atomicity=full property).

    • TTL: Specify a table data storage period. After this period, the data will be permanently deleted.

    • Custom attributes: Allows you to add custom attributes to tables created in YSON format. To add a new attribute, click + Attribute and enter its name and value.

  • Write settings:

    • Disable schema migration: Select to prevent changes to the target data schema when the source schema is modified. By default, when the source schema is modified, the transfer will update the target schema accordingly: create new tables, add new columns, add new enumerated values and enumerated types. By default, changes like deleting tables and columns are not applied.

    • Discard large values: Enable it to ignore non-critical data that fails to comply with the limits. If the size of a column value does not comply with the YTsaurus limitations, this value will be replaced with BigStringValueStub.

    • Copy with static table: Select this option to perform copy operations via temporary static tables. For the Drop cleanup policy, existing data in the target will be deleted once the copying is completed. If no drop mode is used, new and existing data will be merged.

    • YT computing pool: Specify the computing pool for operations on tables.

Settings for the YTSaurus Static database typeSettings for the YTSaurus Static database type

Management console
  • Table settings:

    • Chunk format: Select the format for storing data in the chunk:

      • Columnar (optimize_for=scan): To optimize scanning.
      • Line-by-line (optimize_for=lookup): To optimize search.
    • Sort static tables: Enable this option if you need to sort table records by key.
    • Custom attributes: Allows you to add custom attributes to tables created in YSON format. To add a new attribute, click + Attribute and enter its name and value.

  • Write settings:

    • Discard large values: Enable it to ignore non-critical data that fails to comply with the limits. If the size of a column value does not comply with the YTsaurus limitations, this value will be replaced with BigStringValueStub.

    • YT computing pool: Specify the computing pool for operations on tables.

After configuring the data source and target, create and start the transfer.

Was the article helpful?

Previous
Source
Next
Managing transfer process
© 2025 Direct Cursus Technology L.L.C.