Yandex Cloud
Search
Contact UsTry it for free
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • AI for business
    • Security
    • DevOps tools
    • Serverless
    • Monitoring & Resources
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
    • Price calculator
    • Pricing plans
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Managed Service for ClickHouse®
  • Getting started
  • Access management
  • Pricing policy
  • Terraform reference
  • Yandex Monitoring metrics
  • Audit Trails events
  • Public materials
  • Release notes
    • General questions
    • Questions about ClickHouse®
    • Connection
    • Updating a cluster
    • Cluster configuration
    • Moving and restoring a cluster
    • Monitoring and logs
    • All questions on one page
  1. FAQ
  2. Questions about ClickHouse®

Questions about ClickHouse®

Written by
Yandex Cloud
Updated at December 10, 2025
  • Why should I use ClickHouse® in Managed Service for ClickHouse® rather than my own VM-based installation?

  • When should I use ClickHouse® rather than PostgreSQL?

  • How do I load data into ClickHouse®?

  • How do I load a large data volume into ClickHouse®?

  • What will happen to my cluster if one of its nodes fails?

  • Can I deploy a ClickHouse® database cluster in multiple availability zones?

  • How does replication work for ClickHouse®?

  • Why does my ClickHouse® cluster use three extra hosts?

  • How does ClickHouse® handles data deletion based on TTL?

  • Can I use JSON data for tables in ClickHouse®?

  • Why is my cluster slow even though the computing resources are not fully utilized?

Why should I use ClickHouse® in Managed Service for ClickHouse® rather than my own VM-based installation?Why should I use ClickHouse® in Managed Service for ClickHouse® rather than my own VM-based installation?

Managed Service for ClickHouse® automates routine database maintenance:

  • Quick DB deployment with the required available resources.

  • Data backup.

  • Regular software updates.

  • Ensuring DB cluster fault tolerance.

  • Database usage monitoring and statistics.

When should I use ClickHouse® rather than PostgreSQL?When should I use ClickHouse® rather than PostgreSQL?

ClickHouse® only supports adding and reading data since it is primarily designed for analytics (OLAP). For other purposes, you might want to use PostgreSQL.

How do I load data into ClickHouse®?How do I load data into ClickHouse®?

Use the INSERT statement described in this ClickHouse® article.

How do I load a large data volume into ClickHouse®?How do I load a large data volume into ClickHouse®?

Use the CLI to efficiently compress data when transferring it (we recommend issuing no more than one INSERT command per second).

Currently, data transfer from physical media is not supported.

What will happen to my cluster if one of its nodes fails?What will happen to my cluster if one of its nodes fails?

DB clusters consist of at least two replicas, so if one node is down, the cluster will keep running.

You may lose data only if a node with a non-replicated table fails.

Can I deploy a ClickHouse® database cluster in multiple availability zones?Can I deploy a ClickHouse® database cluster in multiple availability zones?

Yes. A database cluster may consist of hosts residing in different availability zones or even regions.

How does replication work for ClickHouse®?How does replication work for ClickHouse®?

Managed Service for ClickHouse® clusters use ClickHouse® Keeper or ZooKeeper for replication. In the first case, replication and fault tolerance are enabled by default, so no further configuration is needed. In the second case, each ClickHouse® cluster comes with a ZooKeeper cluster containing at least three hosts.

Yandex Cloud users do not have access to ZooKeeper and cannot configure it.

Why does my ClickHouse® cluster use three extra hosts?Why does my ClickHouse® cluster use three extra hosts?

When creating a ClickHouse® cluster of two or more hosts, Managed Service for ClickHouse® automatically creates a cluster of three ZooKeeper hosts to manage replication and fault tolerance, unless ClickHouse® Keeper support is enabled. These hosts are counted towards both the cloud resource quota and the cluster cost. By default, ZooKeeper hosts come with the minimum host class.

For more information about using ZooKeeper, see this ClickHouse® article.

How does ClickHouse® handles data deletion based on TTL?How does ClickHouse® handles data deletion based on TTL?

With TTL, data is deleted in parts or during merges rather than in rows.

Deleting entire data parts is more efficient and uses less server resources, but requires the TTL expression and partitioning key to match or be of the same granularity for all rows in a data part.

Deleting data during merges uses more resources and takes place either along with regular background merges or during unscheduled merges. Merge frequency is defined by the merge_with_ttl_timeout parameter, which indicates the minimum time in seconds before a repeat merge to process data with expired TTL. You set this parameter when creating the table. The default value is 14,400 seconds (4 hours).

We recommend managing data with TTL so that old data is always deleted in data parts. To do this, set ttl_only_drop_parts to true when creating tables.

Can I use JSON data for tables in ClickHouse®?Can I use JSON data for tables in ClickHouse®?

Yes, you can. However, JSON is currently an experimental data type in ClickHouse®. To allow creating tables of this type, run this query:

SET allow_experimental_object_type=1;

Note

The SET queries are not supported when connecting to a cluster via the management console. To run such a query, use a different cluster connection method, e.g., via clickhouse-client.

Make sure you have the latest client version installed.

For more information, see this ClickHouse® article.

Why is my cluster slow even though the computing resources are not fully utilized?Why is my cluster slow even though the computing resources are not fully utilized?

Your storage may have insufficient maximum IOPS and bandwidth to process the current number of requests. In this case, throttling occurs, which degrades the entire cluster performance.

The maximum IOPS and bandwidth values increase by a fixed value when the storage size increases by a certain step. The step and increment values depend on the disk type:

Disk type Step, GB Max IOPS increase (read/write) Max bandwidth increase (read/write), MB/s
network-hdd 256 300/300 30/30
network-ssd 32 1,000/1,000 15/15
network-ssd-nonreplicated, network-ssd-io-m3 93 28,000/5,600 110/82

To increase the maximum IOPS and bandwidth values and make throttling less likely, expand the storage when updating your cluster.

If you are using the network-hdd storage, consider switching to network-ssd or network-ssd-nonreplicated by restoring the cluster from a backup.

ClickHouse® is a registered trademark of ClickHouse, Inc.

Was the article helpful?

Previous
General questions
Next
Connection
© 2025 Direct Cursus Technology L.L.C.