Storage in Yandex Data Processing
Yandex Data Processing enables you to use network storage drives to store subclusters. Network drives are based on network blocks, which are virtual disks in the Yandex Cloud infrastructure.
When creating a cluster, you can choose between the following storage types:
-
Network HDD storage (
network-hdd
): Most cost-effective option for clusters that do not require high read/write performance. -
Network SSD storage (
network-ssd
): Balanced solution. Such disks are slower than non-replicated SSDs, but, unlike non-replicated disks, they ensure data integrity in case Yandex Cloud hardware fails. -
Non-replicated SSD storage (
network-ssd-nonreplicated
): Network SSDs with enhanced performance achieved by eliminating redundancy.The storage size can only be increased in 93 GB increments.
For more information about this disk type, see the Yandex Compute Cloud documentation.
Note
Up to 5% of disk space is reserved for system use, so the disks may have less available space than indicated when creating a cluster.
For more information about sizes and performance of different disk types, see the Yandex Compute Cloud documentation.
Specifics of non-replicated SSD storage
Non-replicated SSD storage does not provide fault tolerance; this means, if a disk fails, the data is permanently lost.