Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
  • Blog
  • Pricing
  • Documentation
Yandex project
© 2025 Yandex.Cloud LLC
Yandex Query
    • Overview
    • Terms and definitions
    • Quotas and limits
    • Query processing
      • Description
      • Federated queries
      • Data partitioning
      • Partition projection
    • Unified analysis of streaming and analytical data
    • Backups
  • Access management
  • Pricing policy
  • Integration
  • Audit Trails events
  • FAQ
  1. Concepts
  2. Batch processing
  3. Description

Batch processing

Written by
Yandex Cloud
Improved by
Max Z.
Updated at March 7, 2025

Batch processing is a technology for processing data that involves preparing aggregated information based on large arrays of data. This type of data analysis is traditional and is used for processing data stored, for example, in a DBMS.

Data volumes increase with time but that does not mean all data is accessed often. So, rarely used data is usually transferred to storage systems like Yandex Object Storage which are much more cost-effective than DBMS.

Data is stored in Yandex Object Storage as a file structure with directories and files. To store data in files, use standard storage formats: CSV, JSON, etc.

Yandex Query allows you to access data stored in Yandex Object Storage in the same way as a DBMS by making queries in an SQL dialect called YQL.

Yandex Object Storage usually stores massive amounts of data. Yandex Query analyzes how much data needs processing and runs dozens to hundreds of concurrent data processing jobs within the computing cluster. This allows maintaining a high processing speed even for large data volumes.

Use cases

  • Analyzing data with Jupyter
  • Analyzing data with Query
  • Visualizing Yandex Object Storage data in Yandex DataLens
  • Processing files with usage details in Yandex Cloud Billing
  • Working with data in Yandex Managed Service for ClickHouse®
  • Working with data in Yandex Managed Service for PostgreSQL
  • Working with data in Yandex Object Storage

Was the article helpful?

Previous
Query processing
Next
Federated queries
Yandex project
© 2025 Yandex.Cloud LLC