Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Query
    • Overview
    • Terms and definitions
    • Quotas and limits
    • Query processing
    • Unified analysis of streaming and analytical data
    • Backups
  • Access management
  • Pricing policy
  • Integration
  • Audit Trails events
  • FAQ

In this article:

  • Support for raw data storage
  • Streaming data processing
  • Integration with external systems
  • Streaming processing
  • Batch processing
  • Yandex DataLens
  1. Concepts
  2. Overview

Yandex Query overview

Written by
Yandex Cloud
Updated at October 22, 2024
  • Support for raw data storage
  • Streaming data processing
  • Integration with external systems
    • Streaming processing
    • Batch processing
    • Yandex DataLens

Yandex Query is a data service that can run federated queries against Yandex Object Storage object storage, Managed Service for ClickHouse®, Yandex MPP Analytics for PostgreSQL, Managed Service for MySQL®, Managed Service for PostgreSQL, and Managed Service for YDB managed databases, and Yandex Data Streams real-time streams. Yandex Query uses YQL – a unified SQL dialect – to aggregate query results across these systems.

Yandex Query is a fully managed cloud service, meaning that you need no running servers with software deployed. All the resources you need for your queries are allocated the moment you run them and vacated as soon as the queries are complete. The queries themselves start running instantly.

Yandex Query allows you to:

  • Use the same written query in scenarios for analyzing data stored in Yandex Object Storage and analyzing data in real time.
  • Aggregate query execution results across different systems.
  • Save on development thanks to using a common query language, YQL, and a common approach.

Yandex Query combines data virtualization features and a real-time streaming data analysis system. This architecture is called Unified Lambda.

The Unified Lambda model uses a unified SQL query text for processing streaming data and data stored in storage systems of different classes.

Support for raw data storageSupport for raw data storage

Companies prefer to store large volumes of rarely accessed data in object storage of the Yandex Object Storage class. Long-term storage of rarely processed data in storage systems like this is most cost-efficient. Data is stored in Yandex Object Storage in unstructured form and this data needs to be processed in a simple and analyst-friendly way.

Streaming data processingStreaming data processing

Streaming processing is based on grouping window functions that receive data streams, group them by source and time window, make computations, and send execution results to external systems. A distinctive feature of Yandex Query is a unified text of SQL queries used for both streaming and batch processing.

Integration with external systemsIntegration with external systems

Streaming processingStreaming processing

Streaming queries can get data from the following sources:

  • Yandex Data Streams. Application logs, Debezium database CDC streams, or any other information can be used as input data.

Streaming processing results are exported to:

  • Monitoring as metrics for creating charts and dashboards or alerting.
  • Yandex Data Streams. Using Yandex Data Transfer, data from Yandex Data Streams can be sent to different systems, including various DBMS.

Batch processingBatch processing

Analytical queries in Yandex Query can get data from Yandex Object Storage in JSON, CSV/TSV, and Parquet formats compressed using different algorithms. You can also run analytical queries against Managed Service for ClickHouse®, Yandex MPP Analytics for PostgreSQL, Managed Service for MySQL®, Managed Service for PostgreSQL, and Managed Service for YDB managed databases.

You can use Yandex Query for cross-service data analytics, accessing all supported data sources in a single query.

You can download the query execution results from the Yandex Query user interface. If required, you can also save them to Yandex Object Storage.

Yandex DataLensYandex DataLens

With Yandex Query, you can visualize data stored in Yandex Object Storage.

ClickHouse® is a registered trademark of ClickHouse, Inc.

Was the article helpful?

Previous
Unified analysis of streaming and analytical data
Next
Terms and definitions
© 2025 Direct Cursus Technology L.L.C.