Yandex Cloud
Search
Contact UsTry it for free
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
  • Marketplace
    • Featured
    • Infrastructure & Network
    • Data Platform
    • AI for business
    • Security
    • DevOps tools
    • Serverless
    • Monitoring & Resources
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
    • Price calculator
    • Pricing plans
  • Customer Stories
  • Documentation
  • Blog
© 2026 Direct Cursus Technology L.L.C.
Yandex Cloud Stackland
  • What's new
  • Installation
    • All tutorials
    • Installing Stackland on Yandex BareMetal
    • Setting up external access to a pod in a cluster
    • All guides
    • Projects
    • Resource model
      • Overview
      • Certificate Manager
      • DNS
      • IAM
      • Logging Stack
      • Managed Service for Apache Kafka®
      • Managed Service for PostgreSQL
      • Managed Service for ClickHouse®
      • DataLens
      • Monitoring
      • Object Storage
      • Disk subsystem
      • NVIDIA® GPU support
      • Policy Manager
      • Secrets Store
      • SpeechSense
  • Access management
  • Pricing policy
  • Diagnostics and troubleshooting

In this article:

  • Architecture
  • Infrastructure requirements
  • GPU
  • Resources
  • Dependencies
  • Configuration
  • See also
  1. Concepts
  2. Components
  3. SpeechSense

SpeechSense

Written by
Yandex Cloud
Updated at April 8, 2026
  • Architecture
  • Infrastructure requirements
    • GPU
    • Resources
  • Dependencies
  • Configuration
  • See also

Yandex SpeechSense is a speech analytics platform within Stackland. It transcribes and analyzes audio recordings of conversations, extracts text, classifies customer interactions, and generates analytical reports.

SpeechSense does not come with the basic Stackland package and requires a separate license.

ArchitectureArchitecture

SpeechSense comprises three components:

  • YaGPT: Handles natural language processing and text generation. It is used to create conversation summaries and classify interactions. Requires GPU resources.
  • SpeechKit: Performs speech recognition and synthesis, converts audio files to text. Requires GPU resources.
  • SpeechSense: Provides speech analytics, data processing, and the web UI. SpeechSense orchestrates the YaGPT and SpeechKit workflows and features a UI for managing analysis results.

Infrastructure requirementsInfrastructure requirements

GPUGPU

SpeechSense requires the following NVIDIA® GPU nodes:

  • YaGPT Pro: 2 × NVIDIA® H100
  • SpeechKit STT Backend: 1 × NVIDIA® A100
  • SpeechKit Embeddings: 1 × NVIDIA® H100

Before installing SpeechSense, enable the NVIDIA® GPU support component.

ResourcesResources

TA services operate without a GPU but require significant CPU and RAM resources. We recommend allocating at least 32 vCPUs and 64 GB RAM for TA services.

DependenciesDependencies

SpeechSense relies on the following Stackland components:

  • Managed Service for PostgreSQL: Metadata and state storage.
  • Managed Service for ClickHouse®: Analytical queries and storage of large-scale data.
  • Managed Service for Apache Kafka®: Streaming data processing.
  • Object Storage: Storage of audio files and models.
  • NVIDIA® GPU support: GPU resource management.
  • Identity and Access Management: User authentication and authorization.

When SpeechSense is enabled, the controller automatically checks for these dependencies and provisions the required database clusters, Apache Kafka® topics, and certificates.

ConfigurationConfiguration

To manage SpeechSense, use the SpeechsenseConfig custom resource.

Here is an example:

apiVersion: stackland.yandex.cloud/v1alpha1
kind: SpeechsenseConfig
metadata:
  name: default
spec:
  enabled: true
  settings:
    s3: # Optional. Add data if you need an external storage
      endpoint: "<object_storage_address>"
      accessKeyID: "<key_ID>"
      secretAccessKey: "<secret_key>"

Where:

  • enabled: Enables and disables the component.
  • settings.s3.endpoint: Object Storage address.
  • settings.s3.accessKeyID: Storage access key ID.
  • settings.s3.secretAccessKey: Storage secret access key.

See alsoSee also

  • Uploading SpeechSense images: Pulling SpeechSense images.
  • Configuring and activating SpeechSense: Configuring and activating SpeechSense.
  • Disabling SpeechSense: Disabling SpeechSense.
  • NVIDIA® GPU support: GPU management in a cluster.

Was the article helpful?

Previous
Secrets Store
Next
ClickHouse monitoring
© 2026 Direct Cursus Technology L.L.C.