Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
    • Yandex Cloud Partner program
  • Blog
  • Pricing
  • Documentation
© 2025 Direct Cursus Technology L.L.C.
Yandex SpeechKit
  • SpeechKit technology overview
    • About the technology
    • Supported languages
    • Streaming recognition
      • Synchronous recognition
      • Asynchronous recognition
    • Recognition result normalization
    • Analyzing recognition results
    • Speaker labeling
    • Extending a speech recognition model
    • Uploading fine-tuning data for a speech recognition model
    • Detecting the end of utterance
  • Supported audio formats
  • IVR integration
  • Quotas and limits
  • Access management
  • Pricing policy

In this article:

  • Audio requirements
  • Use cases
  1. Speech recognition
  2. Audio file recognition
  3. Synchronous recognition

Synchronous audio recognition

Written by
Yandex Cloud
Improved by
amatol
Updated at April 30, 2025
  • Audio requirements
  • Use cases

Warning

This feature is only available in the Russia region.

Synchronous audio recognition ensures fast response times and is suitable for pre-recorded small single-channel audio fragments.

If you want to recognize speech over the same connection, use streaming mode. In streaming mode, you can get intermediate recognition results.

Audio requirementsAudio requirements

The audio you send must meet the following requirements:

  • Maximum file size: 1 MB
  • Maximum length: 30 seconds
  • Maximum number of audio channels: 1

If your file is larger, longer, or has more audio channels, use asynchronous recognition.

SpeechKit allows you to recognize and synthesize the following audio formats:

  • LPCM
  • OggOpus
  • MP3

For more information about each format's special features, see Supported audio formats.

Use casesUse cases

  • Example of using the API v1 for synchronous recognition

See alsoSee also

  • Synchronous recognition API

Was the article helpful?

Previous
Streaming recognition
Next
Asynchronous recognition
© 2025 Direct Cursus Technology L.L.C.