Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
    • Yandex Cloud Partner program
  • Blog
  • Pricing
  • Documentation
© 2025 Direct Cursus Technology L.L.C.
Yandex SpeechKit
  • SpeechKit technology overview
    • Overview
    • API authentication
        • Synchronous recognition API
  • Supported audio formats
  • IVR integration
  • Quotas and limits
  • Access management
  • Pricing policy

In this article:

  • Query parameters
  • Parameters in the request body
  • Response
  • Use cases
  1. API references
  2. Recognition
  3. API v1
  4. Synchronous recognition API

Synchronous recognition API

Written by
Yandex Cloud
Updated at February 10, 2025
  • Query parameters
  • Parameters in the request body
  • Response
  • Use cases

With the synchronous recognition API, you can transcribe prepared audio files with the following characteristics:

  • Maximum file size: 1 MB
  • Maximum duration: 30 seconds
  • Maximum number of audio channels: 1

The synchronous recognition service is located at stt.api.cloud.yandex.net/speech/v1/stt:recognize

Query parametersQuery parameters

Parameter Description
lang string
Recognition language.
See the model description for acceptable values. The default value is ru-RU, Russian.
topic string
Language model to use for recognition.
The more accurate your choice of the model, the better the recognition result. You can specify only one model per request.
The acceptable values depend on the language you select. The default value is general.
profanityFilter boolean
This parameter controls the profanity filter in recognized speech.
Acceptable values:
  • false (default): Profanities will not be excluded from the recognition results.
  • true: Profanities will be excluded from the recognition results.
rawResults boolean
Flag for how to write numbers: true for words, false (default) for figures.
format string
Submitted audio format.
Acceptable values:
  • lpcm: LPCM without a WAV header.
  • oggopus (default): OggOpus.
sampleRateHertz string
Submitted audio sampling frequency.
Applies if format equals lpcm. Acceptable values:
  • 48000 (default): 48 kHz.
  • 16000: 16 kHz.
  • 8000: 8 kHz.
folderId string

ID of the folder you have access to. Required for authorization with a user account (see Authentication with the SpeechKit API resource). Do not use this field if you make a request on behalf of a service account.

The maximum string length is 50 characters.

Parameters in the request bodyParameters in the request body

The request body has to contain the binary content of an audio file.

ResponseResponse

The recognized text is returned in the result field of the response.

{
  "result": <recognized_text>
}

For more information about the response format and codes, see Response status codes.

Use casesUse cases

  • Example of using the API v1 for synchronous recognition.

Was the article helpful?

Previous
API authentication
Next
Streaming Recognition API
© 2025 Direct Cursus Technology L.L.C.