Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
    • Yandex Cloud Partner program
  • Blog
  • Pricing
  • Documentation
© 2025 Direct Cursus Technology L.L.C.
Yandex SpeechKit
  • SpeechKit technology overview
    • Overview
    • How to recognize short audio files in the API v1
    • How to recognize long audio files in the API v3 and v2
    • How to synthesize speech in the API v1
    • How to synthesize speech in the API v3
  • Supported audio formats
  • IVR integration
  • Quotas and limits
  • Access management
  • Pricing policy

In this article:

  • Authentication for API access
  • Execute a request
  1. Getting started
  2. How to recognize short audio files in the API v1

How to recognize short audio files in SpeechKit

Written by
Yandex Cloud
Updated at March 28, 2025
  • Authentication for API access
  • Execute a request

The service can recognize speech in different ways. In the example below, the audio file is recognized using the synchronous recognition API. This API has the following limitations:

  • Maximum audio duration: 30 seconds
  • Maximum file size: 1 MB

To complete this example, prepare an audio file in one of the supported formats and compliant with the above limitations.

In the example, the API is used via the cURL utility.

Authentication for API accessAuthentication for API access

To work with the SpeechKit API, you need to pass authentication. The authentication method depends on the account type:

Yandex or federated account
Service account
  1. Get an IAM token for your Yandex account or federated account.
  2. Get the ID of the folder for which your account has the ai.speechkit-stt.user, ai.speechkit-tts.user, or higher roles.
  3. When accessing SpeechKit via the API, provide the received parameters in each request:

    • For API v1 and API v2:

      Specify the IAM token in the Authorization header in the following format:

      Authorization: Bearer <IAM token>
      

      Specify the folder ID in the request body in the folderId parameter.

    • For API v3:

      • Specify the IAM token in the Authorization header.
      • Specify the folder ID in the x-folder-id header.
      Authorization: Bearer <IAM_token>
      x-folder-id <folder_ID>
      

SpeechKit supports two authentication methods based on service accounts:

  • With an IAM token:

    1. Get an IAM token.

    2. Provide the IAM token in the Authorization header in the following format:

      Authorization: Bearer <IAM_token>
      
  • With API keys.

    Use API keys if requesting an IAM token automatically is not an option.

    1. Get an API key.

    2. Provide the API key in the Authorization header in the following format:

      Authorization: Api-Key <API_key>
      

Do not specify the folder ID in your requests, as the service uses the folder the service account was created in.

In the example below, authentication is performed under a Yandex account.

Execute a requestExecute a request

Send a speech recognition request:

export FOLDER_ID=<folder_ID>
export IAM_TOKEN=<IAM_token>
curl \
  --request POST \
  --header "Authorization: Bearer ${IAM_TOKEN}" \
  --data-binary "@speech.ogg" \
  "https://stt.api.cloud.yandex.net/speech/v1/stt:recognize?folderId=${FOLDER_ID}&lang=ru-RU"

Where:

  • FOLDER_ID: Folder ID you got earlier.
  • IAM_TOKEN: IAM token you got earlier.
  • lang: Recognition language.
  • speech.ogg: Speech audio file you prepared.

The service will respond with the recognized text, for example:

{
   "result":"I'm Yandex SpeechKit. I can turn any text into speech. Now you can, too!"
}

More informationMore information

  • Synchronous recognition API
  • Example of using the API v1 for synchronous recognition
  • Asynchronous recognition API v2

Was the article helpful?

Previous
Overview
Next
How to recognize long audio files in the API v3 and v2
© 2025 Direct Cursus Technology L.L.C.