Speech recognition using Playground

Written by

Yandex Cloud

Updated at December 12, 2025

To recognize speech from an audio file via the SpeechKit Playground:

Management console

In the management console, select the folder you are going to use to work with SpeechKit.
Go to SpeechKit.
In the left-hand panel, select SpeechKit Playground.
Navigate to the Speech recognition tab.
Under Recognition parameters:
- Language: Select the language or leave Automatic.
- Text normalization: Presents dates and times in numerical format, converts numbers from text to digits, and provides access to additional settings.
- Profanity filter: Masks profanity.
- Literature text: Adds capital letters and punctuation marks.
- Speaker recognition: Attributes each recognized phrase to a particular speaker.
- Grouping speaker phrases: Divides phrases into two groups by speaker.
Click Select file or drag the audio file to the loading area.

Tip

Convert the file to a supported audio format beforehand: MP3, WAV, or OGG with the OPUS audio codec. Maximum file size: 60 MB.
Classifiers: Finds phrases of a given category in the text, e.g., greetings, negative or obscene language. This works only for Russian.
Result processing: Processing of results with the help of an LLM:
- Model: Select a model for processing. The processing cost depends on the model you select.
- Instructions:
  - Enter a prompt in the input field or select a ready-made one.
  - Result format: Specify your preferred recognition result format.
  - Add instructions: Add another instruction. You can add up to five instructions in total.
Click Start recognition to start speech recognition for the audio file.

Recognition may take from a few seconds to a few minutes depending on the audio file size.
Click View code to get the request code for Python REST or Python gRPC.

screen

SpeechKit Playground features basic speech recognition options. For more flexible recognition settings, use the API.

Speech recognition using Playground

Was the article helpful?