Getting started with Yandex SpeechSense

Written by

Updated at April 24, 2025

Getting started
Set up your environment
Upload your data

Yandex SpeechSense allows you to analyze your business communication channels based on dialog audio recordings or text chat messages and supports integration with your PBX and CRM systems. SpeechSense utilizes Yandex SpeechKit voice technologies to transcribe speech and perform statistical and qualitative analysis of recorded audio dialogs.

SpeechSense can analyze audio and text dialogs in Russian and Kazakh languages.

You can upload your data to SpeechSense or use a demo recording synthesized using SpeechKit.

Getting started

Go to the management console and log in to Yandex Cloud or sign up if not signed up yet. For information on how to get started with Yandex Cloud, see Getting started with Yandex Cloud.
Accept the user agreement.
In Yandex Cloud Billing, make sure you have a billing account linked and its status is ACTIVE or TRIAL_ACTIVE. If you do not have a billing account yet, create one.
Assign the speech-sense.spaces.creator role to your Yandex Cloud account.

Note

If you are unable to manage roles, contact your cloud or organization administrator.
Open the SpeechSense home page.
Select the organization to work with SpeechSense in or create a new one.

Set up your environment

Create a space to store all your projects: select Create space, enter a name, add a description if required, and click Create.
Link a billing account to the space. This account will be debited for the use of SpeechSense.

Tip

To manage a billing account, you must have a Yandex account. If using Yandex Cloud through an identity federation, contact support.
Navigate to the Connections tab and create a connection based on the metainformation of your audio recordings or text chats:
1. Specify Connection name.
2. Select the Two-channel audio or Сhat data type.
3. Set the connection parameters. For more information, see Creating a connection.
4. Click Create connection.
Example of metadata.json for loading audio data
```
{
   "direction_outgoung": "true",
   "client_id": "456",
   "client_name": "John Doe",
   "date": "2023-09-29T09:08:38.958Z",
   "date_to": "2023-09-29T09:15:07.897Z",
   "language": "RU",
   "operator_id": "123",
   "operator_name": "Jane Doe"
}
```
Create a project by clicking Create project on the space page, entering a name for the project, and adding a connection to it. For each connection, you can select filtering rules based on its metainformation and the YandexGPT Pro criteria. With filters, you can ensure that only the dialogs you need are added to the project. After adding all required connections and configuring filters, click Create project.

Upload your data

SpeechSense uses the gRPC API to upload data.

To upload data:

Create a service account.
Add the service account to the namespace with the speech-sense.data.editor role. To learn more about the roles available in the service, see Access management in SpeechSense.
Create an API key or IAM token for the service account to authenticate with the API. Learn more about authentication in the SpeechSense API.
Upload your audio data or chat history data using a Python data transfer script.

Use one message for the whole audio.

SpeechSense supports the following audio file formats:
- LPCM: AUDIO_ENCODING_LINEAR16_PCM
- WAV: CONTAINER_AUDIO_TYPE_WAV
- OggOpus: CONTAINER_AUDIO_TYPE_OGG_OPUS
- MP3: CONTAINER_AUDIO_TYPE_MP3

Getting started with Yandex SpeechSense

Getting startedGetting started

Set up your environmentSet up your environment

Upload your dataUpload your data

Was the article helpful?

Getting started

Set up your environment

Upload your data