Getting started with Yandex SpeechSense
Yandex SpeechSense allows you to analyze your business communication channels based on dialog audio recordings or text chat messages and supports integration with your PBX and CRM systems. SpeechSense utilizes Yandex SpeechKit voice technologies to transcribe speech and perform statistical and qualitative analysis of recorded audio dialogs.
SpeechSense can analyze audio and text dialogs in Russian.
You can upload your data to SpeechSense or use a demo recording
Getting started
-
Go to the management console
and log in to Yandex Cloud or sign up if not signed up yet. For information on how to get started with Yandex Cloud, see Getting started with Yandex Cloud. -
Accept the user agreement.
-
In Yandex Cloud Billing
, make sure you have a billing account linked and its status isACTIVE
orTRIAL_ACTIVE
. If you do not have a billing account yet, create one. -
Assign the
speech-sense.spaces.creator
role to your Yandex Cloud account.Note
If you are unable to manage roles, contact your cloud or organization administrator.
-
Open the SpeechSense home page
. -
Select the organization to work with SpeechSense in or create a new one.
Set up your environment
-
Create a space to store all your projects: select Create space, enter a name, add a description if required, and click Create.
-
Link a billing account to the space. This account will be debited for the use of SpeechSense.
Tip
To manage a billing account, you must have a Yandex account. If using Yandex Cloud through an identity federation, contact
support. -
Navigate to the Connections tab and create a connection based on the metainformation of your audio recordings or text chats:
- Enter Connection name.
- Select the Two-channel audio or Сhat data type.
- Set the connection parameters. For more information, see Creating a connection.
- Click Create connection.
Example of metadata.json for loading audio data
{ "direction_outgoung": "true", "client_id": "456", "client_name": "John Doe", "date": "2023-09-29T09:08:38.958Z", "date_to": "2023-09-29T09:15:07.897Z", "language": "RU", "operator_id": "123", "operator_name": "Jane Doe" }
-
Create a project by clicking Create project on the space page, entering a name for the project, and adding a connection to it. For each connection, you can select filtering rules based on its metainformation and the YandexGPT Pro criteria. With filters, you can ensure that only the dialogs you need are added to the project. After adding all required connections and configuring filters, click Create project.
Upload your data
SpeechSense uses the gRPC API to upload data.
To upload data:
-
Add the service account to the namespace with the
speech-sense.data.editor
role. To learn more about the roles available in the service, see Access management in SpeechSense. -
Create an API key or IAM token for the service account to authenticate with the API. Learn more about authentication in the SpeechSense API.
-
Upload your audio data or chat history data using a Python data transfer script.
Use one message for the whole audio.
SpeechSense supports the following audio file formats: