Yandex Cloud
Поиск
Связаться с намиПодключиться
  • Документация
  • Блог
  • Все сервисы
  • Статус работы сервисов
    • Популярные
    • Инфраструктура и сеть
    • Платформа данных
    • Контейнеры
    • Инструменты разработчика
    • Бессерверные вычисления
    • Безопасность
    • Мониторинг и управление ресурсами
    • Машинное обучение
    • Бизнес-инструменты
  • Все решения
    • По отраслям
    • По типу задач
    • Экономика платформы
    • Безопасность
    • Техническая поддержка
    • Каталог партнёров
    • Обучение и сертификация
    • Облако для стартапов
    • Облако для крупного бизнеса
    • Центр технологий для общества
    • Облако для интеграторов
    • Поддержка IT-бизнеса
    • Облако для фрилансеров
    • Обучение и сертификация
    • Блог
    • Документация
    • Контент-программа
    • Мероприятия и вебинары
    • Контакты, чаты и сообщества
    • Идеи
    • Истории успеха
    • Тарифы Yandex Cloud
    • Промоакции и free tier
    • Правила тарификации
  • Документация
  • Блог
Проект Яндекса
© 2025 ООО «Яндекс.Облако»
Yandex SpeechSense
  • Начало работы
    • Аутентификация в API
      • Overview
        • Overview
        • UploadAsStream
        • Upload
        • UploadText
        • Search
        • Get
  • Аудитные логи Audit Trails
  • Управление доступом
  • Правила тарификации
  • История изменений
  • Вопросы и ответы

В этой статье:

  • gRPC request
  • GetTalkRequest
  • GetTalkResponse
  • Talk
  • Field
  • Transcription
  • Phrase
  • PhraseText
  • Word
  • PhraseStatistics
  • UtteranceStatistics
  • AudioSegmentBoundaries
  • DescriptiveStatistics
  • Quantile
  • RecognitionClassifierResult
  • PhraseHighlight
  • RecognitionClassifierLabel
  • AlgorithmMetadata
  • Error
  • SpeechStatistics
  • SilenceStatistics
  • InterruptsStatistics
  • InterruptsEvaluation
  • ConversationStatistics
  • SpeakerStatistics
  • Points
  • Quiz
  • TextClassifiers
  • ClassificationResult
  • ClassifierStatistics
  • Histogram
  • Summarization
  • SummarizationStatement
  • SummarizationField
  • TalkState
  • AlgorithmProcessingInfo
  1. Справочник API
  2. gRPC (англ.)
  3. Talk
  4. Get

Talk Analytics API, gRPC: TalkService.Get

Статья создана
Yandex Cloud
Обновлена 26 ноября 2024 г.
  • gRPC request
  • GetTalkRequest
  • GetTalkResponse
  • Talk
  • Field
  • Transcription
  • Phrase
  • PhraseText
  • Word
  • PhraseStatistics
  • UtteranceStatistics
  • AudioSegmentBoundaries
  • DescriptiveStatistics
  • Quantile
  • RecognitionClassifierResult
  • PhraseHighlight
  • RecognitionClassifierLabel
  • AlgorithmMetadata
  • Error
  • SpeechStatistics
  • SilenceStatistics
  • InterruptsStatistics
  • InterruptsEvaluation
  • ConversationStatistics
  • SpeakerStatistics
  • Points
  • Quiz
  • TextClassifiers
  • ClassificationResult
  • ClassifierStatistics
  • Histogram
  • Summarization
  • SummarizationStatement
  • SummarizationField
  • TalkState
  • AlgorithmProcessingInfo

rpc for bulk get

gRPC requestgRPC request

rpc Get (GetTalkRequest) returns (GetTalkResponse)

GetTalkRequestGetTalkRequest

{
  "organization_id": "string",
  "space_id": "string",
  "connection_id": "string",
  "project_id": "string",
  "talk_ids": [
    "string"
  ],
  "results_mask": "google.protobuf.FieldMask"
}

Field

Description

organization_id

string

id of organization

space_id

string

id of space

connection_id

string

id of connection to search data

project_id

string

id of project to search data

talk_ids[]

string

ids of talks to return. Requesting too many talks may result in "message exceeds maximum size" error.
Up to 100 of talks per request is recommended.

results_mask

google.protobuf.FieldMask

All types of analysis will be returned if not set.

GetTalkResponseGetTalkResponse

{
  "talk": [
    {
      "id": "string",
      "organization_id": "string",
      "space_id": "string",
      "connection_id": "string",
      "project_ids": [
        "string"
      ],
      "created_by": "string",
      "created_at": "google.protobuf.Timestamp",
      "modified_by": "string",
      "modified_at": "google.protobuf.Timestamp",
      "talk_fields": [
        {
          "name": "string",
          "value": "string",
          "type": "FieldType"
        }
      ],
      "transcription": {
        "phrases": [
          {
            "channel_number": "int64",
            "start_time_ms": "int64",
            "end_time_ms": "int64",
            "phrase": {
              "text": "string",
              "language": "string",
              "normalized_text": "string",
              "words": [
                {
                  "word": "string",
                  "start_time_ms": "int64",
                  "end_time_ms": "int64"
                }
              ]
            },
            "statistics": {
              "statistics": {
                "speaker_tag": "string",
                "speech_boundaries": {
                  "start_time_ms": "int64",
                  "end_time_ms": "int64",
                  "duration_seconds": "int64"
                },
                "total_speech_ms": "int64",
                "speech_ratio": "double",
                "total_silence_ms": "int64",
                "silence_ratio": "double",
                "words_count": "int64",
                "letters_count": "int64",
                "words_per_second": {
                  "min": "double",
                  "max": "double",
                  "mean": "double",
                  "std": "double",
                  "quantiles": [
                    {
                      "level": "double",
                      "value": "double"
                    }
                  ]
                },
                "letters_per_second": {
                  "min": "double",
                  "max": "double",
                  "mean": "double",
                  "std": "double",
                  "quantiles": [
                    {
                      "level": "double",
                      "value": "double"
                    }
                  ]
                }
              }
            },
            "classifiers": [
              {
                "start_time_ms": "int64",
                "end_time_ms": "int64",
                "classifier": "string",
                "highlights": [
                  {
                    "text": "string",
                    "offset": "int64",
                    "count": "int64"
                  }
                ],
                "labels": [
                  {
                    "label": "string",
                    "confidence": "double"
                  }
                ]
              }
            ]
          }
        ],
        "algorithms_metadata": [
          {
            "created_task_date": "google.protobuf.Timestamp",
            "completed_task_date": "google.protobuf.Timestamp",
            "error": {
              "code": "string",
              "message": "string"
            },
            "trace_id": "string",
            "name": "string"
          }
        ]
      },
      "speech_statistics": {
        "total_simultaneous_speech_duration_seconds": "int64",
        "total_simultaneous_speech_duration_ms": "int64",
        "total_simultaneous_speech_ratio": "double",
        "simultaneous_speech_duration_estimation": {
          "min": "double",
          "max": "double",
          "mean": "double",
          "std": "double",
          "quantiles": [
            {
              "level": "double",
              "value": "double"
            }
          ]
        }
      },
      "silence_statistics": {
        "total_simultaneous_silence_duration_ms": "int64",
        "total_simultaneous_silence_ratio": "double",
        "simultaneous_silence_duration_estimation": {
          "min": "double",
          "max": "double",
          "mean": "double",
          "std": "double",
          "quantiles": [
            {
              "level": "double",
              "value": "double"
            }
          ]
        },
        "total_simultaneous_silence_duration_seconds": "int64"
      },
      "interrupts_statistics": {
        "speaker_interrupts": [
          {
            "speaker_tag": "string",
            "interrupts_count": "int64",
            "interrupts_duration_ms": "int64",
            "interrupts": [
              {
                "start_time_ms": "int64",
                "end_time_ms": "int64",
                "duration_seconds": "int64"
              }
            ],
            "interrupts_duration_seconds": "int64"
          }
        ]
      },
      "conversation_statistics": {
        "conversation_boundaries": {
          "start_time_ms": "int64",
          "end_time_ms": "int64",
          "duration_seconds": "int64"
        },
        "speaker_statistics": [
          {
            "speaker_tag": "string",
            "complete_statistics": {
              "speaker_tag": "string",
              "speech_boundaries": {
                "start_time_ms": "int64",
                "end_time_ms": "int64",
                "duration_seconds": "int64"
              },
              "total_speech_ms": "int64",
              "speech_ratio": "double",
              "total_silence_ms": "int64",
              "silence_ratio": "double",
              "words_count": "int64",
              "letters_count": "int64",
              "words_per_second": {
                "min": "double",
                "max": "double",
                "mean": "double",
                "std": "double",
                "quantiles": [
                  {
                    "level": "double",
                    "value": "double"
                  }
                ]
              },
              "letters_per_second": {
                "min": "double",
                "max": "double",
                "mean": "double",
                "std": "double",
                "quantiles": [
                  {
                    "level": "double",
                    "value": "double"
                  }
                ]
              }
            },
            "words_per_utterance": {
              "min": "double",
              "max": "double",
              "mean": "double",
              "std": "double",
              "quantiles": [
                {
                  "level": "double",
                  "value": "double"
                }
              ]
            },
            "letters_per_utterance": {
              "min": "double",
              "max": "double",
              "mean": "double",
              "std": "double",
              "quantiles": [
                {
                  "level": "double",
                  "value": "double"
                }
              ]
            },
            "utterance_count": "int64",
            "utterance_duration_estimation": {
              "min": "double",
              "max": "double",
              "mean": "double",
              "std": "double",
              "quantiles": [
                {
                  "level": "double",
                  "value": "double"
                }
              ]
            }
          }
        ]
      },
      "points": {
        "quiz": [
          {
            "request": "string",
            "response": "google.protobuf.StringValue",
            "id": "string"
          }
        ]
      },
      "text_classifiers": {
        "classification_result": [
          {
            "classifier": "string",
            "classifier_statistics": [
              {
                "channel_number": "google.protobuf.Int64Value",
                "total_count": "int64",
                "histograms": [
                  {
                    "count_values": [
                      "int64"
                    ]
                  }
                ]
              }
            ]
          }
        ]
      },
      "summarization": {
        "statements": [
          {
            "field": {
              "id": "string",
              "name": "string",
              "type": "SummarizationFieldType"
            },
            "response": [
              "string"
            ]
          }
        ]
      },
      "talk_state": {
        "processing_state": "ProcessingState",
        "algorithm_processing_infos": [
          {
            "algorithm": "Algorithm",
            "processing_state": "ProcessingState"
          }
        ]
      }
    }
  ]
}

Field

Description

talk[]

Talk

TalkTalk

Field

Description

id

string

talk id

organization_id

string

space_id

string

connection_id

string

project_ids[]

string

created_by

string

audition info

created_at

google.protobuf.Timestamp

modified_by

string

modified_at

google.protobuf.Timestamp

talk_fields[]

Field

key-value representation of talk fields with values

transcription

Transcription

various ml analysis results

speech_statistics

SpeechStatistics

silence_statistics

SilenceStatistics

interrupts_statistics

InterruptsStatistics

conversation_statistics

ConversationStatistics

points

Points

text_classifiers

TextClassifiers

summarization

Summarization

talk_state

TalkState

FieldField

connection field value

Field

Description

name

string

name of the field

value

string

field value

type

enum FieldType

field type

  • FIELD_TYPE_UNSPECIFIED
  • FIELD_TYPE_STRING
  • FIELD_TYPE_NUMBER
  • FIELD_TYPE_DECIMAL
  • FIELD_TYPE_BOOLEAN
  • FIELD_TYPE_DATE
  • FIELD_TYPE_JSON

TranscriptionTranscription

Field

Description

phrases[]

Phrase

algorithms_metadata[]

AlgorithmMetadata

Their might be several algorithms that work on talk transcription. For example: speechkit and translator
So there might be other fields here for tracing

PhrasePhrase

Field

Description

channel_number

int64

start_time_ms

int64

end_time_ms

int64

phrase

PhraseText

statistics

PhraseStatistics

classifiers[]

RecognitionClassifierResult

PhraseTextPhraseText

Field

Description

text

string

language

string

normalized_text

string

words[]

Word

WordWord

Field

Description

word

string

start_time_ms

int64

end_time_ms

int64

PhraseStatisticsPhraseStatistics

Field

Description

statistics

UtteranceStatistics

UtteranceStatisticsUtteranceStatistics

Field

Description

speaker_tag

string

speech_boundaries

AudioSegmentBoundaries

Audio segment boundaries

total_speech_ms

int64

Total speech duration

speech_ratio

double

Speech ratio within audio segment

total_silence_ms

int64

Total silence duration

silence_ratio

double

Silence ratio within audio segment

words_count

int64

Number of words in recognized speech

letters_count

int64

Number of letters in recognized speech

words_per_second

DescriptiveStatistics

Descriptive statistics for words per second distribution

letters_per_second

DescriptiveStatistics

Descriptive statistics for letters per second distribution

AudioSegmentBoundariesAudioSegmentBoundaries

Field

Description

start_time_ms

int64

Audio segment start time

end_time_ms

int64

Audio segment end time

duration_seconds

int64

Duration in seconds

DescriptiveStatisticsDescriptiveStatistics

Field

Description

min

double

Minimum observed value

max

double

Maximum observed value

mean

double

Estimated mean of distribution

std

double

Estimated standard deviation of distribution

quantiles[]

Quantile

List of evaluated quantiles

QuantileQuantile

Field

Description

level

double

Quantile level in range (0, 1)

value

double

Quantile value

RecognitionClassifierResultRecognitionClassifierResult

Field

Description

start_time_ms

int64

Start time of the audio segment used for classification

end_time_ms

int64

End time of the audio segment used for classification

classifier

string

Name of the triggered classifier

highlights[]

PhraseHighlight

List of highlights, i.e. parts of phrase that determine the result of the classification

labels[]

RecognitionClassifierLabel

Classifier predictions

PhraseHighlightPhraseHighlight

Field

Description

text

string

Text transcription of the highlighted audio segment

offset

int64

offset in symbols from the beginning of whole phrase where highlight begins

count

int64

count of symbols in highlighted text

RecognitionClassifierLabelRecognitionClassifierLabel

Field

Description

label

string

The label of the class predicted by the classifier

confidence

double

The prediction confidence

AlgorithmMetadataAlgorithmMetadata

Field

Description

created_task_date

google.protobuf.Timestamp

completed_task_date

google.protobuf.Timestamp

error

Error

trace_id

string

name

string

ErrorError

Field

Description

code

string

message

string

SpeechStatisticsSpeechStatistics

Field

Description

total_simultaneous_speech_duration_seconds

int64

Total simultaneous speech duration in seconds

total_simultaneous_speech_duration_ms

int64

Total simultaneous speech duration in ms

total_simultaneous_speech_ratio

double

Simultaneous speech ratio within audio segment

simultaneous_speech_duration_estimation

DescriptiveStatistics

Descriptive statistics for simultaneous speech duration distribution

SilenceStatisticsSilenceStatistics

Field

Description

total_simultaneous_silence_duration_ms

int64

total_simultaneous_silence_ratio

double

Simultaneous silence ratio within audio segment

simultaneous_silence_duration_estimation

DescriptiveStatistics

Descriptive statistics for simultaneous silence duration distribution

total_simultaneous_silence_duration_seconds

int64

InterruptsStatisticsInterruptsStatistics

Field

Description

speaker_interrupts[]

InterruptsEvaluation

Interrupts description for every speaker

InterruptsEvaluationInterruptsEvaluation

Field

Description

speaker_tag

string

Speaker tag

interrupts_count

int64

Number of interrupts made by the speaker

interrupts_duration_ms

int64

Total duration of all interrupts

interrupts[]

AudioSegmentBoundaries

Boundaries for every interrupt

interrupts_duration_seconds

int64

Total duration of all interrupts in seconds

ConversationStatisticsConversationStatistics

Field

Description

conversation_boundaries

AudioSegmentBoundaries

Audio segment boundaries

speaker_statistics[]

SpeakerStatistics

Average statistics for each speaker

SpeakerStatisticsSpeakerStatistics

Field

Description

speaker_tag

string

Speaker tag

complete_statistics

UtteranceStatistics

analysis of all phrases in format of single utterance

words_per_utterance

DescriptiveStatistics

Descriptive statistics for words per utterance distribution

letters_per_utterance

DescriptiveStatistics

Descriptive statistics for letters per utterance distribution

utterance_count

int64

Number of utterances

utterance_duration_estimation

DescriptiveStatistics

Descriptive statistics for utterance duration distribution

PointsPoints

Field

Description

quiz[]

Quiz

QuizQuiz

Field

Description

request

string

response

google.protobuf.StringValue

id

string

TextClassifiersTextClassifiers

Field

Description

classification_result[]

ClassificationResult

ClassificationResultClassificationResult

Field

Description

classifier

string

Classifier name

classifier_statistics[]

ClassifierStatistics

Classifier statistics

ClassifierStatisticsClassifierStatistics

Field

Description

channel_number

google.protobuf.Int64Value

Channel number, null for whole talk

total_count

int64

classifier total count

histograms[]

Histogram

Represents various histograms build on top of classifiers

HistogramHistogram

Field

Description

count_values[]

int64

histogram count values. For example:
if len(count_values) = 2, it means that histogram is 50/50,
if len(count_values) = 3 - [0] value represents first third, [1] - second third, [2] - last third, etc.

SummarizationSummarization

Field

Description

statements[]

SummarizationStatement

SummarizationStatementSummarizationStatement

Field

Description

field

SummarizationField

response[]

string

SummarizationFieldSummarizationField

Field

Description

id

string

name

string

type

enum SummarizationFieldType

  • SUMMARIZATION_FIELD_TYPE_UNSPECIFIED
  • TEXT
  • TEXT_ARRAY

TalkStateTalkState

Field

Description

processing_state

enum ProcessingState

  • PROCESSING_STATE_UNSPECIFIED
  • PROCESSING_STATE_NOT_STARTED
  • PROCESSING_STATE_PROCESSING
  • PROCESSING_STATE_SUCCESS
  • PROCESSING_STATE_FAILED

algorithm_processing_infos[]

AlgorithmProcessingInfo

AlgorithmProcessingInfoAlgorithmProcessingInfo

Field

Description

algorithm

enum Algorithm

  • ALGORITHM_UNSPECIFIED
  • ALGORITHM_SPEECHKIT
  • ALGORITHM_YGPT
  • ALGORITHM_CLASSIFIER
  • ALGORITHM_SUMMARIZATION
  • ALGORITHM_EMBEDDING
  • ALGORITHM_STATISTICS

processing_state

enum ProcessingState

  • PROCESSING_STATE_UNSPECIFIED
  • PROCESSING_STATE_NOT_STARTED
  • PROCESSING_STATE_PROCESSING
  • PROCESSING_STATE_SUCCESS
  • PROCESSING_STATE_FAILED

Была ли статья полезна?

Предыдущая
Search
Следующая
Overview
Проект Яндекса
© 2025 ООО «Яндекс.Облако»