Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
  • Blog
  • Pricing
  • Documentation
Yandex project
© 2025 Yandex.Cloud LLC
Yandex SpeechKit
  • SpeechKit technology overview
    • About the technology
    • Supported languages
    • Streaming recognition
    • Recognition result normalization
    • Analyzing recognition results
    • Speaker labeling
    • Extending a speech recognition model
    • Uploading fine-tuning data for a speech recognition model
    • Detecting the end of utterance
  • Supported audio formats
  • IVR integration
  • Quotas and limits
  • Access management
  • Pricing policy

In this article:

  • Managing normalization parameters
  • API v1
  • API v2
  • API v3
  1. Speech recognition
  2. Recognition result normalization

Recognition result normalization

Written by
Yandex Cloud
Updated at April 11, 2025
  • Managing normalization parameters
    • API v1
    • API v2
    • API v3

Normalization is converting (post-processing) the text that results from running a Yandex SpeechKit speech recognition model. It is carried out based on the parameters specified in the API requests.

Note

Normalization is not done in auto language (auto language code) recognition mode.

You can use the following normalization levels:

  • Representing dates and time in numeric format.
  • Converting numbers from dictionary to numeric format.
  • Capitalizing the beginning of a sentence and proper nouns.
  • Profanity masking.
  • Formatting phone numbers, e.g., 7(999)123-45-67 instead of 7 999 123 45 67.
  • Adding punctuation marks.

Warning

Rules for the presentation of normalized text may change. Please consider this during integration and stay tuned for updates.

Managing normalization parameters

API v1

  • REST API v1 for synchronous recognition.
Normalization level API parameter
Profanity masking profanityFilter=true
Converting numbers from dictionary to numeric format Enabled by default. To disable, set rawResults=true

API v2

  • gRPC API v2 for streaming recognition.
Normalization level API parameter
Profanity masking config.specification.profanityFilter=true
Converting numbers from dictionary to numeric format Enabled by default. To disable, set config.specification.rawResults=true
  • REST API v2 for asynchronous recognition.
Normalization level API parameter
Profanity masking config.specification.profanityFilter=true
Converting numbers from dictionary to numeric format Enabled by default. To disable, set config.specification.rawResults=true
Capitalizing the beginning of a sentence and proper nouns. Adding punctuation marks config.specification.literature_text=true

API v3

To enable normalization, set the text_normalization=TEXT_NORMALIZATION_ENABLED parameter in the API. Some options are activated by additional parameters if you set text_normalization=TEXT_NORMALIZATION_ENABLED.

Normalization level API parameter
Representing dates and time in numeric format text_normalization=TEXT_NORMALIZATION_ENABLED
Converting numbers from dictionary to numeric format text_normalization=TEXT_NORMALIZATION_ENABLED
Formatting phone numbers text_normalization=TEXT_NORMALIZATION_ENABLED. To disable this level only, additionally specify phone_formatting_mode=PHONE_FORMATTING_MODE_DISABLED.
Profanity masking profanity_filter=true with text_normalization enabled
Capitalizing the beginning of a sentence and proper nouns. Adding punctuation marks literature_text=true with text_normalization enabled

Was the article helpful?

Previous
Asynchronous recognition
Next
Analyzing recognition results
Yandex project
© 2025 Yandex.Cloud LLC