Speech synthesis using Playground
Written by
Updated at February 18, 2025
To convert text to speech via the SpeechKit Playground interface:
-
In the management console
, select the folder you are going to use to work with SpeechKit. -
From the list of services, select SpeechKit.
-
Go to the Speech synthesis tab.
-
Paste up to 5,000 characters of text into the central part of the window.
-
In the settings section on the left side of the window:
- Pauses: Set up fixed pauses between words using tags, e.g.,
<[small]>
,<[large]>
. For a pause of a particular length, use thesil<[t]>
tag, wheret
is the the pause length in milliseconds. - Emphasize word: Accent a word using the
<[accented]>
tag or by enclosing it in asterisks (** **
). - Stress: Mark the stressed vowel in homographs by prefixing it with
+
. - Phonemes: Tag words with
[[]]
to ensure proper pronunciation using phonemes.
- Pauses: Set up fixed pauses between words using tags, e.g.,
-
Under Synthesis settings on the right side of the window:
- Language: Set the speaker's language.
- Voice: Select the speaker's voice.
- Role: Control tone and emotion by selecting the speaker's role.
- Speech speed: Set the speaker's speech rate in the range from
0.1
to3.0
, where1.0
is the average human speech rate. - Voice pitch: Adjust the speaker's voice pitch. The higher the value, the greater the intonation contour of the synthesized audio in Hz.
- Audio format: Select the audio format.
-
Click
Synthesize and playback to synthesize speech. -
To download the result, click
.
SpeechKit Playground features basic speech synthesis options. For more flexible synthesis settings, use the API.