Speech synthesis (TTS)
How can I voice long texts?
To voice a large text, break it into parts in any way convenient for you. The maximum size of speech synthesis requests is limited to 5,000 characters.
How do I configure stress and pronunciation?
To adjust the pronunciation of individual words and the text in general, use the SSML or TTS markup.
How do I add a pause in text?
To add a pause to your text, use TTS markup. Specify the pause duration in milliseconds in parentheses. A pause will appear where you place the tag. For example: Start sil<[3000]> continue in 3 seconds
.
Note that the pause is indicated as a recommendation only. SpeechKit is designed to generate natural speech.
A cURL request does not work in Windows PowerShell
In the Windows PowerShell terminal, the curl
command is an alias for the Invoke-WebRequest
The Yandex Cloud documentation provides examples of API calls using the Bash shell syntax. You can run them as is in the Linux console, macOS terminal, or WSL in Windows 10 or higher. To run the examples in Windows PowerShell, you will have to modify them yourself. For more on command equivalents between Bash and PowerShell, as well as tips, see Working with the Yandex Cloud CLI and API in Microsoft Windows.
What goes into the cost of synthesis?
For examples of calculating the usage cost, pricing rules, and effective prices, see SpeechKit pricing policy.