System requirements for CPU-based servers
To install SpeechKit Hybrid, you need a Linux server with Docker Engine support. For a list of supported operating systems, see the Docker official documentation
Warning
The CPU must support the AVX2
On Linux, to check whether your host supports AVX2, run:
grep -q avx2 /proc/cpuinfo && echo AVX2 || echo No AVX2
Hardware requirements
Recommended hardware configurations to run SpeechKit Hybrid containers:
Container | CPU | RAM | HDD |
---|---|---|---|
Speech recognition (40 channels) | Two Intel® Xeon® Gold 6230R CPUs | 128 GB | 300 GB |
Speech synthesis (10-16 SPS1) | One Intel® Xeon® Gold 6230R CPU | 64 GB | 300 GB |
1 Seconds per second (SPS): Number of seconds of synthesized text generated per runtime second.
Software requirements
To install and configure SpeechKit Hybrid services, you will need the Yandex Cloud CLI and a registry in Yandex Container Registry.
-
If you do not have the Yandex Cloud command line interface yet, install and initialize it.
-
Create a registry in Yandex Container Registry.
The folder specified in the CLI profile is used by default. You can specify a different folder using the
--folder-name
or--folder-id
parameter.yc container registry create --name speechkit-hybrid
Result:
id: <registry_ID> folder_id: <folder_ID> name: speechkit-hybrid status: ACTIVE created_at: "<creation_date_and_time>"
-
Create a service account with the
editor
role for the selected folder. -
Create an API key for the service account.
-
Notify the SpeechKit team of the created registry ID. All the required containers will appear in your registry, and you will be provided with the
docker-compose.yaml
file that contains the deployment settings.