Yandex Foundation Models pricing policy
To calculate the cost of using the service, see the prices in this section.
In the management console
- YandexGPT API: 10 free requests per hour.
- YandexART: 10 free requests per day.
What goes into the cost of using Yandex Foundation Models
Billing unit
Foundation Models usage is detailed out in billing units. The cost of a billing unit is different for text generation and vectorization.
Text generation
Text generation cost is based on the overall number of prompt and response tokens and depends on the YandexGPT API request parameters. Namely, the cost depends on the following:
- Model that gets a request.
- Model working mode.
The number of prompt and response tokens for the same text may vary depending on model.
The total number of billing units is based on the overall number of prompt and response tokens and is rounded up to a whole number.
Tokenization
The use of tokenizer (TokenizerService calls and Tokenizer methods) is not charged.
Fine-tuned models
The use of models fine-tuned in Yandex DataSphere is charged according to the YandexGPT Pro policy.
Text classification
At the Preview stage, the use of classifiers based on YandexGPT is free of charge.
Text vectorization
The cost of text vectorization (getting text embeddings) depends on the size of the text submitted for vectorization.
Image generation
You are charged for each generation request in YandexART. The requests are not idempotent; therefore, two requests with the same settings and generation prompt are two separate requests.
Internal server errors
You are not charged for a request that fails due to an internal server error.
Prices for the Russia region
Warning
Prices for Yandex Cloud resources vary from region to region. For more information about the available regions, see Regions.
The currency that can be used to pay for resources depends on which legal entity the user has entered into agreement with. For more information about account registration, see Registering an account in Yandex Cloud.
Text generation in YandexGPT API
Number | Cost, without VAT |
---|---|
1,000 units | $0.0016 |
Model parameters | Number of units per token | Cost per 1,000 tokens, without VAT |
---|---|---|
YandexGPT Lite, synchronous mode | 1 | $0.0016 |
YandexGPT Lite, asynchronous mode | 0.5 | $0.0008 |
YandexGPT Pro, synchronous mode | 6 | $0.0096 |
YandexGPT Pro, asynchronous mode | 3 | $0.0048 |
Models fine-tuned in DataSphere, synchronous mode | 6 | $0.0096 |
Models fine-tuned in DataSphere, asynchronous mode | 3 | $0.0048 |
Text vectorization in YandexGPT API
Number | Cost, without VAT |
---|---|
1,000 units | $0.00008 |
Model parameters | Number of units per token | Cost per 1,000 tokens, without VAT |
---|---|---|
Embeddings | 1 | $0.00008 |
Image generation
Warning
The prices listed below will come into effect starting November 1, 2024.
Service | Cost, without VAT |
---|---|
1 request for YandexART image generation | $0.0176 |
Examples of YandexGPT API usage cost calculation
Calculating text generation cost
Example 1
Cost of using YandexGPT API for text generation with the following parameters:
- Number of prompt tokens: 225
- Number of response tokens: 525
- Model: YandexGPT Lite
- Model working mode: Synchronous
Total: ($0.0016 / 1,000 units) × 750 units = $0.0015
Example 2
Cost of using YandexGPT API for text generation with the following parameters:
- Number of prompt tokens: 115
- Number of response tokens: 1,500
- Model: YandexGPT Pro
- Model working mode: Asynchronous
The cost is calculated as follows:
- Number of prompt and response tokens: 115 + 1,500 = 1,615
- Price per 1,000 tokens for the YandexGPT Pro model, asynchronous mode: $0.0048
- Number of units per token for the YandexGPT Pro model, asynchronous mode: 3
- Total number of units in usage details: 1,615 × 3 = 4,845
Total: ($0.0048 / 1,000 tokens) × 1,615 tokens = $0.007752
Example 3
Cost of using YandexGPT API for text generation with the following parameters:
- Number of prompt tokens: 1,020
- Number of response tokens: 30
- Model: YandexGPT Pro fine-tuned in DataSphere
- Model working mode: Synchronous
The cost is calculated as follows:
- Number of prompt and response tokens: 1,020 + 30 = 1,050
- Price per 1,000 tokens for the model fine-tuned in DataSphere, synchronous mode: $0.0096
- Number of units per token for the model fine-tuned in DataSphere, synchronous mode: 6
- Total number of units in usage details: 1,050 × 6 = 6,300
Total: ($0.0016 / 1,000 units) × 6,300 units = $0.01008 or ($0.0096 / 1,000 tokens) × 1,050 tokens = $0.01008
Calculating text vectorization cost
Cost of using YandexGPT API for text vectorization with the following parameter:
- Number of tokens in the request: 2,000