Text generation models
YandexGPT API provides access to large generative models:
- The standard YandexGPT Lite suitable for real-time tasks.
- The large YandexGPT Pro for more accurate responses to sophisticated prompts.
If out-of-the-box models are not enough, you can fine-tune YandexGPT Lite and Llama 8b1 for them to provide more accurate responses to your requests.
To access your model via the API, under modelUri
, specify its URI/latest
, /rc
, and /deprecated
segments indicate the model version. /latest
is used by default.
Generative models
When updating models, generations available in different branches (/latest
, /rc
, and /deprecated
segments) may change.
Model |
URI |
Generation |
|
YandexGPT Lite |
|
344 |
Asynchronous, synchronous |
YandexGPT Pro |
|
344 |
Asynchronous, synchronous |
YandexGPT Pro 32k |
|
44 |
Synchronous2 |
Model fine-tuned in Yandex DataSphere |
|
3 |
Asynchronous, synchronous |
Llama 8b1 |
|
3.1 |
Asynchronous, synchronous |
Llama 70b1 |
|
3.1 |
Asynchronous, synchronous |
Modified models share usage quotas with their basic models.
1 Llama was created by Meta. Meta is designated as an extremist organization and its activities are prohibited in Russia.
2 32kYandexGPT Pro features an expanded context and is designed specifically to handle large texts in synchronous mode. In asynchronous mode, the YandexGPT Pro model supports the same amount of context.
Model lifecycle
Each model has certain lifecycle characteristics, such as the model name, branch, and release date. These characteristics allow you to precisely identify the model version. Below, you can see our rules for updating models. Refer to these rules to adjust your solutions to a new version as apporpriate.
For each model, there are three branches (in the order from the oldest to the newest one): Deprecated
, Latest
, and Release Candidate
(RC
). Each of the branches is subject to the SLA.
The RC
branch is updated as the new model is ready and may change at any time. When a model in the RC
branch is ready for general use, we announce the upcoming release both in the release notes and our Telegram community
One month after the announcement, the RC
version becomes the Latest
one, and the Latest
version is moved to the Deprecated
branch. We continue the support of the Deprecated
version for one more month, after which models in the Deprecated
and Latest
branches become identical.
Tuning capabilities
You cannot fine-tune a text generation model based on new data, e.g., the knowledge base of your support service. However, you can train the model to generate responses in a specific format or analyze texts. You can train the model to:
- Summarize and rewrite texts.
- Generate questions and answers from text input.
- Provide responses in a particular format or style.
- Classify texts, queries, and conversations.
- Extract entities from texts.