Common instance models
Yandex AI Studio provides access to large generative models from different vendors. If out-of-the-box models are not enough, you can fine-tune some of them for more accurate responses. All roles required for working with the models are listed in Access management in Yandex AI Studio.
In a common instance, model resources are available to all Yandex Cloud users and shared between them, so model response time may increase under heavy workloads. We guarantee that no other user can access the context of your exchanges with the model: even with logging on, requests are stored anonymized and potentially sensitive information is masked. However, we recommend disabling data logging whenever you use our models to process sensitive information.
Common instance models are subject to the update rules described in Model lifecycle. When updating models, generations available in different branches (/latest
, /rc
, and /deprecated
segments) may change. Modified models share usage quotas with their basic models.
Model and URI |
Generation |
Context |
|
YandexGPT Lite |
Deprecated 5Latest 5RC 5 |
32,000 |
Asynchronous, synchronous |
YandexGPT Pro |
Deprecated 5Latest 5RC 5.1 |
32,000 |
Asynchronous, synchronous |
Qwen3 235B |
— |
256,000 |
|
gpt-oss-120b |
— |
128,000 |
|
gpt-oss-20b |
— |
128,000 |
|
Fine-tuned text models |
Depends on the basic model |
Depends on the basic model |
Asynchronous, synchronous |
Gemma 3 27B |
— |
128 000 |
|
YandexART |
— |
— |
Asynchronous |
1 Llama was created by Meta. Meta is designated as an extremist organization and its activities are prohibited in Russia.
Gemma 3 27B processes Base64-encoded images. The model can handle images of any aspect ratio thanks to an adaptive algorithm that scales the longer side of the image to 896 pixels while preserving important visual details. Each image uses 256 context tokens.
Model lifecycle
Each model has a number of lifecycle characteristics, such as model name, branch, and release date. These allow you to uniquely identify the model version. Refer to the model update rules provided below to adapt your solutions to the new version as needed.
There are three model branches (from old to new): Deprecated
, Latest
, Release Candidate
(RC
). Each of the branches is subject to the SLA.
The RC
branch is updated as the new model is ready and may change at any time. When a model in the RC
branch is ready for general use, we announce the upcoming release both in the release notes and our Telegram community
One month after the announcement, the RC
version becomes the Latest
one, and the Latest
version is moved to the Deprecated
branch. We continue the support of the Deprecated
version for one more month, after which models in the Deprecated
and Latest
branches become identical.
Accessing models
You can access text generation models of different versions in a number of ways.
When operating text generation models via Yandex Cloud ML SDK, use one of the following formats:
-
Model name, provided as a string. Only the
Latest
versions are available.# Text generation model = ( sdk.models.completions("yandexgpt") ) # Image generation model = ( sdk.models.image_generation("yandex-art") )
-
Model name and version, provided as strings in the
model_name
andmodel_version
fields, respectively.# Text generation model = ( sdk.models.completions(model_name="yandexgpt-lite", model_version="rc") ) # Image generation model = ( sdk.models.image_generation(model_name="yandex-art", model_version="latest") )
This example explicitly specifies the
YandexGPT Lite
model of theRelease Candidate
version and theYandexART
model of theLatest
version. -
Model URI, provided as a string containing the full URI of the required model version. You can also use this method to access fine-tuned models.
# Text generation model = ( sdk.models.completions("gpt://b1gt6g8ht345********/yandexgpt/deprecated") ) # Image generation model = ( sdk.models.image_generation("art://b1gt6g8ht345********/yandex-art/latest") )
This example explicitly specifies the
YandexGPT Pro
model of theDeprecated
version and theYandexART
model of theLatest
version.
To access YandexGPT models via the REST API or gRPC API, specify the model URI containing the folder ID in the modelUri
field of the request body. The /latest
, /rc
, and /deprecated
segments indicate the model version. /latest
is used by default.
To access a YandexART model via the REST API or gRPC API, specify the model URI containing the folder ID in the modelUri
field of the request body. The /latest
segment indicates the model version and is optional.
-
Accessing the
Latest
versions:{ "modelUri": "gpt://b1gt6g8ht345********/yandexgpt-lite/latest" ... "modelUri": "art://b1gt6g8ht345********/yandex-art/latest" }
-
Accessing the
RC
version (if available):{ "modelUri": "gpt://b1gt6g8ht345********/yandexgpt-lite/rc" ... }