Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML Services
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex AI Studio
    • About Yandex AI Studio
      • Overview
      • Common instance models
      • Dedicated instance models
      • Batch processing
      • Function calling
      • Reasoning mode
      • Formatting model responses
      • Embeddings
      • Datasets
      • Fine-tuning
      • Tokens
    • Quotas and limits
  • Compatibility with OpenAI
  • Access management
  • Pricing policy
  • Audit Trails events
  • Public materials
  • Release notes

In this article:

  • Text generation models
  • Multimodal models
  • Use cases
  1. Concepts
  2. AI models
  3. Batch processing

Batch processing

Written by
Yandex Cloud
Updated at September 30, 2025
  • Text generation models
  • Multimodal models
  • Use cases

Text generation modelsText generation models

Model

URI

Context

Qwen 2.5 7B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-7b-instruct

32,768

Qwen 2.5 72B Instruct
Model card
Qwen license

gpt://<folder_ID>/qwen2.5-72b-instruct

16,384

QwQ 32B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwq-32b

32,768

Llama 3.3 70B Instruct2
Model card
Llama 3.3 license

gpt://<folder_ID>/llama3.3-70b-instruct

8,192

Llama 3.1 70B Instruct2
Model card
Llama 3.1 license

gpt://<folder_ID>/llama3.1-70b-instruct

8,192

DeepSeek-R1 Distill Llama 70B
Model card
MIT license
Based on Llama-3.3-70B-Instruct. Llama-3.3-70B-Instruct Terms of Use

gpt://<folder_ID>/deepseek-r1-distill-llama-70b

8,192

Qwen2.5 32B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-32b-instruct

32,768

DeepSeek-R1 Distill Qwen 32B
Model card
MIT license

gpt://<folder_ID>/deepseek-r1-distill-qwen-32b

32,768

phi-4
Model card
MIT license

gpt://<folder_ID>/phi-4

16,384

Gemma 3 1B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-1b-it

32,768

Gemma 3 4B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-4b-it

131,072

Gemma 3 12B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-12b-it

65,536

Gemma 3 27B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-27b-it

32,768

Qwen3 0.6B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-0.6b

32,768

Qwen3 1.7B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-1.7b

32,768

Qwen3 4B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-4b

32,768

Qwen3 8B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-8b

32,768

Qwen3 14B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-14b

32,768

Qwen3 32B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-32b

32,768

Qwen3 30B A3B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-30b-a3b

32,768

Qwen3 235B A22B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-235b-a22b

32,768

2 Llama was created by Meta. Meta is designated as an extremist organization and its activities are prohibited in Russia.

Multimodal modelsMultimodal models

Model

URI

Context

Qwen2 VL 7B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2-vl-7b-instruct/

4,096

Qwen2.5 VL 7B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-vl-7b-instruct/

4,096

Qwen 2.5 VL 32B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-vl-32b-instruct/

4,096

DeepSeek VL2
Model card
DeepSeek license

gpt://<folder_ID>/deepseek-vl2/

4,096

DeepSeek VL2 Tiny
Model card
DeepSeek license

gpt://<folder_ID>/deepseek-vl2-tiny/

4,096

Gemma 3 4B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-4b-it/

4,096

Gemma 3 12B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-12b-it/

4,096

Gemma 3 27B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-27b-it/

4,096

Use casesUse cases

Running a model in batch mode

Was the article helpful?

Previous
Dedicated instance models
Next
Function calling
© 2025 Direct Cursus Technology L.L.C.