Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
  • Blog
  • Pricing
  • Documentation
Yandex project
© 2025 Yandex.Cloud LLC
Yandex Foundation Models
    • Overview
    • API authentication
        • Overview
          • Overview
          • Describe
          • Validate
          • Create
          • Update
          • Delete
          • List
          • ListUploadFormats
          • ListUploadSchemas
          • GetUploadDraftUrl
          • GetDownloadUrls
          • StartMultipartUploadDraft
          • FinishMultipartUploadDraft
          • ListTypes
          • GetPreview
          • ListOperationsIds
  • Yandex Cloud ML SDK
  • Compatibility with OpenAI
  • Access management
  • Pricing policy
  • Public materials
  • Release notes

In this article:

  • gRPC request
  • CreateDatasetRequest
  • CreateDatasetResponse
  • DatasetInfo
  • ValidationError
  1. API references
  2. Dataset API
  3. gRPC
  4. Dataset
  5. Create

Foundation Models Dataset Service API, gRPC: DatasetService.Create

Written by
Yandex Cloud
Updated at January 23, 2025
  • gRPC request
  • CreateDatasetRequest
  • CreateDatasetResponse
  • DatasetInfo
  • ValidationError

Creates dataset.

gRPC requestgRPC request

rpc Create (CreateDatasetRequest) returns (CreateDatasetResponse)

CreateDatasetRequestCreateDatasetRequest

{
  "name": "string",
  "folder_id": "string",
  "description": "string",
  "metadata": "string",
  "task_type": "string",
  "labels": "map<string, string>",
  "upload_format": "string",
  "allow_data_log": "bool"
}

Field

Description

name

string

Required field. Name of the dataset.

folder_id

string

Required field. Folder ID of the dataset.

description

string

Description of the dataset. Optional.

metadata

string

Metadata of the dataset. Optional.

task_type

string

Required field. Task type of the dataset.

labels

object (map<string, string>)

Labels of the dataset. Optional.

upload_format

string

Required field. Upload format of the dataset.
The list of supported upload formats can be retrieved via ListUploadFormats method.

allow_data_log

bool

Allow to use the dataset to improve the models quality. Default false.

CreateDatasetResponseCreateDatasetResponse

{
  "dataset_id": "string",
  "dataset": {
    "dataset_id": "string",
    "folder_id": "string",
    "name": "string",
    "description": "string",
    "metadata": "string",
    "status": "Status",
    "task_type": "string",
    "created_at": "google.protobuf.Timestamp",
    "updated_at": "google.protobuf.Timestamp",
    "rows": "int64",
    "size_bytes": "int64",
    "created_by_id": "string",
    "labels": "map<string, string>",
    "created_by": "string",
    "updated_by": "string",
    "validation_error": [
      {
        "error": "string",
        "error_description": "string",
        "row_numbers": [
          "int64"
        ]
      }
    ],
    "allow_data_log": "bool"
  }
}

Field

Description

dataset_id

string

ID of the created dataset.

dataset

DatasetInfo

Information about the dataset.

DatasetInfoDatasetInfo

Information about the dataset.

Field

Description

dataset_id

string

ID of the dataset.

folder_id

string

Folder ID of the dataset.

name

string

Name of the dataset.

description

string

Description of the dataset.

metadata

string

Metadata of the dataset.

status

enum Status

Status of the dataset.

  • STATUS_UNSPECIFIED
  • DRAFT
  • VALIDATING
  • READY
  • INVALID
  • DELETING

task_type

string

Task type of the dataset.

created_at

google.protobuf.Timestamp

Create dataset timestamp.

updated_at

google.protobuf.Timestamp

Update dataset timestamp.

rows

int64

Number of rows in the dataset.

size_bytes

int64

Size of the dataset.

created_by_id

string

Deprecated. Use created_by instead

labels

object (map<string, string>)

Labels of the dataset

created_by

string

User ID of the dataset's creator.

updated_by

string

User ID of the dataset's last updater.

validation_error[]

ValidationError

allow_data_log

bool

Allow to use the dataset to improve the models quality. Default false.

ValidationErrorValidationError

Information about dataset validation error.

Field

Description

error

string

Name of the validation error.

error_description

string

Description of the validation error.

row_numbers[]

int64

Row numbers in which the error occurred.

Was the article helpful?

Previous
Validate
Next
Update
Yandex project
© 2025 Yandex.Cloud LLC