Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
    • Yandex Cloud Partner program
  • Blog
  • Pricing
  • Documentation
© 2025 Direct Cursus Technology L.L.C.
Yandex DataSphere
  • Getting started
    • About Yandex DataSphere
    • DataSphere resource relationships
    • Communities
    • Cost management
    • Project
    • Computing resource configurations
      • Nodes and aliases
      • Health checks and monitoring
      • Node metrics
    • Foundation models
    • Quotas and limits
    • Special terms for educational institutions
  • Terraform reference
  • Audit Trails events
  • Access management
  • Pricing policy
  • Public materials
  • Release notes

In this article:

  • Nodes
  • Node from a model
  • Node from a Docker image
  • Node statuses
  • Instance statuses
  • Alias
  1. Concepts
  2. DataSphere Inference
  3. Nodes and aliases

DataSphere Inference

Written by
Yandex Cloud
Updated at September 27, 2024
  • Nodes
    • Node from a model
    • Node from a Docker image
    • Node statuses
    • Instance statuses
  • Alias

DataSphere provides tools for releasing services available to third-party resources. You can deploy a model trained in DataSphere and use the same tools to develop a fully featured service based on a Docker image.

To publish a service, DataSphere provides special resources: nodes and aliases.

NodesNodes

A node is an isolated group of specially created VMs (instances) the computing load is distributed across. Node instances are created with a preset environment and fixed interpreter state. Depending on your needs, you can select different instance configurations.

Warning

When deploying and using models, you pay for the uptime of each node instance: from its start to deletion.

If you no longer need the service you deployed, delete the node.

You can access the nodes via the API. API requests can change the state of the node interpreter. To return to the initial state, you will have to recreate the entire node.

Note

The maximum size of a request to and a response from the node API is 16 MB.

To create a node, specify your organization's cloud folder where the node will deploy its instances and store its logs. In the folder, set up a subnet with internet access via a NAT gateway and create a service account with the vpc.user role. Specify this service account in the DataSphere project settings.

Node from a modelNode from a model

With nodes from models, you can deploy your models saved in DataSphere as a service accessible via the API. You can create a node from the following models:

  • LightGBM
  • ONNX
  • TensorFlow.Keras
  • TensorFlow
  • PyTorch
  • XGBoost

If you want to deploy a model that is not supported by the Triton server, you can convert it to ONNX format or deploy a model based on a Docker image.

To create a node from a model trained outside of DataSphere, load the model from the file to a variable in the notebook and then create a DataSphere model from this variable.

DataSphere uses Triton Inference Server to deploy nodes from models. You can deploy multiple models saved to DataSphere on a single node. DataSphere itself will define the input and output parameters, ensure the model API operation, and then provide monitoring for both the node and the Triton server.

Note

When deploying PyTorch models, DataSphere cannot automatically figure out the input and output parameters.

Node from a Docker imageNode from a Docker image

Nodes deployed from a Docker image hosted in a container registry will run as a fully featured service. The Docker image does not have to contain a model trained in DataSphere. You can create any image and place it in any registry you find appropriate. To learn how to push a Docker image to a Yandex Container Registry registry, see Pushing a Docker image to a registry.

Note

To use Yandex Container Registry, the project service account needs the container-registry.images.puller role.

When creating a node from a Docker image, you set the node's API, port you want your service to use, connection time, format of metrics you will collect, and other parameters. Once the node is created, DataSphere will monitor its state, maintain the operation of the instances, and scale the node within the specified instance range as needed. For instances, you can use the ru-central1-a and ru-central1-b availability zones.

To work with a node based on a large model or Docker image, you can connect an additional disk from 10 to 4,096 GB. If a node has multiple instances, a disk will be created for each one.

Node statusesNode statuses

A DataSphere node can have one of the following statuses:

  • Healthy: Number of instances with the Healthy status in the node is equal to the minimum number of required instances.
  • Unhealthy: Number of instances with the Healthy status in the node is below the allowed minimum.
  • Created: Node has just been created.
  • Suspended: Node is paused.
  • Deleting: Node is being deleted.

Instance statusesInstance statuses

Node instances can have one of the following statuses:

  • Healthy: Instance is healthy and available for balancing.
  • Unhealthy: There are issues with the instance and it has been excluded from balancing.
  • Created: VM has been created for the instance.
  • Started: Connection has been established with the instance's VM.
  • Preparing: Instance is getting ready to process requests.
  • Deleting: Instance is being deleted.
  • Undefined: Initial state of the instance; VM is not created yet.

AliasAlias

An alias is a special resource used for publishing and updating a service. It allows you to replace nodes and update the running service without affecting the user experience.

Create an alias and use it as your service endpoint. You can update related nodes, balance the load across them, and remove deprecated Docker image versions without affecting the user experience.

See alsoSee also

  • Creating a node
  • Creating an alias
  • Deploying a service from an ONNX model
  • Deploying a service based on a Docker image
  • Deploying a service based on a Docker image with FastAPI

Was the article helpful?

Previous
Specifics of working with temporary Yandex Data Processing clusters
Next
Health checks and monitoring
© 2025 Direct Cursus Technology L.L.C.