Yandex Cloud
Search
Contact UsGet started
  • Blog
  • Pricing
  • Documentation
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • ML & AI
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Customer Stories
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Education and Science
    • Yandex Cloud Partner program
  • Blog
  • Pricing
  • Documentation
© 2025 Direct Cursus Technology L.L.C.
Yandex DataSphere
  • Getting started
  • Terraform reference
  • Audit Trails events
  • Access management
  • Pricing policy
  • Public materials
  • Release notes

In this article:

  • Getting started
  • Create a project
  • Run the project
  • Set up your environment
  • Upload data to the project
  • Start training
  • What's next

Getting started with DataSphere

Written by
Yandex Cloud
Updated at April 11, 2025
  • Getting started
  • Create a project
  • Run the project
  • Set up your environment
  • Upload data to the project
  • Start training
  • What's next

Yandex DataSphere is an end-to-end ML development environment where you can use well-known IDEs, serverless computing technology, and seamlessly combine a broad range of Yandex Cloud computing resource configurations. Yandex DataSphere is part of the data platform and provides powerful features to work with Yandex Cloud services. As an IDE, DataSphere provides Jupyter® Notebook.

In this section, you will learn how to:

  1. Create a project.
  2. Run projects.
  3. Configure the environment.
  4. Upload data to projects.
  5. Start training.

Getting startedGetting started

  1. Go to the management console and log in to Yandex Cloud or sign up if not signed up yet.
  2. Go to Yandex Cloud Billing and make sure you have a billing account linked and its status is ACTIVE or TRIAL_ACTIVE. If you do not have a billing account yet, create one.
  3. Open the DataSphere home page.
  4. Accept the user agreement.
  5. Select the organization to work with DataSphere in or create a new one.

Create a projectCreate a project

  1. Open the DataSphere home page.
  2. In the left-hand panel, select Communities.
  3. Select the community to create a project in.
  4. On the community page, click Create project.
  5. In the window that opens, enter a name and description (optional) for the project.
  6. Click Create.

Run the projectRun the project

To run a project, click Open project in JupyterLab.

Set up your environmentSet up your environment

Popular packages for data analysis and machine learning are pre-installed and ready for use, see the list.

You can install missing packages using the pip package manager.

To install a package:

  1. Write the following command in the notebook cell:

    %pip install <package_name>
    

    For example, install the seaborn package to visualize statistics:

    %pip install seaborn
    

    You can use various options that the pip install command supports. See usage examples for this command.

  2. Run the cell. To do this, click .

    The package installation result is displayed under the cell.

You can also configure the environment to run your code using Docker images.

Upload data to the projectUpload data to the project

You can upload small data volumes (up to 100 MB) to your DataSphere project through the JupyterLab interface. We recommend uploading larger data volumes from network storages or databases. For large data volumes, datasets make another convenient option.

To upload data to your project through the JupyterLab interface:

  1. In the File Browser section, select a folder for the data.
  2. Click at the top left.
  3. Select the files to upload.

Learn more about project storage.

DataSphere allows you to upload data from different sources:

  • Connecting to S3 using the library
  • Connecting to Google Drive
  • Connecting to a ClickHouse® database
  • Connecting to a PostgreSQL database
  • Connecting to Yandex Disk

Start trainingStart training

To start computations:

  1. Under the File Browser section, select the notebook with the Python or bash code.

  2. Select and run one or more cells with the code by choosing Run → Run Selected Cells, or pressing Shift + Enter.

  3. Wait for the operation to complete.

    The execution result is displayed under the cell.

What's nextWhat's next

  • Learn about service features.
  • See other service guides.
  • Deploy the trained model as a service.

Was the article helpful?

Next
All guides
© 2025 Direct Cursus Technology L.L.C.