Checking GPU load

Written by

Updated at October 11, 2024

Getting started
Checking GPU performance
- Checking a connection using TensorFlow
- Checking a connection using nvidia-smi
Writing GPU utilization statistics while training a model
Example of writing GPU utilization statistics

Yandex DataSphere supports computing resource configurations with GPUs.

You can check the GPU performance, load, and resource utilization statistics using the TensorFlow library or the nvidia-smi utility.

Getting started

Open the DataSphere project:

Select the relevant project in your community or on the DataSphere homepage in the Recent projects tab.
Click Open project in JupyterLab and wait for the loading to complete.
Open the notebook tab.

Checking GPU performance

Checking a connection using TensorFlow

Select the desired GPU configuration. In our example, we use the g1.1 configuration.

Enter the following code in the cell:

import tensorflow as tf


tf.config.list_physical_devices('GPU')

Run the cell. To do this, click .
This will output a list of all GPUs used in the notebook.

Checking a connection using nvidia-smi

Select the desired GPU configuration. In our example, we use the g1.1 configuration.
Enter the following code in the cell:
```
#!:bash
nvidia-smi
```
Run the cell. To do this, click .
This will output GPU status details.

Writing GPU utilization statistics while training a model

Enter the following code in the cell:

import subprocess

with open("stdout.txt","wb") as out:
proc = subprocess.Popen(["nvidia-smi", "dmon"], stdout=out, stderr=subprocess.STDOUT)

<GPU_utilization_code>

proc.terminate()
proc.kill()

The code uses the nvidia-smi dmon command that collects GPU performance statistics every second.

Run the cell. To do this, click .
As a result, the stdout.txt file with detailed GPU statistics will appear in the model directory.

Example of writing GPU utilization statistics

Use a ready-made model to test GPU configurations. When executing the code on the g1.1 and g2.1 configurations, the model uses 18% to 25% of GPU resources. You can view the data in the sm column of the stdout.txt file.

Enter the following code in the cell:

import subprocess
import tensorflow as tf
import datetime

mnist = tf.keras.datasets.mnist

(x_train, y_train),(x_test, y_test) = mnist.load_data()
x_train, x_test = x_train / 255.0, x_test / 255.0

def create_model():
  return tf.keras.models.Sequential([
    tf.keras.layers.Flatten(input_shape=(28, 28)),
    tf.keras.layers.Dense(512, activation='relu'),
    tf.keras.layers.Dropout(0.2),
    tf.keras.layers.Dense(10, activation='softmax')
  ])

with open("stdout.txt","wb") as out:
    proc = subprocess.Popen(["nvidia-smi", "dmon"], stdout=out, stderr=subprocess.STDOUT)

    model = create_model()
    model.compile(optimizer='adam',
                  loss='sparse_categorical_crossentropy',
                  metrics=['accuracy'])
    model.fit(x=x_train,
              y=y_train,
              epochs=5,
              validation_data=(x_test, y_test))
    model = create_model()
    model.compile(optimizer='adam',
                  loss='sparse_categorical_crossentropy',
                  metrics=['accuracy'])

    proc.terminate()
    proc.kill()

Run the cell. To do this, click .
As a result, the stdout.txt file with detailed GPU statistics will appear in the model directory.

Checking GPU load

Getting startedGetting started

Checking GPU performanceChecking GPU performance

Checking a connection using TensorFlowChecking a connection using TensorFlow

Checking a connection using nvidia-smiChecking a connection using nvidia-smi

Writing GPU utilization statistics while training a modelWriting GPU utilization statistics while training a model

Example of writing GPU utilization statisticsExample of writing GPU utilization statistics

Was the article helpful?

Getting started

Checking GPU performance

Checking a connection using TensorFlow

Checking a connection using nvidia-smi

Writing GPU utilization statistics while training a model

Example of writing GPU utilization statistics