Yandex Managed Service for YTsaurus metrics
This section describes the Managed Service for YTsaurus metrics delivered to Monitoring.
The name label contains the metric name.
Labels shared by all Managed Service for YTsaurus metrics:
|
Label |
Value |
|
service |
Service ID: |
|
cluster_id |
Cluster ID |
Cluster resource metrics
Computing resource metrics
|
Name |
Description |
|
|
Maximum number of CPUs available within the scheduler pool. Additional labels:
|
|
|
Number of CPUs allocated by the scheduler for current tasks. Additional labels:
|
|
|
Maximum number of GPUs available within the scheduler pool. Additional labels:
|
|
|
Number of GPUs allocated by the scheduler for current tasks. Additional labels:
|
|
|
Memory allocation limit for user tasks within the scheduler pool. Additional labels:
|
|
|
Amount of memory allocated for user tasks by the scheduler. Additional labels:
|
|
|
Number of unallocated CPUs available to the scheduler. Additional labels:
|
|
|
Amount of unallocated memory for user tasks available to the scheduler. Additional labels:
|
Pool scheduler metrics
|
Name |
Description |
|
|
Number of
|
|
|
Number of lightweight
|
|
|
Maximum allowed number of simultaneously
|
|
|
Total number of all operations within the scheduler pool. Additional labels:
|
|
|
Maximum allowed number of operations within the scheduler pool. Additional labels:
|
|
CPU |
|
|
|
Number of CPUs currently used for tasks within the scheduler pool. Additional labels:
|
|
|
Number of CPUs required to start all pending tasks within the scheduler pool. Additional labels:
|
|
|
Number of CPUs guaranteed to be available within the scheduler pool. Additional labels:
|
|
|
Maximum number of CPUs set in the scheduler pool configuration. Additional labels:
|
|
RAM |
|
|
|
Current user-task memory utilization within the scheduler pool. Additional labels:
|
|
|
Amount of memory for user tasks required to start all pending tasks within the scheduler pool. Additional labels:
|
|
|
Guaranteed user-task memory within the scheduler pool. Additional labels:
|
|
|
Amount of memory for user tasks set in the scheduler pool configuration. Additional labels:
|
|
GPU |
|
|
|
Number of GPUs currently used for tasks within the scheduler pool. Additional labels:
|
|
|
Number of GPUs required to start all pending tasks within the scheduler pool. Additional labels:
|
|
|
Number of GPUs guaranteed to be available within the scheduler pool. Additional labels:
|
|
|
Number of GPUs set in the scheduler pool configuration. Additional labels:
|
Metrics for fault diagnostics
|
Name |
Description |
|
|
Number of nodes in |
|
|
Number of nodes in |
|
|
Number of nodes in |
|
|
Number of active alerts about issues with YTsaurus cluster nodes. The additional
|
CPU and memory metrics
|
Name |
Description |
|
|
Total CPUs in the cluster. Additional labels:
|
|
|
Total time spent on permission checks by the security system during user-initiated write operations. Additional labels:
|
|
|
Total time spent on permission checks by the security system during user-initiated read operations. Additional labels:
|
|
|
Total actual RSS (Resident Set Size) memory usage in the cluster. Additional labels:
|
|
|
Number of user-initiated write requests processed by the security system. Additional labels:
|
|
|
Number of user-initiated read requests processed by the security system. Additional labels:
|
|
|
Current weight throttler value. Additional labels:
|