Managed Service for Trino cluster state monitoring

Written by

Updated at July 16, 2026

Cluster state monitoring
Setting up alerts in Yandex Monitoring
Cluster health and status
- Cluster health states
- Cluster statuses

Data on the cluster and host state is available in the management console. You can view them on the Monitoring tab of the cluster management page or in Yandex Monitoring.

Diagnostic information about cluster states is presented as graphs.

Charts are updated every 15 seconds.

Note

The most appropriate multiple units (MB, GB, and more) are automatically used in charts.

You can configure alerts in Yandex Monitoring to receive notifications about cluster failures. In Yandex Monitoring, there are two alert thresholds: Warning and Alarm. If the specified threshold is exceeded, you will receive alerts via the configured notification channels.

Cluster state monitoring

To view detailed information on the health state of a Managed Service for Trino cluster:

Management console

In the management console, navigate to the relevant folder.
Navigate to Managed Service for Trino.
Click the name of your cluster and select the Monitoring tab.
To get started with Yandex Monitoring metrics, dashboards, or alerts, click Open in Monium in the top panel.

The page displays the following charts:

Under Cluster:
- Total cluster nodes: Number of cluster hosts.
- Total Available Processors: Number of available CPUs.
- Cluster distributed memory (cluster memory pool): Amount of distributed cluster memory.
  - Total Distributed Bytes: Total memory (in GB).
  - Free Distributed Bytes: Available memory (in GB).
Under Queries:
- Queries (executing)
  - Running: At least one part of the query plan is in progress.
  - Queued: Query is received and waiting to run.
  - Progressing: Query is in progress and not blocked.
  - Finishing: Query is almost complete.
- Queries (by status)
  - Submitted: Received queries.
  - Started: Queries in progress.
  - Failed: Queries that resulted in an error.
  - Completed: Completed queries, including failed ones.
  - Canceled: Queries canceled by the user.
  - Abandoned: Completed queries with unclaimed results.
- Errors: Errors and their types. The number of errors is given per second.
  - UserError: User-side errors.
  - Internal: Server-side error.
  - InsufficientResources: Errors caused by insufficient resources.
  - External: External errors.
Under Execution:
- Timings, p95 (One minute): 95th percentile of query execution time with a one minute window.

Setting up alerts in Yandex Monitoring

To configure cluster state indicator alerts:

Management console

In the management console, select the folder with the cluster for which you want to set up alerts.
Navigate to Monitoring.
Under Service dashboards, select Managed Service for Trino — Cluster Overview.
On the chart of interest, click and select Create alert.
If the chart displays multiple metrics, select the data query for the relevant metric and click Continue. Learn more about the query language in this Yandex Monitoring guide.
Set the Alarm and Warning threshold values to trigger the alert.
Click Create alert.

To have other cluster health indicators monitored automatically:

Management console

Create an alert.
Add a status metric.
In the alert parameters, set the alert thresholds.

For a complete list of supported metrics, see this Monitoring guide.

Cluster health and status

A cluster’s State indicates its health, while its Status shows whether the cluster is started, stopped, or in a transitory state.

To view the health state and status of a cluster:

Open the folder dashboard.
Navigate to Managed Service for Trino.
In the cluster row, hover over the indicator in the Availability column.

Cluster health states

State	Description	Suggested actions
ALIVE	The cluster is operating normally.	No action is required.
DEGRADED	The cluster is not running at its full capacity.	Contact support and specify the following: Cluster ID. IDs of the last operations performed on it. Time when the cluster entered the `DEGRADED` state according to availability charts.
DEAD	The cluster is out of order.	Contact support and specify the following: Cluster ID. IDs of the last operations performed on it. Time when the cluster entered the `DEAD` state according to availability charts.
UNKNOWN	The cluster’s state is unknown.	Contact support and specify the following: Cluster ID. IDs of the last operations performed on it. Time when the cluster entered the `UNKNOWN` state according to availability charts.

Cluster statuses

Status	Description	Suggested actions
CREATING	Preparing for the first start	Wait a while and get started. The time it takes to create a cluster depends on the host class.
RUNNING	The cluster is operating normally	No action is required.
STOPPING	The cluster is stopping	After a while, the cluster status will switch to `STOPPED` and the cluster will be disabled. No action is required.
STOPPED	The cluster is stopped	Start the cluster to get it running again.
STARTING	Starting the cluster that was stopped earlier	After a while, the cluster status will switch to `RUNNING`. Wait a while and get started.
UPDATING	Updating the cluster's configuration	Once the update is complete, the cluster will get the status it had prior to the update: `RUNNING` or `STOPPED`.
ERROR	Error when performing an operation with the cluster or during a maintenance window	If the cluster remains in this status for a long time, contact support. You can see whether a cluster is available by its status.
STATUS_UNKNOWN	The cluster is unable to determine its status	If the cluster remains in this status for a long time, contact support.

Managed Service for Trino cluster state monitoring

Cluster state monitoringCluster state monitoring

Setting up alerts in Yandex MonitoringSetting up alerts in Yandex Monitoring

Cluster health and statusCluster health and status

Cluster health statesCluster health states

Cluster statusesCluster statuses

Was the article helpful?

Cluster state monitoring

Setting up alerts in Yandex Monitoring

Cluster health and status

Cluster health states

Cluster statuses