Monitoring Managed Service for GitLab instance status
Note
This feature is in the Preview stage.
You can view instance status data in the Monitoring tab on the instance page or in Yandex Monitoring.
Diagnostic information about the instance status is visualized in charts. The charts automatically apply the most appropriate multiple measurement units (GB, percentage, etc.).
You can configure alerts in Yandex Monitoring to receive notifications about cluster failures. In Yandex Monitoring, there are two alert thresholds: Warning
and Alarm
. If the specified threshold is exceeded, you will receive alerts via the configured notification channels.
Monitoring instance status
To view detailed information about the Managed Service for GitLab instance state:
- Go to the folder page
and select Managed Service for GitLab. - Click the instance name and select the Monitoring tab.
-
To get started with Yandex Monitoring metrics, dashboards, or alerts, click Open in Monitoring in the top panel.
The following charts will open on the page:
- Data disk: Amount of used and free space on the data storage disk.
- System disk: Amount of used and free space in the system storage.
- CPU: Load on processor cores.
- Memory: RAM usage. At high loads, the value of the Available memory parameter goes down and the Used memory values go up.
Alert settings in Monitoring
-
In the management console
, select the folder with the instance you want to configure alerts for. -
In the list of services, select
Monitoring. -
On the home page, under Service dashboards, select Managed Gitlab.
-
In the chart you need, click
and select Create alert. -
If the selected chart contains multiple metrics in the Alert configuration section, you can keep all data queries or only specific ones. For example, for the System disk metric, you can remove the redundant free storage space query and keep only the used space query.
To remove the redundant query, click
→ Delete in the query row.You can learn more about the query language in the Monitoring documentation.
-
Set the alert trigger conditions:
-
Set the query name that will be used to calculate the alert status.
-
Select the aggregation function to apply to metric values in the time window.
-
Select the comparison function t0 compare the aggregation function result with threshold values.
-
Specify the
Warning
andAlarm
threshold values. Select whether the current chart value should be greater than, less than, or equal to the threshold. -
Set the alert calculation time window and delay.
-
-
Click Create.
You can also create alerts for specific Managed Service for GitLab metrics.
The recommended thresholds are as follows:
The recommended thresholds for the Remaining free disk space (sys.filesystem.FreeB
) metric are as follows:
- Aggregation function: Average.
- Comparison function: Less than or equal to.
- Alarm: 7% of the storage size.
- Warning: 10% of the storage size.
- Evaluation window: 5 minutes.
- Evaluation delay: 30 seconds.
Instance statuses
The status shows whether the instance is started, stopped, or in intermediate state. You can view the status in the list of Managed Service for GitLab instances or on each instance's page.
The following statuses are possible:
Status |
Description |
Suggested actions |
CREATING |
Preparing for the first launch |
Wait a while and get started. The time it takes to create an instance depends on the host class. |
RUNNING |
The instance is running normally. |
No action is required. |
STOPPING |
Stopping the instance |
After a while, the instance status will change to |
STOPPED |
The instance is stopped. |
Start the instance to get it running again. |
STARTING |
Starting the instance that was stopped earlier |
After a while, the instance status will change to |
UPDATING |
Updating the instance |
After the update is completed, the cluster status will change to |
ERROR |
An error occurred that does not allow the instance to continue working |
Run the initial diagnostics:
If you cannot find the cause of the error yourself, contact support |
STATUS_UNKNOWN |
Instance is unable to determine its own status |
Run the initial diagnostics:
If you cannot find the cause of the error yourself, contact support. |
DELETING |
Deleting the instance |
Once deleted, the instance will be removed from the Instances list page and will no longer be accessible. No action is required. When you delete an instance, a backup is automatically created and stored for two weeks. If you need to restore the instance after deletion, contact support. |