Yandex Cloud
Search
Contact UsGet started
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
  • All Services
  • System Status
    • Featured
    • Infrastructure & Network
    • Data Platform
    • Containers
    • Developer tools
    • Serverless
    • Security
    • Monitoring & Resources
    • AI for business
    • Business tools
  • All Solutions
    • By industry
    • By use case
    • Economics and Pricing
    • Security
    • Technical Support
    • Start testing with double trial credits
    • Cloud credits to scale your IT product
    • Gateway to Russia
    • Cloud for Startups
    • Center for Technologies and Society
    • Yandex Cloud Partner program
  • Pricing
  • Customer Stories
  • Documentation
  • Blog
© 2025 Direct Cursus Technology L.L.C.
Yandex Application Load Balancer
  • Getting started
    • Overview
    • Load balancers
    • HTTP routers
    • Limit on the number of requests
    • Backend groups
    • Target groups
    • Monitoring and logging
    • Monitoring charts
    • Quotas and limits
    • Tips for configuring health checks
  • Access management
  • Pricing policy
  • Terraform reference
  • Monitoring metrics
  • Audit Trails events
  • L7 load balancer logs
  • Release notes

In this article:

  • HTTP statistics
  • RPS
  • 5xx
  • 4xx
  • Latency
  • Request size
  • Response size
  • Scaling statistics
  • Active connections
  • Connections per second
  • Requests per second
  • Bytes per second
  1. Concepts
  2. Monitoring charts

Monitoring charts

Written by
Yandex Cloud
Updated at November 11, 2025
  • HTTP statistics
    • RPS
    • 5xx
    • 4xx
    • Latency
    • Request size
    • Response size
  • Scaling statistics
    • Active connections
    • Connections per second
    • Requests per second
    • Bytes per second

Note

For Stream listeners, the system does not collect statistics on individual HTTP requests.

Monitoring charts give the real-time status of your load balancer and related metrics. Use them for anomaly detection and performance analysis. Load balancer statistics are automatically logged in the Yandex Monitoring metrics. For a full list of metrics delivered to Yandex Monitoring, see the reference.

HTTP statisticsHTTP statistics

HTTP charts show traffic intensity, backend response stability, and request processing times. Use them to identify load anomalies, rising error rates, and performance degradation.

RPSRPS

RPS (Requests Per Second): Number of HTTP requests to the load balancer per second. The chart reflects the intensity of incoming traffic. Review the chart at regular intervals to understand the load balancer's average workload at different days and times. The average value will help you detect anomalies early.

  • A spike in RPS may indicate increased user activity or a DDoS attack. Compare this with the 4xx or 5xx error charts: if there are no errors, the traffic is more likely legitimate than not. However, if there are errors, your backends may be overwhelmed or under a DDoS attack.
  • A sharp drop in RPS during normally active periods could mean your load balancer has become unreachable. Check its condition, availability zones, and DNS records.

5xx5xx

5xx (Server Errors): Number of 5xx responses returned by backends through the load balancer. This is the key indicator of your server side stability.

  • A spike in 5xx errors may indicate that backends are overwhelmed due to infrastructure issues. To track down the issue, inspect backend logs.
  • A gradual increase in 5xx errors may indicate memory leaks or resource degradation on backends. Health-check your infrastructure and consider restarting your instances.

4xx4xx

4xx (Client Errors): Number of 4xx responses.

  • 4xx errors typically rise due to user errors, invalid API calls, or authentication issues. Look up request logs, check whether your access tokens are current.

LatencyLatency

Latency: Time from when the load balancer receives the first byte of the request to when it sends the last byte of the response. The chart displays values from the 50th to the 99th percentile.

  • Correlate latency with RPS, 5xx errors, and backend resource metrics to find out why latency increases.

Request sizeRequest size

Request size: Total volume of load balancer requests per second. Use it to assess network load and the impact of large requests.

  • A spike in request size at the same RPS will increase bytes per second. Check your network bandwidth and compression settings.

Response sizeResponse size

Response size: Total volume of load balancer responses per second.

  • Growing response size impacts latency and increases network load. Review your compression and caching settings or change response format.

Scaling statisticsScaling statistics

Scaling charts illustrate the behavior of connections and the volume of processed data. Use them to configure autoscaling and analyze resource unit utilization.

Active connectionsActive connections

Active connections: Number of active connections.

  • A spike may indicate more user sessions, long-lived connections, or issues with termination of connections. Compare with requests per second: many active connections at low RPS suggests stalled connections.

Connections per secondConnections per second

Connections per second: Number of new connections per second.

  • A high value at stable RPS means clients are not reusing connections or are frequently reconnecting. Peak connections per second rates strain the infrastructure and may require additional resource units.

Requests per secondRequests per second

Requests per second: Number of requests to the load balancer per second.

  • Use this metric to calculate required resource units and set up autoscaling thresholds.

Bytes per secondBytes per second

Bytes per second: Total volume of requests and responses processed by the load balancer per second.

  • A rise in bytes per second at the same RPS indicates that the average size of requests or responses is increasing. Keep an eye on this metric for network capacity planning.

Tip

To detect infrastructure failures early, create a dashboard that consolidates your most critical metrics: RPS, 99th-percentile latency, and 5xx error rate.

See alsoSee also

  • Monitoring and logging
  • Setting up alerts

Was the article helpful?

Previous
Monitoring and logging
Next
Quotas and limits
© 2025 Direct Cursus Technology L.L.C.