Alerting and SLOs
Learn how to turn observability data into actionable alerts using Service Level Objectives, error budgets, and symptom-based alerting that reduces noise and pages humans only when it matters.
From Observability to Action
Traces, logs, and metrics tell you what is happening. Alerting decides when a human needs to act. The goal is to page on real user pain, not on every blip.
What Is an SLI?
A Service Level Indicator is a measured aspect of service behavior, such as request latency, error rate, or availability. SLIs are the raw signal you alert on.
ok = 995
total = 1000
availability = ok / total * 100
print('Availability SLI:', availability, '%')