Service Level Objectives and Error Budget Management
SRE Bot | Foundations | Max 30 Points
| Level | Criteria |
|---|---|
| 1 | No formal SLOs; availability discussed informally; no error budgets |
| 2 | Basic SLOs for some services; not consistently tracked; no budget enforcement |
| 3 | SLOs for critical services; error budgets calculated; basic burn rate monitoring |
| 4 | Comprehensive SLOs; budgets enforced; dev slowdowns when budget exhausted |
| 5 | SLOs drive all decisions; multi-window burn rates; automated freezes |
| # | Question | Max |
|---|---|---|
| 1 | How well-defined are your SLIs? | 6 |
| 2 | How do you track/enforce error budgets? | 6 |
| 3 | How aligned are stakeholders on SLO targets? | 6 |
| 4 | What happens when error budget exhausted? | 6 |
| 5 | How do you review and iterate on SLOs? | 6 |
| Domain | Relationship |
|---|---|
| Observability | SLIs require metrics/logs infrastructure |
| Alerting | Burn rate alerts drive incident response |
| Release Eng | Error budgets gate feature releases |
Error Budgets Enable Velocity
Managed risk, not zero risk.