Game Days, Blast Radius Control, Failure Injection
SRE Bot | Resilience | Max 30 Points
| Level | Criteria |
|---|---|
| 1 | No chaos practice; only learn from real outages |
| 2 | Occasional game days; manual failure injection |
| 3 | Regular chaos experiments; blast radius controlled |
| 4 | Continuous chaos in staging; production game days |
| 5 | Chaos in production daily; antifragile systems |
| # | Question | Max |
|---|---|---|
| 1 | How often do you run chaos experiments? | 6 |
| 2 | How do you control blast radius? | 6 |
| 3 | Do you run game days? | 6 |
| 4 | How do you apply learnings from chaos? | 6 |
| 5 | What chaos tooling do you use? | 6 |
| Domain | Relationship |
|---|---|
| Reliability | Validate patterns via chaos |
| DR | Test DR via chaos experiments |
| Incidents | Build muscle memory for response |
Break Things on Purpose
Find failures before they find you.