HRO Pattern Recognition

Learning from High-Reliability Organizations

Resilience Patterns | Technical Operations Excellence

5
HRO Principles
10
Root Cause Categories
4
Swiss Cheese Layers
10-6
Aviation Error Rate

5 HRO Principles Deep Dive

PrincipleApplication
Preoccupation with FailureTreat near-misses as failures; never assume safety
Reluctance to SimplifyResist simple explanations; embrace complexity
Sensitivity to OperationsMaintain situational awareness at all times
Commitment to ResilienceFocus on recovery, not just prevention
Deference to ExpertiseAuthority migrates to knowledge in crisis

Big 10 Root Causes

#CategoryExample
1Config ChangeBad deploy, wrong flag
2CapacityResource exhaustion
3DependencyUpstream/downstream fail
4HardwareDisk, network, memory
5SecurityAttack, credential leak
6Human ErrorTypo, wrong command
7Software BugRace condition, logic error
8DataCorruption, schema drift
9NetworkPartition, DNS, latency
10ExternalCloud provider, 3rd party

Swiss Cheese Model

Accidents occur when holes in multiple defense layers momentarily align.

- James Reason

  • Layer 1: Organizational controls
  • Layer 2: Technical safeguards
  • Layer 3: Monitoring & detection
  • Layer 4: Human operators

Pattern Recognition Table

SignalPatternAction
Latency spikeCapacity/DependencyScale or isolate
Error burstDeploy/ConfigRollback
Gradual degradeResource leakRestart/investigate
Cascading failMissing circuit breakerShed load
Partial outageNetwork partitionFailover

HRO vs Traditional Orgs

AspectTraditionalHRO
FailuresHide/blameLearn/share
ComplexitySimplify awayEmbrace
AuthorityHierarchyExpertise
FocusEfficiencyReliability

Industries We Learn From

IndustryKey Practice
AviationChecklists, crew resource mgmt
NuclearDefense in depth, safety culture
HealthcareRoot cause analysis, just culture
MilitaryAfter-action reviews, command

Failure Taxonomy

  • Active failures: Immediate triggers (human error)
  • Latent conditions: Dormant system weaknesses
  • Error-provoking: Conditions that invite mistakes

Failures Are Teachers

Every incident is a window into system weaknesses.