Observability 2.0

Wide Events, High Cardinality, and Beyond the Three Pillars

Observability | Technical Operations Excellence

100s
Fields Per Event
106
High Cardinality
1
Unified Format
<10s
Query Response

Observability 1.0 vs 2.0

Aspect1.02.0
Data3 pillars (siloed)Wide structured events
CardinalityLow (pre-aggregated)High (millions)
QuestionsKnown unknownsUnknown unknowns
DebugCorrelate across toolsSingle pane of glass

Wide Structured Events

Emit one wide event per unit of work, with all relevant context attached.

- Charity Majors

  • Request context: user_id, tenant_id, request_id
  • Timing: duration, queue_time, db_time
  • Result: status, error_type, cache_hit
  • Environment: version, host, region, pod

High Cardinality Fields

FieldCardinality
user_idMillions
trace_idBillions
request_idBillions
build_idThousands
endpointHundreds

Traditional metrics explode with high cardinality

Charity Majors Principles

  • Observability is about understanding new problems
  • Debug from production, not staging
  • Instrument at the code level, not infrastructure
  • Exploratory investigation over dashboards

Core Practices

Instrument Everything

Every service emits structured events on every request

Query Interactively

Ad-hoc questions, slice and dice by any field

SLO Integration

Events feed SLI calculations directly

Event Schema Example

FieldExample Value
serviceapi-gateway
endpoint/v2/users/:id
duration_ms47.3
status_code200
user_idu_abc123
cache_hittrue
db_queries3

Tools for Observability 2.0

ToolStrength
HoneycombQuery-first, high cardinality
Grafana + LokiOpen-source ecosystem
OpenTelemetryVendor-neutral instrumentation

Key Question

Can you debug problems you've never seen before, without adding new instrumentation?

Ask New Questions

True observability answers questions you haven't thought to ask yet.