Root Cause Analysis Patterns

RCA patterns from real incidents and build failures. Every entry has a date and context.

2026-03-19: Document the Prevention, Not Just the Fix

Situation: Fixed an Antora xref resolution bug but initially only documented the fix command, not how to prevent recurrence

Context: RCA-2026-03-19-001, Don Quijote xrefs unresolved

The Pattern: Every RCA must answer: (1) What happened, (2) Why it happened, (3) What was fixed, (4) How to PREVENT it. The prevention step is the most valuable — it’s the pattern that goes in the journal.

Principle: A fix without prevention is just firefighting. The RCA format forces you to extract the transferable lesson.

Source: RCA-2026-03-19-001, formalized as RCA template standard


2026-02-26: Attribute Assumptions Break Builds

Situation: Used 10.50.1.100 when the actual attribute was {kvm-ip} — 6 missing attributes broke rendering

Context: RCA-2026-02-26-001, documentation build failure

The Pattern: Assumed attribute names without checking. The fix was simple (grep antora.yml), but the prevention required a mandatory pre-write checklist: verify attributes exist before using them.

Principle: Assumptions are the root cause of most preventable failures. The checklist isn’t bureaucracy — it’s pattern recognition formalized.

Source: RCA-2026-02-26-001