role · on-call
On-call
On-call is the cycle's last and first reader. They are awake when reality answers. Their job is to keep the timeline honest, to communicate to the people watching, and to hand the postmortem something it can actually use.
What good looks like
A competent on-call shift produces three artefacts every cycle they are summoned:
- A timeline — minute-by-minute, written during, that the postmortem can read without rewriting.
- A comms thread — what was told to whom and when. Engineering thread. Client thread if needed. No surprises in the morning.
- A runbook entry or change — every shift either uses a runbook (and notes the friction) or writes one. No two pages on the same scenario.
An on-call shift that produces these has the operation chain working. A shift that skips the timeline leaves the postmortem inventing history; one that skips the runbook update locks the knowledge in one person.
The on-call's stance
| On-call is responsible for | On-call is not responsible for |
|---|---|
| The timeline being written during, not after | Solving the root cause alone |
| The comms thread to engineering, leadership, client | Being the only person on the bridge |
| Following the runbook or noting why they didn't | Rewriting the architecture at 2am |
| The first 48-hour watch on every cycle's first release | The full release decision |
| Handing the postmortem material it can use | Owning the postmortem's fix |
On-call holds the chain by holding the moment the system was under stress and the record of what happened.
Three artefacts to read first
See also
- Skill path — On-call foundations
- Canon — As We Build · After We Build
- Practice — First 48 hours watch · Release gate · Postmortem
- Templates — Runbook · Release brief · Postmortem
- Checklists — First 48 hours · agenda · Release gate · Postmortem
- Clinics — A flag that never got cleaned up · A postmortem that produced a feeling · A runbook that doesn't run
- Areas — 8 · Pipeline & Operations · 10 · Post-Release & Learning