CI Failure Response Runbook
Trigger
Section titled “Trigger”Use this runbook when a pull request or main branch pipeline fails.
- Identify the failing job and capture the first actionable error.
- Reproduce locally with the closest equivalent command.
- Determine whether failure is code regression, flaky test, or environment issue.
- Apply the smallest safe fix and rerun relevant checks.
- If flaky, document evidence and open a follow-up task for stabilization.
- Confirm all required checks pass before merge/release.
Escalation
Section titled “Escalation”- Escalate immediately if failure blocks releases or affects multiple repositories.
- Include failing job URL, error summary, and attempted mitigations in escalation.