Back to skills
SkillHub ClubRun DevOpsDevOps

incident-response

Provides structured incident response workflows for production outages, including timeline tracking, RCA documentation, and runbook updates. Integrates with debugging and monitoring skills for comprehensive SRE practices.

Packaged view

This page reorganizes the original catalog entry around fit, installability, and workflow context first. The original raw source lives below.

Stars
21
Hot score
87
Updated
March 20, 2026
Overall rating
A7.8
Composite score
5.7
Best-practice grade
A92.0

Install command

npx @skill-hub/cli install greyhaven-ai-claude-code-config-incident-response
incident-managementsrepostmortemrunbook

Repository

greyhaven-ai/claude-code-config

Skill path: grey-haven-plugins/incident-response/skills/incident-response

Provides structured incident response workflows for production outages, including timeline tracking, RCA documentation, and runbook updates. Integrates with debugging and monitoring skills for comprehensive SRE practices.

Open repository

Best for

Primary workflow: Run DevOps.

Technical facets: DevOps.

Target audience: Site Reliability Engineers, DevOps engineers, and on-call developers handling production incidents.

License: Unknown.

Original source

Catalog source: SkillHub Club.

Repository owner: greyhaven-ai.

This is still a mirrored public skill entry. Review the repository before installing into production workflows.

What it helps with

  • Install incident-response into Claude Code, Codex CLI, Gemini CLI, or OpenCode workflows
  • Review https://github.com/greyhaven-ai/claude-code-config before adding incident-response to shared team environments
  • Use incident-response for devops workflows

Works across

Claude CodeCodex CLIGemini CLIOpenCode

Favorites: 0.

Sub-skills: 0.

Aggregator: No.

Original source / Raw SKILL.md

---
name: grey-haven-incident-response
description: "Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems. Use when dealing with production outages, SEV1/SEV2 incidents, creating postmortems, or updating runbooks."
# v2.0.43: Skills to auto-load for incident response
skills:
  - grey-haven-code-style
  - grey-haven-observability-monitoring
  - grey-haven-smart-debugging
# v2.0.74: Tools for incident response
allowed-tools:
  - Read
  - Write
  - Bash
  - Grep
  - Glob
  - TodoWrite
  - WebFetch
---

# Incident Response Skill

Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems.

## Description

Production incident response following SRE methodologies with incident timeline tracking, RCA documentation, and runbook updates.

## What's Included

- **Examples**: SEV1 incident handling, postmortem templates
- **Reference**: SRE best practices, incident severity levels
- **Templates**: Incident reports, RCA documents, runbook updates

## Use When

- Production outages
- SEV1/SEV2 incidents
- Postmortem creation
- Runbook updates

## Related Agents

- `incident-responder`

**Skill Version**: 1.0
incident-response | SkillHub