All resources AI Security for the C-Suite

The AI Incident Response Playbook You Need Before You Need It

A specific 7-step playbook for the first 60 minutes of an AI incident. The patterns of recent public failures suggest most companies are not ready.

TL;DR

The 7-step playbook for the first 60 minutes of an AI incident:

  1. Detect and confirm (0–5 min): is this real, or noise?
  2. Pause the agent (5–10 min): kill switch within reach.
  3. Assess scope (10–25 min): who’s affected, what data, what actions taken.
  4. Notify internally (25–35 min): security, legal, comms, exec on call.
  5. Decide on customer notification (35–50 min): regulatory and brand calculus.
  6. Begin remediation (50–60 min): triage and parallel-track.
  7. Document everything (continuous): audit-grade record from minute one.

Most organizations don’t have this playbook. Most incidents go badly because the first 60 minutes are spent figuring out what to do instead of doing.


A specific 7-step playbook for the first 60 minutes of an AI incident. The patterns of recent public failures suggest most companies are not ready.

The AI incident-response playbook doesn’t exist at most companies in 2026. The traditional incident-response playbook is for cyber incidents — breaches, ransomware, DDoS — and the steps don’t quite map to AI incidents like agent misbehavior, prompt injection compromise, or hallucination-driven harm. This piece is the AI-specific playbook to have ready before you need it.

Why AI incidents are different

Three differences from traditional incident response.

1. Detection is harder. Cyber incidents have signatures. AI incidents often don’t — an agent that starts giving subtly wrong answers may not trigger any alert until customers complain. The detection window can be days or weeks instead of minutes.

2. Containment requires different mechanisms. Cyber incidents are contained by network segmentation, account lockdown, system isolation. AI incidents are contained by pausing agents, throttling tool access, or rolling back model versions. The mechanisms exist but are usually less practiced.

3. Communication is harder. Cyber incidents have well-understood narratives (a breach happened, here’s what we know). AI incidents often involve subtle harm to specific customers that’s hard to characterize quickly. The communication challenge is bigger.

These differences mean the playbook needs AI-specific steps, not just an adapted cyber playbook.

The 7-step playbook

Step 1: Detect and confirm (0–5 minutes)

The on-call engineer or supervisor sees an alert (drift, anomalous behavior, customer complaint, security event). Before doing anything else: confirm this is real.

Specific actions:

  • Pull recent agent traces. Is the behavior visibly off?
  • Check the eval dashboard. Are accuracy metrics regressing?
  • Look for the original signal. If a customer reported, what specifically did they report?

Common mistake: jumping to action on a noisy alert. False alarms are common in AI; confirm before triggering the rest of the playbook.

Step 2: Pause the agent (5–10 minutes)

Once confirmed: stop the agent from causing more harm. The kill switch should be exercisable by the on-call without an engineering ticket.

Specific actions:

  • Disable the agent in production.
  • Communicate the disable to the team (Slack channel, status page if external-facing).
  • Estimate how many transactions / customers are affected per hour the agent is paused.

Common mistake: hesitating to pause because of business impact. The cost of continued operation usually exceeds the cost of pause; default to pause and reassess.

Step 3: Assess scope (10–25 minutes)

How big is this? The answer drives every subsequent decision.

Specific actions:

  • Query the audit log for affected transactions. Time range, customer impact, action surface.
  • Identify affected systems (downstream of the agent’s outputs).
  • Estimate financial, customer, and regulatory impact.

Common mistake: scope estimates are usually too narrow in the first hour. The audit log shows what you can see; the unknown unknowns are bigger. Plan for scope to expand.

Step 4: Notify internally (25–35 minutes)

Specific people need to know. The list depends on incident type and severity.

Always notify:

  • Security leadership.
  • Engineering leadership for the affected system.
  • An executive sponsor (CTO, CIO, COO depending on org).

Conditional notifications:

  • Legal: if customer data, regulatory exposure, or material customer harm.
  • Communications: if customer-facing or potential PR exposure.
  • Customer success: if specific customers are affected.

Common mistake: cascading notifications too widely too fast (creates rumor) or too narrowly too slow (creates surprise later). The 25–35 minute window is for inner-circle notification.

Step 5: Decide on customer notification (35–50 minutes)

Two decisions: should customers be notified, and if so, when.

Specific actions:

  • Legal review: do regulatory rules require notification?
  • Affected customer review: do specific customers need to know to take action?
  • Communications draft: what’s the message, when does it go.

Common mistake: defaulting to “wait until we have all the facts.” Some incidents require immediate notification; over-delaying is usually worse than under-delaying.

Step 6: Begin remediation (50–60 minutes)

In parallel with the above, the engineering team starts diagnosis and remediation.

Specific actions:

  • Root cause analysis (what specifically went wrong?).
  • Fix path (what gets changed to make this not happen again?).
  • Rollback option (do we revert to a known-good version?).
  • Recovery for affected customers (what do we do to make them whole?).

Common mistake: trying to do remediation without doing scope assessment first. You can’t remediate effectively without knowing what you’re remediating.

Step 7: Document everything (continuous)

From minute one, everything is documented. Audit-grade record. This is the source of truth for the post-incident review and for any regulatory or legal proceedings.

Specific artifacts:

  • Timeline (what happened when).
  • Decisions log (who decided what, with what justification).
  • Communication log (what was said to whom, when).
  • Affected-systems log (what was changed, by whom).

Common mistake: documenting after the fact. Memory is unreliable; documents are durable. Write as you go.

What to do this quarter

  1. Write the playbook. Use the 7 steps as the template. Fill in your specifics — names, contacts, systems, policies.
  2. Run a tabletop exercise. Walk through a simulated AI incident with the team. Find the gaps.
  3. Test the kill switch. Make sure it actually works. Make sure non-engineers can exercise it.
  4. Schedule the next exercise. Quarterly minimum.

FAQ

How is AI incident response different from cyber incident response? Different attack surfaces, different mitigations, different communication challenges. Most cyber playbooks need significant adaptation for AI. The 7-step structure is similar; the specifics differ.

Who should own AI incident response? The CISO, with the AI program lead as deputy. The CISO has the IR infrastructure and the executive relationships; the AI program lead has the domain context. Together they have the right coverage.

What’s the typical AI incident severity distribution? For Level 2 deployments: ~80% minor, ~15% medium, ~5% major. For Level 3: ~60% minor, ~30% medium, ~10% major. The major incidents are rare but require the playbook to be tested.

Should we have an external IR partner for AI incidents? For high-stakes deployments, yes. The specialized expertise on AI-specific failures is rare in-house. Retainer agreements with AI-specialized IR firms are emerging in 2026.

Will regulators require specific AI incident reporting? Yes, increasingly. The EU AI Act, state-level disclosure laws, and SEC guidance all point toward formal reporting requirements. Build the playbook to produce the records you’ll need to file.


Working with JAIN on AI incident response? We help security and AI program leaders write the playbook and run the tabletop exercises that prepare the team for the actual incident. Book a 30-minute call.

Related reading:

Want to talk through this for your team?

30 minutes, no slides. We'll work the specific call your company is facing.