Back to Labs
Incident room sim

Recover a payments outage

Use the live feed, pick actions, and stabilize the system before the score drops.

Topic intro

Incident response is a balance of speed and safety. The best first action stops the failure and protects customers. This sim lets you see how actions change the system.

How to find the answer

Watch the log feed for what changed right after the deploy. Look for signs of a missing secret or a config gap, then act fast.

Live status

Score drops as the incident worsens.

Score 15
Error rate
4.20%
Latency
1400 ms
Queue depth
980
Revenue loss
$0
Log feed
Waiting for signals...

Actions

Actions are one way. Pick the safest fix first.

Incident note

Write a short summary with cause, fix, and follow up.