Back to LabsIncident room sim
Recover a payments outage
Use the live feed, pick actions, and stabilize the system before the score drops.
Topic intro
Incident response is a balance of speed and safety. The best first action stops the failure and protects customers. This sim lets you see how actions change the system.
How to find the answer
Watch the log feed for what changed right after the deploy. Look for signs of a missing secret or a config gap, then act fast.
Live status
Score drops as the incident worsens.
Score 15
Error rate
4.20%
Latency
1400 ms
Queue depth
980
Revenue loss
$0
Log feed
Waiting for signals...
Actions
Actions are one way. Pick the safest fix first.
Incident note
Write a short summary with cause, fix, and follow up.