The Problem with MTTR: Learning from Incident Reports | Courtney Nash

Dev Interrupted - En podcast af LinearB - Tirsdage

Kategorier:

Tracking Mean Time To Restore (MTTR) is standard industry practice for incident response and analysis, but should it be? Courtney Nash, an Internet Incident Librarian, argues that MTTR is not a reliable metric - and we think she's got a point. We caught up with Courtney at the DevOps Enterprise Summit in Las Vegas, where she was making her case against MTTR in favor of alternative metrics (SLOs and cost of coordination data), practices (Near Miss analysis), and mindsets (humans are the ...

Visit the podcast's native language site