VictorOps is now Splunk On-Call! Learn More.
Zachary Flower April 14, 2020
Stuff breaks, it’s inevitable. People make mistakes, technology breaks down, and processes aren’t infallible. But, when incidents happen, what can we do about it? What...
Read More »Chris Tozzi March 04, 2020
The discussion about incident management tends to focus on what happens in real-time, when an incident is actually occurring. To a degree, that makes sense;...
Read More »Chris Riley February 14, 2020
Every year I get on some technical kick. These fascinations usually end up being some sort of design pattern or process. In 2020, I’m really...
Read More »Dan Holloran October 07, 2019
DevOps and IT operations teams rely on visibility across disparate applications and infrastructure in order to know when a complete service is healthy and when...
Read More »Dan Holloran August 16, 2019
Modern Agile practices and DevOps methodologies are leading to faster feature releases even though systems are becoming more complex. With high velocity comes more change...
Read More »Nell Gable August 12, 2019
In the traditional IT Infrastructure Library (ITIL) approach to IT service management (ITSM) and IT operations, root cause analysis is required for effective incident management....
Read More »Brad Griffith August 01, 2019
Ishikawa’s fishbone diagram is a method for visualizing and analyzing nearly any problem to find the root cause of an issue. According to TechTarget, the...
Read More »Brad Griffith April 17, 2019
IT incidents from active directory, account deletion, printer not printing, and monitor flickering to software development incidents such as application delivery and code merge issues...
Read More »Dan Holloran March 25, 2019
In many organizations, DevOps, IT, SRE and operations teams can become laser-focused on reducing MTTA through improvements to real-time collaboration and visibility. While optimizing the...
Read More »Dan Holloran February 14, 2019
Post-incident reviews, commonly called post mortem reports are a critical and highly understated process of the incident lifecycle. DevOps-centric teams simply can’t improve without retrospective,...
Read More »