See You at Velocity New York

Velocity New York is coming up from October 1 through 4, and we can’t wait to see you there! The O’Reilly Velocity Conference is the best place on the planet for web ops and systems engineering professionals to learn from your peers, exchange ideas with other experts, and share best practices for handling modern systems and applications. Don’t forget to stop by the VictorOps booth to pick up a copy of Jason Hand’s new O’Reilly book, Post Incident Reviews. Attend this talk by Maggie Gourlay,…
Read More

On-Call Horror Story Number One: This Wins the Most Embarrassing Award

Let’s face it. Being on-call is never a neutral experience. We asked five VictorOps employees to share their “on-call horror stories” or worst outage story of their tech career. Most of their responses were, “Which story do you want to hear?” My interviewees were haunted by memories of getting paged in the middle of the night, driving into the office, and spending hours, or even days, trying to fix a problem. If you’ve worked in technology or engineering, you can probably relate to these stories…
Read More

Post-Incident Reviews Part Four: A Post-Incident Review Guide and Next Best Steps

Over the course of the last three blog posts on my eBook, Post-Incident Reviews, Learning from Failure for Improved Incident Response, I have shared some recommended methods for incident management and conveyed how the old ways of conducting post-incident Root Cause Analysis are outdated and ineffective. It is critical for companies to understand that incidents result not from a single root cause, but from a complex interplay of technology and the humans managing it: a socio-technical system that comprises the reality of modern software. If…
Read More

Post-Incident Reviews Part Three: Identifying Lifecycle and Conducting a Review

As we dive further into the eBook, “Post-Incident Reviews, Learning from Failure for Improved Incident Response,” we’ll explore how using analysis as an avenue for learning is the key to developing a successful post-incident plan. If you haven’t read the previous posts in this series, you can find them here: Learning from Failure for Improved Incident Response The Demise of Root Cause Analysis For a free copy of the full eBook, click here. Discovering Areas of Improvement Often times, it’s easy for engineers, product owners,…
Read More

Post-Incident Reviews Part Two: The Demise of Root Cause Analysis

When incidents occur, the natural response is to investigate and pinpoint the cause before looking for a solution. However, this traditional approach assumes that causality is determinable. What the modern IT professional needs to understand, is that problems stem — not from one primary cause — but from the complex interplay of our systems and the teams tasked with managing them. As we dive further into my recent eBook, Post-Incident Reviews, Learning from Failure for Improved Incident Response, I’ll explore how choosing appropriate models can…
Read More

Post-Incident Reviews: Learning from Failure for Improved Incident Response

I think we can all agree, as technology advances at a breakneck pace, expectations and challenges simultaneously increase. Clients expect flawless service, 24/7 support, and quick, easy-to-implement solutions. As an IT professional and DevOps evangelist, I’ve come to understand that to manage these expectations, new and updated methods for detecting, resolving, and improving systems need to evolve too. In my recent eBook, “Post-Incident Reviews, Learning from Failure for Improved Incident Response” I reveal modern methods for implementing a successful post-incident strategy. For a free copy…
Read More

We Just Launched Live Call Routing Using Twilio Functions

As of today, Live Call Routing is available to all VictorOps users. Read the press release here. This functionality connects callers with the right responder immediately, using VictorOps on-call schedules and escalation policies to make sure that the caller gets to the right people every time. If the primary responder does not respond, the call will follow the escalation steps until someone is available. When the call is answered, an ACKNOWLEDGED incident will be posted to the timeline with important information about the call. If…
Read More

An Interview with Jason Hand, Author of New O’Reilly Book, Post-Incident Reviews

Jason Hand is DevOps Evangelist at VictorOps and author of Post-Incident Reviews, a new ebook about learning from failure to improve incident response. Jason spoke with Marlo Vernon about the writing experience. This interview has been edited and condensed. MV: What prompted you to write this book? JH: Since I’ve been working at VictorOps, the topics covered in the book, especially blameless postmortems, have been not only the most relevant to the company, but also the most interesting to me. It’s a very different perspective…
Read More

Post-Incident Reviews: New Book, Updated Feature, Good Times Ahead

I’m excited to announce that VictorOps has updated the feature formerly known as the Postmortem Report, which is now called the Post-Incident Review. This updated feature aligns with our colleague Jason Hand’s new O’Reilly book: Post Incident Reviews. Using VictorOps to continuously improve When we first launched the Postmortem Report over three years ago, we were excited to see how our customers enthusiastically leveraged this feature to improve as a team. Since then, we have seen many permutations of post-incident reviews, but the consistent formula for success…
Read More

How I Built My Own Homebrew Alarm System Using Particle and VictorOps

I’m a Maker. I like to make things. A few months ago, Greg Frank, a VictorOps developer, got me interested in Particle.io Photon Microcontrollers. Greg has completed several home automation projects using Arduino and Photon devices, as documented in this blog post on his Smart Home experiment. Greg’s projects inspired me to consider my home alarm system, and how I could use VictorOps to alert me in a more sophisticated and cool way. Check out my 19-second video to see how it works: My home…
Read More