The rapid pace of growth proved a challenge for Skyscanner’s existing monitoring systems
As Skyscanner expanded globally, they found they were rapidly outgrowing their existing monitoring solutions and were looking for tooling that enabled them to both scale effectively and monitor availability on a global scale. The challenges of maintaining availability across a high-traffic website were further extended due to an increase of smaller services and the resulting volume of simultaneous deployments.
Due to the high level of complexity required to run a website processing more 42,000 events per second between hundreds of services, the Skyscanner team needed a scalable solution to efficiently detect and alert on outages. This is when Skyscanner started to investigate potential solutions.
No More “Round-Robin” location checks
As Skyscanner evaluated monitoring solutions that would support global expansion, Monitis made it to the top of the list because the company offers a widespread distribution of monitoring locations across the world.
Unlike alternative products, Monitis offered a static selection of test node locations rather than a regional pool, which enabled consistent responses for tending of both uptime and endpoint latency. Monitis, a TeamViewer company, with clientele in over 150 countries, provides cloud-based monitoring systems to test the functionality of 300,000+ websites, servers, and applications.
Alert routing and incident notification integration
Skyscanner has 700 uptime tests for monitoring service end-point and transaction monitors to test the most important customer journeys. The endpoints for uptime tests cover API and backend health, regional specific testing, B2B products and most important, the testing of user-facing endpoints.
They use Full Page Load for benchmarking their sites (e.g. http://www.skyscanner.com.br) against the sites of other technology leaders.
Adding VictorOps to quickly route alerts to the right responder
Skyscanner also uses VictorOps, a solution that provides teams with a virtual environment where they can prepare for, react to and recover from each incident regardless of location or device. The Monitis/VictorOps integration further improves Skyscanner’s time to both detect and resolve performance issues, ensuring that their services are always available and performing for their users.
Skyscanner teams now heavily rely on VictorOps for incident management, as they use its Annotations to add external URLs to alerts and to the Monitis shared dashboard. VictorOps’s notifications solved the problem of routing Monitis alerts to the right on-call team member who could ensure that any performance issues would be resolved.
Expanding use of Monitis and VictorOps to teams around the world
Today, Skyscanner teams around the world are working with Monitis and VictorOps to drive business value. Both solutions improve Skyscanner’s reaction time to both detect and resolve performance issues, ensuring their services are always available for their users without the initial complexities of ownership and implementation that often come with new tooling.