Loading…
DOES US 2021 has ended
Back To Schedule
Thursday, October 7 • 2:20pm - 2:50pm
Chaos and Reliability: A Surprising Friendship in the Enterprise

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Chaos Engineering is often characterized as “breaking things in production,” which lends it an air of something only feasible for technologically elite or sophisticated organizations. In practice, it’s been a key element in digital transformation from the ground up for a number of companies ranging from pre-streaming Netflix to those in highly regulated industries like healthcare, telecommunications, and financial services. 
Many enterprises are grappling with application modernization at an ever-increasing scale, and leveraging chaos-informed experimentation as a facet of their SRE practices can help them get their arms around the complexity of their systems. Understanding the complexity of distributed systems is foundational but critical to true observability.  These practices can inevitably lead to clarity in metrics like SLOs, grounded in reality instead of guesswork. 
In this talk, Troy Koss (Director of SRE at Capital One) joins Courtney Nash (researcher at Verica) to explore some of the myths of Chaos Engineering, and how he’s put it into practice at multiple enterprise companies to foster a culture focused on reliability. Join them to learn how not chaotic it can be to adopt chaos engineering and how effective it can be at accelerating your SRE journey. You might be surprised to find out how close you already are to getting started...

Speakers
avatar for Troy Koss

Troy Koss

Director, Site Reliability Engineering (SRE), Capital One
With what seems to be a natural attraction towards reliability, Troy has constantly found himself involved in making things...well...more reliable. After working in software development, he stumbled into operations and saw a clear opportunity to use software to orchestrate such efforts... Read More →
avatar for Courtney Nash

Courtney Nash

Internet Incident Librarian, Verica
Courtney Nash is a researcher focused on system safety and failures in complex socio-technical systems. An erstwhile cognitive neuroscientist, she has always been fascinated by how people learn, and the ways memory influences how they solve problems. Over the past two decades, she’s... Read More →


Thursday October 7, 2021 2:20pm - 2:50pm CDT
Track 2