DOP208-R1: Amazon's approach to failing successfully - a podcast by AWS

from 2021-01-31T22:10:42.023393

:: ::

Welcome to the real world, where things don't always go your way. Systems can fail despite being designed to be highly available, scalable, and resilient. These failures, if used correctly, can be a powerful lever for gaining a deep understanding of how a system actually works, as well as a tool for learning how to avoid future failures. In this session, we cover Amazon's favorite techniques for defining and reviewing metrics-watching the systems before they fail-as well as how to do an effective postmortem that drives both learning and meaningful improvement.

Further episodes of AWS re:Invent 2019

Further podcasts by AWS

Website of AWS