Episode 98 - Histograms and Service Level Objectives - a podcast by Breandan Dezendorf, Ken Mink, Jack Neely, and Jarod Watkins
from 2020-09-11T09:00
Where we talk about what Service Level Objectives actually are and why they are so important in the field of Site Reliability Engineering. We cover the definition of an SLO, how they relate to error budgets, and take a look at various implementations of time series databases' support for calculating accurate percentiles.
Comments for the episode are welcome - at the bottom of the show notes for the episode there is a Disqus setup, or you can email us at feedback@operations.fm.
Sponsors for Episode 98:
42 Lines is a DevOps consulting firm specializing in
Observability, Cloud Migration, Cost Control, Security Practices, and Team
Mentoring.
Links for Episode 98:
- Atlassian Incident Management
- High Availibility Percentage Calculation
- Google SRE Book: Embracing Risk
- Quantile Definition
- Four Golden Signals
- Histograms at Scale
- VictoriaMetrics Histograms
- Circonus Log-Linear Histograms
- T-Digests
Further episodes of Practical Operations Podcast Episode Feed
Further podcasts by Breandan Dezendorf, Ken Mink, Jack Neely, and Jarod Watkins
Website of Breandan Dezendorf, Ken Mink, Jack Neely, and Jarod Watkins