Welcome to episode 22 of the Full Stack Journey podcast! This month, Scott is joined by Michael Kehoe (Twitter, LinkedIn, and web site) to talk about site reliability engineering (SRE). Scott and Michael tackle topics like:

  • What is site reliability engineering?
  • The importance of automation within SRE
  • The key skills an SRE practitioner needs
  • Commonly-encountered programming languages
  • The relationship between SRE and DevOps
  • Learning from failures and using blameless post-mortems to continually improve (“measure everything” and closing the feedback loop)
  • Comparing SRE and the idea of a “full stack engineer”
  • How to build SRE skills

Show Links:

Michael’s Awesome SRE Cheatsheets – GitHub

LinkedIn Engineering blog

High Scalability.com

A Tour of Go – golang.org

SRE Weekly.com

Site Reliability Engineering by O’Reilly

Share this episode

Grab a Packet Capture!

Get a weekly log of all the newest content across the network in the Packet Capture newsletter.

Subscribe

Because you need maintenance too.

Human Infrastructure is a weekly newsletter about life in IT.

Subscribe

Leave a Comment