Artwork

Content provided by Michael Budd, Fraser Hart, Lewis Cains, Edd Mann, Michael Budd, Fraser Hart, Lewis Cains, and Edd Mann. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Michael Budd, Fraser Hart, Lewis Cains, Edd Mann, Michael Budd, Fraser Hart, Lewis Cains, and Edd Mann or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

148: Site Reliability Engineering with Niall Murphy

59:30
 
Share
 

Manage episode 214305866 series 2410493
Content provided by Michael Budd, Fraser Hart, Lewis Cains, Edd Mann, Michael Budd, Fraser Hart, Lewis Cains, and Edd Mann. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Michael Budd, Fraser Hart, Lewis Cains, Edd Mann, Michael Budd, Fraser Hart, Lewis Cains, and Edd Mann or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this week’s episode we are lucky to be joined by Niall Murphy to discuss the discipline of Site Reliability Engineering. We start off by speaking about how he got into computing, how the SRE role came to be and what drew him to it. From here, we highlight the position of an SRE within a company/group, what SLA’s are, the positives of having 50% operations work caps and blameless postmortems. This leads us to talk about the reasoning behind striving for 100% uptime is actually detrimental to the product, and the benefits of having an Error Budget. Finally, we discuss how the role has evolved since its inception, the Wheel of Misfortune and what drew him to contribute to the seminal SRE book.

Show Links

  continue reading

164 episodes

Artwork
iconShare
 
Manage episode 214305866 series 2410493
Content provided by Michael Budd, Fraser Hart, Lewis Cains, Edd Mann, Michael Budd, Fraser Hart, Lewis Cains, and Edd Mann. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Michael Budd, Fraser Hart, Lewis Cains, Edd Mann, Michael Budd, Fraser Hart, Lewis Cains, and Edd Mann or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://player.fm/legal.

In this week’s episode we are lucky to be joined by Niall Murphy to discuss the discipline of Site Reliability Engineering. We start off by speaking about how he got into computing, how the SRE role came to be and what drew him to it. From here, we highlight the position of an SRE within a company/group, what SLA’s are, the positives of having 50% operations work caps and blameless postmortems. This leads us to talk about the reasoning behind striving for 100% uptime is actually detrimental to the product, and the benefits of having an Error Budget. Finally, we discuss how the role has evolved since its inception, the Wheel of Misfortune and what drew him to contribute to the seminal SRE book.

Show Links

  continue reading

164 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide