Real-World SRE Perspectives

35:27
 
Share
 

Manage episode 230242475 series 2285741
By Discovered by Player FM and our community — copyright is owned by the publisher, not Player FM, and audio streamed directly from their servers.

SHOW: 392
DESCRIPTION: Brian talks with Gustavo Franco (@stratus, Customer Reliability Engineer at Google) about real-world experience as SRE/SRE Manager and CRE Manager, a discussion about how to measure SRE success, as well as how to onboard the SRE/CRE concepts and processes to new teams.

SHOW SPONSOR LINKS:

CLOUD NEWS OF THE WEEK:

SHOW INTERVIEW LINKS:

Gustavo's Background: https://conferences.oreilly.com/velocity/vl-ca/public/schedule/speaker/150125

SHOW NOTES:

Topic 1 - Welcome to the show. Tell us about your background, and some of the things you work on today as it relates to SRE and CRE teams.

Topic 2 - Let's talk about what SRE is intended to do, and maybe how it differs (or is the same) from existing teams that might be labeled "Ops" or "DevOps". Maybe we can also talk about some of the types of skills that highlight what SRE does.

Topic 3 - What are some of the ways to avoid an SRE (or CRE) team just becoming the band-aid team to fix all the things that developers don't want to put into code because they are under deadlines (security, bug fixed, scalability, etc.)?

Topic 4 - We're hearing more about these terms "AIOps" and "ChaosEngineering". How much can SRE/CRE teams augment applications through tools that either bring deeper insight (e.g. AIOps) or create scenarios that developers can't emulate (e.g. Chaos)?

Topic 5 - You've been around SRE/CRE for a while now. What are some of the positive and negative lessons you've learned and could share with the audience?
FEEDBACK?

442 episodes available. A new episode about every 6 days averaging 32 mins duration .