Radio and PodcastRadio and PodcastLive Radio & Podcasts
Incidents & Operations with Dan Slimmon artwork
Technology

Incidents & Operations with Dan Slimmon

Software Delivery in Small Batches by Adam Hawkins

Aug 11, 202461:48Technology

In this episode, Adam welcomes Dan Slimmon, an experienced Site Reliability Engineer (SRE) to discuss aspects of incident response and troubleshooting in software engineering. Dan explains his methodology for clinical tr...

About This Episode

Incidents & Operations with Dan Slimmon is an episode from Software Delivery in Small Batches by Adam Hawkins. In this episode, Adam welcomes Dan Slimmon, an experienced Site Reliability Engineer (SRE) to discuss aspects of incident respons...

Podcast

This episode belongs to Software Delivery in Small Batches.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Aug 11, 2024, 61:48 long, audio available.

Questions About This Episode

What is Incidents & Operations with Dan Slimmon about?

In this episode, Adam welcomes Dan Slimmon, an experienced Site Reliability Engineer (SRE) to discuss aspects of incident response and troubleshooting in software engineering. Dan explains his methodology for clinical troubleshooting, the importance of maintaining a common mental model, and techniques for leading effective incident response efforts. They also delve into the value of continuous ops reviews and ongoing mental model updates to prevent issues, emphasizing the need for structured processes and effective communication. Want more? 🚀 New listener? Start with the introduction . 🎁 Enter the FREE giveaway for a copy of "Release It!" 🧭 Get the Small Batches Way guide to software delivery excellence 🥋 Software Kaizen: My One-on-One System for Engineering Leadership 🧑‍🎓 Dan's course on leading incidents (Code SMALLBATCHES24 for 24% off!) Chapters (00:00) - Incidents & Operations (01:14) - Guest Welcome (01:40) - Dan's Career Journey (02:33) - Evolution of Tech Stacks (04:59) - Clinical Troubleshooting Explained (11:53) - Incident Response Fundamentals (17:41) - Effective Communication in Incidents (26:09) - Training for Incident Response (33:22) - The Essence of Incident Response (33:53) - Balancing Short-Term and Long-Term Fixes (35:01) - The Firefighting Analogy in Software Incidents (37:11) - Postmortems: Learning from Incidents (42:14) - Building a Shared Mental Model (42:41) - Looking for Trouble: Proactive System Monitoring (47:59) - Ops Reviews: Continuous Improvement (54:37) - The Importance of Closing the Feedback Loop (59:40) - Final Thoughts and Resources ★ Support this podcast on Patreon ★

Where can I listen to Incidents & Operations with Dan Slimmon?

You can listen to Incidents & Operations with Dan Slimmon online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is Incidents & Operations with Dan Slimmon from?

Incidents & Operations with Dan Slimmon is an episode from Software Delivery in Small Batches by Adam Hawkins.

How long is this episode?

This episode is 61:48 long.

When was this episode published?

This episode was published on Aug 11, 2024.

Can I save Incidents & Operations with Dan Slimmon for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Software Delivery in Small Batches?

Yes. This page shows related episodes from Software Delivery in Small Batches when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to Incidents & Operations with Dan Slimmon?

You can listen to Incidents & Operations with Dan Slimmon on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

Incidents & Operations with Dan Slimmon is from Software Delivery in Small Batches by Adam Hawkins.

What are the episode details?

Published Aug 11, 2024 and 61:48 long