Radio and PodcastRadio and PodcastLive Radio & Podcasts
How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 artwork
Technology

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767

This Week in Machine Learning & Artificial Intelligence (AI) Podcast by TWIML

May 7, 202653:19Technology

In this episode, Scott Clark, co-founder and CEO of Distributional, joins us to explore how teams can reliably operate and improve complex LLM systems and agents in production. Scott introduces a Maslow’s hierarchy of ob...

About This Episode

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 is an episode from This Week in Machine Learning & Artificial Intelligence (AI) Podcast by TWIML. In this episode, Scott Clark, co-founder and CEO of Distributional, joi...

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published May 7, 2026, 53:19 long, audio available.

Questions About This Episode

What is How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 about?

In this episode, Scott Clark, co-founder and CEO of Distributional, joins us to explore how teams can reliably operate and improve complex LLM systems and agents in production. Scott introduces a Maslow’s hierarchy of observability: telemetry for logging, monitoring for known signals, and post-production or online analytics to surface unknown unknowns. We dig into examples of real-world failures Scott’s team has seen in production systems, such as “lazy” tool-use hallucinations that standard evals miss, and how mapping traces into vector fingerprints enables clustering and topic discovery to uncover emergent behaviors. Scott explains how analytics can feed the data flywheel by generating evals, guardrails, and training data, and why online, adaptive approaches are essential for non-stationary models. We also touch on practical how-to’s such as instrumentation with OpenTelemetry, the GenAI semantic conventions, and the role of dedicated analytics tools. The complete show notes for this episode can be found at

Where can I listen to How to Find the Agent Failures Your Evals Miss with Scott Clark - #767?

You can listen to How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 from?

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 is an episode from This Week in Machine Learning & Artificial Intelligence (AI) Podcast by TWIML.

How long is this episode?

This episode is 53:19 long.

When was this episode published?

This episode was published on May 7, 2026.

Can I save How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from This Week in Machine Learning & Artificial Intelligence (AI) Podcast?

Yes. This page shows related episodes from This Week in Machine Learning & Artificial Intelligence (AI) Podcast when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to How to Find the Agent Failures Your Evals Miss with Scott Clark - #767?

You can listen to How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767 is from This Week in Machine Learning & Artificial Intelligence (AI) Podcast by TWIML.

What are the episode details?

Published May 7, 2026 and 53:19 long