Radio and PodcastRadio and PodcastLive Radio & Podcasts
EVA - A Framework for Evaluating Voice Agents by ServiceNow artwork
Technology

EVA - A Framework for Evaluating Voice Agents by ServiceNow

ServiceNow TechBytes by ServiceNow Community TechBytes

Apr 29, 202629:37Technology

Voice AI agent evaluation — why it's fundamentally harder than text, how cascade failures derail conversations invisibly, and ServiceNow's open-source framework to establish industry evaluation standards. Featuring...

About This Episode

EVA - A Framework for Evaluating Voice Agents by ServiceNow is an episode from ServiceNow TechBytes by ServiceNow Community TechBytes. Voice AI agent evaluation — why it's fundamentally harder than text, how cascade failures derail co...

Podcast

This episode belongs to ServiceNow TechBytes.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Apr 29, 2026, 29:37 long, audio available.

Questions About This Episode

What is EVA - A Framework for Evaluating Voice Agents by ServiceNow about?

Voice AI agent evaluation — why it's fundamentally harder than text, how cascade failures derail conversations invisibly, and ServiceNow's open-source framework to establish industry evaluation standards. Featuring real audio examples showing authentication failures, leaked reasoning, and latency problems. WHAT WE COVER TARA BOGAVELLI — Research Engineer, ServiceNow Leading the open-source voice agent evaluation framework. Explains why existing benchmarks don't measure what matters and what ServiceNow is releasing to establish industry standards. KATRINA STANKIEWICZ — Staff Machine Learning Engineer, ServiceNow Cascade model architecture expert. Breaks down STT → LLM → TTS failure modes, named entity transcription challenges, and real audio example analysis. GABRIELLE GAUTHIER MELANÇON — Staff Applied Research Scientist, ServiceNow Multi-language evaluation specialist. Reveals why Large Audio Language Models lag behind, the native speaker requirement, and bot-to-bot simulation methodology. CHAPTERS 0:00 Introduction — The evaluation gap 1:11 ServiceNow's Open-Source Framework Announcement — Tara Bogavelli 2:43 Meet the Researchers 3:43 Voice-Specific Challenges — Tara Bogavelli 5:03 Cascade Architecture: STT → LLM → TTS — Katrina Stankiewicz 7:57 The Named Entity Problem — Katrina Stankiewicz 10:06 Evaluation Metrics: Accuracy vs Experience — Gabrielle Gauthier Melançon 11:23 Bot-to-Bot Testing at Scale — Gabrielle Gauthier Melançon 14:30 The LALM Gap: Why Audio AI Judges Struggle — Tara Bogavelli 16:57 Real Audio Example: Flight Rebooking Gone Wrong 21:58 Breaking Down the Failures — Katrina Stankiewicz 28:30 Wrap-Up & Resources KEY INSIGHTS The Cascade Failure Problem: STT → LLM → TTS errors propagate invisibly Named Entity Transcription: The enterprise blocker—names, confirmation codes, emails break authentication Accuracy vs Experience: Perfect task completion means nothing if users hang up due to poor experience LALM Gap: Large Audio Language Models lag behind text LLMs—human evaluators remain essential Latency Kills Conversations: Five-second pauses make users think the call dropped, breaking the experience even when tasks complete Open-Source Framework: ServiceNow releasing evaluation tools, metrics, and bot-to-bot simulation methodology for the industry. LEARN MORE Website: GitHub: Blog Post: Dataset: ABOUT Hosted by Bobby Brill. ServiceNow Insights podcast explores AI research, real-world applications, and the people building the future of work.

Where can I listen to EVA - A Framework for Evaluating Voice Agents by ServiceNow?

You can listen to EVA - A Framework for Evaluating Voice Agents by ServiceNow online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is EVA - A Framework for Evaluating Voice Agents by ServiceNow from?

EVA - A Framework for Evaluating Voice Agents by ServiceNow is an episode from ServiceNow TechBytes by ServiceNow Community TechBytes.

How long is this episode?

This episode is 29:37 long.

When was this episode published?

This episode was published on Apr 29, 2026.

Can I save EVA - A Framework for Evaluating Voice Agents by ServiceNow for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from ServiceNow TechBytes?

Yes. This page shows related episodes from ServiceNow TechBytes when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to EVA - A Framework for Evaluating Voice Agents by ServiceNow?

You can listen to EVA - A Framework for Evaluating Voice Agents by ServiceNow on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

EVA - A Framework for Evaluating Voice Agents by ServiceNow is from ServiceNow TechBytes by ServiceNow Community TechBytes.

What are the episode details?

Published Apr 29, 2026 and 29:37 long