
Cisco Tech Stories - ep 35 - CCIE with a twist
Jun 1, 2026 - 00:41:23
Radio and PodcastLive Radio & Podcasts
In the latest episode of the Cisco AI Insights podcast, hosts Rafael Herrera and Sonia Marques are joined by Dr. Catarina Carvalho, a Cisco leader in machine learning engineering. Together, they unpack the complex academ...
AI Insights - Ep.3: Rethinking AI Performance Metrics is an episode from Cisco Podcast Network by Cisco Podcast Network. In the latest episode of the Cisco AI Insights podcast, hosts Rafael Herrera and Sonia Marques are joined by Dr. Catari...
This episode belongs to Cisco Podcast Network.
Use the player on this page to stream the episode online.
Published Mar 26, 2026, 00:27:26 long, audio available.
In the latest episode of the Cisco AI Insights podcast, hosts Rafael Herrera and Sonia Marques are joined by Dr. Catarina Carvalho, a Cisco leader in machine learning engineering. Together, they unpack the complex academic paper " Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following," developed by researchers from the University of Maryland and the University of Waterloo. As the industry moves toward more reliable multimodal models, traditional pass-or-fail evaluation is no longer sufficient. This paper introduces a hierarchical framework that uses "LLM-as-a-judge" to evaluate outputs across five distinct criteria: visual grounding, logical coherence, factuality, reflection, and conciseness. Dr. Carvalho guides the discussion through the nuances of this "judge of judges" approach, exploring why human alignment remains the gold standard even as we automate evaluation processes. A special thank you to the teams at both The University of Waterloo and The University of Maryland, College Park, for developing this month's paper. If you are interested in reading the paper yourself, please visit this link:
You can listen to AI Insights - Ep.3: Rethinking AI Performance Metrics online on Radio and Podcast. Open the player on this page to stream the available audio.
AI Insights - Ep.3: Rethinking AI Performance Metrics is an episode from Cisco Podcast Network by Cisco Podcast Network.
This episode is 00:27:26 long.
This episode was published on Mar 26, 2026.
Yes. Use the heart button on the episode page to add it to your favorite episodes list.
Yes. This page shows related episodes from Cisco Podcast Network when more episodes are available from the podcast feed.
You can listen to AI Insights - Ep.3: Rethinking AI Performance Metrics on this page when the episode audio is available from the podcast feed.
AI Insights - Ep.3: Rethinking AI Performance Metrics is from Cisco Podcast Network by Cisco Podcast Network.
Published Mar 26, 2026 and 00:27:26 long