Technology

Benchmarking AI Models

Linear Digressions by Ben Jaffe and Katie Malone

Mar 30, 202600:29:55Technology

How do you know if a new AI model is actually better than the last one? It turns out answering that question is a lot messier than it sounds. This week we dig into the world of LLM benchmarks — the standardized tests use...

About This Episode

Benchmarking AI Models is an episode from Linear Digressions by Ben Jaffe and Katie Malone. How do you know if a new AI model is actually better than the last one? It turns out answering that question is a lot messier than it sounds. This w...

Podcast

This episode belongs to Linear Digressions.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Mar 30, 2026, 00:29:55 long, audio available.

Benchmarking AI Models

About This Episode

Related Episodes