Radio and PodcastRadio and PodcastLive Radio & Podcasts
Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) artwork
Technology

Why AI Evaluation Science Can't Keep Up (with Carina Prunkl)

Future of Life Institute Podcast by Gus Docker

Apr 17, 202654:23Technology

Carina Prunkl is a researcher at Inria. She joins the podcast to discuss how to assess the capabilities and risks of general-purpose AI. We examine why systems can solve hard coding and math problems yet still fail at si...

About This Episode

Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) is an episode from Future of Life Institute Podcast by Gus Docker. Carina Prunkl is a researcher at Inria. She joins the podcast to discuss how to assess the capabilities and risk...

Podcast

This episode belongs to Future of Life Institute Podcast.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Apr 17, 2026, 54:23 long, audio available.

Questions About This Episode

What is Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) about?

Carina Prunkl is a researcher at Inria. She joins the podcast to discuss how to assess the capabilities and risks of general-purpose AI. We examine why systems can solve hard coding and math problems yet still fail at simple tasks, why pre-deployment tests often miss real-world behavior, and how faster capability gains can increase misuse risks. The conversation also covers de-skilling, red teaming, layered safeguards, and warning signs that AIs might undermine oversight. LINKS: Carina Prunkl personal website CHAPTERS: (00:00) Episode Preview (01:04) Introducing the report (02:10) Jagged frontier capabilities (05:29) Formal reasoning progress (12:36) Risks and evaluation science (19:00) Funding evaluation capacity (24:03) Autonomy and de-skilling (31:32) Authenticity and AI companions (41:00) Defense in depth methods (48:34) Loss of control risks (53:16) Where to read report PRODUCED BY: SOCIAL LINKS: Website: Twitter (FLI): Twitter (Gus): LinkedIn: YouTube: Apple: Spotify:

Where can I listen to Why AI Evaluation Science Can't Keep Up (with Carina Prunkl)?

You can listen to Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) from?

Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) is an episode from Future of Life Institute Podcast by Gus Docker.

How long is this episode?

This episode is 54:23 long.

When was this episode published?

This episode was published on Apr 17, 2026.

Can I save Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Future of Life Institute Podcast?

Yes. This page shows related episodes from Future of Life Institute Podcast when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to Why AI Evaluation Science Can't Keep Up (with Carina Prunkl)?

You can listen to Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

Why AI Evaluation Science Can't Keep Up (with Carina Prunkl) is from Future of Life Institute Podcast by Gus Docker.

What are the episode details?

Published Apr 17, 2026 and 54:23 long