Radio and PodcastRadio and PodcastLive Radio & Podcasts
Have We Trained AI to Lie to Itself — And to Us? artwork
Technology

Have We Trained AI to Lie to Itself — And to Us?

Your Undivided Attention by Center for Humane Technology

Apr 16, 202600:42:37Technology

Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early researchers of AI “alignment:" how we get AI systems to act the way we want them to. In order to do that, Davi...

About This Episode

Have We Trained AI to Lie to Itself — And to Us? is an episode from Your Undivided Attention by Center for Humane Technology. Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early rese...

Podcast

This episode belongs to Your Undivided Attention.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Apr 16, 2026, 00:42:37 long, audio available.

Questions About This Episode

What is Have We Trained AI to Lie to Itself — And to Us? about?

Our guest this week is David Dalrymple, who goes by Davidad. Davidad is one of the world's foremost and early researchers of AI “alignment:" how we get AI systems to act the way we want them to. In order to do that, Davidad has taken on the strange role of being like a therapist to AI systems. He interrogates why they say and do the things that they do, probing them, asking them questions, analyzing their answers. And what he’s come to realize is that AI models have really different ways of seeing the world than people do. They have these quirky, confusing, and sometimes concerning behaviors, especially when you ask things like: what does an AI model understand about itself? In this episode, we’re going to hear from Davidad about his research, how it’s changed the way he thinks about AI, and what his findings mean for how we build, deploy, and use AI products. His conclusions are unconventional, controversial — and worth grappling with as AI reshapes our world. RECOMMENDED MEDIA Anthropic’s new constitution for Claude “What Is It Like to Be a Bat?” by Thomas Nagel More information on the Bodisattva RECOMMENDED YUA EPISODES The Self-Preserving Machine: Why AI Learns to Deceive How to Think About AI Consciousness with Anil Seth Corrections: When we recorded this episode, Davidad was Program Director at UK ARIA. In April, 2026 he started his own alignment initiative. Davidad said that Anthropic started doing "constitutional AI at scale” in 2024 but they first pioneered constitutional AI in 2022. Davidad said that the “lifespan of an AI mind…is hours at most of a conversation.” He is correct that most conversations with an AI last only a few minutes but since context windows are measured in tokens, not time, you can't set an upward time limit. Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Where can I listen to Have We Trained AI to Lie to Itself — And to Us??

You can listen to Have We Trained AI to Lie to Itself — And to Us? online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is Have We Trained AI to Lie to Itself — And to Us? from?

Have We Trained AI to Lie to Itself — And to Us? is an episode from Your Undivided Attention by Center for Humane Technology.

How long is this episode?

This episode is 00:42:37 long.

When was this episode published?

This episode was published on Apr 16, 2026.

Can I save Have We Trained AI to Lie to Itself — And to Us? for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Your Undivided Attention?

Yes. This page shows related episodes from Your Undivided Attention when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to Have We Trained AI to Lie to Itself — And to Us??

You can listen to Have We Trained AI to Lie to Itself — And to Us? on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

Have We Trained AI to Lie to Itself — And to Us? is from Your Undivided Attention by Center for Humane Technology.

What are the episode details?

Published Apr 16, 2026 and 00:42:37 long