Lost in the Middle (The Agents Season, Episode 3)
May 4, 2026 - 00:19:44
Radio and PodcastLive Radio & PodcastsModern AI chatbots have a few different things that go into creating them. Today we're going to talk about a really important part of the process: the alignment training, where the chatbot goes from being just a pre-trai...
A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is an episode from Linear Digressions by Ben Jaffe and Katie Malone. Modern AI chatbots have a few different things that go into creating them. Today we're go...
This episode belongs to Linear Digressions.
Use the player on this page to stream the episode online.
Published Feb 14, 2026, 00:19:13 long, audio available.
Modern AI chatbots have a few different things that go into creating them. Today we're going to talk about a really important part of the process: the alignment training, where the chatbot goes from being just a pre-trained model—something that's kind of a fancy autocomplete—to something that really gives responses to human prompts that are more conversational, that are closer to the ones that we experience when we actually use a model like ChatGPT or Gemini or Claude. To go from the pre-trained model to one that's aligned, that's ready for a human to talk with, it uses reinforcement learning. And a really important step in figuring out the right way to frame the reinforcement learning problem happened in 2017 with a paper that we're going to talk about today: Deep Reinforcement Learning from Human Preferences. You are listening to Linear Digressions. The paper discussed in this episode is Deep Reinforcement Learning from Human Preferences
You can listen to A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences online on Radio and Podcast. Open the player on this page to stream the available audio.
A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is an episode from Linear Digressions by Ben Jaffe and Katie Malone.
This episode is 00:19:13 long.
This episode was published on Feb 14, 2026.
Yes. Use the heart button on the episode page to add it to your favorite episodes list.
Yes. This page shows related episodes from Linear Digressions when more episodes are available from the podcast feed.
You can listen to A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences on this page when the episode audio is available from the podcast feed.
A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is from Linear Digressions by Ben Jaffe and Katie Malone.
Published Feb 14, 2026 and 00:19:13 long