Radio and PodcastRadio and PodcastLive Radio & Podcasts
A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences artwork
Technology

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences

Linear Digressions by Ben Jaffe and Katie Malone

Feb 14, 202600:19:13Technology

Modern AI chatbots have a few different things that go into creating them. Today we're going to talk about a really important part of the process: the alignment training, where the chatbot goes from being just a pre-trai...

About This Episode

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is an episode from Linear Digressions by Ben Jaffe and Katie Malone. Modern AI chatbots have a few different things that go into creating them. Today we're go...

Podcast

This episode belongs to Linear Digressions.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Feb 14, 2026, 00:19:13 long, audio available.

Questions About This Episode

What is A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences about?

Modern AI chatbots have a few different things that go into creating them. Today we're going to talk about a really important part of the process: the alignment training, where the chatbot goes from being just a pre-trained model—something that's kind of a fancy autocomplete—to something that really gives responses to human prompts that are more conversational, that are closer to the ones that we experience when we actually use a model like ChatGPT or Gemini or Claude. To go from the pre-trained model to one that's aligned, that's ready for a human to talk with, it uses reinforcement learning. And a really important step in figuring out the right way to frame the reinforcement learning problem happened in 2017 with a paper that we're going to talk about today: Deep Reinforcement Learning from Human Preferences. You are listening to Linear Digressions. The paper discussed in this episode is Deep Reinforcement Learning from Human Preferences

Where can I listen to A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences?

You can listen to A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences from?

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is an episode from Linear Digressions by Ben Jaffe and Katie Malone.

How long is this episode?

This episode is 00:19:13 long.

When was this episode published?

This episode was published on Feb 14, 2026.

Can I save A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Linear Digressions?

Yes. This page shows related episodes from Linear Digressions when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences?

You can listen to A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is from Linear Digressions by Ben Jaffe and Katie Malone.

What are the episode details?

Published Feb 14, 2026 and 00:19:13 long