Technology

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences

Linear Digressions by Ben Jaffe and Katie Malone

Feb 14, 202600:19:13Technology

Modern AI chatbots have a few different things that go into creating them. Today we're going to talk about a really important part of the process: the alignment training, where the chatbot goes from being just a pre-trai...

About This Episode

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences is an episode from Linear Digressions by Ben Jaffe and Katie Malone. Modern AI chatbots have a few different things that go into creating them. Today we're go...

Podcast

This episode belongs to Linear Digressions.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Feb 14, 2026, 00:19:13 long, audio available.

A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences

About This Episode

Related Episodes