
Reinforcement Fine-Tuning and the Future of Specialized AI Models
Aug 5, 2025 - 40:24
Radio and PodcastLive Radio & Podcasts
In this episode, Brandon Cui, Research Scientist at MosaicML and Databricks, dives into cutting-edge advancements in AI model optimization, focusing on Reward Models and Reinforcement Learning from Human Feedback (RLHF)....
Reward Models Data Brew Episode 40 is an episode from Data Brew by Databricks by Databricks. In this episode, Brandon Cui, Research Scientist at MosaicML and Databricks, dives into cutting-edge advancements in AI model optimization, focusin...
This episode belongs to Data Brew by Databricks.
Use the player on this page to stream the episode online.
Published Mar 20, 2025, 39:58 long, audio available.