
Reinforcement Fine-Tuning and the Future of Specialized AI Models
Aug 5, 2025 - 40:24
Radio and PodcastLive Radio & Podcasts
In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhanci...
SWE-bench & SWE-agent Data Brew Episode 44 is an episode from Data Brew by Databricks by Databricks. In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, dis...
This episode belongs to Data Brew by Databricks.
Use the player on this page to stream the episode online.
Published Apr 17, 2025, 36:22 long, audio available.
In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhancing AI in software engineering. Highlights include: - SWE-bench: A benchmark for assessing AI models on real-world coding tasks. - Addressing data leakage concerns in GitHub-sourced benchmarks. - SWE-agent: An AI-driven system for navigating and solving coding challenges. - Overcoming agent limitations, such as getting stuck in loops. - The future of AI-powered code reviews and automation in software engineering.
You can listen to SWE-bench & SWE-agent Data Brew Episode 44 online on Radio and Podcast. Open the player on this page to stream the available audio.
SWE-bench & SWE-agent Data Brew Episode 44 is an episode from Data Brew by Databricks by Databricks.
This episode is 36:22 long.
This episode was published on Apr 17, 2025.
Yes. Use the heart button on the episode page to add it to your favorite episodes list.
Yes. This page shows related episodes from Data Brew by Databricks when more episodes are available from the podcast feed.
You can listen to SWE-bench & SWE-agent Data Brew Episode 44 on this page when the episode audio is available from the podcast feed.
SWE-bench & SWE-agent Data Brew Episode 44 is from Data Brew by Databricks by Databricks.
Published Apr 17, 2025 and 36:22 long