Radio and PodcastRadio and PodcastLive Radio & Podcasts
#335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models artwork
Technology

#335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models

Eye On A.I. by Craig S. Smith

Apr 19, 202601:00:21Technology

Why IBM Is Betting Everything on Small AI Models In this episode of Eye on AI, Craig Smith sits down with Sriram Raghavan, Vice President of AI at IBM Research, to explore one of the most important debates in enterprise...

About This Episode

#335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models is an episode from Eye On A.I. by Craig S. Smith. Why IBM Is Betting Everything on Small AI Models In this episode of Eye on AI, Craig Smith sits down with Sriram Raghav...

Podcast

This episode belongs to Eye On A.I..

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Apr 19, 2026, 01:00:21 long, audio available.

Questions About This Episode

What is #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models about?

Why IBM Is Betting Everything on Small AI Models In this episode of Eye on AI, Craig Smith sits down with Sriram Raghavan, Vice President of AI at IBM Research, to explore one of the most important debates in enterprise AI right now. Do you actually need a massive model to get world class results? IBM's answer is no, and Sriram breaks down exactly why. Sriram explains why IBM chose to train its Granite models directly using reinforcement learning rather than distilling from larger models like most of the industry. The reason goes beyond performance. It comes down to data lineage, safety alignment, and a belief that small, efficient models are the only sustainable path for enterprises running AI across hybrid cloud environments. We get into the full technical stack behind that bet. How data quality has replaced model size as the real competitive advantage. Why parameter count is becoming the wrong metric entirely. How IBM's inference time scaling techniques allow an 8 billion parameter model to match the performance of GPT-4o and Claude 3.5 on code and math benchmarks. And why IBM is pioneering a new concept called Generative Computing, which treats AI models not as prompt receivers but as programmable computing elements with runtimes, modular LoRA adapters, and proper programming abstractions. Sriram also shares where IBM Research is headed next, including breakthroughs in continuous learning, agent orchestration, and making unstructured enterprise data actually usable at scale.

Where can I listen to #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models?

You can listen to #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models from?

#335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models is an episode from Eye On A.I. by Craig S. Smith.

How long is this episode?

This episode is 01:00:21 long.

When was this episode published?

This episode was published on Apr 19, 2026.

Can I save #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Eye On A.I.?

Yes. This page shows related episodes from Eye On A.I. when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models?

You can listen to #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

#335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models is from Eye On A.I. by Craig S. Smith.

What are the episode details?

Published Apr 19, 2026 and 01:00:21 long