Radio and PodcastRadio and PodcastLive Radio & Podcasts
SE Radio 703: Sahaj Garg on Low Latency AI artwork
Technology

SE Radio 703: Sahaj Garg on Low Latency AI

Software Engineering Radio - The Podcast for Professional Software Developers by SE-Radio Team

Jan 14, 202654:50Technology

In this episode, Sahaj Garg , CTO of wispr.ai, joins SE Radio host Robert Blumen to talk about the challenges of building low-latency AI applications. They discuss latency's effect on consumer behavior as well as interac...

About This Episode

SE Radio 703: Sahaj Garg on Low Latency AI is an episode from Software Engineering Radio - The Podcast for Professional Software Developers by SE-Radio Team. In this episode, Sahaj Garg , CTO of wispr.ai, joins SE Radio host Robert Blumen t...

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Jan 14, 2026, 54:50 long, audio available.

Questions About This Episode

What is SE Radio 703: Sahaj Garg on Low Latency AI about?

In this episode, Sahaj Garg , CTO of wispr.ai, joins SE Radio host Robert Blumen to talk about the challenges of building low-latency AI applications. They discuss latency's effect on consumer behavior as well as interactive applications. The conversation explores how to measure latency and how scale impacts it. Then Sahaj and Robert shift to themes around AI, including whether "AI" means LLMs or something broader, as they look at latency requirements and challenges around subtypes of AI applications. The final part of the episode explores techniques for managing latency in AI: speed vs accuracy trade-offs; speed vs cost; latency vs cost; choosing the right model; reducing quantization; distillation; and guessing + validating. Brought to you by IEEE Computer Society and IEEE Software magazine .

Where can I listen to SE Radio 703: Sahaj Garg on Low Latency AI?

You can listen to SE Radio 703: Sahaj Garg on Low Latency AI online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is SE Radio 703: Sahaj Garg on Low Latency AI from?

SE Radio 703: Sahaj Garg on Low Latency AI is an episode from Software Engineering Radio - The Podcast for Professional Software Developers by SE-Radio Team.

How long is this episode?

This episode is 54:50 long.

When was this episode published?

This episode was published on Jan 14, 2026.

Can I save SE Radio 703: Sahaj Garg on Low Latency AI for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Software Engineering Radio - The Podcast for Professional Software Developers?

Yes. This page shows related episodes from Software Engineering Radio - The Podcast for Professional Software Developers when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to SE Radio 703: Sahaj Garg on Low Latency AI?

You can listen to SE Radio 703: Sahaj Garg on Low Latency AI on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

SE Radio 703: Sahaj Garg on Low Latency AI is from Software Engineering Radio - The Podcast for Professional Software Developers by SE-Radio Team.

What are the episode details?

Published Jan 14, 2026 and 54:50 long