Radio and PodcastRadio and PodcastLive Radio & Podcasts
Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. artwork
Technology

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.

The New Stack Podcast by The New Stack Podcast

Apr 29, 202600:28:06Technology

In this episode of The New Stack Makers, AWS developer advocate Morgan Willis demonstrates Strands Agents, an open source agentic framework with rapid adoption since its launch. Using a simple accounting API, she walks t...

About This Episode

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. is an episode from The New Stack Podcast by The New Stack Podcast. In this episode of The New Stack Makers, AWS developer advocate Morgan Willis demonstrates Strands Agents,...

Podcast

This episode belongs to The New Stack Podcast.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Apr 29, 2026, 00:28:06 long, audio available.

Questions About This Episode

What is Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. about?

In this episode of The New Stack Makers, AWS developer advocate Morgan Willis demonstrates Strands Agents, an open source agentic framework with rapid adoption since its launch. Using a simple accounting API, she walks through three approaches to retrieving a customer’s latest invoice, highlighting how design choices dramatically impact efficiency. The initial method maps each API endpoint to a separate tool, requiring five chained calls and consuming about 52,000 tokens. By shifting to intent-based tools—focused on outcomes rather than individual data operations—the same task is completed in a single call using just 2,000 tokens, improving both efficiency and reasoning. In a third iteration, tools are hosted on a remote MCP server via AWS Agent Core Gateway, with semantic search limiting the agent’s toolset to only what’s relevant per query, further reducing token usage. Willis emphasizes that narrowly scoped agents outperform general-purpose ones, delivering better speed, accuracy, and context efficiency. Designing smaller, specialized agents with tailored tools is key as tool ecosystems expand. Learn more from The New Stack around the latest with Strands and MCP: AWS Launches Its Take on an Open Source AI Agents SDK What Is MCP? Game Changer or Just More Hype? MCP’s biggest growing pains for production use will soon be solved Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

Where can I listen to Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.?

You can listen to Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. from?

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. is an episode from The New Stack Podcast by The New Stack Podcast.

How long is this episode?

This episode is 00:28:06 long.

When was this episode published?

This episode was published on Apr 29, 2026.

Can I save Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from The New Stack Podcast?

Yes. This page shows related episodes from The New Stack Podcast when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.?

You can listen to Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it. is from The New Stack Podcast by The New Stack Podcast.

What are the episode details?

Published Apr 29, 2026 and 00:28:06 long