Radio and PodcastRadio and PodcastLive Radio & Podcasts
153. LLM Inference with Bedrock artwork
Technology

153. LLM Inference with Bedrock

AWS Bites by AWS Bites

Mar 6, 202600:43:25Technology

If you’re curious about building with LLMs, but you want to skip the hype and learn what it takes to ship something reliable in production, this episode is for you.We share our real-world experience building AI-powered a...

About This Episode

153. LLM Inference with Bedrock is an episode from AWS Bites by AWS Bites. If you’re curious about building with LLMs, but you want to skip the hype and learn what it takes to ship something reliable in production, this episode is for you.W...

Podcast

This episode belongs to AWS Bites.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Mar 6, 2026, 00:43:25 long, audio available.

Questions About This Episode

What is 153. LLM Inference with Bedrock about?

If you’re curious about building with LLMs, but you want to skip the hype and learn what it takes to ship something reliable in production, this episode is for you.We share our real-world experience building AI-powered apps and the gotchas you hit after the demo: tokens and cost, quotas and throttling, IAM and access friction, marketplace subscriptions, and structured outputs that do not break your JSON parser.We focus on Amazon Bedrock as AWS’s managed inference layer: how to get started with the current access model, how to choose models, how pricing works, and what to watch for in production.We also go deep on structured outputs: constrained decoding, schema design that improves output quality, and how to avoid “grammar compilation timed out”. In this episode, we mentioned the following resources: fourTheorem: Bedrock structured outputs guide Amazon Bedrock Bedrock docs Bedrock pricing Structured outputs Cross-region inference Quotas Throttling help Prompt caching Troubleshooting error codes Do you have any AWS questions you would like us to address? Leave a comment here or connect with us on X/Twitter, BlueSky or LinkedIn: - ⁠ | ⁠ | ⁠ - ⁠ | ⁠ | ⁠

Where can I listen to 153. LLM Inference with Bedrock?

You can listen to 153. LLM Inference with Bedrock online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is 153. LLM Inference with Bedrock from?

153. LLM Inference with Bedrock is an episode from AWS Bites by AWS Bites.

How long is this episode?

This episode is 00:43:25 long.

When was this episode published?

This episode was published on Mar 6, 2026.

Can I save 153. LLM Inference with Bedrock for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from AWS Bites?

Yes. This page shows related episodes from AWS Bites when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to 153. LLM Inference with Bedrock?

You can listen to 153. LLM Inference with Bedrock on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

153. LLM Inference with Bedrock is from AWS Bites by AWS Bites.

What are the episode details?

Published Mar 6, 2026 and 00:43:25 long