Radio and PodcastRadio and PodcastLive Radio & Podcasts
303 - How LLMs Work - the 20 minute explainer artwork
Technology

303 - How LLMs Work - the 20 minute explainer

Fragmented - The Software Podcast by Kaushik Gopal

Feb 2, 202600:25:45Technology

Ever get asked "how do LLMs work?" at a party and freeze? We walk through the full pipeline: tokenization, embeddings, inference — so you understand it well enough to explain it. Walk away with a mental model that you ca...

About This Episode

303 - How LLMs Work - the 20 minute explainer is an episode from Fragmented - The Software Podcast by Kaushik Gopal. Ever get asked "how do LLMs work?" at a party and freeze? We walk through the full pipeline: tokenization, embeddings, infe...

Podcast

This episode belongs to Fragmented - The Software Podcast.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Feb 2, 2026, 00:25:45 long, audio available.

Questions About This Episode

What is 303 - How LLMs Work - the 20 minute explainer about?

Ever get asked "how do LLMs work?" at a party and freeze? We walk through the full pipeline: tokenization, embeddings, inference — so you understand it well enough to explain it. Walk away with a mental model that you can use for your next dinner party. _ Full shownotes at fragmentedpodcast.com . Show Notes Words -> Tokens: OpenAI Tokenizer visualizer - Visualize how text becomes tokens Tokens -> Embeddings: RGB Color model - wikipedia Word2Vec technique - wikipedia Efficient Estimation of Word Representation - original Word2Vec paper by Mikolov et al. Embeddings -> Inference: Word embedding Temperature, Top-k, Top-p samping Get in touch We'd love to hear from you. Email is the best way to reach us or you can check our contact page for other ways. We want to hear all the feedback: what's working, what's not, topics you'd like to hear more on. We want to make the show better for you so let us know! Contact us Newsletter Youtube Website Co-hosts: Kaushik Gopal Iury Souza [!fyi] We transitioned from Android development to AI starting with Ep. . Listen to that episode for the full story behind our new direction.

Where can I listen to 303 - How LLMs Work - the 20 minute explainer?

You can listen to 303 - How LLMs Work - the 20 minute explainer online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is 303 - How LLMs Work - the 20 minute explainer from?

303 - How LLMs Work - the 20 minute explainer is an episode from Fragmented - The Software Podcast by Kaushik Gopal.

How long is this episode?

This episode is 00:25:45 long.

When was this episode published?

This episode was published on Feb 2, 2026.

Can I save 303 - How LLMs Work - the 20 minute explainer for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Fragmented - The Software Podcast?

Yes. This page shows related episodes from Fragmented - The Software Podcast when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to 303 - How LLMs Work - the 20 minute explainer?

You can listen to 303 - How LLMs Work - the 20 minute explainer on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

303 - How LLMs Work - the 20 minute explainer is from Fragmented - The Software Podcast by Kaushik Gopal.

What are the episode details?

Published Feb 2, 2026 and 00:25:45 long