Radio and PodcastRadio and PodcastLive Radio & Podcasts
636: Red Hat's James Huang artwork
Technology

636: Red Hat's James Huang

Coder Radio by The Mad Botter

Dec 19, 202520:53Technology

Links James on LinkedIn Mike on LinkedIn Mike's Blog Show on Discord Alice Promo AI on Red Hat Enterprise Linux (RHEL) Trust and Stability: RHEL provides the mission-critical foundation needed for workloads where securit...

About This Episode

636: Red Hat's James Huang is an episode from Coder Radio by The Mad Botter. Links James on LinkedIn Mike on LinkedIn Mike's Blog Show on Discord Alice Promo AI on Red Hat Enterprise Linux (RHEL) Trust and Stability: RHEL provides the missi...

Podcast

This episode belongs to Coder Radio.

Listen Online

Use the player on this page to stream the episode online.

Episode Details

Published Dec 19, 2025, 20:53 long, audio available.

Questions About This Episode

What is 636: Red Hat's James Huang about?

Links James on LinkedIn Mike on LinkedIn Mike's Blog Show on Discord Alice Promo AI on Red Hat Enterprise Linux (RHEL) Trust and Stability: RHEL provides the mission-critical foundation needed for workloads where security and reliability cannot be compromised. Predictive vs. Generative: Acknowledging the hype of GenAI while maintaining support for traditional machine learning algorithms. Determinism: The challenge of bringing consistency and security to emerging AI technologies in production environments. Rama-Llama & Containerization Developer Simplicity: Rama-Llama helps developers run local LLMs easily without being "locked in" to specific engines; it supports Podman, Docker, and various inference engines like Llama.cpp and Whisper.cpp. Production Path: The tool is designed to "fade away" after helping package the model and stack into a container that can be deployed directly to Kubernetes. Behind the Firewall: Addressing the needs of industries (like aircraft maintenance) that require AI to stay strictly on-premises. Enterprise AI Infrastructure Red Hat AI: A commercial product offering tools for model customization, including pre-training, fine-tuning, and RAG (Retrieval-Augmented Generation). Inference Engines: James highlights the difference between Llama.cpp (for smaller/edge hardware) and vLLM, which has become the enterprise standard for multi-GPU data center inferencing.

Where can I listen to 636: Red Hat's James Huang?

You can listen to 636: Red Hat's James Huang online on Radio and Podcast. Open the player on this page to stream the available audio.

Which podcast is 636: Red Hat's James Huang from?

636: Red Hat's James Huang is an episode from Coder Radio by The Mad Botter.

How long is this episode?

This episode is 20:53 long.

When was this episode published?

This episode was published on Dec 19, 2025.

Can I save 636: Red Hat's James Huang for later?

Yes. Use the heart button on the episode page to add it to your favorite episodes list.

Are there related episodes from Coder Radio?

Yes. This page shows related episodes from Coder Radio when more episodes are available from the podcast feed.

Quick Answers About This Episode

Where can I listen to 636: Red Hat's James Huang?

You can listen to 636: Red Hat's James Huang on this page when the episode audio is available from the podcast feed.

Which podcast is this episode from?

636: Red Hat's James Huang is from Coder Radio by The Mad Botter.

What are the episode details?

Published Dec 19, 2025 and 20:53 long