MLOps.community

MLOps.community thumbnail

Machine Learning into Production podcast, interview style

Categories:

Links

Episodes

Showing 1-10 of 27

The Latency Goldilocks Zone Explained thumbnail

The Latency Goldilocks Zone Explained

12 May 2026

iFood's ILO AI agent leverages a Learning Context Model to deliver hyper-personalized food recommendations by integrating diverse AI techniques, navigating cultural nuances, and balancing familiar and novel choices while addressing multi-channel design, latency, scalability, data alignment, and experimental innovation challenges.

Open episode
Building MCP Before MCP Existed: Inside Despegar's Sofia Agent thumbnail

Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

8 May 2026

Sophia, an AI-powered travel concierge using a multi-agent system and decentralized collaboration, aims to streamline bookings, in-trip services, and personalized experiences through AI-driven automation, chat/voice interfaces, and orchestration layers, while expanding capabilities and reducing friction in travel processes.

Open episode
Voice Agent Use Cases thumbnail

Voice Agent Use Cases

1 May 2026

Designing voice-based AI systems involves balancing user control with automation, addressing speech quality-latency trade-offs, creating intuitive non-technical interfaces, overcoming transcription and turn-taking challenges in real-world environments, integrating hybrid models and domain-specific tuning, while ensuring compliance, user trust, and ethical considerations in applications like customer support and dynamic environments through feedback loops.

Open episode
The Creator of Superpowers: Why Real Agentic Engineering Beats Vibe Coding thumbnail

The Creator of Superpowers: Why Real Agentic Engineering Beats Vibe Coding

24 Apr 2026

The text discusses using the Greenfield toolset to convert legacy code into structured specifications and the Superpowers framework to enhance AI agents through psychological persuasion techniques, emphasizing task decomposition, subagent roles, challenges in consistency and security, and future trends in agentic problem-solving and ethical AI development.

Open episode
It's 2026, and We're Still Talking Evals thumbnail

It's 2026, and We're Still Talking Evals

21 Apr 2026

Evaluations in AI product development must be integrated early, address real-world complexities, use nuanced metrics beyond accuracy, employ user-centric and iterative testing, leverage post-deployment data, and adapt tailored strategies to balance quality, domain-specific metrics, and system reliability.

Open episode
Why Agents are Driving Software Development to the Cloud thumbnail

Why Agents are Driving Software Development to the Cloud

17 Apr 2026

The text promotes transitioning from isolated AI agents to cloud-native platforms that treat agents as autonomous team members with defined roles, emphasizing structured governance, transparency, and natural language interaction to streamline collaboration and workflows like code review and data analysis.

Open episode
The Modern Software Engineer thumbnail

The Modern Software Engineer

14 Apr 2026

Recommended: A throughtful overview on the impact of AI covering the impact on learning and skill aquisition.

AI transforms learning and workflows through tools like Claude, accelerating skill acquisition and bridging knowledge gaps, while raising concerns about job obsolescence, ethical dilemmas, and the need for human oversight, standardized practices, and collaborative approaches in an era of rapid tech advancement.

Open episode
How We Cut LLM Latency 70% With TensorRT in Production thumbnail

How We Cut LLM Latency 70% With TensorRT in Production

10 Apr 2026

Optimizing AI systems via TensorRT LLM, efficient GPU use, cold start management with AWS FSX, and model quantization, while addressing challenges in in-house development, scaling strategies, hidden scaling complexities ("AI iceberg"), and balancing technical efficiency with organizational alignment through frameworks like Flywheel and responsible AI practices.

Open episode
Getting Humans Out of the Way: How to Work with Teams of Agents thumbnail

Getting Humans Out of the Way: How to Work with Teams of Agents

7 Apr 2026

Recommended: An optimistic view of using Agentic AI with safeguards.

AI agents streamline software development through tools like pixel diff analysis, automated reporting, and annotated walkthroughs, addressing challenges in accuracy, code quality, and workflow adaptation while redefining human roles as validation overseers and collaborators in autonomous systems.

Open episode
Fixing GPU Starvation in Large-Scale Distributed Training thumbnail

Fixing GPU Starvation in Large-Scale Distributed Training

3 Apr 2026

Optimizing ML workflows requires addressing data bottlenecks through caching, efficient structuring, and hardware-aware strategies to reduce remote data calls, minimize GPU-CPU overhead, and prioritize infrastructure over model tuning, while managing trade-offs between training efficiency and serving latency.

Open episode

Showing 1-10 of 27