More MLOps.community episodes

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs thumbnail

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Published 24 Feb 2026

Duration: 01:25:49

The podcast dives into AI development, software engineering, and GPU innovation, focusing on efficient workloads and the trade-offs between quick solutions and scalable systems.

Episode Description

March 3rd, Computer History Museum CODING AGENTS CONFERENCE, come join us while there are still tickets left.https://luma.com/codingagentsChris Fregly...

Overview

The podcast covers multiple topics related to product innovation, software development, and engineering practices, focusing on trends and challenges in AI and machine learning. It examines the use of SageMaker HyperPods, which employ pre-warmed GPUs to improve efficiency for AI workloads, and discusses the rise of "throwaway" applications designed for specific, short-term needs without long-term maintenance. The conversation also addresses the trade-off between quick, functional solutions and scalable, robust systems, emphasizing the role of software engineers in developing reliable production-grade applications. Additionally, the episode explores the use of AI tools in code generation and debugging, including a feature called the "playground skill" for visualizing code flow.

The discussion extends to issues with GPU hardware and limitations in AI infrastructure, highlighting the need for better documentation and transparency. It also touches on the growing interest in optimizing AI models for specific applications and newer hardware such as NVIDIA's Blackwell. The author reflects on writing a book focused on co-design principles that integrate hardware, software, and algorithms, and underscores the importance of open-source tools and community collaboration in advancing AI development and deployment.

Recent Episodes of MLOps.community

19 Jun 2026 Sandboxing, Agent Harnesses, and Agent Teamwork

The text examines "Harness" componentsprompts, tools, and feedback systemsthat balance AI agent autonomy with control through adaptive strategies, human oversight, and iterative testing to improve reliability and alignment with human judgment in dynamic tasks.

16 Jun 2026 MCP Servers Are Becoming the UI for AI Agents

Gateways as proxies for AI via MCP address security, traffic control, and cost management while tackling server development challenges, optimization of tool calls, microservices scaling, protocol tracing limitations, ownership shifts, and the need for unbiased evaluations and agent-driven usability assessments.

12 Jun 2026 MCP, Agents & the $40M Bet on Multiplayer AI

Recommended: Multiplayer Bots as a Action Paradigm

The integration of AI into work practices shifts toward collaborative "multiplayer" systems using flocking-inspired dynamics, addressing challenges like limited AI time horizons, technical tools for shared collaboration, balancing human-AI roles, infrastructure scaling, and the need for adaptive governance and futureproofing.

9 Jun 2026 From Single-Player to Multi-Player: Operating AI Agents at Scale

AI agent infrastructure and governance require control planes for security, compliance, and risk mitigation, addressing operational challenges, productivity gains, and the need for standardized frameworks, modular designs, and transparent collaboration.

5 Jun 2026 The Control-vs-Magic Spectrum Building Agents

iFood Pago leverages AI-driven tools like ChatBank to automate financial services for Brazilian restaurants, balancing automation with personalization while addressing challenges in scaling AI, risk management, and the impact of declining training costs on software accessibility.

More MLOps.community episodes