More MLOps.community episodes

Durable Execution and Modern Distributed Systems thumbnail

Durable Execution and Modern Distributed Systems

Published 17 Mar 2026

Duration: 01:00:36

Temporal enhances developer productivity by enabling crash-proof workflows through deterministic programming models, separating business logic from fault tolerance, and simplifying distributed systems with durable execution, workflows, activities, and persistence layers like Cassandra/Postgres.

Episode Description

Johann Schleier-Smith is the Technical Lead for AI at Temporal Technologies, working on reliable infrastructure for production AI systems and long-run...

Overview

The podcast focuses on enhancing developer productivity through tools and frameworks, particularly agentic systems that interact with the world asynchronously, reliably, and durably. It emphasizes durable execution, a methodology ensuring software tasks complete reliably despite failures like cloud outages or rate limits. Key principles include crash-proof software, separation of reliability mechanisms from business logic, and the use of platforms like Temporal, an open-source solution that decouples durability from application code. Temporal employs a programming model distinguishing between workflows (deterministic, business-logic-heavy code) and activities (IO-heavy tasks), enabling deterministic execution and cross-region failover. This approach abstracts complexity in retries and fault tolerance, allowing developers to focus on core logic while ensuring resilience in distributed systems.

The discussion extends to challenges and comparisons, such as the learning curve of deterministic workflows and the limitations of legacy systems lacking built-in durability. It contrasts durable execution with checkpointing, noting the latters limitations in handling complex, concurrent scenarios. Temporals "continue as new" feature allows long-running agents to maintain state through snapshots, combining checkpointing with durable execution. Use cases span agentic systems (e.g., LLM-driven workflows), transaction management, and serverless architecture, highlighting Temporals role in simplifying cloud workflows by managing state persistence and recovery internally. Key benefits include abstracting infrastructure complexity, supporting both linear workflows and branched, concurrent processes, and enabling scalability through auto-scaling and serverless integration.

The podcast also addresses advanced features like dynamic workflow control (signals, updates, pausing), handling large payloads via external storage, and future developments in streaming and agent interaction. It underscores the importance of separating Temporals state management from external systems (e.g., databases) while ensuring security through encryption and trusted execution environments. Ultimately, the focus is on Temporals role in modernizing cloud workflows, reducing developer overhead, and enabling scalable, reliable systems for agentic and non-agentic applications alike.

Recent Episodes of MLOps.community

12 May 2026 The Latency Goldilocks Zone Explained

iFood's ILO AI agent leverages a Learning Context Model to deliver hyper-personalized food recommendations by integrating diverse AI techniques, navigating cultural nuances, and balancing familiar and novel choices while addressing multi-channel design, latency, scalability, data alignment, and experimental innovation challenges.

8 May 2026 Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

Sophia, an AI-powered travel concierge using a multi-agent system and decentralized collaboration, aims to streamline bookings, in-trip services, and personalized experiences through AI-driven automation, chat/voice interfaces, and orchestration layers, while expanding capabilities and reducing friction in travel processes.

1 May 2026 Voice Agent Use Cases

Designing voice-based AI systems involves balancing user control with automation, addressing speech quality-latency trade-offs, creating intuitive non-technical interfaces, overcoming transcription and turn-taking challenges in real-world environments, integrating hybrid models and domain-specific tuning, while ensuring compliance, user trust, and ethical considerations in applications like customer support and dynamic environments through feedback loops.

24 Apr 2026 The Creator of Superpowers: Why Real Agentic Engineering Beats Vibe Coding

The text discusses using the Greenfield toolset to convert legacy code into structured specifications and the Superpowers framework to enhance AI agents through psychological persuasion techniques, emphasizing task decomposition, subagent roles, challenges in consistency and security, and future trends in agentic problem-solving and ethical AI development.

21 Apr 2026 It's 2026, and We're Still Talking Evals

Evaluations in AI product development must be integrated early, address real-world complexities, use nuanced metrics beyond accuracy, employ user-centric and iterative testing, leverage post-deployment data, and adapt tailored strategies to balance quality, domain-specific metrics, and system reliability.

More MLOps.community episodes