More MLOps.community episodes

Logs Are All You Need: Rethinking Observability with AI Agents thumbnail

Logs Are All You Need: Rethinking Observability with AI Agents

Published 2 Jun 2026

Duration: 00:46:39

The text explores using genetic Pareto principles for parallel agent optimization and introduces Sazabi, an AI-native observability platform that replaces traditional telemetry with log-based analysis, natural language queries, and AI-driven alerts, emphasizing log-centric simplicity and secure, dynamic agent testing.

Episode Description

Sherwood Callaway is the founder of Sazabi, an AI-native observability platform built for engineering teams that ship fast. Before this, he started th...

Overview

The podcast explores the application of "genetic Pareto" principles in agentic systems, emphasizing parallel testing of multiple agent versions to optimize outcomes. It introduces Sazabi, an AI-native observability platform designed to streamline modern software engineering workflows by moving away from traditional, cumbersome observability tools. Sazabi distinguishes itself through three key features: eliminating direct telemetry access in favor of natural language queries, prioritizing logs over the traditional "three pillars" (metrics, logs, traces) to simplify instrumentation, and leveraging AI to automate log analysis. The platform critiques existing tools like Datadog for failing to adapt to workflows involving agents and AI, positioning itself as a solution to bridge gaps between evolving software practices and outdated observability infrastructure.

The discussion delves into Sazabis technical architecture, which integrates AI agents with sandboxes for safe execution, using Vercel AI SDKs and a SQL-like log query interface. It emphasizes log-based reconstruction of metrics and traces, minimizing complexity by avoiding traditional monitoring setups. Challenges include evaluating agent performance through dynamic, context-aware alerts and managing shared memory via Git-based state tracking. The platform also addresses difficulties in assessing agent behavior, such as ambiguous CLI tool usage and the impracticality of mocking all external services for testing. Development practices focus on local infrastructure and homegrown tools, favoring simplicity and security through read-only systems and Git-managed memory. The system prioritizes logs as the core data source for observability, enabling actionable insights while avoiding overreliance on code generation or external dependencies.

What If

  • What if you used genetic Pareto parallelism to optimize your agent workflows?

    • Move: Spawn multiple agent versions in parallel for the same task (e.g., generating code that passes tests) and evaluate outputs against defined metrics like test coverage or speed.
    • Why Now?: Modern AI agent development favors rapid iteration, and parallel testing reduces time-to-optimization by leveraging computational resources efficiently.
    • Expected Upside: Achieve higher-quality outcomes faster by selecting the best-performing agent version, reducing manual optimization effort.
  • What if you rearchitected your observability system around Sazabis log-centric, chat-first model?

    • Move: Replace traditional dashboards with a natural language interface for querying logs; eliminate the need for metrics/traces by centralizing all telemetry in logs.
    • Why Now?: Teams are moving toward AI-native workflows, and Sazabis approach simplifies observability for non-technical users, aligning with modern DevOps trends.
    • Expected Upside: Reduce instrumentation overhead and improve accessibility, enabling faster issue resolution and democratizing access to observability insights.
  • What if you integrated Git-backed sandboxes for persistent agent state and collaboration?

    • Move: Use Git repositories to store agent memory (e.g., user preferences) and synchronize shared state across sandboxes, ensuring consistency across parallel agent runs.
    • Why Now?: Managing state in distributed environments is a growing challenge, and Git provides a familiar, version-controlled solution for persistence and collaboration.
    • Expected Upside: Streamline agent coordination, avoid data silos, and enable seamless sharing of findings between subagents without relying on external databases.

Takeaway

  • Adopt Natural Language Observability with Sazabi: Replace traditional log dashboards with Sazabis chat-based interface to query production issues using natural language, reducing the need for manual telemetry configuration or technical expertise.
  • Simplify Instrumentation with Log-Centric Tools: Focus solely on logging (vs. metrics/traces) to minimize instrumentation complexity, eliminating dependencies on tools like Prometheus or trace propagation systems.
  • Implement AI-Driven Alerting: Replace static alert thresholds (e.g., CPU > 80%) with dynamic, context-aware alerts generated by AI from real-time logs, paired with actionable remediation suggestions (e.g., Slacks, code fixes).
  • Run Parallel Agent Evaluations in Sandboxes: Use sandboxed environments to execute multiple agent versions in parallel (e.g., 100 variants) for tasks like code generation, selecting the best output based on criteria like test results or accuracy.
  • Leverage Git for Persistent Agent State: Store agent memory, user preferences, or shared data in Git-backed file systems to enable persistent state across sandboxed runs, ensuring consistency in shared workflows or agent collaboration.

Recent Episodes of MLOps.community

5 Jun 2026 The Control-vs-Magic Spectrum Building Agents

iFood Pago leverages AI-driven tools like ChatBank to automate financial services for Brazilian restaurants, balancing automation with personalization while addressing challenges in scaling AI, risk management, and the impact of declining training costs on software accessibility.

29 May 2026 AI Is Fast. AI Projects Are Slow. Let's Fix That.

AI reshapes software engineering by shifting to AI-integrated workflows, demanding balance between efficiency and productivity, maintaining code quality, mastering new tools like RocketRide, ensuring observability, and managing integration complexities across models and pipelines.

28 May 2026 Architecting Modern AI Systems: Platforms, Agents, and Integration

Modern AI architecture, infrastructure challenges, open-source vs. proprietary models, and safety-critical conversational agents for mental health via Bell and Kids Help Phone's hackathon, alongside GPU efficiency, scalable frameworks, and balancing innovation with control in deployment.

28 May 2026 [Special Announcement] MLOps Community Linux Foundation

The MLOps community is forming the Agentic AI Foundation under the Linux Foundation to govern open-source projects like MCP and agents.md, maintain existing community activities, launch an ambassador program, and rebrand the podcast as "Agentic Conversations."

26 May 2026 Inside Just Eat's AI Lab: Voice Agents & Agentic Commerce

Just Eat Takeaway evolves through AI-driven innovation, voice interfaces, and wearables, focusing on agentic commerce agents, super apps, and no-app models while addressing privacy, device continuity, and logistics challenges like autonomous delivery.

More MLOps.community episodes