More MLOps.community episodes

Conversation with the MLflow Maintainers thumbnail

Conversation with the MLflow Maintainers

Published 16 Jan 2026

Duration: 00:58:23

MLflow is adapting to support traditional machine learning and emerging agent-based AI systems, tackling challenges in chatbot development such as observability and effective memory management.

Episode Description

Corey Zumar is a Product Manager at Databricks, working on MLflow and LLM evaluation, tracing, and lifecycle tooling for generative AI.Jules Damji is...

Overview

The podcast explores how MLflow is expanding its capabilities to support not only traditional machine learning but also agent-based AI systems, which are becoming increasingly common in production environments. Chatbots, in particular, are evolving from simple text-based interfaces to more complex, multimodal systems capable of tool calling and image processing. To keep pace with these advancements, MLflow is being adapted to accommodate new agent paradigms while staying true to its open-source ethos. Challenges in this space include improving observability, developing multi-turn evaluation frameworks, and managing memory to ensure coherent, context-aware interactions. Techniques like multi-turn judges from systems such as DeepEval and Ragaz are being used to assess agent performance across entire conversations.

The discussion also touches on key areas such as prompt engineering, feedback loops, and governance in AI systems, emphasizing the need for secure data handling and access control. There is a growing need for tools that can support the evaluation, feedback collection, and development of both models and agents within a single platform. As the boundaries between data science and agent development blur, MLflow and similar tools are playing a crucial role in unifying these workflows and addressing the complex challenges of building and maintaining advanced AI systems.

Recent Episodes of MLOps.community

31 Mar 2026 This One Shift Makes Developers Obsolete

Processing live stream data involves transcription, AI-driven skill categorization, GitHub organization, multimedia-comment correlation, and knowledge graphs, while addressing redundancy, AI costs, and MLOps trends, AI agent debates, adversarial workflows, security risks, and tooling like Open Claw and Agent Zero.

30 Mar 2026 Operationalizing AI Agents: From Experimentation to Production // Databricks Roundtable

Deploying AI agents in real-world systems demands robust safety protocols, human oversight, and structured testing to address risks like errors and vulnerabilities, while balancing innovation with responsibility through observability, governance, domain expertise, and tools like MLflow, across use cases from workflow automation to critical system reliability.

27 Mar 2026 arrowspace: Vector Spaces and Graph Wiring

Epiplexity introduces a framework redefining entropy and complexity with structural information, while topological search and graph-based methods enhance semantic accuracy in machine learning by preserving data through high-dimensional embeddings and hybrid geometric-topological analysis, outperforming traditional approaches in retrieval and reasoning tasks.

20 Mar 2026 Agentic Marketplace

AI-driven agent systems in OLX's classifieds marketplace aim to innovate user experiences by overcoming UI constraints through dynamic intent extraction, hybrid chat/UI models, and trust-building in real estate and motors, with future focus on logistics automation, secure transactions, and human-agent integration.

17 Mar 2026 Durable Execution and Modern Distributed Systems

Temporal enhances developer productivity by enabling crash-proof workflows through deterministic programming models, separating business logic from fault tolerance, and simplifying distributed systems with durable execution, workflows, activities, and persistence layers like Cassandra/Postgres.

More MLOps.community episodes