More Engineering Culture by InfoQ episodes

Tiger Teams, Evals and Agents: The New AI Engineering Playbook thumbnail

Tiger Teams, Evals and Agents: The New AI Engineering Playbook

Published 10 Apr 2026

Duration: 00:22:35

The text examines open-source software's role in innovation and collaboration challenges, then transitions to AI engineering's emergence as a field requiring new workflows, tools, and interdisciplinary approaches to address automation, data integration, and cultural shifts in rapidly evolving tech landscapes.

Episode Description

This is the Engineering Culture Podcast, from the people behind InfoQ.com and the QCon conferences. In this podcast Shane Hastie, Lead Editor for Cult...

Overview

The podcast discusses the critical importance of mobile application security, emphasizing that inadequate measures can leave vulnerabilities, and highlights GuardSquare as a provider of advanced security solutions for Android and iOS apps. It then shifts focus to open-source development and its challenges, as explored in an interview with Sam Bhagwat. Bhagwat reflects on open-source philosophy, noting its role in fostering global collaboration and innovation but also addressing tensions between flexibility, community expectations, and commercial interests. He underscores the need for founders to adapt their initial visions based on community feedback and the complexities of balancing open-source purism with profitability. The conversation also delves into the growing field of AI engineering, where traditional development practices must evolve to accommodate new paradigms, requiring adaptability, cross-disciplinary teamwork, and a focus on statistical and probabilistic reasoning.

The discussion extends to the integration of AI in both open-source projects and engineering workflows. Generative AI is presented as a tool to automate tasks like bug triaging, code generation, and PR evaluation, though maintaining quality requires rigorous testing and custom evaluations tailored to organizational needs. AI agents are explored as autonomous systems capable of handling complex tasks through structured workflows and memory management, with challenges in ensuring accuracy and contextual relevance, especially when dealing with proprietary data. The podcast also touches on the practical development of AI agents, emphasizing iterative refinement, data curation, and staged deployment to manage risks. Finally, it addresses broader cultural and structural shifts in engineering teams, including the need for cross-functional collaboration, embracing discomfort in learning new fields, and the importance of fostering enthusiasm for innovation in AI engineering.

Recent Episodes of Engineering Culture by InfoQ

More Engineering Culture by InfoQ episodes