More Podcasts by InfoQ episodes

From MCP and Vibe Coding to Harness Engineering: How Did AI Native Engineering Evolve in One Year thumbnail

From MCP and Vibe Coding to Harness Engineering: How Did AI Native Engineering Evolve in One Year

Published 8 Jun 2026

Duration: 00:41:23

The evolving AI adoption in software delivery involves architecture, collaboration, and rapid advancements, highlighting shifts in coding tools from autocomplete to agentic modes, context engineering challenges, hybrid tool use, local model limitations, privacy concerns, and the need for formal validation and industry-academia collaboration to enhance agent autonomy and address reliability gaps.

Episode Description

Birgitta Bockeler, Distinguished Engineer at Thoughtworks, returns to discuss the rapid evolution of AI in software delivery. She touches on the evolu...

Overview

The podcast explores the rapid evolution of AI adoption in software delivery, emphasizing how decisions on architecture, collaboration, and long-term design are critical. It highlights the shifting landscape of coding assistants, with tools like Cloud Code and Cursor evolving in capabilities and user preferences, where terminal-based and IDE-based solutions cater to different workflows. The discussion extends to the transition from basic autocomplete to more advanced "agentic" modes, though full agent capabilities remain under development. Practical challenges include balancing functionality with code quality, the use of "throwaway code," and emerging practices like context engineering to integrate design systems, business logic, and coding conventions into agents. Tools like MCPs and CLIs are being replaced by skills-based interfaces for efficiency, while hybrid workflows combining multiple assistants are common.

Context engineering is positioned as a key practice to enhance agent reliability, using pre-defined inputs and post-generation feedback loops (e.g., static analysis, test suites) to refine outputs autonomously. The role of "harness engineering" is emphasized, which structures agent behavior through feedforward guidance and feedback mechanisms to reduce human oversight. However, local models face performance and tool-calling limitations compared to cloud-based alternatives, and privacy concerns drive regional shifts toward localized AI models. The podcast also addresses risks of rapid AI tooling, such as fragmented development practices ("vibe coding") and the need for structured risk assessment frameworks. Future directions include expanding harness systems to include custom static analysis and architecture fitness checks, alongside academic efforts to bridge validation gaps through formal methods and deterministic testing.

The discussion underscores the tension between speed of AI-driven development and maintaining code quality and security, advocating for transparency in AI failures to improve safety. Emerging tools for graph-based code analysis, enhanced code navigation, and privacy-conscious integration are highlighted, alongside debates over the sustainability of AI cost models and geopolitical influences on development. Overall, the focus is on refining workflows that balance automation with human oversight, ensuring adaptability in a rapidly changing AI ecosystem while addressing practical challenges like tool obsolescence, context alignment, and the spectrum of agent autonomy.

What If

  • What if you integrated hybrid coding assistant workflows to maximize context and flexibility?

    • Move: Adopt a hybrid approach, using Cloud Code for terminal-based automation and Cursor for IDE-based interactive debugging and design tool integration.
    • Why Now?: Cloud Codes terminal capabilities align with CI/CD pipelines, while Cursors IDE features (e.g., Figma integration) streamline design-to-code workflows.
    • Expected Upside: Reduced context switching, faster prototyping, and better alignment with your tech stack and design tools.
  • What if you built a context engineering framework to improve agent reliability?

    • Move: Create a structured "context directory" with markdown files outlining coding conventions, architectural constraints, and business rules for your agents.
    • Why Now?: Context engineering is critical for guiding agents, and manual specs help mitigate errors in auto-generated code or agentic workflows.
    • Expected Upside: Agents produce more accurate, maintainable code, reducing rework and enabling safe deployment of AI-generated features.
  • What if you replaced Monolithic Context Providers (MCPs) with skills/CLI-based integrations for system interactions?

    • Move: Refactor your integration workflows to use Figma CLI or custom scripts instead of MCPs for accessing design assets or external APIs.
    • Why Now?: CLI-based tools are more efficient and scalable than MCPs, and skills allow lazy loading of context, reducing overhead.
    • Expected Upside: Faster, more reliable system interactions (e.g., Figma or GitHub) and better alignment with modern tooling ecosystems.

Takeaway

  • Adopt Hybrid Coding Assistant Workflows: Use both terminal-based (e.g., Cloud Code) and IDE-based (e.g., Cursor) tools depending on your task. For headless pipelines, leverage terminal-based tools, while using IDE-based assistants for interactive debugging and design tool integration (e.g., Figma).
  • Implement Context Engineering Practices: Prepare environments with pre-defined coding conventions, architecture constraints, and business context to guide coding agents. Use tools like Replet to align prototypes with production-ready tech stacks, reducing rework later in development.
  • Integrate Static Analysis and Feedback Loops: Automate immediate post-generation feedback using linters, test suites, and type checkers within your workflows. Use mutation testing and architecture fitness functions to validate agent outputs iteratively, ensuring code quality without manual intervention.
  • Prioritize Privacy-Compliant Tooling: Avoid relying on cloud-based models (e.g., GitHub Copilot) for sensitive codebases. Opt for local models or privacy-focused alternatives (e.g., Chinese models) and ensure user controls for data exposure, especially if handling proprietary or regulated data.
  • Build a "Harness" for Agent Autonomy: Design a system framework that combines feed-forward inputs (e.g., documentation, specifications) and automated feedback (e.g., static analysis results, test coverage) to guide agents in refactoring, error correction, and self-validation, reducing reliance on manual oversight.

Recent Episodes of Podcasts by InfoQ

1 Jun 2026 Requirements Analysis for Architects: A Conversation with Sonya Natanzon

Architects must balance technical and business priorities, prioritize user satisfaction and organizational goals, navigate communication challenges, apply domain-driven design principles, address AI's impact on software development, and adapt to evolving technologies while emphasizing creativity and strategic alignment.

18 May 2026 Context is the Key to the Agentic Architecture Revolution: A Conversation with Baruch Sadogursky

AI adoption in architectural decision-making emphasizes trade-offs between efficiency and complexity, challenges of ambiguous requirements, context-driven engineering, frameworks like the Intent Integrity Kit for iterative clarity, architect roles in managing systems and stakeholder dynamics, and the need to balance AI capabilities with human oversight amid ethical and technical limitations.

4 May 2026 Roq: Leveraging Quarkus to Build Static Sites at the Speed of Go

Java's resurgence is fueled by performance gains, modern frameworks like Quarkus, and native compilation, exemplified by Rooka lightweight static site generator leveraging Quarkus for dynamic rendering, Markdown content, and streamlined workflows, with future AI integration and open-source advancements.

More Podcasts by InfoQ episodes