From MCP and Vibe Coding to Harness Engineering: How Did AI Native Engineering Evolve in One Year

Published 8 Jun 2026

Show Notes: soundcloud.com/infoq-channel/from-mcp-and-vibe-coding-to

Duration: 00:41:23

The evolving AI adoption in software delivery involves architecture, collaboration, and rapid advancements, highlighting shifts in coding tools from autocomplete to agentic modes, context engineering challenges, hybrid tool use, local model limitations, privacy concerns, and the need for formal validation and industry-academia collaboration to enhance agent autonomy and address reliability gaps.

Episode Description

Birgitta Bockeler, Distinguished Engineer at Thoughtworks, returns to discuss the rapid evolution of AI in software delivery. She touches on the evolu...

Overview

The podcast explores the rapid evolution of AI adoption in software delivery, emphasizing how decisions on architecture, collaboration, and long-term design are critical. It highlights the shifting landscape of coding assistants, with tools like Cloud Code and Cursor evolving in capabilities and user preferences, where terminal-based and IDE-based solutions cater to different workflows. The discussion extends to the transition from basic autocomplete to more advanced "agentic" modes, though full agent capabilities remain under development. Practical challenges include balancing functionality with code quality, the use of "throwaway code," and emerging practices like context engineering to integrate design systems, business logic, and coding conventions into agents. Tools like MCPs and CLIs are being replaced by skills-based interfaces for efficiency, while hybrid workflows combining multiple assistants are common.

Context engineering is positioned as a key practice to enhance agent reliability, using pre-defined inputs and post-generation feedback loops (e.g., static analysis, test suites) to refine outputs autonomously. The role of "harness engineering" is emphasized, which structures agent behavior through feedforward guidance and feedback mechanisms to reduce human oversight. However, local models face performance and tool-calling limitations compared to cloud-based alternatives, and privacy concerns drive regional shifts toward localized AI models. The podcast also addresses risks of rapid AI tooling, such as fragmented development practices ("vibe coding") and the need for structured risk assessment frameworks. Future directions include expanding harness systems to include custom static analysis and architecture fitness checks, alongside academic efforts to bridge validation gaps through formal methods and deterministic testing.

The discussion underscores the tension between speed of AI-driven development and maintaining code quality and security, advocating for transparency in AI failures to improve safety. Emerging tools for graph-based code analysis, enhanced code navigation, and privacy-conscious integration are highlighted, alongside debates over the sustainability of AI cost models and geopolitical influences on development. Overall, the focus is on refining workflows that balance automation with human oversight, ensuring adaptability in a rapidly changing AI ecosystem while addressing practical challenges like tool obsolescence, context alignment, and the spectrum of agent autonomy.

What If

What if you integrated hybrid coding assistant workflows to maximize context and flexibility?
- Move: Adopt a hybrid approach, using Cloud Code for terminal-based automation and Cursor for IDE-based interactive debugging and design tool integration.
- Why Now?: Cloud Codes terminal capabilities align with CI/CD pipelines, while Cursors IDE features (e.g., Figma integration) streamline design-to-code workflows.
- Expected Upside: Reduced context switching, faster prototyping, and better alignment with your tech stack and design tools.
What if you built a context engineering framework to improve agent reliability?
- Move: Create a structured "context directory" with markdown files outlining coding conventions, architectural constraints, and business rules for your agents.
- Why Now?: Context engineering is critical for guiding agents, and manual specs help mitigate errors in auto-generated code or agentic workflows.
- Expected Upside: Agents produce more accurate, maintainable code, reducing rework and enabling safe deployment of AI-generated features.
What if you replaced Monolithic Context Providers (MCPs) with skills/CLI-based integrations for system interactions?
- Move: Refactor your integration workflows to use Figma CLI or custom scripts instead of MCPs for accessing design assets or external APIs.
- Why Now?: CLI-based tools are more efficient and scalable than MCPs, and skills allow lazy loading of context, reducing overhead.
- Expected Upside: Faster, more reliable system interactions (e.g., Figma or GitHub) and better alignment with modern tooling ecosystems.

Takeaway

Adopt Hybrid Coding Assistant Workflows: Use both terminal-based (e.g., Cloud Code) and IDE-based (e.g., Cursor) tools depending on your task. For headless pipelines, leverage terminal-based tools, while using IDE-based assistants for interactive debugging and design tool integration (e.g., Figma).
Implement Context Engineering Practices: Prepare environments with pre-defined coding conventions, architecture constraints, and business context to guide coding agents. Use tools like Replet to align prototypes with production-ready tech stacks, reducing rework later in development.
Integrate Static Analysis and Feedback Loops: Automate immediate post-generation feedback using linters, test suites, and type checkers within your workflows. Use mutation testing and architecture fitness functions to validate agent outputs iteratively, ensuring code quality without manual intervention.
Prioritize Privacy-Compliant Tooling: Avoid relying on cloud-based models (e.g., GitHub Copilot) for sensitive codebases. Opt for local models or privacy-focused alternatives (e.g., Chinese models) and ensure user controls for data exposure, especially if handling proprietary or regulated data.
Build a "Harness" for Agent Autonomy: Design a system framework that combines feed-forward inputs (e.g., documentation, specifications) and automated feedback (e.g., static analysis results, test coverage) to guide agents in refactoring, error correction, and self-validation, reducing reliance on manual oversight.

Recent Episodes of Podcasts by InfoQ

20 Jul 2026 Strands Agents with Clare Liguori

"Explores AI adoption challenges, Strands Agents SDK's model-driven approach, responsible AI practices, and future scaling of advanced, multi-step agent systems."

13 Jul 2026 Governance in the Age of AI: A Conversation with Sarah Wells

"AI and architecture decisions shape long-term systems, requiring governance to balance standardization and flexibility, while platform engineering and DevOps aim to reduce friction; challenges include fragmentation, compliance, and risks like security and cost, with AI's role evolving alongside human oversight, mentorship, and adaptability in software development."

6 Jul 2026 Spite-Driven Engineering: A New Blueprint for Cloud Security in the AI Native Era

Examines challenges in AI adoption, system architecture, and kernel security, critiquing Linux and other systems' shortcomings in isolation and scalability, while addressing trade-offs in development speed vs. understanding, hardware limitations, and proposals for improved design and collaboration.

29 Jun 2026 Architectural Patterns: Moving Beyond Cloud-Native to Local-First - Insights from Adam Wiggins

Challenges in AI adoption highlight trade-offs between cloud-centric collaboration and local-first computing, with debates over data ownership, CRDTs for decentralized sync, hybrid workflows, and integrating AI with user-centric, offline-capable tools.

15 Jun 2026 Increasing Users Data Agency: From BlueSky's AT Protocol to the Local-First Software Movement

Discusses challenges in AI integration, the shift to modular cloud-native systems using Apache Parquet, decentralized infrastructures like Blue Skys AT Protocol, the Local First movement prioritizing local data storage, AutoMerge for collaborative non-text files, retrofitting hurdles, and open standards to combat vendor lock-in.

More Podcasts by InfoQ episodes