More Dev Interrupted episodes

Scaffolding is coping not scaling, and other lessons from Codex | OpenAIs Thibault Sottiaux thumbnail

Scaffolding is coping not scaling, and other lessons from Codex | OpenAIs Thibault Sottiaux

Published 27 Jan 2026

Duration: 2424

Software development experts discuss the importance of general-purpose AI agents, scalable solutions, and simplicity in design, with a focus on expanding AI capabilities beyond coding to tackle broader software engineering challenges.

Episode Description

If you rely on complex scaffolding to build AI agents you aren't scaling you are coping. Thibault Sottiaux from OpenAIs Codex team joins us to explain...

Overview

The podcast explores the transformation of OpenAI's Codex into a general-purpose AI agent designed to adapt across various software development scenarios. It emphasizes the value of building scalable, foundational systems over focusing on specific products, reflecting the "bitter lesson" in AI that scalable solutions tend to prevail over domain-specific optimizations. The discussion advocates for simplicity in design to avoid unnecessary complexity and outlines a vision for the agent to evolve beyond coding into broader software engineering tasks.

Looking ahead, the team aims to develop multi-agent networks that could significantly enhance productivity. Open-sourcing the agents repository is seen as a way to encourage innovation and collaboration within the community. Insights from earlier open-source efforts and language transitions, such as moving from TypeScript to Rust, underscore the importance of adaptability and user feedback. Future challenges include improving the agent's efficiency in long-running tasks and tailoring its behavior to better match individual user preferences and workflows.

Recent Episodes of Dev Interrupted

24 Mar 2026 Why AI-assisted PRs merge at half the rate of human code | LinearBs 2026 Benchmarks

The 2026 Engineering Benchmark Report reveals that while 88.3% of developers use AI regularly, AI-generated pull requests face low merge rates (32.7%), larger sizes, and prolonged reviews due to systemic issues like poor data quality, inadequate policies, and organizational gaps, emphasizing the need for governance, smaller focused PRs, and foundational practices to optimize AI's potential in engineering workflows.

More Dev Interrupted episodes