The podcast discusses a project exploring agentic engineering through the development of a cross-platform (web, iOS, Android) team management app using LLMs like Claude. The goal was to test new technologies with minimal manual code review, prioritizing prompting strategies to build a minimum viable product (MVP) quickly. The team shifted from solo development to a two-person collaborative workflow, using Expo as the framework for cross-platform development. Expo simplified the process by abstracting React Native complexities and offering built-in features like serverless deployment and cloud-based builds. The project involved integrating the LLM (Claude) to guide architecture, using tools like Terraform for infrastructure and Scaleway for serverless SQL databases. Challenges included UI/UX design issues, such as misaligned navigation elements and input validation, which required manual debugging and iterative prompting. The LLM excelled in backend tasks like API testing, security, and infrastructure setup but struggled with UI-level validation and layout adjustments, highlighting the need for automated tools like Playwright and specialized human expertise.
Key challenges revolved around balancing the LLMs strengths in rapid code generation with its limitations in testing, debugging, and UI refinement. The team relied on manual error correction, copying terminal logs into prompts to iteratively fix issues, while emphasizing the need for structured workflows and phased development to manage complexity. The MVP was functional but lacked polish, underscoring the importance of rigorous testing and user experience design. The project also highlighted the limitations of current LLMs in handling repetitive tasks and maintaining consistency across platforms, with suggestions for role-specific agent systems or improved self-audit capabilities to enhance reliability. Overall, the experiment demonstrated the potential of agentic engineering to streamline development but underscored the necessity of human oversight, clear specifications, and integration of automation tools to address gaps in AI-driven workflows.