More The BugBash Podcast episodes

From Scale to Rigor: An Engineering Journey at Meta and Oxide thumbnail

From Scale to Rigor: An Engineering Journey at Meta and Oxide

Published 11 Mar 2026

Duration: 01:01:30

Software correctness and reliability are discussed in the context of transitioning data-driven environments to hardware systems, with a focus on property-based testing and formal methods.

Episode Description

Today were talking with Rain from Oxide Computing, tracing their journey from the massive, data-driven scale of Meta to the high-stakes, air-gapped wo...

Overview

The podcast discusses challenges and best practices in software development, emphasizing software correctness, reliability, and the transition from large-scale data-driven environments (e.g., Meta) to high-stakes, air-gapped hardware systems (e.g., Oxide). Key technical topics include the importance of tooling and testing, with a focus on property-based testing and its role in uncovering bugs early in development. Examples include its application in the Move virtual machine (part of Facebooks Libra blockchain project), where oracles were used to validate system behavior and ensure consistency between serial and parallel execution models.

The discussion highlights technical writing as a critical practice for aligning teams and ensuring clarity, particularly in complex systems. It also addresses the cultural differences between large organizations (like Meta) and smaller, specialized companies (like Oxide), where tooling and testing efforts are more visible and impactful.

Hardware and infrastructure management at Oxide is explored, including the replacement of legacy components like Baseboard Management Controllers (BMCs) with modern service processors that use APIs for system management, simplifying maintenance and avoiding manual scripting. Testing strategies for these systems involve automated simulations (e.g., using the Hubris OS in Rust) and manual lab validation.

The role of LLMs (large language models) in software development is also examined, with a focus on their potential for rapid prototyping, code generation, and testing, though the need for rigorous human oversight and validation is stressed. Balancing speed and quality in development is emphasized, with a focus on achieving higher levels of correctness and rigor through collaboration between AI tools and human expertise.

Finally, the discussion touches on customer-driven development, where feedback informs product roadmaps and prioritization of improvements, and the importance of collaborative problem-solving with technical customers to address system gaps.

Recent Episodes of The BugBash Podcast

18 Mar 2026 Semmathesy and the Agentic Era: Learning Systems in 2026

AI reshapes software development through Semathesis systems, enhancing productivity and collaboration while navigating challenges like legacy code limitations, the shift from deterministic coding to adaptive AI agents, and balancing automation with human oversight in complex, dynamic environments.

More The BugBash Podcast episodes