Retrieval After RAG: Hybrid Search, Agents, and Database Design Simon Hrup Eskildsen of Turbopuffer

Published 12 Mar 2026

Duration: 3632

TurboPuffer is a next-gen database platform focusing on AI-powered vector search, full-text search, and scalability, with a strong emphasis on hiring top talent and customer-driven innovation.

Episode Description

Turbopuffer came out of a reading app.In 2022, Simon was helping his friends at Readwise scale their infra for a highly requested feature: article rec...

Overview

The podcast discusses the challenges and goals of achieving Product-Market Fit (PMF) for a tech product, emphasizing the need for transparency and heavy investment in hiring to ensure success. It highlights the role of Turbo Farfar, a search engine designed for unstructured data, which aims to bridge AI models with vast external data sources. The company distinguishes itself by focusing on scalability and integration with AI workloads, contrasting with competitors like Elasticsearch. Technical discussions center on modern storage solutions like NVMe SSDs and obiX storage, which enable reliability and simplified architecture, while legacy systems struggle with performance. The podcast explores the parallels between Turbo Farfar's mission and the success factors of past database companies, such as addressing new workloads and storage innovations. Key conditions for success include solving critical workloads, achieving storage breakthroughs, and supporting evolving query demands.

The narrative also delves into the technical challenges of building a scalable database, including cost constraints, infrastructure scaling, and the trade-offs between storage latency and retrieval efficiency. Collaborations with companies like Notion and Cursor are highlighted, with Turbo Farfars role in reducing costs and improving performance metrics for these clients. The teams approach emphasizes leveraging cloud storage (S3) and avoiding traditional consensus layers, prioritizing compute-storage separation to reduce costs. Future roadmap plans include expanding into full-text search, enhancing vector search capabilities, and exploring hybrid workloads. The discussion underscores the tension between innovation and budget limitations, with deferred features due to economic constraints and the need for cost-effective storage solutions.

Additionally, the podcast touches on the companys operational philosophy, such as hiring high-caliber engineers, maintaining a "talent-dense" team, and the importance of transparent communication with stakeholders. While the technical and strategic aspects dominate, the conversation also references early-stage challenges, including infrastructure testing, pricing strategies, and the impact of AI-driven search demands on database design. The overarching goal remains achieving PMF by the end of the year, with a commitment to return investor funds if unsuccessful, emphasizing a bold yet transparent approach to scaling the product.

Recent Episodes of Latent Space

5 May 2026 Doing Vibe Physics Alex Lupsasca, OpenAI

AI is advancing theoretical physics by rapidly solving complex problems like quantum field theory calculations and simulating models such as SYK, though it still relies on human collaboration for original insights and contextual validation, reshaping research methodologies and education.

27 Apr 2026 Physical AI that Moves the World Qasar Younis & Peter Ludwig, Applied Intuition

Applied Intuition develops safety-critical physical AI for automotive, construction, mining, and defense sectors, selling AI technology to manufacturers and governments through simulation, infrastructure, and proprietary systems to advance industrial innovation with reliable autonomy.

23 Apr 2026 AIE Europe Debrief + Agent Labs Thesis: Unsupervised Learning x Latent Space Crossover Special (2026)

The text discusses AI's evolving landscape, focusing on experimental agents potentially breaking containment by 2026, market disruptions from foundation models, infrastructure advancements like RAG, debates between infrastructure and application firms, outsourcing strategies, pre-2023 training data advantages, competitive coding AI sectors, and future trends in personalization and industry transformation amid scalability and quality challenges.

22 Apr 2026 Shopifys AI Phase Transition: 2026 Usage Explosion, Unlimited Opus-4.6 Token Budget, Tangle, Tangent, SimGym with Mikhail Parakhin, Shopify CTO

Shopify's AI strategies involve in-house tools like Tangled and QMD to automate workflows, collaborate with the AI community, address challenges in token usage and code quality, and explore applications in e-commerce, CI/CD optimization, and scalable AI experimentation.

15 Apr 2026 Notions Token Town: 5 Rebuilds, 100+ Tools, MCP vs CLIs and the Software Factory Future Simon Last & Sarah Sachs of Notion

CLIs and MCPs are emphasized for enterprise efficiency, alongside challenges in early AI integration, custom agent development for automation, strategic AGI management, and balancing automation with oversight, pricing, and collaboration tools like Notion.

More Latent Space episodes