More Latent Space episodes

Retrieval After RAG: Hybrid Search, Agents, and Database Design  Simon Hrup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design Simon Hrup Eskildsen of Turbopuffer

Published 12 Mar 2026

Duration: 3632

TurboPuffer is a next-gen database platform focusing on AI-powered vector search, full-text search, and scalability, with a strong emphasis on hiring top talent and customer-driven innovation.

Episode Description

Turbopuffer came out of a reading app.In 2022, Simon was helping his friends at Readwise scale their infra for a highly requested feature: article rec...

Overview

The podcast discusses the challenges and goals of achieving Product-Market Fit (PMF) for a tech product, emphasizing the need for transparency and heavy investment in hiring to ensure success. It highlights the role of Turbo Farfar, a search engine designed for unstructured data, which aims to bridge AI models with vast external data sources. The company distinguishes itself by focusing on scalability and integration with AI workloads, contrasting with competitors like Elasticsearch. Technical discussions center on modern storage solutions like NVMe SSDs and obiX storage, which enable reliability and simplified architecture, while legacy systems struggle with performance. The podcast explores the parallels between Turbo Farfar's mission and the success factors of past database companies, such as addressing new workloads and storage innovations. Key conditions for success include solving critical workloads, achieving storage breakthroughs, and supporting evolving query demands.

The narrative also delves into the technical challenges of building a scalable database, including cost constraints, infrastructure scaling, and the trade-offs between storage latency and retrieval efficiency. Collaborations with companies like Notion and Cursor are highlighted, with Turbo Farfars role in reducing costs and improving performance metrics for these clients. The teams approach emphasizes leveraging cloud storage (S3) and avoiding traditional consensus layers, prioritizing compute-storage separation to reduce costs. Future roadmap plans include expanding into full-text search, enhancing vector search capabilities, and exploring hybrid workloads. The discussion underscores the tension between innovation and budget limitations, with deferred features due to economic constraints and the need for cost-effective storage solutions.

Additionally, the podcast touches on the companys operational philosophy, such as hiring high-caliber engineers, maintaining a "talent-dense" team, and the importance of transparent communication with stakeholders. While the technical and strategic aspects dominate, the conversation also references early-stage challenges, including infrastructure testing, pricing strategies, and the impact of AI-driven search demands on database design. The overarching goal remains achieving PMF by the end of the year, with a commitment to return investor funds if unsuccessful, emphasizing a bold yet transparent approach to scaling the product.

Recent Episodes of Latent Space

20 Mar 2026 Dreamer: the Personal Agent OS David Singleton

Dreamer is an AI platform democratizing access to agentic tools for non-technical users via customizable AI assistants, community-built apps, cross-device integration, and privacy-focused features, with a beta emphasis on accessibility, real-world productivity use cases, and third-party developer opportunities.

More Latent Space episodes