More Latent Space episodes

Retrieval After RAG: Hybrid Search, Agents, and Database Design  Simon Hrup Eskildsen of Turbopuffer thumbnail

Retrieval After RAG: Hybrid Search, Agents, and Database Design Simon Hrup Eskildsen of Turbopuffer

Published 12 Mar 2026

Duration: 3632

TurboPuffer is a next-gen database platform focusing on AI-powered vector search, full-text search, and scalability, with a strong emphasis on hiring top talent and customer-driven innovation.

Episode Description

Turbopuffer came out of a reading app.In 2022, Simon was helping his friends at Readwise scale their infra for a highly requested feature: article rec...

Overview

The podcast discusses the challenges and goals of achieving Product-Market Fit (PMF) for a tech product, emphasizing the need for transparency and heavy investment in hiring to ensure success. It highlights the role of Turbo Farfar, a search engine designed for unstructured data, which aims to bridge AI models with vast external data sources. The company distinguishes itself by focusing on scalability and integration with AI workloads, contrasting with competitors like Elasticsearch. Technical discussions center on modern storage solutions like NVMe SSDs and obiX storage, which enable reliability and simplified architecture, while legacy systems struggle with performance. The podcast explores the parallels between Turbo Farfar's mission and the success factors of past database companies, such as addressing new workloads and storage innovations. Key conditions for success include solving critical workloads, achieving storage breakthroughs, and supporting evolving query demands.

The narrative also delves into the technical challenges of building a scalable database, including cost constraints, infrastructure scaling, and the trade-offs between storage latency and retrieval efficiency. Collaborations with companies like Notion and Cursor are highlighted, with Turbo Farfars role in reducing costs and improving performance metrics for these clients. The teams approach emphasizes leveraging cloud storage (S3) and avoiding traditional consensus layers, prioritizing compute-storage separation to reduce costs. Future roadmap plans include expanding into full-text search, enhancing vector search capabilities, and exploring hybrid workloads. The discussion underscores the tension between innovation and budget limitations, with deferred features due to economic constraints and the need for cost-effective storage solutions.

Additionally, the podcast touches on the companys operational philosophy, such as hiring high-caliber engineers, maintaining a "talent-dense" team, and the importance of transparent communication with stakeholders. While the technical and strategic aspects dominate, the conversation also references early-stage challenges, including infrastructure testing, pricing strategies, and the impact of AI-driven search demands on database design. The overarching goal remains achieving PMF by the end of the year, with a commitment to return investor funds if unsuccessful, emphasizing a bold yet transparent approach to scaling the product.

Recent Episodes of Latent Space

22 Jun 2026 Red-Teaming after Mythos Zico Kolter & Matt Fredrikson, Gray Swan

AI security challenges in large language models, such as data leakage and prompt injection, require adversarial testing, red teaming, tools like *Shade* and *Signal*, and structured frameworks to address integration risks, robustness gaps, and enterprise-specific security demands.

3 Jun 2026 Scaling Past Informal AI - Carina Hong, Axiom Math

Formal verification is positioned as a critical tool for advancing AI by ensuring system correctness through mathematical rigor, exemplified by Axiom Math's achievements, tools like Lean, challenges in AI generalization, and the vision of AI as a "superhuman mathematician" through verified reasoning.

3 Jun 2026 Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

Strategic AI development shifts to ecosystem-driven frameworks prioritizing value creation, covering Microsoft's rigorous model training, agent-driven workflow management, real-world impact challenges, innovative business models, inclusive AI participation, and redefining work through agentic systems.

2 Jun 2026 GitHub's plan for Agents Kyle Daigle, GitHub

Advanced AI integration in developer workflows leverages tools like GitHub Copilot and agentic systems to automate tasks and boost productivity, while addressing challenges like skill bloat, security, open-source trust issues, and the shift to modular AI capabilities in enterprise and collaborative environments.

1 Jun 2026 Why Video Agent models are next Ethan He, xAI Grok Imagine

Advancements in AI research through community-driven knowledge sharing, challenges in scaling video models, technical innovations like vision transformers and diffusion models, and the integration of language models in generative media, alongside hurdles in training efficiency and sustainable development.

More Latent Space episodes