More No Priors - AI, Technology, Startups episodes

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud thumbnail

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

Published 1 May 2026

Duration: 00:43:01

Base 10's 30x growth and $1B+ revenue projections underscore AI inference market challenges like infrastructure gaps, open-source model shifts, domain-specific customization needs, geopolitical tensions, compute constraints, and emerging trends in AI-as-a-service and societal integration.

Episode Description

Baseten CEO and co-founder Tuhin Srivastava sits down with Sarah Guo and Elad Gil to discuss the rapid growth of AI inference demand, Basetens 30x gro...

Overview

The podcast discusses the rapid scaling of Base 10, which has achieved 30x growth in 24 months with projected annual revenue exceeding $1 billion, driven by rising demand for AI inference across industries. It emphasizes the complexities of AI inference as a lagging market compared to training, noting advancements in open-source models and post-training techniques like reinforcement learning, which enable companies to customize and own their inference processes. The distinction between application-specific AI (e.g., healthcare workflows, customer support systems) and generic "frontier" models is highlighted, with examples of startups like Bridge and Abridge demonstrating the need for domain-specific solutions. The market is dominated by a "long tail" of models and enterprises, but AI-native startups are growing rapidly, pushing infrastructure providers to balance serving both startups and enterprise-scale needs.

Key challenges include compute constraints, supplier difficulties, and the need for strategic infrastructure investments, such as deploying 90 global compute clusters and developing a unified runtime fabric. The discussion underscores the importance of inference and post-training capabilities as core competencies, with post-training optimization and feedback loops critical for refining models. Geopolitical concerns around open-source models, particularly Chinese models, are acknowledged, but the focus remains on leveraging cost-effective options like DeepSeek while prioritizing model performance. The podcast also touches on the evolving role of compute as a strategic asset, the diversification of chip technology, and the need for runtime innovations to address scaling, security, and performance bottlenecks in large language models. Future trends point to a shift toward AI as a service, with AI-driven "units of cognition" reshaping industries and consumer expectations.

Recent Episodes of No Priors - AI, Technology, Startups

9 Apr 2026 The Agentic Economy: How AI Agents Will Transform the Financial System with Circle Co-Founder and CEO Jeremy Allaire

Circle aims to build a digital financial infrastructure using programmable money and blockchain, inspired by Bitcoin, emphasizing stablecoins like USDC, Austrian economics principles, full-reserve banking, and the integration of blockchain, AI, and tokenized assets to enable global value transfer, microtransactions, and new economic paradigms while addressing regulatory and scalability challenges.

20 Mar 2026 Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

The shift from traditional coding to agent-centric workflows redefines development by having AI agents handle 80% of tasks, emphasizing intent-driven "manifestation" over code, while addressing challenges like agent limitations, real-world applications in automation and smart homes, and broader implications for education, ethics, and AI ecosystem dynamics.

More No Priors - AI, Technology, Startups episodes