More No Priors - AI, Technology, Startups episodes

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud thumbnail

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

Published 1 May 2026

Duration: 00:43:01

Base 10's 30x growth and $1B+ revenue projections underscore AI inference market challenges like infrastructure gaps, open-source model shifts, domain-specific customization needs, geopolitical tensions, compute constraints, and emerging trends in AI-as-a-service and societal integration.

Episode Description

Baseten CEO and co-founder Tuhin Srivastava sits down with Sarah Guo and Elad Gil to discuss the rapid growth of AI inference demand, Basetens 30x gro...

Overview

The podcast discusses the rapid scaling of Base 10, which has achieved 30x growth in 24 months with projected annual revenue exceeding $1 billion, driven by rising demand for AI inference across industries. It emphasizes the complexities of AI inference as a lagging market compared to training, noting advancements in open-source models and post-training techniques like reinforcement learning, which enable companies to customize and own their inference processes. The distinction between application-specific AI (e.g., healthcare workflows, customer support systems) and generic "frontier" models is highlighted, with examples of startups like Bridge and Abridge demonstrating the need for domain-specific solutions. The market is dominated by a "long tail" of models and enterprises, but AI-native startups are growing rapidly, pushing infrastructure providers to balance serving both startups and enterprise-scale needs.

Key challenges include compute constraints, supplier difficulties, and the need for strategic infrastructure investments, such as deploying 90 global compute clusters and developing a unified runtime fabric. The discussion underscores the importance of inference and post-training capabilities as core competencies, with post-training optimization and feedback loops critical for refining models. Geopolitical concerns around open-source models, particularly Chinese models, are acknowledged, but the focus remains on leveraging cost-effective options like DeepSeek while prioritizing model performance. The podcast also touches on the evolving role of compute as a strategic asset, the diversification of chip technology, and the need for runtime innovations to address scaling, security, and performance bottlenecks in large language models. Future trends point to a shift toward AI as a service, with AI-driven "units of cognition" reshaping industries and consumer expectations.

Recent Episodes of No Priors - AI, Technology, Startups

21 May 2026 The Story Behind Cerebras $63 Billion IPO with Founder and CEO Andrew Feldman

The text examines how AI's acceleration mirrors past tech revolutions like streaming, highlighting Cerebros' AI-optimized hardware, challenges in scaling, strategies for overcoming skepticism, and the transformative potential of AI to redefine productivity and business models through radical innovation and open-source collaboration.

More No Priors - AI, Technology, Startups episodes