More Latent Space episodes

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka thumbnail

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

Published 26 Feb 2026

Duration: 3137

A podcast discusses the application of distillation in machine learning, particularly in training smaller language models, and raises concerns about the potential for malicious use and the need for improved detection and regulation.

Episode Description

Swyx joined SAIL! Thank you SAIL Media, Prof. Tom Yeh, 8Lee, Hamid Bagheri, c9n, and many others for tuning into SAIL Live #6 with Nathan Lambert and...

Overview

The podcast covers the concept of distillation in machine learning, focusing on its application in training smaller language models using the outputs of larger ones. This process can involve using logits from traditional models or synthetic data generated by large language models (LLMs), enabling the creation of more efficient, deployable models. Major companies like DeepSeq and Google are highlighted as using distillation to reduce the size and computational demands of their models while maintaining performance.

The discussion also addresses ethical and legal challenges, particularly around "distributed distillation attacks," where third-party AI labs may use outputs from competing models to train their own systems. This raises concerns about AI geopolitics and the enforcement of terms of service. The podcast explores detection methods for such attacks, such as identifying unusual usage patterns, but notes the difficulty in distinguishing between legitimate training practices and malicious intent. Additionally, the episode touches on the broader implications of distillation for AI innovation and regulation, while examining the role of benchmarks like Sweet Bench in evaluating model performance, despite their known limitations in accurately assessing LLM capabilities across varied domains.

Recent Episodes of Latent Space

20 Mar 2026 Dreamer: the Personal Agent OS David Singleton

Dreamer is an AI platform democratizing access to agentic tools for non-technical users via customizable AI assistants, community-built apps, cross-device integration, and privacy-focused features, with a beta emphasis on accessibility, real-world productivity use cases, and third-party developer opportunities.

More Latent Space episodes