More Latent Space episodes

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka thumbnail

[LIVE] Anthropic Distillation & How Models Cheat (SWE-Bench Dead) | Nathan Lambert & Sebastian Raschka

Published 26 Feb 2026

Duration: 3137

A podcast discusses the application of distillation in machine learning, particularly in training smaller language models, and raises concerns about the potential for malicious use and the need for improved detection and regulation.

Episode Description

Swyx joined SAIL! Thank you SAIL Media, Prof. Tom Yeh, 8Lee, Hamid Bagheri, c9n, and many others for tuning into SAIL Live #6 with Nathan Lambert and...

Overview

The podcast covers the concept of distillation in machine learning, focusing on its application in training smaller language models using the outputs of larger ones. This process can involve using logits from traditional models or synthetic data generated by large language models (LLMs), enabling the creation of more efficient, deployable models. Major companies like DeepSeq and Google are highlighted as using distillation to reduce the size and computational demands of their models while maintaining performance.

The discussion also addresses ethical and legal challenges, particularly around "distributed distillation attacks," where third-party AI labs may use outputs from competing models to train their own systems. This raises concerns about AI geopolitics and the enforcement of terms of service. The podcast explores detection methods for such attacks, such as identifying unusual usage patterns, but notes the difficulty in distinguishing between legitimate training practices and malicious intent. Additionally, the episode touches on the broader implications of distillation for AI innovation and regulation, while examining the role of benchmarks like Sweet Bench in evaluating model performance, despite their known limitations in accurately assessing LLM capabilities across varied domains.

Recent Episodes of Latent Space

22 Jun 2026 Red-Teaming after Mythos Zico Kolter & Matt Fredrikson, Gray Swan

AI security challenges in large language models, such as data leakage and prompt injection, require adversarial testing, red teaming, tools like *Shade* and *Signal*, and structured frameworks to address integration risks, robustness gaps, and enterprise-specific security demands.

3 Jun 2026 Scaling Past Informal AI - Carina Hong, Axiom Math

Formal verification is positioned as a critical tool for advancing AI by ensuring system correctness through mathematical rigor, exemplified by Axiom Math's achievements, tools like Lean, challenges in AI generalization, and the vision of AI as a "superhuman mathematician" through verified reasoning.

3 Jun 2026 Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

Strategic AI development shifts to ecosystem-driven frameworks prioritizing value creation, covering Microsoft's rigorous model training, agent-driven workflow management, real-world impact challenges, innovative business models, inclusive AI participation, and redefining work through agentic systems.

2 Jun 2026 GitHub's plan for Agents Kyle Daigle, GitHub

Advanced AI integration in developer workflows leverages tools like GitHub Copilot and agentic systems to automate tasks and boost productivity, while addressing challenges like skill bloat, security, open-source trust issues, and the shift to modular AI capabilities in enterprise and collaborative environments.

1 Jun 2026 Why Video Agent models are next Ethan He, xAI Grok Imagine

Advancements in AI research through community-driven knowledge sharing, challenges in scaling video models, technical innovations like vision transformers and diffusion models, and the integration of language models in generative media, alongside hurdles in training efficiency and sustainable development.

More Latent Space episodes