
The Bitter Lesson: The history of reinforcement learning
13 Jun 2026
The discussion critiques reward-driven models of intelligence by contrasting behaviorist roots with modern AI advancements like neural networks and self-play, examining historical cases such as TD-Gammon and AlphaGo, while highlighting the limitations of reward frameworks in capturing autonomy and the shift toward data-driven learning over human-encoded rules.
Open episode