Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore Yi Tay 2

Published 23 Jan 2026

Duration: 5524

Researchers discuss AI advancements in areas such as spreadsheet automation, reinforcement learning, and large language models.

Episode Description

From shipping Gemini Deep Think and IMO Gold to launching the Reasoning and AGI team in Singapore, Yi Tay has spent the last 18 months living through...

Overview

The podcast explores the integration of AI models in handling spreadsheet tasks, highlighting their ability to generate summaries and reduce the need for manual effort. It also mentions the "Nano Banana" project, which successfully creates calming images and helps maintain system stability. The conversation reflects on returning to a familiar research environment at Google after a long absence, emphasizing the comfort of established infrastructure.

A major focus is placed on reinforcement learning (RL) as a critical research area, particularly in decision-making and model training. The discussion covers the evolution of AI models, the importance of self-awareness in learning algorithms, and the challenges of moving from imitation-based learning to independent decision-making. The potential of large language models (LLMs) in tackling complex problems, such as those found in the International Mathematical Olympiad, is examined, along with the development of systems like Gemini. Other topics include data efficiency, the difficulties of training models on large-scale data, and the future potential of artificial general intelligence (AGI).

Recent Episodes of Latent Space

22 Jun 2026 Red-Teaming after Mythos Zico Kolter & Matt Fredrikson, Gray Swan

AI security challenges in large language models, such as data leakage and prompt injection, require adversarial testing, red teaming, tools like *Shade* and *Signal*, and structured frameworks to address integration risks, robustness gaps, and enterprise-specific security demands.

3 Jun 2026 Scaling Past Informal AI - Carina Hong, Axiom Math

Formal verification is positioned as a critical tool for advancing AI by ensuring system correctness through mathematical rigor, exemplified by Axiom Math's achievements, tools like Lean, challenges in AI generalization, and the vision of AI as a "superhuman mathematician" through verified reasoning.

3 Jun 2026 Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

Strategic AI development shifts to ecosystem-driven frameworks prioritizing value creation, covering Microsoft's rigorous model training, agent-driven workflow management, real-world impact challenges, innovative business models, inclusive AI participation, and redefining work through agentic systems.

2 Jun 2026 GitHub's plan for Agents Kyle Daigle, GitHub

Advanced AI integration in developer workflows leverages tools like GitHub Copilot and agentic systems to automate tasks and boost productivity, while addressing challenges like skill bloat, security, open-source trust issues, and the shift to modular AI capabilities in enterprise and collaborative environments.

1 Jun 2026 Why Video Agent models are next Ethan He, xAI Grok Imagine

Advancements in AI research through community-driven knowledge sharing, challenges in scaling video models, technical innovations like vision transformers and diffusion models, and the integration of language models in generative media, alongside hurdles in training efficiency and sustainable development.

More Latent Space episodes