More Practical AI episodes

Controlling AI Models from the Inside thumbnail

Controlling AI Models from the Inside

Published 20 Jan 2026

Duration: 2635

The podcast delves into the AI safety crisis, discussing ongoing struggles with AI-generated harm, the limitations of current security measures, and emerging solutions for real-time monitoring and more sophisticated safety protocols.

Episode Description

As generative AI moves into production, traditional guardrails and input/output filters can prove too slow, too expensive, and/or too limited. In this...

Overview

The podcast explores the ongoing challenges in AI safety, particularly the risks of AI systems generating harmful or unintended content such as violence, pornography, or dangerous advice. It differentiates between using AI for security and ensuring AI systems themselves are secure, stressing the importance of proactive safety measures beyond basic input and output filtering. Current approaches are criticized for being reactive, often relying on post-hoc analysis of outputs and struggling with detecting harmful content in complex media like video and audio.

The discussion highlights emerging solutions that use internal model instrumentation to identify unsafe behavior in real-time, offering a more efficient and scalable alternative. It also addresses the value of interpretability in AI, the need for layered defense strategies, and the potential of edge devices to support safety mechanisms with lower computational requirements. The conversation touches on economic and practical barriers to implementing strong safety measures and the difficulty of tailoring these systems to industry-specific needs, while envisioning a future of more adaptable and context-aware AI security frameworks.

Recent Episodes of Practical AI

4 Jun 2026 Breaking down the 2026 Stanford AI Index Report

Recent advancements in AI, highlighted by the Stanford AI Index Report's findings on accelerating capabilities, human-level performance in specialized tasks, impacts on education and work, challenges like flawed benchmarks and the "jagged frontier," robotics limitations, U.S.-China leadership dynamics, governance gaps, and broader implications for labor, creativity, and policy.

28 May 2026 Rebooting Enterprise AI with MCP and Kubernetes

The Multi-Cloud Protocol (MCP) bridges AI systems with enterprise infrastructure, enabling secure, scalable interactions between LLMs and traditional tools via standardized, governance-focused operational frameworks.

21 May 2026 Hermes Agent: Agents that grow with you

Noose Research's mission to democratize AI through open-source tools like the Hermes Agent emphasizes efficiency, distributed training, ethical alignment, and agentic systems, while navigating challenges like monopolization, geopolitical competition, and the balance between open-source ideals and commercial interests, alongside debates on AI's creative limits and societal impact.

14 May 2026 U.S. Congressman Beyer on AI challenges facing America and the World

AI policy debates, cybersecurity vulnerabilities, economic disruptions, ethical risks, international collaboration, and philosophical questions on AI consciousness and human alignment dominate discussions on balancing innovation with governance and societal impact.

7 May 2026 The Myth of Model Wars: Open vs Closed AI in 2026

AI integration into physical systems via embedded tech in retail, manufacturing, and logistics is driven by microelectronics democratizing access, emphasizing infrastructure and edge applications over model types, while navigating challenges in scalability, tooling, and aligning AI with real-world business needs.

More Practical AI episodes