Google DeepMind's AI Control Roadmap: treating agents as insider threats
DeepMind just published their internal framework for securing AI agents: real-time monitoring, threat modeling borrowed from cybersecurity, and the assumption that alignment might fail.