Can we tell if an AI is loyal by reading its mind? DeepMind's Neel Nanda (part 1) - 80,000 Hours Podcast | Wave AI Podcast Notes