Wave Pod
Discover
Library
Get Wave AI
Sign In
“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen - LessWrong (30+ Karma) | Wave AI Podcast Notes
← LessWrong (30+ Karma)
LessWrong (30+ Karma)
“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen
April 27, 2026
·
00:31:30
Send to my inbox
Technology
Society & Culture
Loading summary
Sign in to save
Share
Sign in to transcribe