wave
Pod
Get Wave AI
Sign In
“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen - Redwood Research Blog | Wave AI Podcast Notes
Back to Redwood Research Blog
“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen
Redwood Research Blog
Mon Apr 27 2026
Sign in to process episode
Loading summary...
Send to Email