“A Toy Environment For Exploring Reasoning About Reward” by jenny, Bronson Schoen - LessWrong (30+ Karma) | Wave AI Podcast Notes