A Key Concept in AI Alignment: Deep Reinforcement Learning from Human Preferences - Linear Digressions | Wave AI Podcast Notes