From Demonstrations to Rewards: Alignment Without Explicit Human Preference - AI Insiders | Wave AI Podcast Notes