Beyond Semantic Manipulation: Token-Space Attacks on Reward Models - Best AI papers explained | Wave AI Podcast Notes