State of LLMs 2026: RLVR, GRPO, Inference Scaling — Sebastian Raschka - The MAD Podcast with Matt Turck | Wave AI Podcast Notes