Wave Pod
Discover
Library
Get Wave AI
Sign In
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL - Daily Paper Cast | Wave AI Podcast Notes
← Daily Paper Cast
Daily Paper Cast
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL
May 7, 2026
·
00:21:27
Send to my inbox
Science
Technology
Loading summary
Sign in to save
Share
Sign in to transcribe