Wave Pod
Discover
Library
Get Wave AI
Sign In
“LLM Misalignment Can be One Gradient Step Away, and Blackbox Evaluation Cannot Detect It.” by Yavuz Bakman - LessWrong (30+ Karma) | Wave AI Podcast Notes
← LessWrong (30+ Karma)
LessWrong (30+ Karma)
“LLM Misalignment Can be One Gradient Step Away, and Blackbox Evaluation Cannot Detect It.” by Yavuz Bakman
March 16, 2026
·
00:07:00
Send to my inbox
Technology
Society & Culture
Loading summary
Sign in to save
Share
Sign in to transcribe