Wave Pod
Discover
Library
Get Wave AI
Sign In
“Contrastive features elicit different perturbation responses than SAE features” by Francisco Ferreira da Silva, StefanHex - LessWrong (30+ Karma) | Wave AI Podcast Notes
← LessWrong (30+ Karma)
LessWrong (30+ Karma)
“Contrastive features elicit different perturbation responses than SAE features” by Francisco Ferreira da Silva, StefanHex
March 21, 2026
·
00:14:53
Send to my inbox
Technology
Society & Culture
Loading summary
Sign in to save
Share
Sign in to transcribe