Wave Pod
Self-Distillation for Data-Scarce Language Model Pretraining - Best AI papers explained | Wave AI Podcast Notes