Episode 54: Scaling AI: From Colab to Clusters — A Practitioner’s Guide to Distributed Training and Inference - Vanishing Gradients | Wave AI Podcast Notes