AI Testing and Evaluation: Reflections - Microsoft Research Podcast | Wave AI Podcast Notes