AI testing, benchmarks and evals - Thoughtworks Technology Podcast | Wave AI Podcast Notes