Transcript
A (0:00)
Foreign.
B (0:11)
Hello and welcome to the Last Week in AI podcast, where you can hear us chat about what's going on with AI. As usual in this episode we will be summarizing and discussing some of last week's most interesting AI news. You can go to the episode description for all the links to the news we're discussing and the timestamps so you can jump to the discussion if you want to. I am one of your regular hosts, Andre Kankov. I studied AI in grad school and I now work with it at a generative AI startup.
C (0:41)
And I'm as you can hear me typing here just because I'm making final notes on what is an insane week. If we do our jobs right today, this will be a banger of an episode. If we do our jobs wrong, it'll seem like any other week this has been insane. I'm Jeremy, by the way. You guys all know that if you listen, I Gladstone, AI, AI, National Security, all that jazz. This is pretty nuts. Like we were talking about this, I think, last week, where we're catching up on two weeks worth of news and we were talking about how every time we miss a week and it's two weeks, inevitably it's like the worst two weeks to miss. And the AI universe was merciful that time. It was not merciful this time. This was an insane again, banger of a week. Really excited to get into it, but man, is there a lot to cover.
B (1:26)
Exactly. Yeah. There hasn't been a week like this probably in a few months. You know, there was a similar week, I think around February, where a whole bunch of releases and announcements were bunched up from multiple companies. And that's what we're seeing in this one. So just to give you a preview, the main bit that's exciting and very full is announcements concerning tools and sort of consumer products. So Google had their IO 2025 presentation and that's where most of the news has come out of. They really just went on the attack, you could say, with a ton of stuff either coming out of beta and experimentation being announced, being demonstrated, et cetera. And we'll be getting into all of that. And then afterward, Anthropic went and announced Cloud 4 and some additional things in addition to Cloud 4, which was also a big deal. So those two together made for a really, really eventful week. So that'll be a lot of what we were discussing. And then in applications as business, we'll have some stories related to OpenAI, we'll have some interesting research and some policy and safety updates. About safety related to these new models and other recent releases. But yeah, the exciting stuff is definitely going to be first up and we're just going to get into it. First in tools and apps is Claude 4. Maybe because of my own bias about this being exciting. So this is Claude Opus 4 and Claude Sonnet 4. This is the large and medium scale variant of Claude from Entropic. Previously we had Claude 3.7, I think for a few months Cloud 3.7 has been around, but not super lengthy and this is pretty much an equivalent update. They'll be costing the same as the 3.7 variants. And the pitch here is that they are better at coding in particular and better at long workflows, so they are able to maintain focused effort across many steps in a workflow. This is also coming paired with updates to Claude code, so it's now more tightly integrated with development environments. Coming with an SDK now, so you don't have to use it as a command line tool. You can use it programmatically and related to that as well. Both of these models, Opus and Sonnet, are hybrid models, same as 3.7, so you can adjust the reasoning budget for the models. So I guess qualitatively not anything new compared to what Anthropic has been doing, but really doubling down on the agentic direction. The kind of demonstration that people seem to be optimizing these models for the task of give a model some work and let it go off and do it and come back after a little while to see what it built with things like cloud code.
