Transcript
Michelle Lee (0:00)
Foreign.
Andrei Karlenkov (0:11)
Welcome to the Last Week in AI podcast where you can hear us chat about what's going on with AI. As usual in this episode, we will summarize and discuss some of last week's most interesting AI news. You can also go to Last Week in AI for our newsletter with even more articles. I am one of your regular hosts, Andrei Karenkov. I studied AI in grad school and now work at the startup Astrocade. And with me once again, guest co host Michelle Lee.
Michelle Lee (0:42)
Hi everyone, I am Michelle Lee. I did my PhD at Stanford with Andre and I am currently the founder and CEO of Medra. We are building physical AI scientists.
Andrei Karlenkov (0:53)
Yeah, back in the day we both used to do robotics research. I went the easy route to do generative AI producing video games. But you stuck with it and actually doing science. So it's good that you're finding a good fight.
Michelle Lee (1:08)
Absolutely. I mean, I think right now, I mean, even looking at the news today, there are a lot of very exciting AI and robotics news. So Andre, you should get into robotics.
Andrei Karlenkov (1:19)
I really might. We'll see. And speaking of what we'll be discussing, it's going to be a pretty exciting episode actually.
Michelle Lee (1:26)
It's.
Andrei Karlenkov (1:26)
We have had kind of a quiet stretch in this past week. Things popped off in a way they haven't in a little while. So of course we're going to be talking about Gemini Free, going to be talking about Nada banana Pro Opus 4.5. And then beyond that, we do have some exciting news on the robotics front. With a new startup emerging out of stealth and a bunch of funding for one established one, quite a few interesting open source releases. We'll be touching on some research on interoperability and on alignment. Also that we'll discuss from Anthropic. So overall, quite a few interesting things in this one. I would recommend you stick around. Before we dive in, I do want to acknowledge real quick, there's a new review on Apple Podcasts called the Best AI Podcast. The Dash still alive. Question mark. So we did switch to a new hosting provider and I hope that it's still being posted to everyone's feed. If somehow there's some technical glitch that you're not no longer seeing it, please do let me know. And moving into the story, starting with tools and apps, we begin with Gemini Free. So somehow everyone was aware that Gemini Free was going to be released last week and so it was. Gemini 2.5 Pro, I think came out early this year around February, was kind of mind blowing at the time. That was really the first Gemini that people were like, wow, Google is starting to get back into the lead. Gemini 3 kind of as big a release. People were very excited and people are not disappointed with this release. It got like really impressive results on the toughest of benchmarks, including humanity's last exam. Got a record score of 37.4. And this is the benchmark that was like developed to be the hardest thing that we could possibly do so that LLMs couldn't beat it. It's like half a year old. It used to be zero progress on it. Now it's actually increasingly being solved. So overall a lot of just impressive results coming out of it. They also announced Google Antigravity, which is their new coding ide, which I guess is to compete with Cursor coming a couple months after they sort of acquired Windsurf. Yeah, but not really, which some people on Twitter were a little amused by. But yeah, Gemini Free, super exciting. And there's also Gemini Free Deep Think, which is more research focused and you know, can go even deeper.
