
Loading summary
Gavin
Google's big IO event is next week, and rumors are leaking like ooze from the goo.
Kevin
Lovely visual. We've got new video model samples which show deep reasoning and character consistency. We have a new Flash model that's quicker and perhaps up to 20 times cheaper than GPT 5.5. And a new Spark agent that might know everything about your life. Everything.
Gavin
And they're using AI to reinvent something that you use daily and has remained mostly unchanged since 1968.
Kevin
Now, see, that's a juicy tease. Can we get a little hint, Gav?
Gavin
Uh, no spoilers, but it's the mouse cursor.
Kevin
Okay. I don't think you know how. I don't think you know how. Spo.
Gavin
Oh, oh, oh. Kevin, figure 03 is sorting packages longer than you can stay awake.
Kevin
Well, I will be sleeping soundly inside my mecha unit tree as we stomp around and terrorize entire cities. Gavin. So you know
Gavin
all of that and more on this episode of AI for Humans. Stomp, stomp, stomp. Welcome, everybody, to AI for Humans, your twice a week guide to the wonderful world of AI we are awesome manos. Yeah. Yeah. So every once in a while. Yeah, there you go. This week, Kevin, we have some big leakage. There's leakage from the goob. We got some ooze from the goob. We are always talking a little bit ahead of things that are happening here. And one of the fascinating things about the AI space is you do hear bubbles of news coming out of large events. And next week is Google's big IO event. And we know that we have been following this for a couple of years. You might remember a couple of years ago, there was the famous AI AI AI sundar pichai video. AI AI AI AI generative AI generative AI generatory AI. Last year we got a really cool look at VO3 and Kevin. There are some really big things that they are teasing. Well, not officially teasing. That are leaking about next week's event. Yeah.
Kevin
Grab your wooden flutes. Let's go to the Ren faire. Bard is coming back, baby. No, sorry, that's uses their slogan. I know.
Gavin
Grab your wooden flutes and join Bard on the AI march to win a world.
Kevin
Hey, don't. Don't close the browser tab yet. We do have actual news. We'll get to it. Right now, Google Spark is the rumored agent. Gavin, people were seeing pop ups. It might have leaked a little too soon, but Google Spark is their agent that has access to all of the things. And it makes complete sense because every other agent seems to have access to all of your Google. So if you want an agent that can work in the cloud 247 that has access to your Gmail, your calendar, your Google Docs, et cetera, that's what this agent is promising.
Gavin
Yeah. So I think one thing that's interesting about all these leaks is they are coming out of kind of like these people that just suddenly opened their Gemini and it was like, oh, there's a thing there. And we're going to talk about the Omni model coming up as well too. Kevin. To me, Google Spark is a thing that Google should be able to own. Right. So we've been talking for the last couple months about the movement towards personalized AI agents. Obviously OpenClaw kind of beginning began this conversation in a big way. I think a lot of people have been struggling with open Claw, which sometimes can be the case with open source software when especially it's on the bleeding edge. What Gemini has the opportunity to do here is to create something that feels all encompassing of your life. Because you and I have talked about this before. G Docs is how I do my writing and work. I often have sheets up for Google Excel version, version of Excel. Gmail is where all of my. In fact I have like six, six Gmail accounts now because I'm doing a couple other jobs. So if there really is something that can bring all these things together, that feels like it's a pretty significant place for Google to live.
Kevin
Look, a lot of people use the Claude desktop app and even now, you know, Codex or ChatGPT, they have their own connectors or plugins that will let you access a lot of this Google stuff. But you have to download that separate app. You usually have to pay, you have to authenticate, you got to click through, you got to authorize a bunch of stuff. It feels weird. It still requires multiple clicks. This is a chance for Google to provide agents to people where they don't have to know what a command line is, they don't have to know what a plugin or a connector is. They could just click the button and ask and then there is all of their info put in there. I wonder how opt in it will be, how opt out it will be. I wonder if you will have to pay for an ultra plan for it or if they're just gonna, you know, just go for it and subsidize the cost of this thing like so many companies are.
Gavin
I mean for any company that could subsidize cost, Google should be subsidizing costs because Google and Meta are the Two still the two money printers of Internet business. And like Google has the opportunity here to, no pun intended, claw back a lot of actual AI usage. There's a couple other things we should talk about that are kind of leaking out ahead of this event. And I want to apologize ahead of time to the Google IoT because I I I said I could attend Google I o. I will not be attending Google I o as much as I wish.
Kevin
Well, this is the breaking news, Gavin, let's get to it. This is the story people clicked on our YouTube video for. Give them the exclusive. Give them the exclusive.
Gavin
I will be traveling down to Los Angeles during that time, but thank you to Google. But first of all, Kevin, the other big thing that I'm very excited about that is going to come out next week, we think. Now, again, these are just things that have flitted into the world.
Kevin
This is the rumor mill.
Gavin
Yeah. Yes, this is the rumor mill. And people, it's more than rumors because we assume that these are things that are popping up and we've seen some things. There is a new video model coming from Google that we think is going to be Veo Omni. Now, we don't know that's the name of it, but it seems like that's the case. And again, somebody has opened their Gemini and support. Supposedly there's a thing here. And Kevin, what this is is a new video model that does a lot of reasoning behind the scenes. Now, we've talked about reasoning video models before and Obviously Sea Dance 2 is what I still think. I've been kind of toying with it. I'm trying to think of some things that I can do with it, but it is still by far the best at action and other things and physics in the world. Something that Sea Dance 2 fails at all. The time is writing. So if you have a scene where you're doing like a big active scene and you put in the background you're in a town or you say you're in Paris or wherever you are, what you'll see on the signs in the background is gibberish. Like you wouldn't kind of old school AI image gen. What this the leaks of this model are showing is that maybe they have focused on making all of that make sense. And there's a video that I want you to play a little bit of here that we'll listen to. But for the viewers on YouTube, we'll be able to see what's interesting about this. Play this video about the math professor that's doing an equation on the Board.
Kevin
We start with the fundamental identity. Sine squared plus cosine squared equals one. Now, if we divide every term by cosine squared, we arrive at the identity for tangent.
Gavin
Okay. Okay, hold on. So, Kevin, this is what. I understand all this stuff, so I want to make sure that you're understanding what's going on here in this math.
Kevin
I was just gonna say I don't want a weird flex, because people know that, you know, I was a bit of a mathlete in my day.
Gavin
Yes,
Kevin
I was. So I don't want to flex on the math. We don't know if the math on the board is correct. We. We are both too lazy.
Gavin
Oh, we do, because there are. There are people out there who have confirmed the math on the board is correct. So just to explain, just. Yes, yes. I'm not saying I can.
Kevin
We can confirm it's real because some random bot on X said that the
Gavin
math checks out more than random bots.
Kevin
These are.
Gavin
These are. Multiple people have said the math checks out. And here's. Here's what I want to say. The thing that's going on in this video, there are small problems with it. When you watch it, you'll see the chalk at one point kind of jumps to the next thing. But, yes, according to the world at large, and this is not just one random person, multiple people have said it is doing the math right in the board. And you can imagine a world where if it can get the right text and the right math formulas right, it opens the door to a much larger understanding of the world. And, Kevin, we know that with Gemini and Veo and also genie. Don't forget about genie, which is their world model that Google has been thinking a lot about this idea of, like, a unified model that can see the world. So I will say, like, there's another clip which we can show here on video, which is two men kind of sitting down and eating some spaghetti together. Supposedly, this was somebody trying to recreate the Will Smith spaghetti thing, but it didn't allow him to use the word Will Smith. So this is a version of it. I think it's fine. It's definitely better than VO3, but if you just look at the RAW video, I don't think it's as good as Sea Dance 2. Like, there's things in it that don't feel as strong to me.
Kevin
No, Someone in the thread actually posted a version of the same prompt generated by Seed Dance, and it kind of blows it away.
Gavin
Yes.
Kevin
But that said, you know, listen, okay, the whole point of the OMNI model is that you won't be going to a dropdown and selecting generate video. You won't be going to a separate tool to make audio. You won't be going to a separate tool to do the reasoning of the math. This is supposedly an all in one model. And so that in and of itself is impressive. You know, I'm looking at like, again, the goalposts have moved because some people are going hugely, like you said, oh, the chalk jumped a little bit and the, the chalk itself isn't winnowing down properly. When he's using. I'm like, okay, this is where we're at now with these models. Crazy, crazy impressive. And I'm again curious to see if they will subsidize the living heck out of this.
Gavin
Yeah. So I think we have to look at a couple quick things with this world that Google's kind of putting together to the point of their Gemini agent spark, like that idea that like they can bring all this together to the Omni model. Again, they're trying to bring all this together. The other thing that they're trying to do, I think, Kevin, is there's. There are a few rumors floating out there that we may not be the new Gemini that they're going to drop. Gemini probably 3.2 pro at this event may not be like pushing the edge, but I don't think that's what they're trying to do. The more interesting rumor again that's floating out There is Gemini 3.2 flash. And what's interesting about this far, the more interesting rumor. Yes. Yeah. So Gemini 3.2 Flash is their cheaper, faster model. And you and I both know that the Gemma models from Google have also been very good for small models. There are rumors that are floating around about Gemini Flash that it is way lower than GPT 5.5 costs at about 90% of the capacity. And so far, according to the people that have like tried it, quote unquote, it is also much faster. And you and I have been talking about this forever. When we first tried Cerebrus or we tried these fast models. Part of what is the truck, the tricky thing with AI is how long it takes to go back and forth with it. If you get near instant answer, if you get near instant answers in a world where you're getting this sort of thing, that is a huge, huge, huge deal.
Kevin
With good intelligence, you will take 90% of the quality of response or accuracy of the response. You will take that because you can generate multiple responses or have a mixture of experts distill their responses and then take 10 of them and smash them all together in the time that it would normally take for one generation. And when you. Again, rumor mill, but when you cut costs by like 20%, you make it 20% cheaper, suddenly this becomes like, if you don't have enough RAM to run a capable model locally, this could become your daily driver. This could be the model that powers that Spark agent in the cloud.
Gavin
Right.
Kevin
Or maybe this will be the model that powers the new Google Book, which is clearly trying to a rebranding of the Chromebook that is going to run a lot of AI locally, because now the operating system is serving up AI as opposed to just launching apps and managing memory.
Gavin
Did you ever try owning a Chromebook? Did you ever have that experience?
Kevin
I worked on a Chromebook once and that was fine. It was fine for the time. The cloud apps are good, but, you know, when I used it, it was still tough to get like files off of an SD card at one point.
Gavin
Yes, yes, yes. I was going to say it's been a while. And I have to say, like, the Chromebook always felt to me like it didn't feel powerful enough to do the thing I wanted to do. And it kind of felt like, I guess a lot of kids in schools get them right. They were handed them for a long time. Like, that's a cheap computer to kids. Still, this is exciting, right? We're starting to see hardware designed with AI built in. Now a lot of hardware manufacturers are kind of like rushing to throw AI things into their actual devices. And I'm more interested in, like, more powerful onboard processing than I am necessarily, like Gemini Intelligence. But it is cool to see the idea that there will be a computer that's built from the ground up. And Kevin, this also transitions into a kind of weird piece of big news that Google also dropped. And it came from Google DeepMind, which is always interesting to me when a Google DeepMind thing drops. And this is a team that's been working on rethinking how you use your mouse cursor with AI. And you may hear that idea may like be like, get AI out of the dumbest thing in the world. I don't want to hear how to rethink my mouse cursor. I know that my fingers understand how to use a mouse, but really, this is kind of interesting. You want to kind of tell people what the basics of this are.
Kevin
Yeah, it's imagine attaching an AI assistant to your mouse cursor so that in success, it is contextually aware of what it is you're trying to do on the screen at any time and can offer AI powered suggestions. What's interesting is that that DeepMind teaser came out, right? Like, oh, hey, we're experimenting with this thing. And then I think less than 48 hours, it was announced as a feature as part of this new Google Book. So you just kind of shake your mouse, which, if you do that on a Mac, sometimes it makes your cursor giant. If you have that installed, you'll shake your mouse.
Gavin
I did. I've never done that before. Kevin, you've given life to me. I just shook my mouse.
Kevin
Oh, look at that.
Gavin
Yeah.
Kevin
And it will help you find your cursor, but here it will turn your cursor into sort of an AI enhanced cursor. What does that mean? That means you can select things on the screen and it will contextually offer you some things that you can do with it. So if you shake it over an image, it could offer you editing for that image, or you could grab that image and drop it on another image and say, stitch these two things together or select a to do list and say, create a widget to track this and share it with the family. So it's, again, it's putting AI exactly where your cursor is and trying to make it contextually aware. I think it's. I mean, look, the proof will be in the proverbial pudding. I think it's really interesting.
Gavin
The gooding is that. What's Google pudding called? Google? I don't know. I just tried to say something and it just garbled out of my mouth. Did you hear that? Like, I could barely.
Kevin
The Gemini Jello, Gavin. But you can use AI. You can use AI to again, do anything. With that cursor, you can create dynamic widgets which they have examples of. I'm going to keep talking. I'm not going to let you get in with whatever this is.
Gavin
Jello and also Jello with a G. Jello with a G, Kevin. It's just Jello with a G. Let's keep going. He did it.
Kevin
He did it, everyone. And he didn't need AI to get there. Remember that. And also, last thing about these Google Books is that, you know what's interesting is that they're kind of doing the mirroring on an iPhone thing. You can mirror your phone elegantly and quickly and you can run Android apps on it. So I don't know, I think the Google Book is suddenly pretty interesting for that. If they're going after, like, it's a budget Budget notebook, but it has all these AI features in it. I think it's interesting and we've said AI is going to make its way to the OS level. Here it is.
Gavin
Yes, here it is. And also the other thing that AI, I think, speaking of the thinking of the pointer is that you can talk to it. Right. I think the other thing that happened earlier this week that really got quite a bit of coverage, but also was an interesting story, is that Thinking Machines, which is Mira Murati's startup, released basically a series of videos on the new set of voice models that they are working on. And Kevin, the funniest thing about this, I don't know if you caught this or not, the first guy in these videos is actually the original guy from the GPT4O launch of the Voice models, where it's the same guy who had kind of a. It seemed like he was getting kind of attracted to the voice he was talking to. But this guy's working for Miramoratti now. But what's cool about this, Kevin, and I think you and I, with our work with AI voice, really understand this, is that what they're trying to do is make the feeling of an interaction with an AI voice model feel way more natural and normal. So you can interrupt and it's listening and doing stuff in the background.
Kevin
So I went so in the weeds on this model. I went so in the weeds on it. I was fascinated by it. Yeah.
Gavin
So it's not downloadable, was it? You can't use it. I don't.
Kevin
No, no, no. But I read all the papers and I even looked at like the tech that inspired their papers. Like, I went really deep on it because I was. I was fascinated with it. It is truly bi directional. Like it. Most of these models, they take your input. They usually convert your audio into a transcript. They feed that into the machine, you get the output, it then waits and then it's doing input and output simultaneously. Every 200 milliseconds, there's a tick and it's doing.
Gavin
Which is amazing when you're working with AI audio, because that is a thing that really slows you down if that doesn't happen.
Kevin
Yes, yes. And it can interrupt the user. It can talk over you, which is kind of important. It can, it can wait, it can lay back in the cut until it's activated. It is multimodal, so it can process video as well as audio at the same time. It uses like an interactive model as the main model to handle that interaction. But then it also can access a smarter model in the background for deep reasoning and things like that. But the audio, I didn't know this. It makes a heat map of your audio Gav. Oh, wow. Literally. Yeah.
Gavin
That's really interesting.
Kevin
Imagine like up or down is the frequency and left and right is sort of like the, like the. What is it like the, the volume or, or, or like.
Gavin
Sure.
Kevin
So. Oh, I'm sorry. Sorry. I. I did. So time goes left to right and then the frequency is top to bottom, but then there's a brightness or a color, the intensity.
Gavin
Interesting.
Kevin
Interesting is like, is there a lot there or not? And it couples that with the transcript. So it can actually tell the performance and that's why it can emote. It can tell if you're happy or sad. It can shift. It's actually looking at like a heat map of the transcription and the audio. I think it's super cool. This is the small version, the one that they demoed. It's the small version of their model. We don't know how capable the full thing is, but an interesting first shot out of their canon.
Gavin
Yeah, we also don't know when it's going to be available. I looked into that a little bit. You know, Thinking Machines has had some struggles where people have left. In fact, there's a story this week where like a bunch more people just left because their shares vested after a year. So they like jumped over for a bit and then they left. But I am excited to see how this goes because I do think this kind of overall Omni model idea and the Google book and all this stuff, this idea that like you'll interact and be part of your computer in a different way than has happened before is big. And I think, you know something, I just. Right, that's happening right now. I think there may be a Codex update today. That's something to kind of be aware of. There's been teasing a new OpenAI has been teasing a Codex update for today. But Codex is one of those things that I have seen a lot more people starting to dive into and use. And I do want to just quickly mention that like Sam has been talking about, Sam Allman has been talking about this idea that they're trying to bring Codex to a ton more people in the world. And in fact, Kev, I thought this was crazy. And if you're out there and you're an owner of a business or you're a small team, they're giving teams now two months of Codex use for free and all that compute for free to try to onboard more people right now. So this all kind of just fits into this much larger idea that we are kind of entering a stage where like this is just going to be part of our lives. It's just going to be kind of omniversal around us at any given time.
Kevin
Ooh, I like that. Well, Gavin, what if you're one of those people out there that doesn't want to upload all of their data to the old cloud and rely on one of these massive companies that can giveth and then taketh away?
Gavin
Well, you know Kevin, there's a really great series of open source models that if you want to use local AI, you can do that on your own. And I'll tell you how.
Kevin
Right here. Oh, are you setting up a clip? I thought you were. I was just asking you a question. It sounded like you were tossing to a clip.
Gavin
This is a clip. I'm about to do something I would never do with a cloud AI that's feed it my personal data and let it yell at me. And I can do that because I've got this HP ZBook Fury cranking away on my desk. I mean, drinking on my lap. There are powerful laptops and then there's this big thanks to HP and Intel for sponsoring AI for humans this week and hooking us up with this, this massive, amazing machine. Powered by a Core Ultra 9V processor with 256 gigabytes of RAM and an Nvidia RTX Pro 5000 Blackwell GPU. This thing handles AI like almost nothing I've ever seen. Today I'm running Ollama with Quin 3.8B which is one of the best open source local models out there right now, entirely on this machine. No cloud, no API calls, no data leaving my desktop. Here's why that matters. If you want to run a private LLM of your own stuff, your journals, your contacts, your finances, you really don't want any of that sitting on someone else's server. And with the Z Book Fury and local AI models, it doesn't have to. The model runs directly on this computer, everything works locally and no one ever has to phone home. And because I'm me, I did not use this for something sensible. I fine tune Quinn 38B to be the most aggressive, deeply unhelpful motivational coach possible. I call him Big Tony. Big Tony has read my calendar and Big Tony does not approve of my calendar number. Again, the wild part is that none of this actually leaves my desktop. Big Tony lives here. Big Tony cannot be subpoenaed. Big Tony is mine. Local LLMs private data zero cloud. That is the workflow. Check out the link in our description below if you want to build your own ZBook fury. And again, thank you to HP and Intel for sponsoring AI for humans. Thank you everybody. And also go follow us. That's right, like, and subscribe to our YouTube channel. We have a Patreon, we have a newsletter. We have all that stuff. Go find us. Oh, also hype this video. We've been talking about this for a little while and everybody in the past is like, somebody said, don't ask for people to comment. Ask for hypes. Okay. Your job today, AI for Humans Watcher, is go hype this video. I don't know how to do that. You are smart. You can find that out. Go for it.
Kevin
Yeah. And if you have trouble hyping it, just leave a comment and let us know. But I think it involves waving a white towel around your head like a hell.
Gavin
Just shake your mouse cursor really fast and see what happens.
Kevin
That's right, Gavin. Listen, I love running local models. I love running local models. But I do have a complaint, if I may. They don't make my neighbor ill. Oh, well, that's.
Gavin
That's good, isn't it? If they don't make your neighbor ill. Okay, okay.
Kevin
Someone doesn't like data centers. All right.
Gavin
Okay.
Kevin
Teacher, you forgot to assign homework. What a dork.
Gavin
You went deep on this story and I am very interested in this because this is like, it is probably the biggest story outside of the models in the AI right now. I don't know if you caught this, but Bernie Sanders and AOC just introduced a bill to try to ban data center development countrywide. I think this is a bad idea. I'll be very clear about that. I think we need to. Whatever, whatever.
Kevin
I'm going to take the monorail to get around Kevin o' Leary's data center, which got approved in Utah and it's bigger than Manhattan. Look, there's a lot of data center hate going around. A lot of it is justified. Communities don't want it. It can pollute the water. It can drain a lot of energy. There's a lot of, like, council members basically taking bribes and driving around and souped up Humvees in order to get these data centers. And there is. But here's an interesting reason to hate him as well. Data center sickness. And I wanted to just quickly shout out Ben Jordan. I went deep on his YouTube channel. He has some phenomenal videos, like really well put together, stuff about technology. A lot of times Leaning on the darker side of tech. But Ben has one on something called infrasound. And I wasn't even aware of infrasound. I mean, I think I was spiritually, emotionally, subconsciously. But infrasound is. Whenever there's a sound with a frequency lower than 20 Hz, lower than most humans can hear, it's like it can cause rumblings and it can literally shake the walls. You could see in, like, glasses of water. Think Jurassic park stomping around like the dinosaur. But this low frequency sound, though, you can't hear it, you can absolutely feel it. And it can cause cortisol spikes, it can cause restlessness, it can cause anxiety, it can cause a whole lot of stuff. And he's done some amazing work exposing infrasound pollution, because I think that's a real thing. He's measuring it. He's interviewing people like boots on the ground whose lives are getting upended by it.
Gavin
And.
Kevin
And guess what, Gavin, let's play a little guessing game. I'm going to give you one guess. It's got. It's called One and Done. It's a brand new game show. Play the sound effect.
Gavin
Love it.
Kevin
One and done. Here we go, Gavin. One and done. Guess where a lot of infrasound pollution is being generated.
Gavin
Kevin's bathroom.
Kevin
Ooh, I'm sorry, you're not a winner on One and Done today. No, everybody can hear that. It spans the entire audible spectrum, unfortunately, and it does shake the foundation. But we were looking for data centers, Gavin. Of course, unfortunately, data centers. It happens around crypto mining. A lot of, you know, it's the same stuff, right? It's big generators kicking in and, and diesel gen and whatnot to provide enough power. So it's yet another problem. It's one that we probably should be regulating and solving. I don't see a lot of efforts happening there.
Gavin
But please, I have a question about this, because I did not go deep on this story. And obviously the data center story is going to be, I think, the largest public facing AI story of the next couple of years. Yeah, here's my biggest question about this. And I understand, I'm not saying this is fake. And also, obviously, the world at large, we understand that things that don't seem believable on the top of your brain, like, of course these things really happen. My question is, like, when you go back to something like the Industrial revolution, was this the same sort of thing that, like, say, a factory does, or is how is it different from, like a normal factory or a Manufacturing plant. Because When I was 18, I worked in a place called the Longview Fiber Mill. And the Longview Fiber Mill was a paper pulp plan. It was a job I had after school. Very hard job. My grandfather got it for me because he's like, hey, you do this, you'll understand why I want to go to college. And I did. But my question is, how does that. How does. Do data centers differ in this from a normal factory? And does that. Is this. Is it a different sort of thing than. Than what a normal factory would be? That would be my big question. I'm not saying. I also believe that, like, data centers should probably be put in remote places. I mean, we have a lot of land on this world. Like, it makes sense. I know they have to build the facilities up, but, like, it doesn't. It doesn't. Like, I just want to make sure I have a good sense of that, I think, is what I would.
Kevin
I think that's great. Yeah, I'm. I'm. I'm barely qualified to be on YouTube, Gavin, so I don't know that I'm qualified to sincerely, maybe somebody to compare infrasound to current infrastructure. I don't. You know, I don't know. But I mean, if you look back at like, the air and water and the pollution and urban filth of the industrial revolution, and then you. And then you extrapolate, oh, the scale with which people are talking about these projects, right? You look at even the pollution that the Colossus data center had, right? Running all these turbines that were already against what little regulations we have. I'm sure it's. I'm sure there are parallels, but I don't know. It's just the scale and the scope with which we talk about.
Gavin
Kevin, Big Data centers here, they have something to say to me. So give me one second, okay? Just get one second.
Kevin
No, not a problem. And hey, by the way, if Big Data wants to sponsor the podcast, please let them know we will edit this out.
Gavin
Okay? They said. They said that they'll only sponsor the podcast if we stop talking about this right now. So are you ready to stop talking about this? Hey, you know what?
Kevin
Listen, infrasound might be killing you and your pets. Yes, but is it worth it? Did you see the package sorting Stream from Figure 03, Gavin?
Gavin
That's exactly right. What do you know? That's a perfect transition. Kevin.
Kevin
To vibrate me, Daddy.
Gavin
The 24 hour now package stream from figure 03. So figure 03 we have talked about here before is a humanoid robot company that comes out of America, it's an American company and I think it is live still right now. What you are watching is one of their robots sort through packages. And this may look very boring, but according to Figuro3, this is completely autonomous. What's happening. There's been some controversy which we can get into that. But it's autonomous and you're watching it flip packages so that the labels are I think down. Right. Because it's trying to get the packages right.
Kevin
It's going to go over a barcode scanner at some point.
Gavin
And this is a job that like humans have done. It is not a great human job. So it's not a terrible thing to replace this job. But this is the beginning stages of seeing what it looks like to run a robotic factory because not only did they do it for eight hours originally, but then they said speed it up, we're doing it for 24. These things don't get tired, they just do the same thing over and over again. And it is fascinating to watch a live stream of it happening.
Kevin
Yeah, Especially when like the robots as they get closer to like running out of battery, they will like step back and there's an interchange, a supposedly fully autonomous one where the next robot will sort of tap in and go um, I love that they've added name tags. I'm watching Frank right now go through packages. The, the best is, is hanging out with, with a live chat as they lose their minds over a box flip. Because the box flips are the best. Right. It's they, they'll do the kind of a two handed flip to get the label down. Sometimes they drop packages, sometimes they actually don't have the barcode in the right area. So it's not flawless. And watching chat pop off when that happens is just a bizarre, bizarre thing. But they tell the operation real quick. There were rumors that these things were actually being teleoperated because there's a clip where the robot. Yeah, that's it, that's it. It looks like the robot's reaching for its visor, which is one of those things that a teleoperator may do at the end of their shift. Some people were saying it's, it's from the training data. Others are saying it's clearly a lie. Brett Adcock, the CEO of Figure came out and said no, the robot's lifting its arm so it doesn't hit the metal chute that's to the side of it. And, and I don't know if they like I'm watching now, I'm watching the live stream as we're talking and it's several times has made that motion. I don't know if they told the robot to go ahead and do that a little bit more to prove the point. But yeah, like, but oh, we just got a box foot gab and we're 29, 000 packages in. We're seeing so many box.
Gavin
Fantastic. Fantastic. Anyway, so this is a really just a good example of like we are moving quickly into the robot world. There was a conversation earlier online about the idea of like all these manufacturing plants. Speaking of manufacturing for like car plants like that Ford stock just shot up because maybe there's a world that Ford as a, as a company that can manufacture things quickly and has lines to manufacture stuff, might be an AI stock too, which is another whole big AI stock bubble question which is a big thing brewing right now. But Kevin, speaking of AI robots, there is a very cool new robot from Unitree and Unitree has been building forever, but they have finally done it. They have built what is a walkable mecca that you can sit in in a cage. But not only do you sit in it when it's walking, it can transform into a four legged walker. And the funniest thing to me about this is so if you're not watching this on video, it is about probably like 12 to 13ft tall. It is very much like the first stage of like the big mech you would see in anime. But when it transforms, what it does is it kind of makes you go backwards. So you're sitting back and your head's looking up and then you hear it clomp. Maybe you can just. Maybe you can just play the clomps for people. Sure.
Kevin
Yeah.
Gavin
So they can just hear the clomping.
Kevin
Hey, royalty free music Rockheads, get ready.
Gavin
So that clumping you hear is actually the clumping of its feet going up and down as it tries to walk around. Anyway, this is just a crazy cool thing to see Unitry doing. And again, you may have seen the President of the United States is in China right now. There's been a lot of really interesting stories about like where Chinese robotics are. There's a great clip of Fox News. Is Brett Buyer asking for a sausage? Which we can't play here. He's asking for a sausage from a robot.
Kevin
Bummer.
Gavin
We can't play that here because we will get demonetized. But anyway, very cool robot news.
Kevin
You know what we can end with Gavin.
Gavin
What's that?
Kevin
A robot asking Brett Baer for a sausage.
Gavin
Oh, you have that?
Kevin
No, but I mean, we could see. Dance it.
Gavin
Okay, fair enough.
Kevin
I'm out of credits, but we'll see
Gavin
you all next week. Thanks, everybody. Bye. Bye.
Kevin
Bye. Good evening. Excuse me, sir. Do you have a sausage?
Episode Title: Google Is Cooking Again. The I/O Leaks Are Wild.
Date: May 15, 2026
Hosts: Kevin Pereira & Gavin Purcell
This lively episode dives into a wave of leaks and rumors ahead of Google's highly anticipated I/O developer event. Kevin and Gavin unpack the flurry of potential product announcements, including Google's new AI agent “Spark,” advancements in video models, the debut of a powerful new “Flash” AI model, and inventive ways AI is being brought into everyday computing—right down to the humble mouse cursor. They also explore AI’s growing presence in hardware, including the reimagined “Google Book,” AI-enabled voice models, a viral robot package-sorting livestream, factory floor automation, and broader societal trends, such as data center controversies.
Leaked Features & Big Expectations
Spark: Google’s Next-Gen AI Agent
Omni: Unified Video/Reasoning Model
Gemini 3.2 Flash: Speed + Cost Efficiency
Mouse Cursor Reinvention (DeepMind)
Timestamps:
AI Built Into the OS
Timestamps:
Conversational & Emotional AI Speech
Timestamps:
Privacy and Empowerment
Timestamps:
Societal & Regulatory Challenges
Timestamps:
Notable Quote:
Factory Floors, Livestreams, and Mechas
Figure 03 Package Sorting Livestream
Unitree’s Walkable Mecha
Timestamps:
In the run-up to Google I/O, the episode brims with intrigue about new models and features poised to reshape how we work, communicate, and interact with machines—across the cloud, on local hardware, and even at the OS or robot level. The hosts’ signature banter keeps things playful but insightful as they blend cutting-edge analysis with laughs, making this a must-listen for both AI insiders and the curious.