Summary9 min read

Moonshots with Peter Diamandis
EP #256: Google I/O 2026, Karpathy Joins Anthropic, and Cerebras’ $95B IPO
Date: May 21, 2026

Episode Overview

This blockbuster episode of Moonshots tracks the forefront of technology, diving deep into Google's major 2026 I/O announcements, the seismic shifts in AI talent with Andrej Karpathy joining Anthropic, and the record-shattering $95B IPO of Cerebras—a company reshaping the boundaries of AI hardware. Host Peter H. Diamandis, joined by tech industry luminaries Dave, Alex, and Salim, as well as special guest Andrew Feldman (CEO, Cerebras), explores the relentless pace of technological progress and its deep impact on society, business, and the future of humanity.

Key Discussion Points & Insights

1. Google I/O 2026 Recap and The AI Arms Race

The Unprecedented Scale of Google

Sundar Pichai’s Opening ([05:39])
- Google is now processing 3.2 quadrillion tokens per month, a 7x increase over last year.
- 13 Google products have over a billion users; 5 have more than 3 billion.
- AI Overviews exceed 2.5 billion monthly users; Gemini app at 900 million, doubling from last year.
- CapEx has exploded: ~$190B in 2026, up from $31B in 2022—a 6x increase.
- Quote: “In two years, we were processing 9.7 trillion tokens a month… now that number has jumped seven times to 3.2 quadrillion.” — Sundar Pichai ([05:39])
Panel Reaction ([07:21])
- Dave: “If you said five years ago, hey, Google’s going to 6x its capex and the stock will go up. Nobody in their right mind would have said that’s even possible. And there it is.”
- Alex: “It’s perhaps just slightly less remarkable than it might seem, given that Gemini is basically being swapped for Assistant… nice to see Gemini usage taking off. It’s nice...to have something other than a duopoly between OpenAI and Anthropic.” ([08:19])

Google’s Enduring Power & Internal Culture

The gang revisits Larry Page’s sci-fi whiteboard (space elevators, BCIs) and the trajectory towards exponential progress ([10:36]).
Alex: “We’re going to speedrun every science fiction movie ever made in the decade over the next 10 years. That’s the singularity.” ([11:36])
Google’s relentless self-disruption, now 'disrupting the disruptors' and redefining what an AI-native company looks like.

2. Gemini Models: Omni, Flash, and the Multimodal Future

Gemini Omni – Multimodal, Video-Centric Models

Demis Hassabis reveals Gemini Omni: Able to generate realistic videos, images, and simulations from multimodal prompts ([13:33]).
- Demo: Claymation-like explainer of protein folding rendered on command.
- Alex: “Google DeepMind...is the only frontier lab still chasing multimodality. OpenAI and Anthropic have deprioritized video. China’s labs are serious, but in the US, Google’s leading.” ([15:33])
- Dave: “Get in there and build stuff and play with it. Put yourself in a movie as a character, change the backgrounds… just new things every week.” ([15:30])

Gemini 3.5 Flash – Speed vs. Intelligence Tradeoff

New Default Model: 4x faster than predecessors, optimized for high-throughput and agentic coding ([19:14]).
- Alex: “I would call Gemini 3.5 Flash solidly mid. Not talking about throughput or cost, it doesn’t compare favorably with say GPT 5.5 High… It’s a fast model that gives consumers a better experience, but enterprise AI is still dominated by Anthropic and OpenAI.” ([20:08])
- Salim: “You’re seeing this kind of bifurcation now between premium cognition and ultra-cheap but very fast cognition.” ([24:53])

3. Trust and Authentication: SynthID & The Verification Age

SynthID Watermarking Expansion ([25:48]):
- Google has watermarked over 100B images, 60,000 years of audio.
- Now rolling out content credentialing—indicates origin and edits.
- Industry-wide adoption expanding: Nvidia, OpenAI, Kakao, 11 Labs signing on.
- Alex: “We’re going to get cryptographic chains of custody from reality capture to what is ultimately presented to the user... End-to-end authentication of realness, proof of reality, coming from the synthetic end.” ([26:47])
Panel Insight:
- Salim: “Abundance minus trust equals scarcity…We’re moving from the information age to the verification age, and trust is becoming infrastructure.” ([28:59])
- Dave: “Industry will self-regulate faster than Congress can legislate.”

4. Productivity Gains: Anti Gravity 2.0, Gemini Spark, and AI Search Agents

Anti Gravity 2.0

AI-First Code Development Environment ([33:22])
- “It’s a copycat, fast-follow of Cursor, moving to agent-first development. I don’t know anyone serious using Anti Gravity for primary dev work.” — Alex ([34:30])
- Dave: “It’s really obvious where the puck is going… in the future, you’re not even going to look at code.” ([38:10])

Gemini Spark – Always-On Life Agent

An ‘Operating System for your life’ ([40:23])
- Persistent, integrated agent that drafts emails, manages Google services, adapts to user needs.
- Alex: “It’s a lazy copycat [of OpenClaw]. I’d like to see the art of the possible. They delivered a minimum viable strategic response, not a next-gen leap.” ([43:40])
- Salim: “This is boring, but it’s safe. The deep integration will enable the next productivity jump—default behavior is massively powerful.” ([45:15], [47:10])
- Dave: “Nothing can slow down Google because of their massive distribution advantage.”

Intelligent Search and Persistent Agents

AI Search Box: Expands with user curiosity, persistent agent searches ([48:47]).
- Dave: “Google AdWords is now going to gently drift you toward a different question… The revenue power of that is astounding.” ([50:14])
- Alex: “After so many armchair commentators said Google would be disrupted… they self-disrupted and changed the rectangle.” ([51:06])
- “I give them full props for risking disruption of their own business.” — Salim ([53:19])

Universal Cart

Shopping Embedded Everywhere: Unified intelligent cart in Google services ([55:44]).
- Salim: “Tomorrow, we go from intent to agent to transaction. The next trillion-dollar company helps agents buy.”
- Dave: “At this point, the battle on the cloud and backend is so much bigger… this is just the next stage.” ([60:03])

5. NotebookLM, Branding, and The Integration Challenge

Gemini App reaches 900M users; NotebookLM has 1.5B notebooks and docs created ([60:49]).
- Alex: “Why is NotebookLM still branded as such? Unify everything under Gemini.” ([63:33])
- Salim: “You get the peanut butter problem… resource spread too thin, fragmentation.”
- “Apple is alone in relentlessly iterating on a small number of products.” ([66:33])

6. Ambient Computing: Audio Glasses and the Next HCI Frontier

Google Audio Glasses and Samsung XR ([67:17]):
- No display: voice-driven AI assistant persistent in-ear.
- Salim: “Moving to ambient, continuous human-computer interaction, but lack of visual is mid.” ([69:01])
- Alex: “Google should have owned smart glasses… now Meta is running away with this space.”
- Dave: “Society will have rifts—half will embrace AI agents always-on, half will be offended (privacy, always-recording).” ([70:54])
- Peter: “Presence in life is at risk—audio is a clever, more acceptable stopgap.”

7. Gemini for Science: Root Node Problems and Simulations

Demis Hassabis on Science ([74:49]):
- Gemini models accelerating research, scientific simulation, drug discovery.
- “When we look back at this time, I think we will realize we were standing in the foothills of the Singularity.” — Demis ([75:53])
- Alex: “This is DeepMind at its best… tackling root node problems like fusion, protein folding. I’m broadly super supportive.” ([76:08])

8. Gemini X Prize: Open Innovation at Scale

Launch of $2M Build with Gemini X Prize for impactful AI app development ([79:11]).
- Peter: “Teaching people to fish instead of giving them fish. The goal is mass impact.”
- Salim: “History shows open innovation seeds ecosystems… couldn’t be prouder.” ([81:15])
- Dave: “The biggest and highest-calling XPRIZE yet.”

9. Cerebras $95B IPO: Hardware Moonshot

Andrew Feldman (CEO) on the Cerebras Journey ([83:22] onward)

Wafer-Scale Compute: Built the world’s largest chip (dinner plate sized), orders-of-magnitude faster for AI inference.
- “Nobody ever built a chip this big. Not once in the 75-year history of the computer industry.” ([94:34])
- “We solved this problem and were way ahead of the market. Inference demand exploded only after models caught up.”
- Signed $20B+ deal with OpenAI, AWS deal follows, new era of demand ([98:06]).
Innovation vs. Imitation:
- “Money and the acquisition of talent isn’t enough. There’s something else—luck, integrity, relentless grit.” ([129:21])
Future of Compute:
- “Fabs are our pyramids. Building in the US is slow and hard, but critical for sovereignty. We need decades-long commitment.”
- Feldman on orbital data centers: Technical challenges are huge, but Cerebras chips may have unique advantages (“fault tolerant, big chip advantages in space” [123:53]).
Entrepreneurial Journey:
- “It’s a pressure test on your soul. For every entrepreneur, it’s the number of times you get kicked in the gut before lunch and it can still be a good day.” ([117:24])
- “Would you rather be doing anything else? No.” ([117:52])

10. Industry Moves & Power Shifts

Andrej Karpathy Joins Anthropic

Panel Analysis ([84:17]):
- Andrej’s journey: cofounder OpenAI, Tesla FSD, OpenAI, now Anthropic.
- “If you’re not in the Frontier Lab, you’re missing out, things are moving past you.” — Karpathy ([87:28])
- Andrew Feldman: “His point likely applies to hardware too... If you aren’t building for the top labs, your ideas/hardware will drift.”
- Salim: “It’s notable he had his pick, and chose Anthropic.”

Elon Lawsuit News ([89:43])

Jury rules definitively against Elon Musk in the OpenAI lawsuit; panel agrees it’s a waste of energy and a distraction.
- Feldman: “Billionaires in pissing matches interest me not at all… I want them building cool shit not fighting.”

Notable Quotes & Memorable Moments

“We’re going to speedrun every science fiction movie ever made in the decade over the next 10 years. That’s the singularity.” — Alex ([11:36])
“Abundance minus trust equals scarcity…We’re moving from the information age to the verification age.” — Salim ([28:59])
“Every major technology starts out scarce and expensive, then the learning curve kicks in…The cost collapses, and AI compute’s going down the same path.” — Salim ([125:54])
“Money and the acquisition of talent isn’t enough. There’s something else—luck, integrity, relentless grit.” — Andrew Feldman ([129:21])
“Standing in the foothills of the Singularity.” — Demis Hassabis ([75:53])
“Nobody ever built a chip this big. Not once in the 75-year history of the computer industry.” — Andrew Feldman ([94:34])
“If you’re not with the Foundation Labs, your judgment will drift… you’ll be left behind.” — Andrej Karpathy via podcast replay ([85:33])

Timestamps for Key Segments

| Segment | Timestamp | |------------------------------------------------|--------------| | Google I/O: Sundar’s opening | 05:39 | | Gemini Omni unveiling (multimodal, video) | 13:33 | | Synth ID & content authentication | 25:48 | | Anti Gravity 2.0 & agentic coding | 33:22 | | Gemini Spark & consumer agent integration | 40:23 | | Persistent AI Search Agent / Reinvented Search | 48:47 | | Universal Cart & e-commerce | 55:44 | | NotebookLM & app fragmentation | 60:49 | | Google Audio Glasses | 67:17 | | Gemini for Science | 74:49 | | Build with Gemini X Prize | 79:11 | | Cerebras IPO / Andrew Feldman guest | 83:22 | | Karpathy joins Anthropic | 84:17 | | Elon–OpenAI lawsuit resolution | 89:43 |

Conclusion

This episode offers a panoramic view of the state of 2026 tech: Google’s AI supremacy and reinvention, the intense race for AI hardware and compute, the looming impact of ambient AI agents everywhere, and the ever-shifting alliances in AI talent. The optimism for humanity’s future is palpable, grounded by sober reflection on business, ethics, and the sheer intensity of this exponential age.

Main Takeaway:
The unstoppable moonshot mindset is alive—a world where science fiction becomes science fact, trust and authenticity become infrastructure, and the combination of grit and vision still determine who shapes the next decade.

Loading summary

Transcript485 lines

[00:00]
A
If you said five years ago, hey, Google's going to 6x, its capex and the stock will go up. Nobody in their right mind would have said that's even possible.
[00:07]
B
Quadrillions, Billions. Hundreds of billions. Trillions. It gets numbing after a while. There was a lot of conversation that Google was cooked. Google was not going to make it. That their revenue engine was being massively disrupted. And here they are, you know, sort of disrupting the disruptors.
[00:24]
C
All of these numbers, I think were inevitable.
[00:27]
B
Andre Kaparthy joins Anthropic. He was the co founder of OpenAI. He left in 2017 to run full self driving for Elon. And he'll start a new initiative focused on using Claude to accelerate Claude's own pre training research. Cerebrus Record IPO closes up 68% market cap $95 billion Andrew Feldman, the CEO of Cerebrus.
[00:48]
A
It's like a lifetime achievement, like kind of like a Nobel Prize or an Olympic gold medal where you carry it for the rest of your life.
[00:55]
B
All right, I see Andrew Feldman has entered the room. Andrew, a pleasure to have you here.
[01:00]
D
Thank you for having me on your show. Appreciate it.
[01:04]
B
Now that's a moonshot.
[01:05]
A
Ladies and gentlemen,
[01:09]
B
everybody. Welcome to another episode of Moonshot. Today we have our extraordinary group of moonshot mates. DB2, our emperor of AI investing, Saleem, professor of all things exponential organizations, and our very own artificial super intelligence, our moonshot mate, awg. I'm Peter Diamandis, your host. Gentlemen, a pleasure to have you here.
[01:32]
C
And Peter, you're a birthday boy as well. Shall we sing Happy Birthday to you.
[01:38]
B
To you. Everybody's gonna be signing off the pod right here. Right here.
[01:44]
E
That's right. We are not singers. Stick to the verbal.
[01:47]
C
Well, one of us was. Well, my first career was in the New York City Opera Company.
[01:53]
B
Really? You just keep impressing all of us. Well, thank you, gentlemen, for your. For your.
[01:58]
E
We want to hear. I would like to hear an aria sometime.
[02:01]
C
All right, maybe I'll do an outro.
[02:03]
B
Okay. For sure.
[02:07]
E
So we had such a fun surprise birthday party for you.
[02:10]
B
Oh, my God, it was crazy. I have never been so surprised in my life. And honestly, you, Saleem, are the most giving individuals. So for. So Saleem was visiting with Lily and his son Milan. We had a birthday dinner for him Saturday night, but his birthday was Sunday. And he walks me along the beach to a surprise birthday party where there are 50 people. I walk in this room and I've never been more surprised in my life. I mean, I literally dropped to My knees in a level of surprise.
[02:42]
D
All right.
[02:43]
B
But it was your birthday, Saleem, and you were.
[02:46]
E
It was a perfect decoy. It was a great decoy. It was awesome.
[02:50]
B
Oh, my God.
[02:51]
E
Very special. Very special. And Dave, you sent a great message over. It was great. It was wonderful.
[02:57]
B
Thank you all for that. So welcome to Moonshots. Our job here is get you pumped about the future. No politics, no doomerism. Just the science and technology driving us along the singularity. Today we have a special episode, our annual recap of Google's mega event called Google I O. We'll cover news with Andrej Karpathy joining Anthropic and Elon's defeat in the trial against OpenAI. And finally, we'll be joined by. By Andrew Feldman, the CEO of Cerebrus after an epic ipo. So, Dave, it was a blast to be with you at Google I O. Here's an image of us along with Tyler Donahue. What do you think of it?
[03:39]
A
It's great to be where it all started. The vibe on AI is global, but within this epicenter where everything began, it's off the charts. And the first hour, just the AM of stuff to talk about in one hour compared to a year ago, compared to two years ago. I mean, we're taking it all for granted, but it's just crazy, you know, just. Just the number of new Google brands, of new AI products is. It's pretty baffling. So we'll go through it all. You know, we've got it all beautifully cut up today, so you can analyze
[04:11]
B
every piece of it and. And Alex, you were watching online, weren't you?
[04:16]
C
I was. I was watching in real time, dissecting it all for my newsletter, the Innermost Loop. I thought there were some high points, some low points, some mid points, if and eager to dive in, and some
[04:27]
B
probably some no points. And of course, where's Waldo today? Salim, you know you're a probability function on planet earth. So where are you today, Saleem?
[04:39]
E
I just landed in Brazil and I've got a bunch of meetings and presentations here. So I flew from L. A with you, Peter. Directly here, of course.
[04:49]
B
Of course you are. Let it not be said that you're not the traveling dude, Peter.
[04:55]
A
I'm at the Cerebras headquarters today too. The vibe here, you know, third biggest tech IPO in history, have. The vibe here is just. I couldn't resist the opportunity to feel it, you know, right after an ipo, a company, it's a once in a lifetime kind of thing. For most People. So the vibe is just. Just epic.
[05:13]
B
Yeah. A record ipo. Until the next record ipo. Until the next record ipo.
[05:18]
A
I know it's. It's going to be. It's going to be 10x this year. So you got to savor it while, you know, while you have the record.
[05:25]
B
It's insane. All right, let's jump into all things Google. I o. I want to kick it off with the opening summary by Sundar. Quite the year. Let's listen to Sundar and then we'll. We'll continue on and dive.
[05:39]
D
In two years, we were processing 9.7 trillion tokens a month. Across the surface is a huge number. Last year at IO, that grew to about 480 trillion tokens. Tokens. And fast forward to today. That number has jumped seven times to 3.2 quadrillion tokens per month. Over 8.5 million of you are now building new apps and experiences with our models monthly. And our model APIs are now processing around 19 billion tokens per minute. We are, of course, also seeing incredible demand across our products. We now have 13 products with over a billion users each. Five of those have more than 3 billion users. AI overviews now has over 2.5 billion monthly users. And AI mode has been a revelation. Our biggest upgrade to search ever. People love it. In just a year, it's already surpassed 1 billion monthly users. Last year at I O, the Gemini app had 400 million monthly active users. Today, we have surpassed 900 million, more than doubling in a year. And Today, more than 50 billion images have been generated with our nano banana models. In 2022, we were spending $31 billion annually in capex. This year, we expect that number to be about six times that, approximately 180 to $190 billion. Instead, we can now seamlessly distribute training across multiple sites, scaling across more than 1 million TPUs globally. This gives us the ability to create the largest training cluster in the world. Both chips are more energy efficient, delivering up to two times better performance per watt.
[07:21]
B
Wow. Quadrillions billions. Hundreds of billions, trillions. It gets numbing after a while.
[07:28]
A
It does. But you got to. You got to step back and like 6x ing your capex. If you said five years ago, hey, Google's going to 6x its capex and the stock will go up. Nobody in their right mind would have said that's even possible. And there it is. I mean, those TPUs are. I mean, that's the linchpin. And they're 2x the power efficiency, but they're in the hunt now competing vertically from the transistor all the way through the user experience. And no one else can say that. So, yeah, there's just a lot. Also, if you look at the array of logos that have over a billion users, I mean, you have to go back, rewind the video, go back and look at that again. It's just a litany of Google things now. Yeah, that's relentless.
[08:11]
E
The Gemini having 900 million users is pretty incredible because that's pretty close to ChatGPT. I found that very striking.
[08:20]
B
It is. Alex, what's your take on these numbers here?
[08:23]
C
Predictable thoughts?
[08:24]
B
Easy.
[08:25]
C
Well, inevitable in some sense. I'm reminded about 20 years ago I had a conversation with Larry Page during his interregnum when he wasn't CEO and I was reminded he was asking me for advice on how to get Google interested in spending $100 million to work on AI. If you can believe that. It's unconscionable by today's standards that Google wasn't interested 20 years ago in AI and now here they are, it's become the central focus of the company. Full stack from chips and data centers all the way through Applic, but here we go. All of these numbers, I think, were inevitable. It's been widely remarked that if Google hadn't leaned in at every layer of the stack to trying to own or at least lead an AI, they would have been toast. The original model of Google based on search and ads would have been cooked. So what do you do? You lean into it. Gemini, 900 million plus users. I think that's perhaps just slightly less remarkable than it might seem given that Gemini is basically being swapped for Assistant. Assistant already had quite a bit of traction, but nonetheless, nice to see that Gemini usage is taking off. It's nice, I think, on balance, to have something other than a duopoly between OpenAI and Anthropic, which I think if Google doesn't aggressively lean into consumer and enterprise, Gemini adoption, I think that's the default outcome right now.
[09:53]
B
And Alex, you know, and I do as well, because again, I've known Larry since 2003, 2004. AI was always his focus. It was. He wanted to build an AI company from the very beginning and had a
[10:08]
C
lot of difficulty for a while. I mean, it wasn't obvious to everyone else within Google for the first part of its life that AI was where this was all going.
[10:17]
B
And of course, Eric, the adult supervision in the room came in and built the, you know, the revenue engine. And of course Google's only able to do what it can do today because of the massive revenue. The other thing that people don't realize is that part of Larry's vision early on was also bci. He wanted to connect the brain to
[10:37]
C
AI and space stations. Peter, do you remember that whiteboard that Google used to maintain with their long term tech tree? And they were going to have a Google space station and bcis and all of these things. That's all happening now. Finally, like three decades later, the Google whiteboard vision is finally playing out.
[10:57]
A
The whiteboard actually had the tethered satellite on it. They were going to make an elevator into space.
[11:01]
C
A space elevator.
[11:03]
A
Space elevator, yeah. And they needed a carbon nanotube wire for things to go up on. They manufactured about a meter of it for a lot of money. 20,000 mile cable.
[11:14]
C
We're finally catching up with the whiteboard.
[11:16]
B
I was with Jack Hickory at Sandboxaq yesterday talking about what the large quantitative models the LQMs are going to be able to do. And one of his objectives are new materials that have the tensile strength to give you a space elevator. So, you know, all ofagain. What do we say? What did you say, Alex? We're going to speedrun every science fiction movie ever made in the decade over
[11:37]
C
the next 10 years. Like every sci fi trope everywhere, all at once over the next 10 years. That's the singularity.
[11:43]
B
Just a big shout out and congrats to Sundar and Sergey and Josh and the team there. I mean, hitting their numbers again. You have to remember that a year and a half ago there was a lot of conversation that Google was cooked. Google was not going to make it, that their revenue engine was being massively disrupted. And here they are, you know, sort of disrupting the disruptors.
[12:07]
E
I mean, this is an AI native operating system company because they're now constantly continuous sensing execution, adaptation, and they've kind of hitting an inner loop there.
[12:18]
C
This was the company that birthed the transformer. This was the company that my friend John Smart, I think, perhaps insufficiently famously pointed out that if you looked at the number of words or tokens in an average Google query over a period of 15 years, and this is before everything hit its inflection point in 2017 or so with the transformer. But if you looked at the number of words per Google query, it was following an exponential curve that was was inevitably going to end up in people having full conversations with AI. So Google had the exponential trajectory of user interaction. They had the transformer, they had the compute. It was just a matter of Putting all of the institutional pieces together and it seems like they're finally coming together.
[13:01]
B
Yeah. Amazing. Moving us along after that epic intro by Sundar, the company Google is launching an entirely new family of AI models called Gemini Omni. It's capable of generating the video clips from prompts that include a variety of inputs, including text, photos, videos and audio. You know, Google says Omni will be the create anything from any input product. So let's take a look at the video here being introduced by Demis. And Demis was a rock star on stage. It was so much fun to see him there.
[13:34]
C
I'm excited to announce Gemini Omni models like Veo, Nanobanana and Genie are able to create extremely realistic videos, images and interactive simulations. It's a step change in simulating things like kinetic energy and gravity. Gemini's world knowledge and reasoning really shine in Omni. It can translate complex ideas into highly accurate videos. So for example, you can give it a simple prompt like make a claymation explainer of protein folding. And get this.
[14:05]
D
Proteins start as chains of amino acids. They fold into patterns like the alpha helix and flat sections called beta sheets, forming a perfect three dimensional shape.
[14:15]
C
Omni gives you a more natural way
[14:16]
D
to edit video with conversational language.
[14:20]
E
Wow.
[14:20]
C
What's really cool is you can give it your own videos, for example, this
[14:24]
D
selfie, and change reality in a really fun way.
[14:29]
B
I hope people are watching this on YouTube because the video clips are extraordinary. Dave, you were going to say.
[14:34]
A
Yeah, the crowd reaction, actually Demis had by far the best visuals. And the crowd reaction on first of all, the crowd is huge. Hugely into medicine and science as the use case that everybody cares about. And Demis is the spokesperson for that. But then his visuals on the video stuff were the best too. And you should go watch the original YouTube recording and see the full video of him morphing himself through different places and outfits and everything in real time. It's incredible. And I think in the worry about AI that's going on globally, we lose the fantasy and the cool factor that you could look forward to this your whole life and now you can suddenly play with. And I really encourage everybody, just get in there and build some stuff and play with it. Put yourself in a movie as a character, change the backgrounds. Once you've experienced that, you really get a sense of the amazing things that are possible. You know, starting now and for the next few years, just new things every week.
[15:30]
B
Alex, reality is cooked, isn't it?
[15:33]
C
I think reality is getting enhanced for sure. But I also want to applaud demis and Google DeepMind for being the only arguably remaining American frontier lab to still be chasing multimodality.
[15:47]
B
Even though they're in the uk, right?
[15:50]
C
Well, they're really American. I mean they may have a lot of personnel in uk, but it's an American frontier lab. They're the only frontier lab still chasing multimodality. So OpenAI Katsura and arguably De emphasized video Anthropic has never arguably been chasing multimodality. They've been squarely focused on cogen. That just leaves Google with the only credible frontier American video models, since video is arguably the hardest modality combined with consumer demand. And then you have all of the Chinese frontier labs. China is taking video as a modality far more seriously. My sense from some of GDM's earliest announcements with multimodality is they have a grand vision of modality scaling. Even though video presents as the most consumer friendly, the most impressive demo. Actually at the back end they're probably treating biological sequences like DNA or protein sequences as another modality. They probably have dozens of other modalities that they're trying to fold into this omnimodal model looking for modality scaling in a way that the other American frontier labs just aren't. So it's a bet, it's at this point almost an idiosyncratic bet that they're going to get to some form of superintelligence that's distinguishable because it handles all these different modalities. Text, audio, video, maybe biological sequence data, maybe other crazier modalities, all in some meta uniform way that the other labs aren't achieving. But it's a bet nonetheless.
[17:24]
B
Interesting, Salim. I can't imagine a better kind of technology for education and teaching people. I mean, I so wish this existed when I was doing organic chemistry and studying medicine.
[17:36]
E
I mean this should clear away so much of the cruft of trying to figure out how to present things and how different ways of showing things, biological models, et cetera. I mean this could all become real time and full 3D. I mean it's incredible to see what's going to come from this. Very exciting.
[17:56]
A
The real time part of it is huge too because in a call like this or a podcast like this, you can create real time graphics and visuals to fit the dialogue just purely with your voice. They can do it at Google. We can't do it because we don't have the token speed to keep up, so we have to wait a minute. So if you said something really cool right now, slim hey, Brazil, let me tell you about data center explosions in Brazil. The graphic that backs that up would take a minute to come back. And so you can't do it in real time, but they can. And it's just purely who has access to the computer.
[18:29]
B
Yeah, but it's going to be coming where AI is going to be, just creating a soundtrack and a visual track for your life. Always present when you want. Amazing. Let's dive into their new Gemini 3.5 flash model. So they just launched Gemini 3.5 flash. It's the new default for Gemini app in AI search mode. And as you're about to hear, compared to 3.1 Pro, it's better across all the benchmarks. And importantly, Google says this new model is significantly faster in a league of its own in terms of intelligence versus output speed. It's better handling agentic tasks, offering improved agentic coding, richer and more interactive graphics. Let's take a look.
[19:15]
D
Today I'm excited to introduce Gemini 3.5 Flash, our first in a series of models. When compared to 3.1 Pro, Flash is better across the board, almost all benchmarks. It's made huge progress in coding and look at that extraordinary jump in GDP val, a benchmark that captures many real world economically valuable tasks. Second, 3.5 Flash is a very capable model at the frontier and comparable to the best models, but much, much faster. Which is why when you look at the intelligence versus output speed, it's in a whole league of its own. In the top right quadrant, when looking at output tokens per second, it's four times faster than other frontier models and it's incredible delight to use.
[20:06]
B
Alex, impressive. What do you think?
[20:09]
C
Well, remember when I opened saying there were highlights, low lights, and then to use the colloquialism, mid, I would call Gemini 3.5 flash solidly mid. If you look at its, if you look at its capabilities and others have pointed this out as well, just from a raw capability standpoint, not talking about throughput or cost, it doesn't compare favorably with say GPT 5.5 high or x high or Pro. On the other hand, this is a Flash series model, so it's not Pro yet. Sundar sort of infamously at this point has said Gemini 3.5 Pro, that's coming out in another month. There were groans in the audience at the time, so this isn't intended to be top of range. I think the strategy, if I were to play Kremlin Ologist here, the strategy is, I think Google is sort of solidly tier 1 1/2 at this point in the race to RAW frontier capabilities, 3.5 flash represents perhaps pushing the optimal frontier in terms of throughput versus performance, that optimal frontier. But I think it's also very telling that Sundar is highlighting throughput versus performance instead of like number of tokens on the x axis, input tokens versus performance, or rather output tokens on the X axis versus performance on the Y axis or some other metric. He picked the most flattering possible metric. And if you actually.
[21:37]
B
And they all do,
[21:40]
C
everyone picks flattering metrics. But some flattering metrics are also more sort of truthful than others in some global sense. And, and if you look at the metrics that Google's been highlighting, a note, it's very telling. 3.5 Flash is being compared primarily with 3.1 Pro, less with Frontier other models. But secondly, the areas, the benchmarks where it's really excelling are benchmarks where tool use in particular really aggressive tool use is needed. So if I had to squint at this, I would say the emphasis in Google wasn't necessarily beating the frontier with 3.5 flash. It was probably, I would say, some combination of throughput maxing and tool use maxing. Not pushing the boundaries of the frontier, but solid release nonetheless. It's nice that Google's still in the game.
[22:33]
B
And Dave, I'm imagining we've talked about the labs pulling their punches. I imagine that releasing there'll be the next version of GPT will come out and then of course Pro will come out right after that.
[22:48]
A
Well, the scuttlebutt here in Silicon Valley is that it's a two horse race between OpenAI and Anthropic for the best AI in the world. And the talent is flooding into those two buildings in sf and nothing in that demo or in the vibe on campus at Google contradicts that. So here you pointed out earlier that Alex pointed out that Google is the one remaining horse in the race to the consumer. And this is a very, very fast model that gives the consumer a much better experience. But the other labs have already pivoted to the enterprise and said look, we're giving up on that. We're going totally after these massive enterprise budgets and if you try and build something sophisticated with AI, you want the smartest AI that solves the problem and you're not going to back off to a faster model that's not quite as intelligent. You just can't. And so they're going full bore after self improvement at the other labs. The other difference from a year ago is Google's unstoppable war chest. 180 billion in CAPEX per year and rising. But the other guys in the interim raised OpenAI, raised 120 billion in cash and they'll burn that pretty quickly. And Anthropic is on a similar trajectory now. The war chests are actually not as different as they were a year ago. I think that's the only disappointing thing in the whole show, is the best of the best. Gemini is not up there with Mythos, as far as we know. Now, I will say that Google doesn't. They do soft sell. They don't announce what's coming in four months and promote and trumpet it because they don't need to if something really, really big is cooking and coming soon. They didn't roll it out, but they don't need to roll it out. They'll wait until it's proven.
[24:38]
B
I love the naming nomenclature here. Of course, we got GPT models 5.4, 5.5, 5.6, and Gemini jumps from 3.1 to 3.5. It's fascinating. Salim, what do you make of this?
[24:54]
E
I thought one thing that's clear is you're seeing this kind of bifurcation now between premium cognition and ultra cheap but very fast cognition. And I think that's going to continue. I think Alex makes a great point about throughput. This will allow a lot of throughput. Right. And there's this continuous march for marginal intelligence. Cost trends towards zero.
[25:17]
B
Yeah, Here was the next segment that I pulled out and again, I want to just a shout out to Gianluca who clipped all these beautifully and provided them in record time for us for the show. But there was, yeah, a conversation about Synth ID and content credentialing. You know, really important, especially as we start to encroach on reality. How do you know if something is or is not AI generated? It's going to become more and more important than ever before. Let's take a look here at Sundar talking about Synth ID.
[25:49]
D
Since launch, Synth ID has now watermarked over 100 billion images and videos along with 60,000 years of audio assets. We are now going a step further and adding content credentials verification across products. This will show you if the origin of the content was AI or a camera and if it's been edited with generative AI tools. In this example, Gemini can tell this photo was captured with the Pixel camera and then edited with Google Photos. Of course, this only works at scale. If more partners decide to watermark their own AI generated content. Nvidia signed on to Synth ID last year and today I'm thrilled to announce that OpenAI, Kakao and 11 Labs are adopting Synth ID 2.
[26:39]
B
I love the fact that we're getting to standards and everybody's picking the best. Alex, how important is this?
[26:47]
C
You know, the irony is so many people were hand wringing over the past few years that we won't know what's real and what isn't. And my response was always, we're going to get cryptographic, eventually cryptographic chains of custody from reality capture to what is ultimately presented to the user. In the same sense that when you use a browser you can maybe see a little lock icon to indicate end to end SSL encryption. We're going to get the same thing for reality. And I view Synth ID, which by the way was also just adopted by OpenAI. It was created by Google, now also adopted by OpenAI in the same breath. I think it's sort of ironic that we're going to get, it seems, end to end authentication of realness, proof of reality, if you will, not coming from the camera end, not coming from the reality capture end, coming from the synthetic end, coming from all vendors that want to claim credit in some sense that they were the ones who generated the reality. And then the cameras, the camera makers and all of the recording device manufacturers, they're going to be downstream and the ones who adopt the same protocol. But either way we're getting our end to end proof of reality one way or another.
[28:03]
A
I think there's a much broader story. I'd love Salim to get your thoughts on the bigger, bigger societal implications because a big topic at Stanford last night, you know, with Erik Brynjolfsson and his entire team there. The rate that AI is innovating can't be kept up with by Congress and there's going to be no regulation of any value coming out of Washington. So the industry is starting to self regulate and that's the only AI can keep up with AI. And so we may look back on this moment as one of the first moves by the self regulation community where okay, now we're going to start watermarking images. Hey everybody in the community, please adopt our standard for watermarking images. And then there'll be something else a week later, something else a week later, something else a week later. And that'll become the way that we govern ourselves in the future. Much more so than any law coming out of Washington, purely because the pace can't keep up. Salim, you think that's basically this is the first move in that direction.
[29:00]
E
I think that's exactly right. When you have intelligence becoming abundant, then the scarcity goes towards trust and scarcity creates value. So we may end up at a point where authenticity is more valuable than creativity. And that line between, between something being created and knowing how it was created etc is now merging because of the systems that are being created now. And I think, Dave, the point you make is really, really important. Once you have that trust layer right now, you can scale. I go back to Jerry Mikulski, my community member, who said scarcity equals abundance minus trust. And so if you can solve for trust, you solve for abundance. And one of the biggest challenges today, I thought this was really a big deal because we're moving from the information age to the verification age, and trust is becoming infrastructure. And I think that's a very powerful, valuable pillar for the world going forward.
[29:58]
C
Playing arithmetic, does that mean that abundance equals scarcity plus trust?
[30:04]
E
It does, it does. Scarcity plus trust gets you to abundance.
[30:08]
C
How about that?
[30:11]
B
All right, thank you for. Thank you for.
[30:13]
E
That's like grade five math. Even I can do that. All right, so, but, you know, because the cost of generating content is collapsing, right? Then the, the value shifts from signal filtering authenticity. We, we saw this with photography, where the big problem in photography is how do I take the best photograph? Because each dollar, each click costs you a dollar. And you had a bunch of business models crop up around like the selling expensive cameras or offering courses on photography and, and publishing books on composition. Then we moved to digital photography. The cost of creating a photograph went to zero. And now the big problem everybody has in photography is I have six copies of my photographs on seven different online services, and you can't find anything. And the value then comes in that filtering system, right? And
[31:03]
A
Salim, you know, we're moving into this intentional world that we design and everyone's like, what will happen next? What will happen next? Well, whatever we design is what's going to happen next. But Dario and Demis are two probably dominant architects in the future of how we live. And it's just great to hear both guys go back and forth. But if you do a raw word count from Dario, Dario Amadeus, CEO of Anthropic, and you go back, the words were all about transformer architecture, speeds, intelligence, benchmarks. And then they transitioned to UBI Ethics, and now they're talking about the way the world should be governed going forward and writing papers on it. And so if you just track the word count, it's also on this exponential change rate. Same with Demis. Demis has to Be a little more cautious because technically he's an employee of Google, even though he acts very independently. But those are the two guys just very much determining the future of all humanity. Right now, watermarking is just like move one, act one of the whole future
[32:06]
E
way we live and look at what's happening, right? Our trust in legacy institutions is collapsing and at the same time AI is building up the capability and the infrastructure and the foundation for delivering trust. So hopefully if that happens elegantly, we'll have an elegant shift from scarcity to abundance, rather than a messy one.
[32:29]
B
Everybody, you may not know this, but I've done an incredible research team and every week myself, my research team study the meta trends that are impacting the world. Topics like computation, sensors, networks, AI, robotics, 3D printing, synthetic biology. And these metatrend reports I put out once a week enable you to see the future 10 years ahead of anybody else. If you'd like to get access to the Metatrends newsletter every week, go to diamandis.cometatrends that's diamandis.commetatrends the next product that they dove into in a central part of Google's plans is Anti Gravity. They released Anti Gravity 2.0, a standalone desktop app built to orchestrate multiple agents to execute tasks in parallel. Let's take a listen and then Alex, I'm coming to you for your evaluation. Mid tier, high tier, no tier. Let's go on.
[33:22]
D
At the core is Anti Gravity 2.0, a new standalone desktop application that delivers fully on that original glimpse of a truly agent optimized experience. The new Anti Gravity is unabashedly agent
[33:37]
C
first, focusing on the core agent conversations, agent produced artifacts and multi agent orchestration.
[33:44]
D
Like I said unabashedly, agent first. As Sundar mentioned, this is the exact experience teams here at Google have been using to drive massive value. Let's take this live and actually show
[33:55]
C
this operating system in action.
[33:56]
A
Try running Doom right now.
[33:59]
D
It just doesn't work. Turns out that the OS is currently
[34:02]
C
missing some necessary video and keyboard drivers.
[34:05]
D
So let's just try and fix it
[34:06]
C
in the new Anti Gravity. I have a prompt prepared, I'm going
[34:10]
D
to paste it in.
[34:13]
C
Anti Gravity ended up doing a whole
[34:14]
A
host of research, ended up writing over
[34:17]
C
100 lines of code, and then finally built the operating system.
[34:20]
A
Let's take a peek and see if it works.
[34:22]
D
Amazing.
[34:26]
B
So first off, Alex, what is anti gravity 2.0 and what are you thinking about it?
[34:31]
C
Yeah, so let's remember where antigravity came from. Do you remember Google's acquisition of Windsurf during that debacle. So Anti Gravity is basically Windsurf rebranded from the Windsurf team that was Hacwa hired by Google. And then antigravity 1.0 versus 2.0. I view you were asking earlier, Peter, is this high, mid, low? This is sort of mid in my mind if you look at what Cursor has been doing. By contrast, Cursor was much more aggressively leaning from their old interface, which was sort of a reskinned Visual Studio code code centric editor, from their 1.0 oriented user interface to their more recent interface, which is Agent First. I view this almost as like a copycat. Fast follow or slow follow or somewhere in between from the Windsurf team within gdm, basically following the same metaphor of saying, no, no longer about direct code editing access. Now the primary metaphor is orchestrating fleets of code agents that are doing all of the hard work. So I would say Google's almost hamstringing themselves a little bit by announcing this now as part of I O versus, say, in a more timely Fast follow or even lead. When Cursor was doing this months ago, I've used Anti Gravity quite a bit. Certainly was using it even more when Google first announced it after the Windsurf acquisition. And I would say not super impressive. It was very buggy, I think. 2.0. I haven't had a chance to use 2.0 yet, but hopefully it's a good deal stronger. But really, I don't know anyone who's doing their primary development work with Anti Gravity at this point. The development is happening either with Claude code or with Codex or maybe with Cursor. Anti Gravity don't know anyone who's using it.
[36:26]
B
You know, everybody's trying to.
[36:27]
A
Except every engineer at Google.
[36:28]
C
Leapfrog except for Google, maybe.
[36:30]
B
So, Dave, you were trying to get.
[36:31]
C
Except actually even that's not true though. So. I mean, it's been publicly reported that within Google DeepMind, they're all desperate to get Claude code access for everything.
[36:40]
B
Yeah, we talked about that a couple of pods ago. Dave, you were playing with this actually during the. During Google I O yesterday.
[36:46]
A
Yeah, Tyler and I both installed it in real time as they were rolling it out, which maybe not the smartest move in the world because there's 16,000 people behind us in the crowd, probably all trying to do the same thing. So that was not a great first experience, but I don't blame Google for that. But this morning it worked fine and installed great. And I completely agree with Alex's Assessment. It looks almost identical to the new Cursor Agent First Windows. I almost can't tell where I am. Am I m cursor or am I in Anti Gravity? What they did do, which is a little more extreme than cursor, is it completely replaces anti gravity 1.0. You can't even see the code anymore and you have to go and launch the old thing if you want to actually edit code. So Cursor didn't go quite that far, but it's really obvious where the puck is going. If you want to build things in the future, you're not even going to look at code. You're going to describe what you want and you're going to debug at this much higher level of evals and functional comparisons. And I don't like where that button is. Move it. I think in the future nobody's going to want to go back to autocomplete code editor view. So they leapt ahead and said, we're just going to eliminate that entirely. And if you really want to hack, we'll give you a way to get back to it. But we're going where the puck is going and not where it was. But it really is exactly catch up. Like Alex said, everybody's got the same,
[38:11]
B
you know, it's Codex and it's Claude code and Anti Gravity and they're just going to be leapfrogging each other. I'm just curious if there's going to be some new sort of breakout approach to this that's going to materialize. Alex, do you think there's anything in the future?
[38:28]
C
Well, code is clearly going away as a human endeavor. It's all being abstracted away by code agents that handle all code. And humans aren't maybe in the near term future trustworthy enough to even be allowed to write their own code. So I think that's one obvious arrow of time in this space. I think recursive self improvement is another arrow of time. So not even old generation models are trustworthy enough. Maybe older models are trustworthy enough to rewrite themselves and generate newer, better models, but code's going away. I think that that's the obvious trend here.
[39:00]
A
Well, I think the. Also, Peter, to answer your question on the next paradigm, we've only had this paradigm for a couple months, so let's let it settle in for a little. But no, clearly the next paradigm is exactly the Star Trek holodeck, which Alex has been saying for a while. So right now you're talking to it. It's building things for you. It's incredible, but it's not natively graphical and visual and you're not moving things with your hands. So if you say, I want to move that button, I want to change this, I want to connect this to my email, you're not actually seeing the button move in real time, it's regenerating and then you see a new rendering. And in the future it'll be a real time graphical experience that's interacting in your comfortable physical space, kind of native human environment. And that next iteration is certainly within this calendar year.
[39:51]
B
All right, next up is Gemini Spark. It's Google's take on Open Claw. I think is the most obvious thing to say. It's a new always on AI agent that can write emails, create study guides, keep an eye out for, you know, financial fees that you're being charged. It's Google's we have OpenClaw at home moment. It's powered by Gemini 3.5 Flash and it offers you a 24. 7 operation. Let's take a listen to the conversation
[40:24]
D
about Gemini Spark introducing Gemini Spark taking action on your behalf and under your direction. It runs on dedicated virtual machines on Google cloud and it's 24.7A task right off the bat. This is a pretty straightforward example, but it's so useful. Help me draft an email to the team. Compile everything about our recent Gemini live launches and wins from the last week. And what's amazing here is Spark will go through step by step, look at all these steps all the time it saves you going through and again work across the various skills and apps that you have. And what's really amazing is it'll break it down and also be able to generate files for you. So the first one here, this is a live RSVP tracker, right in Google sheets you can see that it shows who's confirmed and who hasn't. What's amazing about this is it'll actually update because it's connected to Gmail. So when L. Thompson row 8 RSVPs, it'll update, which is pretty amazing.
[41:29]
B
I mean, one of the things I find fascinating is the integration across all of the Google products is very powerful. Right? There's a point at which there's such a cost for not being inside the Google ecosystem that everybody defaults to it.
[41:46]
A
Isn't that this is the most ironic thing I've ever heard because. Because people who are younger don't remember that Google only exists because the FTC stopped Microsoft from killing it. So Microsoft had just killed Netscape, taken total control of the browser and Integrated it with the operating system and made it impossible to do anything on the Internet unless you went through Microsoft. And that triggered the ftc. Mike Hirschland came in, the whole lawsuit stopped Microsoft cold in its tracks and they paid a $1 fine. I don't know if everyone remembers that hilarious outcome, but they had to unbundle. And that opened the door for Google to come into existence. Microsoft pseudo competed with Bing, but they were prevented from competing aggressively and tying it back to the operating system. So here we are all these years later and Google is coming out with this series of kind of exact copy of, cursor, exact copy of, exact copy of, exact copy of. But it's perfectly integrated with these other. You saw in the other slide what a dozen Google products that have over a billion users. A billion users out of this world population is a massive installed base. So if you want openclaw, yeah, you can be over there. But if you want openclaw equivalent that works natively with Google Docs, Gmail, everything else, Android, everything else, your Google Pixel camera, then you have to use this. It's exactly history replaying itself as so ironic because they were so anti Microsoft back then. The whole don't be evil motto was a direct attack on Microsoft, implying they were the big guys that were evil. So here we are years later. I'm not saying Google's evil in any way. I'm saying they're tying as their competitive advantage in the exact same way that Microsoft used to.
[43:38]
B
Alex, what do you think about this compared to OpenCloud?
[43:40]
C
I think it's a lazy copycat product. I think it's obviously Google trying to take advantage of the resources that they have. So note it's hosted in a GCP vm, not necessarily pushed all the way to the edge, although they have aspirations psyched to Gemini and Android Halo for that. But if you're Google and you see openclaw and you see Jensen out there saying openclaw is the next big big chatgpt. Really? What's the smallest, what's the minimum viable response that you could take? It's okay, we're going to host GCP VMs with Gemini Flash that integrate together all of our products that run headlessly. That's the sort of minimum viable strategic response. What I would have liked to see from Google, DeepMind here was the maximum response. Show us the art of the possible. Show us what a next generation openclaw or Hermes competitor actually looks like. Create the benchmarks, show us next generation capabilities. And they didn't deliver that.
[44:41]
B
Here well, but Alex, I mean just to be fair, they're delivering on a lot. It's not just one piece. Right. It's a lot that is being deployed on Google I o day. But having said that, I still love scpi, which is an open claw on top of my Mac studio. I love it because it's got a personality versus being sort of a generic agent that's ever present. I don't know if you can do that with Gemini Spark, but I think the personality side of these are critically important. Salim, any thoughts for you?
[45:15]
E
Two thoughts. One is, I agree with Alex, they really could have gone for a little bit more bite here. But on the other hand, when you can make agents generally available to the average Google user, the the there'll be hundreds of millions more people training up agents and I think that's generally just good. OpenClost has lots of room to be experimental, power user oriented, very opinionated, doing the weird things like the NAT camera stuff. But I think this is a very solid entry into that world to give people a taste of what an ancient world could look like. But it is boring.
[45:59]
C
It's safe. It's playing it safe. Like is it, is it a safe entry? Yes, it's a safe entry. Will a lot of people maybe use this to clean up their Gmail inbox? Yeah, probably. But it's not pushing the frontier, which is really what I would have loved to have seen here.
[46:13]
B
Yeah, I think that's your recurring theme on a lot of these. Alex, is that true?
[46:17]
C
I think, I mean look as accelerationist. Yes. I'd love to see Frontier Labs pushing the frontier here. And to the extent that this is an avatar of Google DeepMind and not just Google Corporate, I would love to see more frontier coming out of the frontier.
[46:31]
B
Dave, you want to close this out this well.
[46:34]
A
Two quick.
[46:34]
E
Two quick.
[46:35]
B
Go ahead, Slim.
[46:36]
E
Sorry, yeah, two quick thoughts. Like there's something very powerful happening here because this is giving everybody an operating system for their lives because of the deep integration with all the other Google stuff. So I think the next productivity jump is going to come from persistence and this will create a massive enabler across the board. And it's going to go back to the earlier comment. I just want to double down on that, which is trained a lot of people on how to build agents and run agents and I think that's going to then enable another class of things to come forward from that.
[47:10]
A
Yeah. So there's no doubt that this is all fast follower. Exactly the way you're characterizing it. On the other Hand. You know, Peter, you love your scpi, I love my agents that I set up too. But when you talk to somebody on the street and you say, hey, have you set up an openclaw or a Hermes? Overwhelmingly across the world people say, no, I haven't done that. And that install and onboarding experience is just too much friction. So I wouldn't underestimate the power of default behavior. Now, over half the world uses Google and if Google says, okay, Gemini Spark is going to be one click away from a Google search, it's completely integrated, massive fraction of the world is just going to click the button and then their first experience with a personalized agent will be via that click. And so I don't think that anything can slow down Google because of their massive distribution advantage. And so they don't have to push the outer boundary. They can afford to be fast follower. And I am equally disappointed. Alex, I'm not saying it, but from a strategy point of view, they don't need those risks. They just need to be as good as one day later and integrated with Chrome, integrated with Google Search, integrated with Android, and they will win.
[48:18]
B
I think Google's magic potion is making it user friendly, making it easy, making it intuitive, and I think they're going to deliver with Gemini Spark on that particular promise. All right, here's another part of Google's resurrection and dominance. It's agentic powering of AI search mode. AI everywhere. And remember the conversation we had, Search is dead. Well, search is not dead, it's just been reinvented. Let's take a listen.
[48:47]
D
I'm excited to announce we're launching a brand new intelligent search box. Before, the search box was a contained space, but now it's totally reimagined with AI. It expands with your curiosity and as you ask, search helps you formulate your question with AI powered suggestions. This goes beyond autocomplete. It offers nuances that you might not
[49:12]
C
have even thought to add.
[49:14]
D
Now we're taking an exciting step toward this vision where you'll be able to create and manage multiple AI agents for your many tasks. Right in search.
[49:23]
A
Now, let's say you're apartment hunting.
[49:24]
D
You can do a total brain dump of what you're looking for with all your criteria like location and natural light and availability. And your agent will continuously scan the entire web across sites, social and forums.
[49:38]
B
Persistent search here. Right. So this is your agent. You know, whenever you've asked a question, it is going to persistently be looking for the latest and greatest. Yes, this new apartment just became available this product just got cheaper. You know, your wife loves this topic and you know, here's a new product delivered to her. The other side though is that autocomplete function. I wonder where it's going to take us. Right, you're going in asking or thinking about asking one question and then of course Google can sort of drift you into asking a different question you didn't intend to actually ask. A lot of interesting perturbations here.
[50:14]
A
Oh my God.
[50:15]
D
Yeah.
[50:15]
A
Well, think about a vacation plan where you're like, you know, I really think I should go to Barbados and it autocompletes to Bermuda and you're rerouted to a different hotel. The revenue power of that is astounding really. You know, OpenAI rolled out their first ads and a lot of companies I know have adopted it, but it's very ham fisted. You know, it's like, here's some ads on the side, they're obviously ads, but the Google version of it has to preserve $200 billion of existing 90% margin revenue. So they haven't quite figured out how they're going to surf that. But their power of the user decision making is like nothing we've ever seen before. So I'm sure they'll find a way.
[50:54]
B
Google AdWords is now going to, is going to now gently sort of drift you towards a different question that you weren't there asking.
[51:04]
A
Absolutely.
[51:05]
B
That would be amazing.
[51:07]
C
I mean, remember Google Instant as well, which also offered relatively fast suggestions. I don't think Google ended up directly monetizing that, but Google does, to Dave's point, have a long history of steering users toward more profitable queries. So I think that's probably quite likely. What I probably underline here is the shape of the rectangle changed after decades. How big a deal is that after so many armchair commentators saying that Google was about to be disrupted by ChatGPT with web search? Turns out Google is able to self disrupt and able to change the shape of that, that multi decade old rectangle. And it changed the shape in the direction of building AI modalities, AI search natively into their search experience, which I think many people were scared wasn't going to happen. They did it in the end.
[51:59]
A
You know guys, my very first venture investment ever was TripAdvisor back when it was first starting. And the big quandary at TripAdvisor was how are we going to have completely unbiased accurate reviews and still get paid by the hotels? We need to make money somehow. How's this going to work? And it turned out that just by sorting the list the human default behavior is so dominant that they'll go overwhelming.
[52:26]
B
Laziness. You mean laziness?
[52:28]
A
Yeah, laziness. But we're buried in decisions now. So many things coming at us from so many different directions that we have to be lazy. Only Alex could actually study every single pathway and make an optimal choice. Everyone else, you just have to fall into the default buckets once in a while. And so 80% of people will click on one of the first two or three hotels so you can have perfectly accurate reviews and just resort the list so the ones that are paying you are at the top and then you have your cake and eat it too. So I think that default behavior will hugely benefit Google because they will steer the users, but they don't have to be super overt and they're not going to misguide you into some fraudulent product. They're allergic to that like crazy. But people will still follow the default suggestions from Gemini and then Google will collect the revenue from whoever is willing to pay.
[53:17]
B
Salim.
[53:20]
E
I just love the fact that they have the courage to risk disrupting their own business. I think that's it's such a hard thing organizationally to do and I gotta give them full props for going after it.
[53:32]
B
Yeah. And for everybody, you know, everybody listening to this, you know, our goal here is to give you an overview of what, what Google has just done. It's so dominant in the planet, it does steer a lot of humanity's sort of abilities. So I hope folks are enjoying this summary. Please dive into these. Your mindset of curiosity is your single greatest tool. So go and play with these things. When you finish listening to this podcast, go and jump onto Google and play with the new AI search or its capabilities.
[54:03]
C
One more note, Peter, if I may just on Google's self disruption via search, I think there's this misconception out there that the main obstacle to Google self disrupting their search with so called modern AI was somehow on the business side or the business risk or the advertising side. I think actually the main obstacle was more technical that Google engineers for a couple of years there were concerned that there wasn't a cost effective way or a time effective way to squeeze generative models into the the very narrow and latency sensitive and cost sensitive parameters of just powering a search, that the models were too expensive and too slow to yield search results that would be competitive. And this is I suspect one of the reasons why you see going back to Gemini 3.5 flash emphasizing throughput so much, it's reflecting Google's own internal dogfooding needs of having ultra high throughput models that they could use to power search and some of their own internal applications that maybe OpenAI and Anthropic aren't feeling that demand function as much.
[55:07]
B
Makes sense. Makes sense. All right, next subject is Google is launching a universal cart that users can add products to from YouTube, from search, from Gemini, from Gmail. Google says this intelligent shopping cart works across a multitude of different merchants from Nike and Target to Walmart to Shopify. So you could literally add a product when you're searching on a Nike site or a Target site and then have it monitored and bought at the same time. Let's take a look again. This is part of Google's incredible revenue engine
[55:44]
E
journey.
[55:45]
C
I am excited to announce the universal cart.
[55:48]
D
A truly intelligent shopping cart.
[55:50]
E
It works across merchants and across services.
[55:53]
D
Services. You'll be able to add things to your cart when you're browsing search, chatting
[55:58]
E
with Gemini, watching YouTube or even reading your Gmail.
[56:03]
D
The moment you add a product, your cart goes to work for you in the background. It finds deals, looks at price drops, gives you insights on the price history
[56:13]
E
and alerts you when something comes back in stock.
[56:17]
B
So, reinventing the shopping experience. I've got some comments, but to hear from you guys first.
[56:23]
C
Elephant in the room. Elephant in the room, yes. The elephant is Amazon. So I look at every announcement relating to shopping, quote unquote from Google through the lens of how are they going to compete with Amazon for retail, e shopping and whether it's trying to commodify, create sort of virtual storefronts for individual retail vendors, whether it's crawling third party e commerce websites and assembling virtual pages and now universal carts, this is all through the lens of how they're going to compete with Amazon. So I think the elephant in the room here is, is Amazon even going to contemplate going anywhere near complying or adopting Google standards? My guess is not.
[57:06]
B
Well, you know the follow on here, we're going to be seeing in a few moments the reinvention of Google Glass where you've got imaging capability and we're going to see probably the next invention of shopping, where shopping is always on wherever you're looking and you see something and your AI agent realizes, oh, I'm focusing on, on Alex's beautiful orchid in the background, which is, I mean that. Is that orchid real? Alex, I just need to ask.
[57:32]
C
I thought Peter, you said reality was cooked. So you tell me.
[57:35]
B
Okay, literally when I look at something, my AI agent will say, oh, you're focusing on that. Do you want to purchase it? Or as you're walking through the day, right. Instead of shopping becoming sort of a something you do for an instant of time, it's a continuous function and universal cart is aggregating all the things and then probably at the end of the day saying hey, do you want to purchase that? Just say yes, we're going to be seeing this is an early step, but not the full instantiation of reinventing shopping Salim, you're going to say, yeah, so
[58:04]
E
today we go from human to website to shopping cart to check out. Right. And tomorrow we're going to go from intent to agent to transaction.
[58:12]
B
Yes.
[58:13]
E
And I think there's a. Every CMO in the world in going forward is going to be asking how do I convince 100 million agents to choose my product? They're going to have to market to the agents. Right. And so, and I think this is powerful. I mean look, Google helped created a trillion dollar company by helping people do search. The somebody's going to create a trillion dollar company by helping agents buy.
[58:38]
B
But are you ever gonna market to an agent? I mean my agent knows what I want, knows my genetics, knows my taste from everything else. It may just be buying stuff for me all the time that could be returnable sort of surprise and delight. Something shows up on the front doorstep. Oh, I thought you'd like this. Here it is. If you don't want, I'll have it picked up and returned and you're not doing it.
[58:58]
E
We'll definitely go to that. The disruption to E commerce is not the better shopping. It's getting rid of shopping.
[59:03]
B
Yes.
[59:03]
E
There's this ambient experience where hey, your shoes look a bit dirty. From the camera I looked at from your doorway camera. I'm sipping you new shoes. It'll be that kind of thing.
[59:15]
B
Dave.
[59:16]
A
I think it's just amazingly shocking the degree to which the big guys don't care anymore about consumer shopping. Because Amazon was nothing but consumer shopping originally and built their entire empire on consumer shopping and then added aws. AWS is so much bigger at Amazon now than the entire Amazon. We know all of the shopping. And so Google was already competing with that side of Amazon with GCP versus aws. And so they're already in this battle royale over compute and data centers and enterprise use and everything. And so Google had already tried to compete in retail with Frugal. Remember Frugal?
[59:58]
B
Course, long time ago.
[59:59]
A
Yeah, Frugal.
[60:00]
C
Right before shopping, yet another rebrand.
[60:04]
A
Yeah. So I think that this is another attempt to take Amazon head on on the shopping side. But I think AI is a big game changer. But it doesn't matter too much whether Amazon defends its turf or whether Google encroaches and wins. At the end of the day, the battle on the cloud and the back end is so much bigger and it's already raging. So this is kind of cool. I think it's just the next stage of this trillion dollar retail battle that will rage on for a while.
[60:35]
B
All right, let's jump into conversation about gemini app and NotebookLM. Here's Josh Woodward, who heads Gemini. I think we're gonna get Josh on the pod here. We'll talk to him about what he's up to. Let's take a listen.
[60:49]
D
More than 900 million users are coming to the Gemini app app every month just on its own. NoteBookLM has now been used to create more than 1.5 billion notebooks, podcasts, slide decks and more. It's now available in more than 230 countries and over 70 languages. It now opens up immediately and in line. And soon you'll be able to pick a regional dialect that resonates with you. You've got a right good mix of different accents knocking about. Like this one from Liverpool. Gemini Omni is coming right into the Gemini app. Let's look how this plays out in the real world. I want you to meet Sashu. She is working on a new song and she wants to create a quick video teaser. So she shares the raw video. She adds some reference visuals to it. Let's take a look at what it looks like. The third update today is about how agents are coming to Gemini. One of our newest out of the box agents. Called the Daily Brief, it's a personalized digest that's designed to be your first stop every morning. Here's how it works. You can see here that it's synthesizing information from across my inbox. And with this travel info, I can just take the next step right in line.
[62:18]
B
So this is an integration story. This is Google integrating across all of its capabilities and making it so magical that you can't afford not to be in the Google ecosystem. Dave, what do you think about this?
[62:29]
A
Yeah, it's amazing. When you got one guy on stage demonstrating here, we can build an entire operating system in real time. Let's go. I'm consuming a trillion tokens right now building, and nobody, nobody gets it right. Nobody can relate to building an operating system. Then Josh gets up there and says, here's a real human Musician. And here's her trying to portray herself to her fan base. The crowd goes crazy and it just shows you the human aspect of this is so dominant even in Google, even within the Empire. The human aspect of it is so dominant in people's minds. And it doesn't come through on the video cast. When 20,000 people see Josh present something and they go, wow, oh my God, I can totally. And the vibe is contagious across the whole crowd. And it just is hard to capture that in a video clip, but it's. Yeah, it's just remarkable. And it's going to unleash so much creativity and it's such an exciting time. And I wish, you know, I wish that was the only vibe, you know, that everyone could just capture and hold and bottle that in. But this was just a great moment.
[63:32]
B
Alex, your take.
[63:34]
C
Why is Google still branding NotebookLM as NotebookLM? It should have been folded directly into Gemini or maybe I agree with Workspace or something else. Why does this still have an independent brand? I don't understand it.
[63:46]
A
Well, look at all the other. It's like Spark and Flash and anti gravity and all this is really fragmented all over the map, like divisional kind of branding.
[63:55]
C
Google has this reputation, for better or for worse, of launching lots of products and having a culture where product managers get promotions for launching but not maintaining products. I'd really love, under the spirit of more wood behind fewer arrows, to see all of this functionality just unified in a way that gets sustained.
[64:13]
B
Yeah, yeah.
[64:14]
A
Especially because AI is such a unifying force. You can put one voice and interface on top of all this mess. And Google's branding originally was so good, you know, all the other searching, it
[64:26]
B
was a white box speaking directly to
[64:29]
A
colorful and humanistic and friendly and all
[64:33]
C
of that speaking directly to Josh and the Google team. Please just unify all of these offerings and maintain them. Don't keep launching 10 different products and product names that we'll forget about a few months from now. Just please unify all of them and maintain them.
[64:50]
B
You know, the idea of a daily brief, I love it. Skippy gives me a daily brief. This is a beautiful integration here, but being able to know what you're doing, when you're doing it, what your intention is, and giving you updates all the time on your flight. You know, there's a new flight that's available now, it's five times cheaper or whatever the case might be. And the weather is going to be hotter than expected. So make sure to, you know, to pack different that level of overlay, intelligence is going to be magical.
[65:20]
C
It is. But at the same time, remember Google Reader? Google's RSS Reader that Google abandoned despite having a rabid user base, myself included. I know Google really wants to own the newsfeed, that much is obvious, but please just maintain it.
[65:36]
E
Okay, Salim, So two thoughts. One is, I've always thought about Notebook LM for Alex's earlier comments as this weird thing sitting out there because it incorporates presentation of learning and interaction that kind of does. Like, you know, what's the difference between doing this and the Gemini app and doing it in Spark? I mean, people are going to create a lot of confusion around this. The, the, the, the point that Alex made, I just want to double down on, which is when in any big company we had this at Yahoo as well, you're rewarded for getting something out there. But then you've got a strategic project manager looking across resource allocation about 10 different projects. And so you get a peanut butter problem where you're very thin across all the different projects. You don't iterate very well. One of the few companies that iterates very well is Apple. They will relentlessly iterate on their products and most other companies on a small number.
[66:31]
B
On a small number of products.
[66:33]
E
Yeah. And they, they go very. There's a whole thing written by Brad Garlinghouse called the Peanut Butter Manifesto when I was at Yahoo that mirrors this whole challenge of how do you navigate this in a limited thing? Because they're not run as each individual team doing startups with KPIs of their own and targets, et cetera, they run across in hierarchical structures in many cases and you suffer a lot from that.
[66:59]
B
All right, Google's new product called Audio Glasses. We heard earlier about their partnership with xreal. Here's a partnership with Samsung and a couple of different glass manufacturers. Let's take a look. And how is this going to impact our lives? Two videos to show, then we'll discuss them.
[67:18]
D
The next big milestone for Android XR is intelligent eyewear.
[67:22]
A
Today I'm excited to announce that our
[67:25]
D
first audio glasses will arrive this fall.
[67:29]
A
They are designed to give you all
[67:30]
D
day help with Gemini that is spoken into your ear privately rather than shown on a display. And these glasses let you stay hands
[67:40]
A
free and heads up for things like
[67:42]
D
listening to music, taking photos, making calls, or tapping into your phone apps, all
[67:49]
E
without reaching for your pocket.
[67:51]
B
All right, video number two from Samsung.
[67:54]
D
Samsung, our vision is to enrich people's lives and help shape how we live tomorrow. In close partnership with Google, we're introducing Intelligent Eyewear that empowers you to connect to the world with confidence built with Samsung's precise engineering and craftsmanship. We're merging form, function and helpful intelligence to create something you'll want to wear in eyewear.
[68:19]
B
Every millimeter counts.
[68:21]
D
Today we're thrilled to share a first look at the upcoming styles co created with our eyewear partners or before Parker and Gentle Muster.
[68:33]
B
All right, the elephant in the room here, it's got forward looking cameras, but you're not seeing words or images on the screen. You're being spoken to by your AI. Interestingly enough, I think that being present in life has just been cooked as well. I mean imagine you're walking around and you're just having, you're not talking to your wife, your girlfriend, your kids, kids. You're just having the agent whispering to you all the time. Salim, what do you make of this?
[69:02]
E
So a bunch of things I was really disappointed in that there's no visual on the, on the screen. I mean doing the audio is, might be in Alex's words, very mid. But we're moving to that point where human computer interaction becomes continuous and becomes an ambient layer that just ongoing. And I think that is the bigger story here because that will kind of just continue to play out as we merge with technology. Already we pick up our phones 80,000 times a day. This just continues that in a very kind of unnoticeable way with the form factor is very workable.
[69:40]
C
Google should have owned smart glasses. Instead Meta is running away with this space. Apple is also playing catch up. Where was Google? Google had. I was one of the earliest users of Google Glass. Remember that?
[69:53]
B
Yes, the Glass.
[69:54]
A
Did you get punched?
[69:56]
C
I did not fortunately. But they had a battery life of like five minutes and they self bricked through operating system updates. It was, I think even Google would recognize it was prematurely released. Google could have kept iterating to this earlier point about doubling down. Google could have kept iterating on smart glasses from the Google Glass era and they didn't. They basically abandoned Google Glasses to enterprise and then abandoned them completely. And now this is I think represents a complete reset, except without all of the conveniences that Google Glass had. Meanwhile, Meta was iterating away, spending billions of dollars, sure, but iterating away at smart glasses. And now Meta has the lead in the space, not Apple and not Google. So I would love to see a very competitive smart glass market between Meta, Google and Apple. But I really would like to see Google XR in particular Android XR stepping it up a notch and shipping much More quickly. Right now, Meta's running away with the space.
[70:53]
B
Dave, what do you think, pal?
[70:55]
A
Well, I mean, this is where society is going to have a huge rift because the punching in the face was a very real thing last time Google went down this path. And they are trying to own the consumer and be a consumer friendly brand, but if they roll out a product where half of society is walking around recording everything all day long and the other half is offended by that, then that's going to be a major, major problem and they're stuck. They want to own this space, they have the technology to do it. People are going to want to talk to their Skippy, their agent all day long. They're not going to want to lose touch with it. So this is a great way to stay in touch with your agentic world that's working for you behind the scenes. So I'm very eager to use it. I'm also not super eager to get punched in the face. There were three commencement addresses this week, including Eric Schmitz in Arizona, where as soon as the commencement speakers said AI, the whole crowd went boo. And so, I mean, if you're not aware that that's what's going on and it's very easy to live in your echo chamber, especially here in Silicon Valley or in Cambridge, but you got to walk around Mississippi or walk around Nebraska to really understand how big a deal this is going to be. So I think the glasses are going to be a huge forcing function in this, in this inevitable time point.
[72:10]
E
Audio glasses. Hello, Oxymoron.
[72:14]
C
Well, audio, visual oxymoron. Now, they go together very well, but
[72:19]
B
the audio feedback layer, I think it was a, you know, they want to provide a product that actually works consistently. And I think probably the image, you know, the imaging up on AR glasses is still kind of weak. You know, the version of the Meta glasses I've tried are okay, but it's still a far way off from being able to turn on an ambient AR layer that's convincing and compelling. The brightness isn't there, you gotta look at it in the right, you know, but audio, having your AI being able to say, oh, okay, I know you were shopping for that. Do you like that one? And I'll order it for you right now. Or importantly, I don't need to, you know, when I see Alex approaching me and I've forgotten his name and he's coming at me and the glasses can say, oh, you know, that's Alex Wiesner, gross. You know, he's got an IQ of 100,000, you know, that should be enough.
[73:10]
C
I'll never forget your name, Peter, for the record.
[73:13]
B
Thank you, I appreciate that. Nor I yours, my friend. But I think the audio interface is a smart move to make it clean and compelling and consistent and something that, that you can interface with on a regular, regular basis. I am concerned about this issue of, you know, you guys all have this, right? You're with your family or with friends and something pops up on your phone and you focus your attention to your phone. You know, the loss of presence in life can be really costful, really costly.
[73:49]
A
Yeah, it really can. And also I think the, the cameras are always on, they're very low battery consumption so you can run the cameras continuously and it's just seeing everything you see and then talking to you about what you're looking at. So that's Alex, that's Celine. But if you really want a display, it can talk to your phone and you can look on your phone to see anything it wants to tell you there. So it's all integrated through Bluetooth anyway. But I think the consumer would rather have the longer battery life and not have to worry about it dying every hour like Alex was saying. So this is a good, this is a good temporary, you know, stepping stone and you know, like you said Peter, putting, putting the display in front of your eyes. If you think being not present in the moment is bad, with this in your ear, imagine when it's flashing like between you and your wife flashing images on your. It's, this is a good, it's a, it's a good product design.
[74:38]
B
Yeah, I know that. Alex, we're going to lose you in a moment but I wanted to have the last two segments with you.
[74:44]
D
Still with us.
[74:46]
B
Let's jump into Demis presentation on Gemini for Science.
[74:50]
C
I'm excited to announce Gemini for Science which brings together powerful AI tools to help accelerate research. Gemini can already assist in solving complex problems.
[75:01]
D
But our new labs prototypes streamline daily scientific tasks.
[75:05]
C
Whether it's staying on top of newly
[75:07]
D
published papers, transforming research goals into usable code or generating new hypotheses.
[75:14]
C
Another powerful tool for science is simulation.
[75:17]
D
AI simulations are going to be critical
[75:19]
C
for understanding and predicting dynamic systems that
[75:22]
D
are simply too complex to model directly.
[75:25]
C
Today, our state of the art weather
[75:27]
D
next models can predict hurricane paths faster and more accurately than traditional systems. At Isomorphic Labs we're modeling molecular interactions to massively accelerate the development of new methods medicines supported by leading industry partners.
[75:41]
C
We're now in pre clinical stage with
[75:44]
D
multiple projects including potential treatments for immune disorders and cancer when we look back at this time, I think we will
[75:52]
C
realize that we were standing in the
[75:53]
D
foothills of the Singularity.
[75:55]
B
Standing in the foothills of singularity. I love that line.
[75:58]
C
I wonder where Demis got that line. That's such a nice line.
[76:01]
E
Yeah, you've said that. You've said that before, Alex.
[76:04]
C
That's a nice line. Thanks, Dennis.
[76:05]
B
Alex, what's your take on this?
[76:08]
C
I think it's wonderful. I'm broadly supportive of what Demis. Sir Demis, excuse me, is doing for DeepMind in science. I think it represents DeepMind at its best when it's challenging what Demis calls root node problems like fusion or protein folding. I think it's wonderful. I don't think. I think Google as a business has deeply vested interest in monetizing this. I think this is more for the public benefit from Google's perspective. But I have portfolio companies, companies that I founded that work very closely with Google on issues and technologies relating to this. And I'm broadly super supportive of DeepMind pushing these out to the public. I think it's very important, and they've made so many interesting, I would call them innovations relating to metascience. How do you produce more science at the algorithmic level and hope to see much more from them in the future on this front?
[77:08]
B
Amazing. Alex, listen, thank you for joining us on this segment. I know you need to jump. Love you as always. Thank you for your brilliance. Appreciate you, pal.
[77:17]
C
Thank you, Peter.
[77:17]
A
Alex.
[77:18]
B
Thanks, everybody. Welcome to the health section of Moonshots, brought to you by Fountain Lockdown. You know, AI is impacting every aspect of our lives. How we teach our kids, how we do our business. But one of the most important things that AI can deliver to us is health. And one of the things I think about when shooting for 100, 120 is am I going to have the cognitive health to be able to think clearly and keep my wits about me for the next 50 years? I'm joined here today by Dr. Dawn Musaylem, the chief medical officer of Fountain Life and a member of my Fountain Life medical team. Dawn, a pleasure. So, dawn, talk to me about brain health.
[77:52]
E
Brain health. You know, you're right.
[77:54]
D
This is the number one concern people coming into Fountain Life have is will.
[77:58]
A
I remember the name of my child
[78:00]
D
and the face of my loved one. 45% of dementia cases are entirely preventable with lifestyle.
[78:06]
A
And what was really intriguing to me,
[78:08]
D
Peter, is that a quarter of our
[78:10]
A
members had advanced brain age, but over
[78:13]
D
13 months of us really helping them
[78:16]
A
live healthier lifestyles and eating Healthier, moving their body regularly and optimizing sleep.
[78:21]
D
People overlook that so often, but that
[78:24]
A
sleep optimization is critical for our brain health. What we showed is that we were able to improve the brain age in
[78:30]
D
46% of those individuals. That's a powerful number.
[78:34]
B
That's amazing. You know, one of the things I love about Fountain is we're constantly searching the world for the most advanced therapeutics and bringing them to our members. So for me and all of you, I hope that you appreciate the fact that you can become the CEO of your own health. You can make sure that you've got the cognitive clarity for the next 50 years. Come and check it out. Fountainlife.com Peter to learn more and become the CEO of your health. Now back to the episode. All right, one more story from Google I O. This one is very personal. We just announced the $2 million build with Gemini Xprize. Let's take a listen and then I'll provide some detail about it.
[79:12]
D
The ultimate platform to make an impact. We are officially launching the build with Gemini X Prize Hackathon. This global hackathon is going to offer up $2 million in prizes for builders who create apps that solve actual real world challenges. And the premise is simple. Pick a problem worth solving filled with Gemini, and let's all try to positively impact the lives of a billion people. To build at that scale, you're going to need some serious power.
[79:44]
B
So a call out to all hackers, all builders out there. You know, we've launched 2x prizes in the last couple of months. The future Vision X Prize. Asking people to create a film trailer and a film treatment for the movie you'd love to see that shows a hopeful, compelling, abundant vision of the future. That XPRIZE is meant to help shape people's view of the future and actually to help shape agents view of the future so they are positive and supportive and aligned with humans. This is one, you know, we've talked about on this pod all the time, that the future is one of entrepreneurship. Instead of getting a degree to go get a job instead find a passion, find a problem and build on it. So I want to say thank you to Google for funding this. They put up $3 million, two for the prize purse, a million for operations. And again, if you want to compete, go to gemini xprize.com you're going to look for a problem that impacts 100,000 people or more and then you've got basically 90 days to build your product using AI. And again, very famously, you're going to describe what you want the Product to do the market how you want to market it in English. The agent's going to build your website, your, your interfaces and the team that's able to build it, market it and scale revenue the most in 90 days wins this X Prize. Teaching people to fish instead of giving them fish, that's the goal here. Salim, any thoughts?
[81:15]
E
Very exciting. It reminds me of the. There was a problem, there's a medical problem called lazy eye and there was no cure for it. And then a team built an app which had a gamification system that solved lazy eye. So I think there's all sorts of things as we take obscure problems that hit a reasonable number of people and really go after it. Very, very excited. We've seen the incredible outcome when you put open innovation that seeds ecosystems like this throughout the history. X Prize. So couldn't be prouder.
[81:48]
B
Yeah, super, super pumped. And you know, full disclosure, both Salim and Dave are on my board at XPRIZE Foundation.
[81:55]
A
Dave, you know, I think I'm excited. I think this is the highest calling for XPRIZE yet. You know, being involved with X Prize, I've been incredibly proud of it for a decade now. But now with Google behind it also OpenAI has a $100 billion charity now and they need to put that money to work as well. And so, you know, all these mega funded companies are suddenly very, very interested in turning AI towards are good, which is not an easy problem. And XPRIZE is really, really good at taking very hard problems and making them actionable and so unleash that capital in ways that actually benefit humanity. You look at the oil cleanup, XPRIZE and the impact that's had and kicking off of all of the space activity that we have via xprize. And so this is just the next chapter, but it's the biggest chapter by far.
[82:40]
B
Love it. And a reminder, everybody watching and listening, if you want to be there at our MoonShot gathering on September 25th, go to moonshots.com we're going to be awarding both the Gemini X Prize as well as the future Vision X Prize. On that day, we're going to the five finalists, the five creators with the five top film trailers, and the five finalists with the Gemini X Prize who've created the most revenue. They'll be there pitching and if you're in the room, you're going to be helping to vote on who the winner is. So again, September 25th, go to moonshots.com to join the moonshot mates and be there with us at the moonshot gathering. All right, I see Andrew Feldman has entered the room. Andrew, a pleasure to have you here.
[83:22]
D
Thank you for having me on your show. Appreciate it.
[83:25]
B
Of course. By way of introduction, Andrew is the co founder and CEO of Cerebrus, a pioneering wafer scale computing dedicated to accelerating AI training and inference. Cerebrus just raised 5.5 billion. I should say you just raised 5.5 billion. The biggest US IPO since Uber in 2019, you know, up 68% and market cap of 95 billion. Quite the coming out party. Andrew felt good.
[83:56]
A
You had to work for it too.
[83:57]
D
Super exciting for the team. We were able to bring a portion of the organization and their families and to share it. I mean my parents were there and my wife and, and my stepdaughter and we made it into a family event and it was really something special.
[84:17]
B
Amazing generational wealth for everybody in your organization. Extraordinary. And we'll come to that story in just a moment. A little bit more in AI news before we turn to Cerberus and chips. A big story. Andrew Kaparthy joins Anthropic. Andre is an extraordinary individual. You know, his work in and he's joined the pre training team at Anthropic and he'll start a new initiative focused on using Claude to accelerate Claude's own pre training research. You know, his announcement, he said the next few years at the frontier of LLM will be especially formative and that's where he wants to be. Karthy was, you know, has a stellar resume in AI. He was the co founder of OpenAI. He left in 2017 to run full self driving for Elon Musk at Tesla. He returned to OpenAI in 2023 and 24 and then he founded Eureka Labs. Let's take a quick listen to Andre on a podcast called no Priors. This is a conversation he had before the announcement. So listen up. When he's doing a shout out and a call out to which Frontier Lab wants to hire me and they're working
[85:25]
C
on what's coming down the line. And I think if you're outside of that Frontier Lab, your judgment fundamentally will start to drift because you're not part of the, you know what's coming down the line.
[85:33]
A
Right.
[85:34]
C
And so I feel like my judgment will inevitably start to drift as well. And I won't actually have an understanding of how these systems actually work under the hood that's an opaque system. I won't have a good understanding of how it's going to develop and et cetera. And so I do think that in that sense I agree and something I'm nervous about. I think it's worth basically being in touch with what's actually happening and actually
[85:49]
E
being in the Frontier Lab.
[85:50]
C
And if some of the Frontier Labs would have me come for, you know,
[85:52]
E
some amount of time and do really
[85:53]
C
good work for them and then maybe
[85:54]
A
come in looking for a job, this is super exciting.
[85:57]
C
Then I think that's maybe a good setup because I kind of feel like
[85:59]
E
it's kind of, you know, maybe that's
[86:01]
C
like one way to actually be connected to what's actually happening, but also not feel like you're necessarily fully controlled by those entities.
[86:07]
A
So I think honestly in my mind,
[86:09]
C
like Noam can probably do extremely good
[86:11]
A
work at oai, but also I think
[86:13]
C
his most impactful work could very well be outside of OpenAI.
[86:15]
D
Noam, that's a call to be an independent researcher.
[86:18]
B
That was Andre on five cups of Coffee, his quote that he put out on X. I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and remain deeply passionate about education. All right, Dave, your take.
[86:37]
A
Andrew, he was calling to you. That was the moment he was saying, cerebras, I need your compute. Please call me. Nice. He does. Everybody does, actually. So andrej came out with that auto research repo a couple months ago and I installed it. It's been auto researching on my cloud for a while now. But it desperately wants much, much more compute. And it doesn't need mega models. You know, it can run on lean, highly focused models. But I think he finally realized that every other OpenAI co founder has either raised billions and billions of dollars or is at a foundation lab and he's the only guy who doesn't have access to the big machine. And it's just, he can't just sit there posting on git anymore. He has to be part of one of these people, big machines, because we're on the cusp of the singularity and you can't miss it by being outside of the big game.
[87:27]
B
Salim, any thoughts?
[87:28]
E
I thought his point was well taken that he. Where he said that if you're not in the foundation, in the labs, you're missing out, things are moving past you. And so he's trying to beat the cutting and knowledge. I just, I. I think there's. It's also notable he would have had his choice as to where to go. And I think it's very interesting that he joined anthropology.
[87:48]
B
Really interesting. And it's interesting that this was announced on the same day as Google. I o a little bit of marketing strategy, perhaps. Andrew, what do you take about. What's your take on this?
[87:59]
A
Look,
[88:02]
D
he is sort of one of the most important and prolific thinkers in the space, right? And not just, I think, as reflected in what he's built, but in. Reflected in sort of how he's taught the community. And I think that's interesting. I think his point that there are a small number of frontier labs that are sufficiently far ahead that if you're not with them, you're not on the frontier, probably applies to hardware too, right? That if you're not building hardware for or engaged with at a fundamental level, one of the three labs, the three most important labs, Google, Anthropic, OpenAI. You are not seeing what they're thinking. And just like your ideas will drift, if you're a model maker, your hardware will drift from what they need as well. And I thought that that was really sort of. I applied that to our domain. And I think, I think he has just an extraordinary track record for useful stuff. He's built extraordinary.
[89:17]
A
He's also just a super, super ethical guy and known for it. So he's a talent attractor. A lot of people here in the Valley want to work for the most ethical organization that they can navigate to. And he's got that aura just all over. As do you, Andrew, actually, as does
[89:32]
B
cerebras, where we are right now, speaking about ethical organizations. Let's jump into the conversation. Let's jump into the conversation about OpenAI versus Elon.
[89:44]
D
Okay.
[89:44]
B
All right. How's that for a transition, guys?
[89:47]
A
That's a stretch.
[89:48]
E
Yeah. You had to work for that one.
[89:50]
B
I did. I took advantage of Dave's point. So the jury rules against Elon musk in the OpenAI lawsuit. So federal jury unanimously rejected. Rejected Elon's lawsuit in just two hours of deliberation. Wow. Jurors ruled that Musk waited too long to sue outside the statute of limitations for his claims. Elon's legal team, of course, is going to appeal. I don't know what to say other than I would love this story to sort of not cloud the entire AI data center conversations that we're having.
[90:31]
A
Very curious, Andrew. You know, OpenAI, you have a very close relationship with OpenAI. Did you have a lot of skin in the game in this outcome? It's only going to be good news for Cerebras. But was it relevant?
[90:42]
D
I think this was a giant distraction. And billionaires in pissing matches interest me not at all. I think these are two of the most important thinkers of our generation. I think what Elon has built is breathtaking. I've met him at dinner together. He's a polymath. He's a brilliant thinker. What Sam has built one of the fastest growing companies in the history capitalism. But some of his ideas also in the invention of the safe at Y Combinator, these were, were enormously meaningful in the structure of Silicon Valley. I think everybody loses when they battle. I think I want Elon building cool shit. I want Sam building cool shit. And I don't want to waste time or read about disagreements. I just want these guys doing what they're the best in the world at, which is building stuff.
[91:46]
B
Well said, Andrew. Well said.
[91:49]
E
I'm just really angry it got to this point. Surely they could have looked at it and gotten the statute of limitations have expired. You just don't even bother. I mean, why? This was very upsetting how much angst and time to Andrew Swin has been spent on this. And they could have been building. How much more could they have done?
[92:09]
B
Really good point. I mean, the ruling they made was obvious from the beginning. If they were ruling based on the, on the timing. But this is the reason Elon's going to appeal, because he says that's not the point.
[92:21]
E
I looked into this challenge. I looked into this and the appeal will very, very likely fail because it's a factual based decision and the courts rarely overturn those.
[92:34]
A
Well, I think to Andrew's point, though, Dennis said earlier in this podcast that we're standing in the foothills of the singularity. Like an appeal is a slow, long. It's going to be irrelevant in the timeline that really matters, I think.
[92:48]
E
Yeah.
[92:49]
B
All right, let's jump into part of the innermost loop chips. And here we are, Andrew. Cerberus record IPO closes up 68% market cap $95 billion. And Dave, you're in the Cerebras offices this morning, aren't you?
[93:04]
A
I am, I am. I love the energy. You know, it's such a rare. I think all of America is driven by this moment, people building toward this event. And Andrew's journey was longer and harder than a lot. And so to actually get there, you know, it's so hard because I pushed that button about, you know, 1% of the size of Cerebrus, but I still push that same button on the Nasdaq. And I didn't realize till that picture's been hanging on my wall for years now. And it's like a Lifetime achievement, like kind of like a Nobel Prize or an Olympic gold medal where you carry it for the rest of your life. And so so few people get to experience that. So I couldn't resist the opportunity. I was only a few miles away anyway. I couldn't resist the opportunity to come here and just feel the aura of it's still settling in here. Clearly. Andrew, is that the way it feels?
[93:51]
D
Yeah. And you're always welcome to come on by.
[93:55]
A
Thanks.
[93:56]
D
And enjoy the mojo that we've got going here. I mean, it's. We try and create an environment where exceptional people can do extraordinary work.
[94:08]
B
Andrew, you and I met at the Citibank event six months ago. I was there giving the dinner keynote on longevity. And I wish I had only had a chance to go on your friends and family round before you had this epic, epic release. If you don't mind, tell all of our listeners and viewers here a little bit of the backstory, the founding story behind Cerebrus. What was the moment where you said this needs to exist and you took a very different route than Nvidia and other chip manufacturers?
[94:35]
D
We did. The founders had all worked together at my previous startup and AMD bought that in 2012. By 2015, we'd wandered off a little bit and we started meeting and
[94:53]
E
we
[94:53]
D
saw AI on the horizon. And what we knew was that this new workload would eat through extraordinary amounts of compute. And we made two really big bets. We made a bet that said, like graphics produced the GPU and like mobile compute, supported the development of the ARM processor. That this technology, this work, would be big enough to require dedicated silicon. And the second bet we made was that the right strategy wasn't to build a derivative of the GPU that you needed to start with a clean sheet of paper and you needed to do something fundamentally different. And these were enormously contrarian bets at the time. And both proved to be dead right from that foundation. We continued the innovative thinking and we supported said. What AI is going to need is memory bandwidth. That's the speed with which you can move data from memory to compute. The way to innovate on that dimension is to use a different type of memory than everybody else uses. We have two types of memory. We have memory that can store a lot that's slow. We call that DRAM or HBM ram. And we have memory that is fast, but can't store very much per square millimeter. And so what we hypothesized was that if we could build a chip the size of a dinner plate, a chip sort of 58 times larger than any chip ever built before. We could stuff it to the gills with sram, therefore, thereby overcoming its weakness in not being able to store very much per square millimeter and benefit from its strength. And that proved to be a very difficult problem to tackle. But when we got it proved to be right. We are somewhere between 15 and 20 times faster than the GPU on any inference problem. And so the challenge along the way was that nobody ever built a chip this big, not once in the 75 year history of the computer industry.
[97:13]
B
They kept on slicing them thinner and thinner and smaller and smaller.
[97:16]
D
That's right. And that even sort of those on the sort of Mount Rushmore of our industry, people like Gene Amdahl had failed, crashed and burned. And interestingly, even after we solved this problem, we had people come and visit our labs and then try and build it, and they also failed. And so what it took was years of perseverance and innovation. And all the credit goes to the engineering team, Gary and Sean and Michael and JP and the team we had. We failed for years. And In August of 2019, we announced we'd solved this problem that had been unsolved forever. And we thought everybody would rush to our door and the world didn't care one bit.
[98:06]
A
Why?
[98:08]
D
The world was utterly indifferent. And over the next, I mean, in the first generation, I think we sold 12, 12 systems. And in the second generation we sold 300, 350. And in the third generation we're selling many, many, many thousands. And so what happened was we solved this problem and we're way ahead of the market. And it wasn't until 2024, late 24, early 25, that the models got fast enough and the models got smart enough that people wanted to use inference everywhere. And that that's what happened. And sort of there we were with the fastest inference machines on earth by orders of magnitude, and suddenly people wanted to use AI. And the way we use AI is with inference. And we were just crushed with demand. And then In December of 2025, we signed a deal with OpenAI, north of $20 billion over several years, one of the largest deals ever signed in Silicon Valley. In March, we signed a term sheet with AWS for deployment in their data centers and business has been pretty good since.
[99:21]
B
Well, congratulations. Let me just take a second and welcome Alex back. Alex. Hey, Alex, good to have you back.
[99:26]
C
Good to be back. Amazing to meet you, Andrew.
[99:29]
D
How are you going? Alex?
[99:31]
A
Doing well. I wanted to ask you, Andrew, I was talking to Valavan yesterday Your chief product architect, brilliant guy in Toronto. Awesome, awesome friend. But I didn't realize that the company had a whole history as a training side company. You know, inference is now what, 80, 90% of the market? And moving a huge amount of data from SRAM through the processing massively benefits on the inference side. But did you see that coming in the initial design? Because I don't think a lot of the research people even knew that inference would be so dominant.
[100:04]
D
I think that we got many, many bets wrong. I think any CEO who looks back over a decade that moved is quite quickly as ours and says they got it all right is probably not a guy you want at your birthday party. We got an enormous amount wrong. But one of the things we got right was an understanding that we make AI with training and we use it with inference. If AI is going to be smart and if it's going to be useful, you need to have an inference business. That bit we saw earlier and the real problem between 2020 and 2024, 25 was that it wasn't smart enough to be useful. And so everybody was focused and all the labs were talking about number parameters and now people don't care. The only question is, does it write good code? Does it give me good answers? Can it do things that I want done? And that's because we've moved into a regime, into a world in which, which it's useful and that's how it's measured. And so, Dave, we did recognize this, we are really good at training. But right now there's such demand for fast inference, such overwhelming demand that we're allocating a lot of our attention to it.
[101:24]
C
I'm curious, Andrew Srem, you mentioned SRAM earlier. The largest models, really the standard models at the this point, that offer frontier capabilities and in some cases up to 10 trillion parameters. How do you think about SRAM when, correct me if I'm wrong, the wafer scale Engine version 3, I think has maybe in the tens of gigabytes of SRAM. 40, 50, 40, 50, something like that. But the largest models are in the trillions of parameters. How do you think about the future of SRAM given that, as you said, you're stuffing it to the gills, right?
[101:59]
D
I think the following. I think that models that size have to be divided up. Whether you're using GPUs or GPUs or us, they have to be cut up and they have to be spread over multiple chips, right? Remember models that size, Alex, they have a very large matrix. Multiply in the attention head and that doesn't fit on a gpu, you have to cut it up and you have to go tensor model parallel. And you don't have to do that with us, but what you do have to do is spread that over four, six or eight chips. And what you do is you divide the model very carefully and you divide it such that no layer runs over two chips. And so what, you're moving results from one layer to the next and you can move it because that's a very small vector, that's a results vector. You can move it over 100 gig ethernet and it is slower. That little hop is slower. But the calculations that take up such a big portion of the time are so much faster that you pay a very small penalty for breaking IT up into 4, 8, 16, 20 chips on the order of 2%. Now, other SRAM solutions that are small, like for example, Grok that Nvidia acquired, they have to break it up because they have only 800 square millimeters to use. They have to break a big model up over two or three thousand chips. And each of those hops hurts their performance. While we have to do a few hops, they have to do thousands. And so there's no way ever to fit everything on any size right amount of memory. But there is a very nice and simple way for us to cleave models to spread it over multiple chips. And yesterday we announced and posted numbers on Kimike 2, which is a trillion parameter model in the open source community. We were of course an order of magnitude faster than anybody else.
[103:58]
A
Oh really? How many wafers on that?
[104:01]
D
I forget I've been busy,
[104:06]
A
but it
[104:06]
D
was about a thousand tokens per second where A really good GPU shop like Fireworks is running at 70.
[104:13]
B
Yeah, all right.
[104:14]
D
And so, and they're a really good shop. And so, you know, 15x, that's pretty good.
[104:23]
A
Yeah. Peter and I were at Google I O yesterday and they showed a whole rack of TPUs operating together, generating 1400 tokens per second, writing code. And you see that and you're like, I need that, I need that tomorrow.
[104:35]
D
Well, the trick there, Dave, sometimes is, and Nvidia has been masterful at this, sleight of hand is not telling you whether they mean tokens per second per user or aggregate throughput. The GPU is an extraordinarily good machine at generating slow tokens. You can generate an NVL72 at 35 tokens per second, which is painfully slow, can generate millions of tokens. On the other hand, if you ask it to generate tokens at 200 tokens per second per user, it can support one or two users. That's a $4 million solution working on one user.
[105:20]
A
Right.
[105:20]
D
And so it's really important when you sort of dig into these, are they telling you gross throughput? Is this a lot of customers who are unhappy with their performance or is this the. Are they able to serve that to individual customers? And how many of them? Andrew, you said go ahead.
[105:40]
B
Yeah. You were being complimentary of Elon as an extraordinary builder, entrepreneur, sort of polymath
[105:46]
D
earlier, one of the best in history for sure.
[105:49]
B
I'm curious. He steps up and announces Terrafab produced 50 times the amount of chips on the planet that exist today, outstripping TSMC and everybody else. What do you think of, of Terrafab? I'm super curious.
[106:04]
D
Look, I think Elon has proven himself on multiple dimensions. He's proven himself to be a visionary. Right. The number of people said you're an idiot to try and build cars in Fremont. Right? I mean, we've got the highest labor rates in the country, maybe among the highest in the world. We've got a regulatory regime that is unfriendly to business. The number of people who said you shouldn't build a rocket company. The number of people who didn't understand that he was building a rocket company because he wanted a satellite company needed. I mean, he has been ahead of everybody for a very long time. Okay, that's the vision side. That's the vision side. He's also been able to execute on some of them, not all. And that's what's cool, is he is trying to do things that other people can't do. Now, this particular problem I know a little bit about and building fabs is very hard. And it is hard in a different way than some of the other problems he's attacked. I'm not saying he can't do it. I'm saying it will always take longer than he says. It will cost vastly more money. That's the challenge of building extraordinary things. It is not a five or ten year project in my humble view. I've been wrong before, but I put this at a 15 or 20 year project. I think it's interesting. It's probably good for the US that we have domestic fabs. But I think that there is a reason why even with the exact same equipment from asml, right, Samsung and TSMC aren't at the same node. TSMC is ahead and they're extraordinary. And the amount of received wisdom and learning from the fabs, they've built over generations cannot be underestimated. But if anybody can do it, Elon can do.
[108:09]
A
What does that mean for Cerebras? Because obviously US manufacturing and like you said a lot about this topic, but US manufacturing of chips is critical. I mean critical for everything. For national defense, for global security, everything. Yeah. And so if it's going to take 15, 20 years, that's just Terrafab. And the TSMC migration to the US is going very slowly, way behind schedule. So what does that mean just in terms of. Well, first of all, supply, demand, just the raw ability to get things made. You must deal with this every day.
[108:46]
D
These things are hard to build, right? I mean fabs are pyramids. They are our pyramids. And TSMC is the greatest manufacturing company on earth. And the challenge is These things take five years to build, six years to build, and $50 billion, $40 billion from the people who built the last one. That's true. Whether it's TSMC or Samsung or, or any of the great builders here, these are unbelievably difficult to build. And that's why they've been behind schedule in the U.S. i think they encountered some challenges that were unforeseen. I think we have political challenges in that these things take a long enough time that they cross administration boundaries. When your projects are more or can't be done in four years and have to cut across multiple administrations, maybe two or three different administrations. Right. Over a period of time when you have local ordinances that get in the way of building, as happened with Samsung's FAB in Texas, they redesigned the FAB because of a local fire ordinance that made no sense. These are painful problems that our system hasn't found a way to overcome. Come. And so I think that we have to find sort of a way to do better. Because I think the reshoring of fab and not just the fab, FAB gets all the glory, but the packaging business every bit as important and something we lost entirely when the fabs left.
[110:21]
A
Yeah, yeah. Actually good question. You know, by the time you get something ready to put into one of your data centers or a third party data center. How many different manufacturing partners has that wafer been through?
[110:32]
D
A fair number.
[110:34]
A
A fair number.
[110:34]
D
It goes to SAP and it goes to someone who ase, who deposits RDL on the backside. It's diced, it's cleaned, it comes to us for a step. I mean it is a long process. I think when we stopped caring about fabs in the 90s and, and IBM sort of left and globalfoundries as fabs, sort of. We didn't do anything to keep them. We lost this collection of surrounding expertise. Right. When a chip comes off a fab, right, it's a dead piece of silicon. The package is how you breathe power and life into it. You get IO in, tune it and how you get power into it. And that's also an enormously challenging technology. It takes material scientists, it takes manufacturing engineering, process engineers, deposition engineering. And we punted all of that by not caring about this industry. And it's all sitting in Taipei and in Korea. The materials are manufactured in Japan. Kyocera is one of the leaders years there and we got to get it all back and we got to make a decades long commitment to this industry.
[111:59]
A
Well, if you said that the tariff lab is 10 plus years out, if I look five years out, do you think that your, you know, Cerebras is able to manufacture on Intel, Samsung and TSMC or are there any other choices or.
[112:14]
D
I, you know, we, we've committed our, our 3 nanometer design to TSMC so that will take us out a little bit. We do manufacture some components at Samsung and have a great deal of respect for Samsung's fab capabilities. We have never used Intel. Lipboo is an extraordinary leader and a longtime advocate for hardware in Silicon Valley. As you know Dave, there was a period between about 2007, 2006 and 2015 or 16 where every VC firm was filled with somebody from VMware who didn't know anything about hardware, who thought compute was made by flea feces in the cloud and that there was some sort of thing in the ether that somehow was generating compute. And we tried to explain for a long time that the way you make more virtual compute is to begin with real compute.
[113:14]
A
Well, I got to tell you, you've inspired so many people that are in campuses right now that are eager to be part of your mission to get that back. So the more I'm going to route as many as I can through this building, please do.
[113:25]
D
I mean guys like Andy Bechtelsheim and Lippboo and a few others were continuing to put money into hardware, continuing to. Pierre Le Mans continued to do it and support us as it was tough going to raise money over that time period. I think while I know lippboot, they've got a lot of work to do and he's done great things so far, but they've got some work to do before we could move to Intel.
[113:59]
B
Alex or Salim, do you have a question?
[114:02]
E
I have a quick One, Andrew, you've gone from raw invention, solving fundamental big problems, to going to now production. When you want to scale these things, can you say how long it takes to create one of those chips? And over time, as you get better and more efficient in the manufacturing process, what do you hope it shrinks to?
[114:23]
D
Well, I say that the first one took four years and maybe half a billion dollars. Somewhere between 400 and 500 million dollars. That's why I take it to dinner when I go with my wife like a 10 year old with his first dirt bike. I mean, it's coming to bed, it's in the bedroom, it's not outside in the garage. It is being carried around. Everywhere I go. I got a wafer. I think the inventions cut across lithography, chip architecture, packaging, cooling, cooling, power delivery and cooling. They included compiler inventions, algorithmic inventions. In fact, some of the hardest problems that we encountered were packaging. We solved them seven or eight years before others encountered them. The B100 was, or the B200 was 18 months late. It was late because they had a problem with their coas.
[115:32]
B
What's that mean?
[115:33]
D
COAS is a process step where TSMC uses a 65 nanometer chunk of silicon as a motherboard. They put on that, they put on Nvidia's chips in the memory and instead of putting it on a green board, a traditional motherboard, they put it on a piece of silicon. And the wires are more efficient in silicon, they can be narrower. This was a big invention, but we knew that there would be a problem with the coefficient of thermal expansion. We knew that because we solved that problem in 2018. There they were in 20, 24, 25, struggling with a problem that we'd solved seven years earlier. That's what happens when you do pioneering work is you encounter problems, you have a chance to solve them long before the rest of the industry, industry even encounters them and knows they're a problem. And so that is one of the joys obviously in everything we do in engineering. There's a trade off. The downside is there's some low days, there are some days you go home and some of these days stack up. And we had about 18 months where we're spending 8 million a month and we couldn't solve the problem. And when you have board meetings every six weeks and you come in and you still can solve it, you still can't solve it, and you're $100 million more in the hole and then you're $120 million in the hole, then you're $140 million on the hole and you still can't solve it. This is some low days and then
[117:12]
B
you have the IPO of the year and it's a high day.
[117:15]
D
That's right. I think, Peter, it's such an archetypal
[117:21]
E
story of an entrepreneur.
[117:22]
B
It's the entrepreneurial journey.
[117:24]
D
It is, I think, Salim, one of the things I've learned along the way, this is my fifth startup, is that this shit will kill you if you can't modulate the highs and the lows. And for every entrepreneur, every CEO, and I tell them first, that this is a pressure test on your soul and second, the number of times you can get kicked in the gut before lunchtime and have it still be a good day as a CEO of a startup is amazing.
[117:52]
B
Would you rather be doing anything else?
[117:54]
D
No. This is all I know how to do. I'm a professional, David, in the battle with Goliath. This is what I know. I have no interest in doing other things and I have no interest in working with people who are other than those who want to attack the hardest problems.
[118:10]
B
Yeah, amazing. Alex, please.
[118:12]
C
Speaking of the hardest problems, and it's almost in the name Cerebras, you have, I think, a 4 trillion transistor budget with your third generation wafer scale engine. I'd love to talk maybe a little bit about what's at the end of the rainbow projecting out, say 10 years when you're on your nth generation model. What does the future look like? Does it look like brain uploads running on WSE8? What's the killer app? What does this look like in 10 years?
[118:43]
D
Alex? I think one of the fun things about being an infrastructure builder is you don't have to have those ideas. No, really, I was with the team and many of them are here in the mid-90s. That helped drive down the cost of networking. We built some of the first and fastest ethernet switches. And we had no idea that WhatsApp would arrive and that it would make possible, even for the poorest members of our society, to communicate home. And when I grew up in the 70s, the only thing I heard my grandmother say on the phone was, put your brother on, it's expensive. It was $4aminute for my mother to call Australia, where her mother was. They spoke for six minutes a week. And the only thing I heard my grandmother say was, I'd say, hello, Boba. She'd say, put your brother on, it's expensive. We put in a company called Yago, along with many others with Juniper and others. We put a small brick in the wall that made the cost of IP transport so low that somebody else could invent a technology that made it such that that every person can talk to their grandparents, no matter how poor they are anywhere in the world. And that's something that we didn't know. That's not the problem I set out to solve and our company set out to solve. We set out to solve a problem as an infrastructure builder that we build roads and what you drive over those roads and how far you take them, that's other people's work. What we're trying to do is allow people to to do extraordinary things on our infrastructure. And so when I think about what we're enabling, that's work for Sam, that's work for Ilya, that's work for others. What we're trying to do is make a compute platform on which their ideas can take flight. And what we know is you need faster calculations.
[120:44]
C
So what I think I heard you say is that, that you're very deliberately not having opinions as to the future shape of the workloads that will run on your infrastructure. And you're primarily at this point deferring to the Frontier Labs to steer the future architecture of workloads.
[121:00]
D
Today's Frontier Labs or New Frontier Labs. Right. We are making bets that the world will continue to depend on sparse linear algebra or underpinning for all these calculations.
[121:13]
A
This episode is pretty brought to you
[121:14]
D
by Blitzi Autonomous software development with infinite
[121:18]
A
code context Blitzi uses thousands of specialized AI agents that think for hours to understand enterprise scale code bases with millions of lines of code. Engineers start every development Sprint with the Blitzi platform bringing in their development requirements. The Blitzi platform provides a plan then generates and precompiles code for each task. Blitzi delivers 80% or more of the
[121:45]
D
development work autonomously while providing a guide
[121:48]
A
for the final 20% of human development work required to complete the Sprint Enterprises are achieving a 5x engineering velocity increase when incorporating Blitzi as their pre IDE development tool, pairing it with their coding copilot of choice to bring an AI native SDA DLC into their org. Ready to 5x your engineering velocity. Visit blitzi.com to schedule a demo and
[122:12]
D
start building with Blitzi today.
[122:17]
B
One final question. Orbital data centers Fiction real must have. Are you going to put your chips up there?
[122:25]
D
I think first we have serious advantage in space. In space, some of the most expensive work is the chip to chip communications, right? We've had chips in space for a long time. That's What a satellite is. A satellite looks like a PC motherboard with a big camera stuck on it. A big telescope. Right. If you unpack what's in one of these small satellites, every computer hobbyist will say, holy cow, that looks like a server motherboard with a business big with the telescope stuck to it. And then it's hardened. Communicating in building a cluster is actually much more complicated because you have to do a lot of communication in this work. Moving the data from the land to the cluster is a problem that we've solved a long time ago. They will continue to improve it. So being a big chip and having to move things off chip less often is a huge advantage. I think this is an exciting domain. I think like many hard problems, the last 10% don't take 10%, they take 90%. Right? Self driving is one of those categories. Right. The last 10% we've been sitting at for eight or 10 years and we're just now sort of getting over the hump of the last 10% because it isn't really 10%. I've got it in the seven to 10 year category.
[123:53]
C
So it's interesting, Andrew, just to pull on that very briefly. What I would have expected you to say to that would have been something like with the wafer scale engine, you had to design around faults. You had to be incredibly fault tolerant at wafer scale and in a space environment with lots of ionizing radiation. You also need to be fault tolerant. And that Cerebras, with its experience with fault tolerance at wafer scale is like the perfect computing platform for highly ionizing environments.
[124:20]
D
There are, you know, I think we have lots of advantage, Alex, and you put your finger on one of them, that you have to try and sort of shield your silicon very differently in space and you will get more flaws
[124:40]
A
and
[124:43]
D
they're single bit errors, their hard errors, their whole collection of error errors that you have to contemplate. Our ability to shut down a core and route around it is an enormous advantage in that environment. I think we've got as a community some work to do over the next four or five years before we have sort of the, the truly hard part of getting them in space, orchestrating the software, getting them to communicate. So I've got it sort of out the better part of a decade before we have sort of production in space. I think it's a very sort of a worthwhile project to pursue, but it's out of little ways.
[125:29]
B
All right, Andrew. We close out these segments with an AMA with our incredible subscriber base and Would love to have you join us. So we've chosen eight eight questions from our comments which we all love and read and we'll be peppering them along. So I'll put them up here. Saleem, I'm going to give you first shot. Andrew, you can look at the others and see get ready for one of them. We have a second page we'll go to. So Saleem, why don't you pick one of these.
[125:55]
E
I'll go with the first one. If the world becomes compute constrained, does the 10 cent lawyer for everyone thesis still hold? Does AI become a luxury only the rich can afford? And this comes from eoright bust1. So this is a fairly. If you've been listening to the podcast, there should be a fairly clear path here, right? Because every major technology starts with scarce and very expensive. And then you saw this with computing bandwidth, DNA sequencing, solar energy, they all look very constrained and very expensive initially. But then the learning curve kicks in, infrastructure kicks in, Javon paradox kicks in, rights law arrives, competition arrives and the cost collapse. AI computers going exactly down the same path. You may have some bottlenecks like chips and power generation data centers, but those become investment honeymoon pots and capital floods to the towards those bottlenecks. Right. But the bigger insight is that as not consuming compute, it's helping design chips and optimize infrastructure. What Alex calls the inner loop. It's improving, it's compressing the models, etce for that example and the system becomes recursive. And as you have intelligence, building more intelligence. This is why we get so excited by this future. It's going to drive the cost of of everything down to that. Maybe it's a $2 lawyer first, maybe it's 50 cents, but over time it's going to get to 10 cents and
[127:25]
D
it's down from a thousand an hour.
[127:27]
B
Yes, that's number one.
[127:28]
D
Number two, I think the problem with lawyers and accountants is the structure of their business. Wrong.
[127:37]
E
Business model for the future is exactly wrong.
[127:43]
D
Their business is to stand between ordinary people and obscure knowledge. That's what your accountant does. You don't want to figure out what the tax rules are with related to depreciation on on a property you bought or was gifted to you in 2020, right?
[128:02]
B
Who.
[128:02]
D
Who wants to know that? And so what their business is is sort of the acquisition of obscure knowledge and the application of that ac. Of that knowledge to particular problems. That's exactly what don't you think, Andrew?
[128:17]
C
That generalizes though, like what are you other than a gatekeeper of obscure knowledge? Regarding the high God,
[128:31]
D
Engineers are at their best. They are actually, you know the reason they're called council is when they're giving advice not on legal matters, when they're giving good business counsel, when it, when common sense is challenging in a confusing environment. Those are when they're at their best. I think when you're drafting all the documents you need for most things have already been drafted.
[128:54]
E
Right.
[128:55]
D
We don't need another lawyer reviewing another NDA.
[128:57]
E
We don't.
[128:58]
B
Andrew, let's give you, let's give you next shot. Which question? Two, three or four? You want to choose?
[129:04]
D
I think number three is interesting. I think there is a profound misunderstanding about how.
[129:12]
B
Let's read the question why can't money by Elon or Zuck a lead in AI, can't they just buy the best talent? And that's from novarift.
[129:21]
A
Great question.
[129:22]
D
No. The answer is no. And I think why couldn't intel build a cell phone processor? At the time they had the best fabs at the time they had the best computer architects and they destroyed tens of billions of shareholder dollars failing. Same with amd. It turns out in our industry that money and the acquisition of talent isn't enough.
[129:49]
B
What is?
[129:50]
D
There's something else.
[129:51]
A
Mtp. What massively transformative purpose is incredibly important.
[129:57]
C
Intel could have said yes to Apple though.
[129:59]
D
No, no, but they could have. But the truth is what led them to believe that they were in a position to say no to Apple.
[130:11]
C
Intel was chasing margins. Intel was infatuated with its own profits. They had an arm division that they sold off. Intel could have sold off.
[130:19]
D
Why? That's the thing we're trying to understand
[130:21]
C
is the innovators dilemma. They were fat and happy and lazy.
[130:26]
D
Maybe, maybe. Or maybe that there's something in your DNA that makes luck big mutations. Well look, I'm sitting here saying all day long we'll take luck over school but I'll also say that all day long that extremely hardworking people with tremendous grit end up more lucky and that both of those are true. That is life is really hard working people over long periods of time who have integrity and ethics, they get lucky more often. And luck is not equally distributed to those who work hard and those who don't.
[131:06]
B
Go ahead, go ahead, go ahead.
[131:09]
E
Throw in a quick plug here. We're, we're launching this service next week. It's shaping luck.com because one of the things is that luck is non linear and in a world that's going exponential you want non linear outcomes. So if you're interested in go Go join us at a webinar.
[131:27]
B
What's it called again?
[131:27]
E
What's the shaping luck.com is the URL okay.
[131:31]
B
Awesome.
[131:31]
E
Free webinar.
[131:34]
D
But I think, Alex, the question isn't. Of course they could. Why didn't they. Why did they miss it? Why did AMD miss it? Why did. Right. Why did. For example, why did Nvidia fail for decades at everything that wasn't a gpu? They failed to build an ARM processor that worked. I think it was called Snapdragon. I think they failed at a Northbridge Southbridge part and they succeeded beyond anybody's expectations at a gpu. I think that. The same question I think holds true for the Yankees. Right? No, no, no. Why doesn't the team with the biggest money win every year in the NFL? Why? I mean there is something that we have, as in thinking about organizations and talent that we don't do a very good job of describing that says there is something that is very hard to buy and that has to be made and that we don't seem to be able to articulate it well. And buying the most talent doesn't seem to be sufficient. You have to have a lot of talent. It's necessary, but it's clearly not sufficient.
[132:53]
B
Alex, let's go to you next for a question. Want to get through our lightning round here?
[132:57]
C
Yes, of course. Suffice it to say I have some pretty different answers to some of these other questions, but I think I have to answer question number two, which it looks like might have been a response to a comment that I made in a previous POD episode. So the question is why wouldn't Sam, I think referring to Sam Altman, cut a deal with Bezos and Blue Origin to become the other counterweight to Elon. And this is for from Scott Ray Broomfield. So I think the answer is that's probably on the table. If I were Sam, I would be exploring a variety of potential heavy launch partnerships to become a counterweight To Elon and SpaceX AI's Dyson Swarm, I think heavy launch is going to become is already arguably part of a critical element of the stack now for the future of compute space.
[133:47]
B
As you know, right. New Glenn is more akin to Falcon 9 and doesn't hold a candle to Starship, which by the way, we'll be making a launch attempt probably by the time this is out. Good luck to Elon on that launch attempt. Super excited. But Starship is coming in at a factor probably 100 times cheaper.
[134:05]
C
I'm not sure that matters.
[134:08]
D
Oh, go ahead, Andrew.
[134:09]
A
Andrew, go ahead.
[134:10]
D
No, I think, Alex, all your points are right and I think that you underestimate Sam at your tremendous cost. I think what Sam has sort of done again and again in our industry is see around corners that other people missed. He was trying to lock down data center capacity in space last year and the year before when all the other foundation labs didn't see it really, they were trying to lock down memory. Oh yeah, for sure.
[134:41]
A
I didn't know that.
[134:43]
D
His ability to look at an exponential and not be afraid of what it says in two or three years while everybody else is afraid, saying, oh, we're not going to need that much, is extraordinary. And his reach is extraordinary. And so with 100% certainty, I will tell you that he is exploring deals with every possible way to get access to compute and data center capacity. And I can say that having watched from a distance, I have no inside information, but I've been dazzled by that ability of his. I think you underestimate that guy. You would think it would be enough to build the fastest growing company in the history of capitalism to get a lot of respect, right? You think that might be a sufficient feather in your cap? But I think he will certainly
[135:41]
E
be
[135:41]
D
in conversations to get compute, whether it's in space or whether it's under the sea or whether it's on, you know, using falling water, using geothermal. He will be in those conversations and his team will be there every single day. Single day.
[135:59]
B
Alex.
[135:59]
A
So cool to have another friend who's on the big, big stage. That inside perspective is awesome.
[136:06]
B
Alex, one point here is again, if you know Elon's Dyson Swarm is 500,000 satellites to a million satellites. It was like a launch every couple of minutes of a starship. You don't get that when you. Glenn. So that vehicle isn't designed for the frequency of launch that we're talking about here. So. So could Sam put up a mini constellation with Bezos? Sure. Could he put something up to really compete with what Elon's proposed? Not without new launch capability. That capability is unique on the planet. If it pans out as expected.
[136:42]
C
Yeah, a few thoughts. There are lots of options. Yes, I agree with the contention that SpaceX is completely dominating mass to orbit. No question about it, including dominating historical work mass to orbit. So if I'm Sam, I would be exploring probably a multi channel strategy. A, I'd be exploring a deal with Elon and SpaceX to leverage SpaceX launch for my own Dyson Swarm. I would be exploring alternative launch capabilities and then if you really believe, again, I think this is the elephant in this space room. If you really believe that we're on this singularity esque exponential, then the fabs don't need to be on the ground. We can build fabs in space and we may not be addicted to heavy launch five to ten years from now. And if you're Sam and you're playing the long game, then you're right. Then you're looking for ways to build fabs on the moon and in Leo that don't need the SpaceX near monopoly.
[137:44]
B
Amazing Dave, do you want to take us to question four?
[137:48]
A
Andrew, did you have another thought?
[137:51]
D
No. I think, I think building a fab on land is hard enough for me. 40 or 50 billion in five years doing something nobody else, I mean that one or two other companies in the world have ever successfully done. I mean I can't really think hard about building fabs in space knowing how
[138:17]
A
great segue to question four, which is directly related. China is building massive compute capacity. Could they sell tokens to US users at very low prices and disrupt providers? It's the same answer you just gave Andrew. Like no, China is not building massive compute capacity. When you're looking at tokens per second, the driver of AI they have. That's why they're so desperately want to import from the us so they're building as quickly as they possibly can, but it's not 5 nanometer, 4 nanometer, 3 nanometer technology. And so it's all bottlenecked at ASML machines, fab construction, everything Andrew's been talking about. So if they had the ability, they would love to do that, but they just don't have the compute.
[139:01]
D
Yeah, the dimension in which they've chosen to invest so far is in power infrastructure. And at that they're just playing better than us right now. They have upgraded their grid, they have tremendous power infrastructure and we've made bad decisions there. We are stuck with a grid that's built in the 50s, that's designed not for what we'd like it for today. And we have trouble politically at the local, at the municipal, at the state, at the federal level doing projects like infrastructure. And so what they have done is a tremendous amount of investment there. They are obviously starved of compute,
[139:42]
A
but
[139:42]
D
they're going to try and build on what they have, which is an absurd amount of power infrastructure.
[139:48]
A
Yeah.
[139:49]
E
When enterprises are going to be doing most of the token purchasing, you're not just buying the token, you're buying trust, you're buying governance, you're buying Reliability, you're buying compliance, etc. Etc.
[139:59]
C
Country, I would perhaps just add to this. It's worth noting that in the past week or so there's been quite a bit of public reporting about how China is operating proxy services that are selling American tokens to Chinese users at incredibly low prices like 10x 10x discounts in order to siphon the reasoning phrases for training their own models. And that is quite disruptive and anthropic is pursuing that.
[140:25]
B
Amazing. Andrew, we want to thank you for being on. We close out every episode with user generated content. This is sort of our. Our outro music and so let's enjoy this one. It's called We Are as Gods. A I guess comment to my new book. It's from today.
[140:46]
E
Andrew is as a. So we have to like bow down to.
[140:50]
C
I'm.
[140:51]
D
I think being a CEO is sufficient man. I don't want the responsibility.
[140:56]
B
Congratulations.
[140:57]
D
I don't need any of that at all.
[140:58]
B
Congratulations on an epic ipo. Amazing. Amazing. All right, let's listen to We Are as Gods by Mosad Zamani. All right, enjoy.
[141:16]
D
Land. Everyone's a founder now.
[141:21]
A
Abundance is the land.
[141:25]
D
AI is the le.
[141:28]
B
The future is forever.
[141:32]
A
The mission is your soul.
[141:36]
B
Take the full control.
[141:44]
D
Oh.
[141:45]
B
Peter sees the vision and Alex writes the code while David finds the logic.
[141:50]
A
Insoleem leads the road.
[141:52]
D
The agency of 1.
[141:54]
B
See the power rise, the exponential life
[141:57]
A
beneath the digital skies.
[142:02]
B
All right, that was a good one.
[142:05]
D
Very cool.
[142:06]
B
Again, thank you for joining us. Gentlemen, always a pleasure. I think we could have kept those conversations going for a couple more hours.
[142:14]
A
Easily.
[142:14]
E
Easily, yeah.
[142:16]
D
Thank you for having me. Really appreciate it. Be well now.
[142:19]
A
Thank you.
[142:19]
C
Thanks Andrew.
[142:20]
B
If you made it to the end of this episode, which you obviously did, I consider you a moonshot mate. Every week my moonshot mates and I spend a lot of energy and time to really deliver you the news that matters. If you're a subscriber. Thank you. If you're not a subscriber yet yet, please consider subscribing so you get the news as it comes out. I also want to invite you to join me on my weekly newsletter called Metatrends. I have a research team. You may not know this, but we spend the entire week looking at the meta trends that are impacting your family, your company, your industry, your nation. And I put this into a two minute read every week. If you'd like to get access to the Metatrends newsletter every week, go to diamandis.com metatrends that's diamandis.com metatrenDS thank you again for joining us today. It's a blast for us to put this together every week.