Summary8 min read

Y Combinator Startup Podcast: "How to Build the Future with Demis Hassabis"

Date: April 29, 2026
Guest: Demis Hassabis, CEO of Google DeepMind
Host: Gary (Y Combinator)

Episode Overview

This episode features a thought-provoking conversation with Demis Hassabis, co-founder and CEO of Google DeepMind, Nobel laureate, and one of the pioneering minds behind transformative AI projects like AlphaGo, AlphaFold, and Gemini. The discussion delves into the current and future state of artificial general intelligence (AGI), key unsolved problems in the field, the evolution from deep reinforcement learning to multimodal models, and the implications of these advances for science, startups, and society at large. Hassabis shares candid insights from the frontlines of AI research, practical reflections for aspiring founders, and a glimpse into how science and foundational technology will be transformed by rapidly advancing AI.

Key Discussion Points and Insights

1. The Road to AGI: What’s Missing?

Current Progress: Hassabis affirms that components like large-scale pre-training, RLHF (Reinforcement Learning from Human Feedback), and chain-of-thought reasoning are foundational and likely to be integral to AGI (02:08).
Unsolved Challenges:
- Continual learning, long-term reasoning, memory, and consistency remain unsolved, though they may only require "one or two big ideas" to crack (00:00–02:08).
- "You have to have an active system that can actively solve problems for you to get to AGI. So agents are that path, and I think we're just getting going." (Demis Hassabis, 00:00 & 15:40)

2. Memory, Context, and Learning Paradigms

Context Window Limitations: Current models brute-force memory by storing "everything," leading to inefficiency. Human brains are much more selective, integrating new information into existing knowledge via processes like REM sleep and episodic memory consolidation (03:46–06:07).
Room for Innovation: True context-aware, continual learning is seen as necessary for adaptable agentic systems.
- "There's still a cost to looking [memory] up and finding the right thing that's actually relevant for the specific decision you've got to make right now." (Demis Hassabis, 03:46)

3. Reinforcement Learning and the DeepMind Philosophy

RL’s Lasting Relevance: DeepMind's foundational focus on agentic systems (e.g., AlphaGo, AlphaStar) continues to inform Gemini's development; old ideas—like Monte Carlo tree search and world modeling—are being revisited at scale for modern foundation models (06:07–07:59).
- "Really you can think of a lot of the things we're doing today, all the leading models with thinking modes and chain of thought reasoning as aspects of what was sort of pioneered with AlphaGo coming back now." (Demis Hassabis, 06:23)

4. Model Scaling and Distillation

Balancing Scale and Efficiency:
- DeepMind emphasizes building the largest models ("frontier") but quickly distilling them into smaller, efficient ones for practical deployment in global products (07:59–09:34).
- "One of our biggest strengths has been distilling and packing that power into smaller and smaller models very quickly." (Demis Hassabis, 08:22)
Edge Use Cases:
- Smaller models enable fast iteration, privacy, and running on devices like phones and robots (11:03).

5. Continual Learning and the Developer Experience

Barriers to Agency:
- The absence of continual learning still limits agents from autonomously adapting to user-specific contexts. Adapting on the fly is seen as crucial for true utility (12:33–12:46).
- "I think that's one of not having continual learning currently is one of the things holding back agents from doing full tasks." (Demis Hassabis, 12:46)

6. Reasoning and Introspection

Imperfections in Reasoning:
- Foundation models perform "jagged intelligence"—they can solve extremely hard problems but still make basic errors (e.g., in math or games) due to incomplete introspection and insufficient chain-of-thought monitoring (13:26–15:26).
- "There's just something to me about almost an introspection about its own thought process that I feel like there's, there's something maybe missing there." (Demis Hassabis, 15:01)

7. Agentic Systems and Hype vs. Reality

State of Play:
- While agents are "just getting started," real breakthroughs—in terms of truly autonomous, creative outputs—are still emerging. So far, efficiency gains seem to benefit human operators first, with full autonomy yet to be realized (15:26–18:25).

8. Creativity and AI

Beyond Mastery to Invention:
- AI’s next leap is moving from brilliant play (like "move 37" in AlphaGo) to inventing new structures altogether ("Can it invent Go?"). Hassabis notes we haven't yet seen a system create like this but doesn't dismiss the possibility—perhaps the right creative human using existing tools could achieve it (18:31–20:19).

9. Open Source and Democratizing AI

Gemma Models:
- DeepMind's open-weight models aim to empower users by making high-quality models available directly for local or edge use—critical for privacy and enabling diverse builders globally (20:19–22:19).
- "We're huge proponents of in general of open source and open science. [...] We want as many people as possible build on it and of course, we'll be building on that too." (Demis Hassabis, 20:42)

10. Multimodality as an Advantage

Gemini’s Multimodality:
- Gemini was designed from the outset to synthesize multiple data types. This has proven crucial for robotics, world modeling, and interactive assistants and is now a key competitive advantage (22:45–24:01).

11. Science, AI, and the Next 10 Years

From Proteins to Virtual Cells:
- After AlphaFold’s breakthrough in protein structure, DeepMind and spinoff Isomorphic Labs are pushing toward simulating cellular systems, with a "virtual cell" possibly a decade away—pending advances in imaging and data capture (25:38–28:17).
- Choosing scientific domains for AI: Look for problems with massive search spaces, a clear objective function, and accessible data or good simulators (33:34–35:13).

12. Science Startups: Real Innovation vs. Derivative Products

Advice to Founders:
- Seek intersections between AI and deep, hard technological challenges in the physical world—biotech, material science, etc.—as these are less susceptible to being made obsolete by rapid foundation model progress (30:49–32:34).

13. Meta-Science and “AI Scientists”

Next Horizon for Discovery:
- AI systems are getting closer to true scientific reasoning, not just pattern matching, but haven’t yet made paradigm-shifting discoveries; Hassabis outlines a tantalizing “Einstein test” for eventual success (35:32–37:52).
- "But even harder than that would be to come up with a new set of Millennium Prize problems that were regarded by top mathematicians to be as deep and meaningful and worthy of lifetime of study and effort to solve. I think that's another level harder. And we don't have [that] yet." (Demis Hassabis, 36:32)

14. Reflections for Founders: Working on the Frontier

Choosing Challenges Wisely:
- Working on deep, consequential problems isn’t inherently harder than chasing superficial ones—it’s just “differently difficult,” and likely more worthwhile (38:20).
- Founders should anticipate AGI’s arrival potentially halfway through multi-year deep tech projects and consider what that means for their company’s strategy.
- "You might as well put your life force into something that will really make a difference if you hadn't done it, if you hadn't been there to push it." (Demis Hassabis, 38:20)

Notable Quotes & Memorable Moments

On what’s missing for AGI:
“Continual learning, long term reasoning, some aspects of memory, these are still unsolved. I think all of these are going to be required for AGI.” (Demis Hassabis, 00:00)
On current memory in AI:
“There's still a cost to looking [memory] up and finding the right thing that's actually relevant for the specific decision you've got to make right now. And that's non-trivial, that cost.” (Demis Hassabis, 03:46)
On foundational ideas from RL:
“Really you can think of a lot of the things we're doing today, all the leading models with thinking modes and chain of thought reasoning as aspects of what was sort of pioneered with AlphaGo coming back now.” (Demis Hassabis, 06:23)
On the limits of model distillation:
“I didn't really see any limit yet in terms of like some kind of theoretical limit. I think we're still pretty far off of that.” (Demis Hassabis, 09:47)
On agents today:
“I still think we're in the experimentation phase. We haven't seen a AAA game that tops the app store charts that was sort of vibe coded yet, right?” (Demis Hassabis, 15:40)
On the next level of creativity:
“Can it invent Go? That's what I want, a system that can invent Go. If you give it a high level description like, ‘a game you can learn the rules of in five minutes, but it takes many lifetimes to master.’” (Demis Hassabis, 18:31)
On open source:
“We’re huge proponents of in general of open source and open science... we publish in, you know, the big journals. We wanted to create world leading models for their sizes and so that's what hopefully we've done with Gemma.” (Demis Hassabis, 20:42)
On scientific AI breakthroughs:
“The problems I like to look for are great if this situation can be described as massive combinatorial search space, the more massive the better in some ways... and then you have a clear objective function.” (Demis Hassabis, 33:34)
On advice to founders:
“Given life's very short and you only have so much time and energy, you might as well put your life force into something that will really make a difference if you hadn't done it, if you hadn't been there to push it." (Demis Hassabis, 38:20)

Important Timestamps and Segments

00:00–02:08 – Demis on What's Missing for AGI
03:46–06:07 – Memory, Context, and Learning in Models
06:07–07:59 – DeepMind’s RL history and influence on Gemini
08:22–09:47 – Model distillation and efficiency
12:46–13:26 – Barriers of continual learning for agents
13:37–15:26 – Shortcomings in model reasoning and introspection
15:40–20:19 – The state (and limits) of current agentic AI, creativity, and bottom-line value
20:42–22:19 – Open source AI: Gemma models, motivations, and implications
22:45–24:01 – Gemini’s multimodality and robotics
25:38–28:17 – From AlphaFold to modeling entire cells, data bottlenecks in science
30:49–32:34 – Building defensible startups at the AI–deep tech intersection
33:34–35:13 – Criteria for AI breakthroughs in scientific domains
35:32–37:52 – The "Einstein test": When will an AI make truly novel discoveries?
38:20–end – Hassabis’ advice for founders and reflections on building at the frontier

Tone and Style

The conversation is deeply technical but also visionary and encouraging to founders and builders. Hassabis maintains an optimistic yet realistic outlook, clearly articulating the state of the art, its limitations, and the strategic nuances of working at the very edge of what's possible in AI.

For Listeners: Key Takeaways

AGI is not just a matter of scale; fundamentally new approaches to memory, learning, and reasoning are needed.
Innovation in scientific domains—especially those intersecting deep tech and real-world data—remains highly defensible for startups, given rapid progress in core AI models.
The ability to combine large foundation models with specialized tools will be central as we approach AGI.
Open, accessible AI (like Gemma) is considered crucial for both privacy-centric and edge applications.
Profound advances in science, creativity, and productivity will come from AI, but these still require human vision, taste, and craft.
True AI creativity—systems that invent rather than extrapolate—remains just out of reach, but possibly not for long.

Loading summary

Transcript52 lines

[00:00]
Demis Hassabis
Continual learning, long term reasoning, some aspects of memory, these are still unsolved. I think all of these are going to be required for AGI depending on what your AGI timeline is. You know, mine's like 2030 or something like this. Then if you start off on a deep tech journey today, you have to just consider AGI appearing in the middle of that journey. It's not bad necessarily, but you have to take that into account. You have to have an active system that can actively solve problems for you to get to AGI. So agents are that path and I think we're just getting going.
[00:40]
Gary
Demis Hassabis has had one of the most unusual careers in tech. He was a chess prodigy as a kid, then designed his first hit video game theme park at 17. He then went back to school, got a PhD in cognitive neuroscience, published foundational work on how memory and imagination work in the brain, and then in 2010 co founded DeepMind with one mission, solve intelligence. And I think they've done it since then. His lab has gone on to do things most people thought were decades away. AlphaGo beat a world champion at Go AlphaFold Cracked Protein Structure Prediction, a 50 year grand challenge in biology and they gave it away for free to every scientist on Earth. That work won him the Nobel Prize in chemistry last year. Today Demis leads Google DeepMind where he's building Gemini and pushing toward the same goal he set when he was a teenager, artificial general intelligence. Please welcome Demis Hassabis. So you've been thinking about AGI longer than almost anyone. When you look at the current paradigm, large scale, pre training, RLHF, chain of thought. How much of the final architecture for AGI do you think we already have and what's fundamentally missing right now?
[02:09]
Demis Hassabis
Well, first of all, thanks Gary for that great introduction and it's great to be here. Thanks for welcoming here. It's an amazing space actually. I'm going to have to come back here often. Very inspiring that you will get to work, work in this space. So the question is, I think the components that you just mentioned, I'm pretty sure will be part of the final architecture for AGI. So I think they've come such a long way now and we've proven out so many things about what they can do. I can't see a world in which we will sort of realize in a couple of years this was a dead end. That doesn't make sense to me. But there still might be one or two things missing on top of of what we already know works. So continual learning Long term reasoning, some aspects of memory, these are still unsolved and how to get the systems to be more consistent across the board. I think all of these are going to be required for AGI. Now it might be that the existing techniques can just scale up to that with some innovation and some incremental innovation, but it could be that there's still one or two big ideas left that need to be cracked. I don't think it's more than one or two if there are out there. And I think, you know, my betting is about 50, 50 if that's the case. So of course at DeepMind, at Google DeepMind, we work on both those things.
[03:29]
Gary
I guess that's, I mean working with a bunch of agentic systems. The wildest thing to me is to what degree it's the same weights over and over. So this idea of continual learning is so interesting because like, you know, right now we're sort of cobbling it together with duct tape. Yes, these dream cycles at night and things like that.
[03:47]
Demis Hassabis
Yeah, it's pretty cool, the dream cycles. And we used to think about this with consolidation with episodic memories. Actually that's what I studied for my PhD is how the hippocampus works and integrates, you know, new knowledge gracefully into the existing knowledge base. So the brain does that amazingly well. It does it, you know, during sleep, especially things like REM sleep, replaying back episodes that are important so that you can learn from it. In fact, our very first Atari program, dqn, one of the ways it was able to master Atari games was by doing experience replay. So we sort of borrowed that from, from neuroscience and replayed successful trajectories many times. You know, that's way back in 2013 now in the, in the dark ages of AI, it was a really important thing. And I agree with you, we're kind of using duct tape right now. So like shove it all in the context, wind this. But this seems a bit unsatisfying, right? And actually even though we're working on machines, not biological brains and so potentially you could have, you know, millions or tens of millions size context window or memory and it can be perfect. There's still a cost to looking it up and finding the right thing that's actually relevant for the specific decision you've got to make right now. And that's non trivial, that cost. Even if you can potentially store it all, I think there's actually a lot of room for innovation in areas like memory.
[05:12]
Gary
Yeah, I mean the wild thing is it feels like a million token context Ones is actually bigger than. I mean, it's plenty big. Honestly, you can do so well.
[05:20]
Demis Hassabis
It's plenty big for most things that it should be used for. I mean, if you think about the context windows sort of equivalent to working memory, you know, humans have. We have like a few digits, you know, it's like a dozen digits, maybe, you know, average of seven. We got million or, you know, 10 million context windows. But the problem is that we're trying to store everything in that, you know, things that aren't not important, things that are wrong. It's pretty brute force currently, and that doesn't seem right. And then the problem is if you're then trying to try and process live video and you're just going to naively record all the tokens, then actually a million tokens isn't that much. It's only like 20 minutes. So actually you need more if you want something that's going to understand your, you know, your. What's going on in your life over maybe a month or two.
[06:08]
Gary
DeepMind has historically leaned into reinforcement learning and search. AlphaGo, AlphaZero and MU0. How much of that philosophy is actually embedded in how you're building Gemini today? Is RL still underrated?
[06:24]
Demis Hassabis
Yeah, I think potentially it is. It sort of goes in ebbs and waves. We know we've worked on agents since the beginning of DeepMind. In fact, that was what we said we were working on. So all of the atari work and AlphaGo, most specifically, they're agent systems. And what we meant by that is systems that are able to, you know, accomplish goals on their own and make active decisions and make plans. And so of course, we were doing it in the domain of games to make it tractable and then doing increasingly complex games, things like Starcraft after AlphaGo, AlphaStar. So we basically did all the games that are out there. And then of course the question is, can you generalize those models to be world models or models of language, not just models of simple games or even complex games. And that's what the last few years has been about. But really you can think of a lot of the things we're doing today, all the leading models with thinking modes and chain of thought reasoning as aspects of what was sort of pioneered with AlphaGo coming back now. And I actually think there's a lot of work we did back then that is relevant today. And we're sort of relooking at some of those old ideas at scale today in a more general way, including things like Monte Carlo tree search and other other ways of doing, augmenting the RL on top of the reinforcement learning we're ready to do today. And I think a lot of those ideas, both from AlphaGo and AlphaZero are really, really relevant to where we are with today's foundation models. And I think a lot of that is what we're going to see of the advances the next few years.
[08:00]
Gary
One question I would have, like, obviously today you need bigger and bigger models to be smarter and smarter, but then we're also seeing distillation working and then smaller models can be like quite a bit faster. I think. You know, you guys have incredible flash models that are like, you're finding that they're 95% as good as the frontier and at like 1/10 the price. Is that right?
[08:23]
Demis Hassabis
I think that's one of our core strengths is, I mean you have to build the biggest models to have the frontier capabilities. But I think one of our biggest strengths has been distilling and packing that power into smaller and smaller models very quickly. Obviously we invented the kind of distillation process and people like Jeff and Oriole and others, and we're still world experts in that. And we also have a huge need to do it because we've got to serve the biggest, probably AI surfaces. There are obviously there's search with AI overviews and AI mode, then there's Gemini app. And now increasingly every single product at Google has, you know, maps and YouTube and so on has some aspect of Gemini or Gemini related technology in it. And so that's billions of users, a dozen, more than a dozen billion user products. And they have to be served extremely fast, extremely efficiently and cheaply and with low latency. So that, that gives us a really important incentive to, to make these flash and even smaller models, flashlight models, extremely efficient. And hopefully that ends up then being really useful for many of the workloads that all of you use for.
[09:35]
Gary
I'm curious about how much smarter these smaller models can actually be. Like, are there limits to the distillation process? Like could a 50B or 400B model be as smart as like a Mythos for today?
[09:48]
Demis Hassabis
Yeah, I don't see any. I don't think we've got to any kind of, or at least none of us know yet if we've got to any kind of information or limit. I mean, maybe at some point that will be the case where there's just an information density that can't, we can't get beyond. But I think for now there's that the assumption we make is that, you know, A year later, after one of our leading, you know, pro models or frontier models goes out, half a year later, a year later you'll have them in the really tiny, almost edge models. And you also see some of that goodness in our Gemma models which hopefully you're all enjoying. Our Gemma 4 models, which I think are really amazing power for their sizes. So again that uses a lot of this, these distillation techniques and, and the idea of how to make things really efficient in these very small models. So I didn't really see any limit yet in terms of like some kind of theoretical limit. I think we're still pretty far off of that.
[10:39]
Gary
That's amazing. I mean that is really good.
[10:41]
Demis Hassabis
Yes.
[10:43]
Gary
You know, one of the weirder things that we're seeing right now is like engineers can do like 500 to 1000 times the amount of work that they were doing like six months ago, I guess. I mean the people in this room there are people who are doing about like a thousand X the work that like Steve Yegi talks about this, it's like a thousand X the work that a Google engineer from the 2000s was doing.
[11:04]
Demis Hassabis
I think it's very exciting. I mean, I think the small models have many uses. One is obviously cost, but the speed can allow, you know, if you think about coding even or other things, you can iterate a lot faster also, especially if there's, if you're collaborating with the system. I think there's a, there's a lot of need for having fast systems, systems that maybe are not quite frontier like you said, like 95%. 90%, but that's plenty good enough. And actually you gain back more than the 10% on the iteration speed. So. And then the other big thing I think is running these things on the edge again for efficiency reasons, but also for privacy and security reasons too. If you think about different devices that you might run these systems on that, you know, process very personal information. You can also think about robotics as well, you know, robots in your house. I think you're going to want very efficient, very powerful local models which may be orchestrated, you know, with some bigger models, frontier models that are in the, in the cloud. But you only delegate to that in certain circumstances. And perhaps you, you know, you process all of the audio visual feed, let's say locally and that stays local. I could imagine that would be a very good sort of end state.
[12:19]
YC Startup School Announcer
YC startup school is back. We're hand selecting the most promising builders in the world and flying them out to San Francisco for July 25th and 26th to discuss the cutting edge of tech, apply now for a spot. Okay, back to the video.
[12:33]
Gary
Going back to context and memory models currently stateless but you know, continue. Like what would the developer experience even be like for someone who's using a continual learning model? Like, you know, any idea, like how you'd steer it?
[12:47]
Demis Hassabis
I think it's really interesting. I think that's one of not having continual learning currently is one of the things holding back agents from doing full tasks. I think they're really useful for aspects of tasks right now and you can patch them together and do some really cool things, but they don't adapt well with the context that you're in. And I think that's the missing piece for them being really kind of fire and forget and they'll figure it out themselves. I think they need to be able to learn about the specific context that you're going to put them in. So I think we have to crack that to get full general intelligence.
[13:26]
Gary
Where are we on reasoning? So models can do really impressive chain of thought now, but they still fail on things smart undergrad wouldn't. What specifically needs to change and what progress do you expect in reasoning?
[13:38]
Demis Hassabis
There's a lot of innovation left in the thinking paradigms, I would say. Again, I think we're fairly, we're doing fairly simplistic things, fairly brute force, one could imagine. I think there's a lot of scope, for example, in monitoring the chain of thought, maybe interjecting midway through a thought process. I often get the impression with our systems and our competitor systems that they're almost overthinking, they're almost getting into sort of loops of things. Like one thing I sometimes like to do is play chess against Gemini. And you know, it's that all the leading foundation models are pretty poor at games, which is quite interesting. It's very cool to kind of look at the thinking traces because obviously these are, can be a well understood. You know, I can tell quite quickly if it's going off on a tangent and it's very sort of provable what the, what the thinking is doing, whether it's useful or not. And so what we see is that, you know, sometimes it will, it will, it will consider a move, it will realize it's a blunder, but it can't find anything better. So it kind of goes back to that move and does it anyway. So, you know, you just shouldn't be seeing that happening in a, in a very precise reasoning system. So there's just sort of huge gaps I think still. But it may only be one or two tweaks that are required to fix those kind of gaps, just to be clear. But I think that's pretty, pretty obvious there are there and that's why you get this kind of jagged intelligence. You know, on the one hand it can solve gold medal problems in imo, which is super hard. But on the other hand, as we've all seen, it can still make basic elementary maths errors if you pose the question in a certain way. Right. So. Or elementary reasoning errors. So there's just something to me about the almost an introspection about its own thought process that I feel like there's, there's something maybe missing there.
[15:27]
Gary
Agents are really big. Some would say they're hyped. I personally think they're just getting started. It's totally insane. What does DeepMind's internal research tell you about where agent capabilities actually are right now versus you know, sort of the hype out there?
[15:40]
Demis Hassabis
I think we are, I agree with you. I think we're just at the beginning. You have to have an active system that can actively solve problems for you to get to AGI. That was always clear to us. So agents are that path. And I think we're just getting going. I think all of us are getting used to how do we best work. And you're leading the way in a lot of this in your own personal experiments. I'm sure many of you are doing that. I think how do you incorporate it into your workflow in a way that isn't just sort of a nice to have but actually starting to do fundamental things. My impression is at the moment we're all experimenting on lots of things, but we're only in maybe the last couple of months starting to find the really valuable places and the technology is probably only getting good enough for that to be the case. Right. That it's not a kind of toy. Nice demonstration. But actually really adding value to your time and efficiency. I often wonder. I see a lot of people working on like setting off, you know, dozens of agents for like 40 hours but I'm not sure I've seen the output that yet of that quite justify that level of input going in. But I think it will come. So I still think we're in the experimentation phase. We haven't seen a AAA game that tops the app store charts that was sort of vive coded yet. Right. I've seen and I've programmed and I'm sure many. We've all done little nice demonstrations and it's like amazing. I can do a protot type of theme park in half an hour now, which took me six months back when I was 17. It's kind of mind blowing and I, and I wish I got this feeling if I spent the whole summer working on it, you could make something really incredible. But it still needs craft and you know, human sort of soul into it. And taste, I think that's, that's something that can, that's. You have to make sure you still bring that to, to whatever it is you're building. And I think it still shows like it's not quite there yet because why haven't we seen a kid making a hit game that's, that sells 10 million copies? Right? That should be possible given the effort that's gone in. So something's still somehow missing. Maybe it's to do with the process or maybe it's to do with the tools. I'm not quite sure. You all probably know better than me because I'm sure you're all experimenting on that. But I haven't seen the result yet, which I would expect once this is really delivering that full value which I think will come in the next six to 12 months.
[17:58]
Gary
Some of it is like how much of it will be autonomous versus I mean I don't think we'd see autonomous first. We would actually probably see people in this room operating at a thousand X
[18:07]
Demis Hassabis
and then that's what you should see first. And then many of you, you know, there'll be like games companies or you know, other types of companies that have built some kind of best selling app, best selling game using these tools. That's what you should see first. And then more of that will get automated.
[18:25]
Gary
I mean some of it is like there's a human in there. And then the human doesn't want to say that the agents did it yet.
[18:32]
Demis Hassabis
I think part of it might be though that we want to discuss creativity. What I often say about that is if we look at the things we've done like AlphaGo so obviously very famously, you'll all know about the move 37 in game two. And for me I was waiting for a moment like that to start the science projects like AlphaFold. So we started AlphaFold like the day we got back from Seoul, which is 10 years ago now. I'm going to Korea after this to celebrate the 10 year anniversary of AlphaGo. But it's not enough to come up with MU37 like that's pretty cool, very useful. But can it invent go? That's what I want, a system that can invent Go. If you give It a high level description, like a game you can learn the rules of in five minutes, but it takes many lifetimes to master. It's beautiful aesthetically, but you can play it in a few hours, in an afternoon. So maybe you could imagine that would be the high level description I would give and then I'd want the thing I get back is go right? And clearly today's systems I think can't do that. So the question is why? And I think there's something still missing there.
[19:43]
Gary
Well, someone in this room might make it.
[19:45]
Demis Hassabis
Then the answer would be there's nothing missing. It just was the way we were using the systems. And that might actually be the answer. It might be that today systems are capable of that. With a brilliant enough creative person using it and providing that impetus that the soul of the project and being able to probably being au fait enough with the tools to like almost be at one with the tools. I could imagine that would be happening if you experimented with the tools all day and all night, like probably many of you are doing that and you combine that with proper deep creativity, something more incredible could be done.
[20:20]
Gary
Switching gears to open source or open weights. I mean the recent release of Gemma, you're making highly capable open and accessible ones that can actually run locally. What do you think that means for will AI be something that is in the hands of the users instead of primarily in the cloud? And does that change who gets to, you know, build with these models?
[20:43]
Demis Hassabis
We're huge proponents of in general of open source and open science. And you mentioned Alphafold at the beginning. You know, we put that all out there for free and all of our science work, even still today we publish in, you know, the big journals. We wanted to create world leading models for their, their sizes. Right. And so that's what hopefully we've done with Gemma. And we're, you know, very committed to that path. And hopefully you will experiment and build and enjoy using Gemma. I think it's been like 40 million downloads now and just in, you know, two and a half weeks. So we're really excited about that. And I also think it's important for there to be western stacks on open source. You know, obviously a lot of the Chinese models are excellent and they're currently we're leading in open source and we think Gemma is very competitive for its sizes in all those respects. And for us, I mean there is a question of resources, talent and computer. Like nobody has enough spare compute to just make two frontier models at maximum size. Right. With different attributes. So that's pretty Difficult. But also for now, what we've decided is that our edge models, the things we want to use for Android and glasses and robotics, it's best that they're open models because they're vulnerable anyway once you put them out on the surfaces. So they might as well be actually fully open. Right. So we've sort of made a decision to kind of unify that at the, at the kind of, we call it nano size level. So that actually works for us strategically as well. And, you know, we hope as many people as possible build on it and of course, we'll be building on that too.
[22:19]
Gary
Earlier, before we came on, I got to show you a demo of my version of Samantha from her, which is harrowing for me to try to demo something to you. And it worked, which is amazing. Gemini was built multimodal and I spent a lot of time with a bunch of the models and I mean the depth of the context and the tool use with speech directly to model, there's nothing like, bar none, like the best one, actually.
[22:45]
Demis Hassabis
Yeah, I think that's still a slightly underappreciated aspect of the Gemini series is we started it being multimodal from the start. That made it a little bit more difficult actually to begin with, because than just focusing on text, for example. But we believe we're going to gain from that in the long run and I think we're seeing that now for things like world model building. So stuff like GENIE that we build on top of Gemini, I think it's going to be really important for things like robotics. So this is why Gemini Robotics, as many of you probably played around with, I think it's going to be built on multimodal foundation models, the robotics models, and we think we have a sort of competitive advantage with, with Gemini being so strong at multimodal, we're using it increasingly in things like Waymo. But also if you imagine devices and assistants that digital assistants that come with you into the real world, maybe on your phone or glasses or some other device, it needs to understand the physical world around you and intuitive physics and the physical context you're in. And that's what our systems are extremely good at. And I think you found that's why you've enjoyed using it in your setup. We're planning to continue on that and I think we're far and away the strongest models on those types of problems.
[24:02]
Gary
So the cost of inference is dropping fast. What becomes possible when inference is essentially free and how does that change what your team is actually optimizing for?
[24:11]
Demis Hassabis
Yeah, I'M not sure inference will ever be essentially free. I mean, there's sort of Jevons paradox and other things about, like, I think we'll just end up using. All of us will end up using whatever we can get our hands on. And you can imagine millions of agents, swarms of agents working together on things. That's one way to use the inference. Or you could imagine single agents or groups, smaller groups of agents, thinking in multiple directions and then ensembling that. So we're experimenting with all these things. Probably many of you are. All of that will use up any inference, I think, that's available. I mean, one day maybe it can be almost cost zero. Certainly the energy. If we solve fusion or superconductors or optimal batteries or some set of those things, which I think we will do with material science, energy costs will be essentially zero, but they'll still be the physical creation of the chips and other things. There'll be some bottleneck, at least for the next few decades, I think. And so if that's the case, there'll still be rationing on the inference side. You'll still have to use it, I think, efficiently.
[25:18]
Gary
Yeah. Well, luckily the smaller models are getting smarter and smarter, which is fantastic. We got a lot of bio and biotech founders in the audience. I can see a few. AlphaFold3 took us beyond proteins to a broad spectrum of biomolecules. How close are we to modeling full cellular systems? Or is that still a fundamentally harder problem in a class of its own?
[25:39]
Demis Hassabis
Well, isomorphic labs, which we spun out from DeepMind after we did AlphaFold 2, which is going amazingly. It's trying to build out not just AlphaFold it's just one piece of the drug discovery process, as many of you know. But we're trying to do the adjacent biochemistry and chemistry to design the right compounds with the right properties and so on. We'll have some big announcements very soon to talk about on that front. I think that's going really well. Eventually you want a whole virtual cell. So I've talked about this in many of my science talks about a full working simulation of a cell that you can perturb. And then the, you know, the, the outputs of that would be close enough to experimental that it's useful. Right. You could skip out a lot of the search steps and generate lots of synthetic data to train other models that then would predict things about, you know, real cells. And I think we're about 10 years away, probably from something like a virtual cell, like a full virtual cell. You know, we're starting out this is we're working on the DeepMind side, science side on a virtual nucleus cell nucleus first because relatively self contained. The trick with all of these things is can you pick a slice of the complexity? Eventually you want to model a human body, but can you model it down to the right level of detail and what slice can you take out of it that will be self contained enough? You can kind of model and approximate the inputs and outputs into that self contained system and then just focus on the self contained system. So a nucleus is quite interesting from that perspective. Then the other issue is just there's not enough data yet. So you need data and I talk to various, you know, top scientists about who work on electron microscopes and other imaging things. If we could image a live cell without killing the cell, that would be game changing obviously because then you could convert it into a vision problem which we would know how to solve. Right. But at the moment there are at least I'm not aware of any techniques that can give you a kind of, you know, nanometer resolution but without destroying. But in, you know, in a live dynamic cell so you can see all the interactions, right. You can take static images at that resolution obviously really detailed now and that's quite exciting. But it's not enough to turn it just into, just into a complex vision problem. So that's one way it could be solved. So it could be a hardware driven, data driven solution or it could be that we build better learned simulators of these dynamical systems. So that's the more modeling way of solving it.
[28:18]
Gary
You've been looking at all kinds of science, not just bio. There's material science, drug discovery, climate modeling, mathematics. If you had to rank which scientific domain will transform the most dramatically the next five years, what's in your list?
[28:31]
Demis Hassabis
Well, they're also exciting and that's why, I mean that for me has been my main passion and always the reason why I've worked on AI for my whole career for 30 plus years now is to use AI as the ultimate tool. I always thought AI would be the ultimate tool for science and to advance scientific understanding, scientific discovery and things like medicine and just our understanding of the universe around us. So actually when you mentioned our original way we used to articulate our mission statement, which is still the way we think about it is there was two steps to it. Step one was solve intelligence, I. E. Build AGI and then step two was use it to solve everything else. We had to change that a bit over time because people were like do you really mean Solve everything else. And we did mean that. And I think people are sort of understanding what that means today. But specifically I was meaning solve other, what I call root node problems in science. So areas of science that would unlock whole new branches or avenues of discovery. And AlphaFold is the prototypical example of what we want to do. So over 3 million researchers around the world, pretty much every biology researcher in the world, uses AlphaFold now. And I was told by some of my pharma executive friends that almost every drug discovered from now on will have used AlphaFold at some point in the drug discovery process. So that's something we're very proud of and it's the sort of impact that we hope to have with, with AI. But I do think it's just the beginning. I don't really see any area of science or engineering that this won't be able to help be helpful with. And the ones you mentioned, I think we're almost like an alpha fold one moment. So we've got very promising results, but it's not quite solved the grand challenge yet in that domain. But I think we're going to have a lot to talk about in the next couple of years on all those areas. You mentioned materials, which I think is very exciting all the way to mathematics in science.
[30:19]
Gary
I mean, it feels Promethean. It's like here is this capability and
[30:23]
Demis Hassabis
you know, I think so. I mean, of course, along with that, including what the parable of Prometheus, we have to also be careful with how we use that and what we use it for. And also the misuse that can happen with those same tools.
[30:37]
Gary
A lot of people in this room are trying to build companies applying AI to science for them. What's the difference between a startup that actually advances the frontier in your view, versus one that's just wrapping an API around a foundation model and calling it AI for science?
[30:50]
Demis Hassabis
Well, look, I think there's one of the things I would recommend I'm trying to think about, and I think you mentioned this to me before, what would I do today myself if I was sitting in your place in Y Combinator? You know, looking at things, one thing you have to do is obviously intercept where the AI tech is going. So that's one hard part of it. But I do think there's huge scope for combining where AI is going with some other deep technology area. I just think that that sweet spot is whether it's materials or medicine or other really hard areas of science, I think those kinds of interdisciplinary teams, especially if it involves the world of atoms, as well. There's not going to be a shortcut to that, at least in the foreseeable future. Those are areas that are pretty safe from just getting swarmed by whatever the next update is to the foundation models. So I think if you're looking for things like that, that's one of the more defensible areas, I would say. And I've always loved deep tech, so I'm kind of biased towards deep tech things. I think nothing that's really long lasting and worthwhile is easy and so I'm always being drawn to deep technologies. Obviously, AI was like that back in 2010 when we started out, right? It was, it was thought to just we know, we know it doesn't work kind of thing is what I was told by investors. And even in academia it was considered to be a very niche subject that we sort of tried in the 90s and we know doesn't work. But if you, you know, if you have belief and conviction in your idea, why it's different this time or what special combination from your background that you had, ideally your expert in both those areas, both the machine learning and the other area you're applying it to, or you can create a founding team with that expertise. I think there's huge impact to be made there and huge value to be built there.
[32:35]
Gary
That's a really important message. I mean even, I mean it's easy to forget. Like basically once you've done it, you've done it, but before you've done it, people are arrayed against you.
[32:44]
Demis Hassabis
Oh sure. I mean, no one believes in it, which is why I think you've got to, you've also got to work in things that you're genuinely passionate about. Like for me, I would have worked on AI no matter what happened. I just decided from a very young age it was the thing that could be the most consequential thing I could think of. It's turned out that way, but it might not. Maybe we would have been 50 years too early. And it was also the most interesting thing I could think of working on. And so I would still be working on AI today, even if we were still, you know, in a little garage somewhere and it still wasn't quite working. I would have still been trying to find, maybe I'd have been back in academia or something, but I would have found some way of continuing to work on it.
[33:24]
Gary
So, I mean, Alphold was like an example of a spike that you pursued and it worked. You know, what makes the scientific domain ripe for an alphafold style breakthrough and is there a pattern, a certain objective
[33:35]
Demis Hassabis
function like think the way I, I should write this up at some point when I have five minutes spare. But the lesson I've learned from all the alpha projects we've done, specifically AlphaGo and AlphaFold is I think the techniques we have and the problems I look like to look for are great in if this, if the situation can be described as massive combinatorial search space, the more massive the better in some ways. So no brute force or special case algorithm will, will solve it. And that's true of GO moves and of different configurations of proteins, far more than the atoms in the universe, both of those and then you have a clear objective function. So you could think of it as minimizing the free energy in the proteins or winning the game of go. So you need to better specify your objective function clearly so you can hill climb and then enough data and or simulator that can generate you lots of in distribution, synthetic data. If those things are true, then I think with today's methods you can go a long way into tackling and finding the kind of needle in the haystack that you need for the solution that you're trying to look for. And I think of just drug discovery by the way, in the same way, right, There is a compound out there that would solve this disease if one could find it. If one could only find it, right, and that wouldn't have any side effects and so on. And as long as the laws of physics allows it, then the only question is how do you find it in an efficient way, in a tractable way. I think we showed for the first time actually with AlphaGo that these systems could find those kinds of needles in a haystack. In that case, you know, the perfect
[35:13]
Gary
go move I guess to get a little meta. I mean we're talking about humans using these methods to create AlphaFold. But then there's a meta level which is humans using AI to explore the space of possible hypotheses. How close are we to AI systems that can do genuine scientific reasoning, not just pattern matching on data?
[35:32]
Demis Hassabis
I think we're close. We're working on these general systems. We have this system called co scientist and we have other algorithms like AlphaVolve that can go a little bit beyond what the basic Gemini will do. And obviously all the frontier labs are experimenting in this way. I've yet to seen anything. So we all tinker with same things, you know, some math problems that are a little bit harder than IMO and so on. I haven't seen anything yet. That is a true, genuine, you know, massive discovery. That's my personal opinion. I think it's coming. I think it may be related to this earlier, this thing we discussed about creativity and actually going on beyond the bounds of what's known. So clearly that's just not pattern matching at that point because there is no pattern to match to. And it's a bit more than extrapolation. It's some kind of analogical reasoning. And I don't think these systems have that, or at least we're not using them in the, in the right way to do that. So the way I often say that in science is can it come up with a hypothesis that's really interesting, not just solve one? When I say just we're now talking about just like solving the Riemann Hypothesis or something, this would be obviously amazing, or one of the Millennium Prize problems, and maybe we're a couple of years out from doing that. But I'd like to solve P, np, that's my favorite one. But even harder than that would be to come up with a new set of Millennium Prize problems that were regarded by top mathematicians to be as deep and meaningful and worthy of lifetime of study and effort to solve. I think that's another level harder. And we don't have. I still don't think we know how to do that. I don't think it's magical, though I do think these systems will eventually be able to do that. Maybe we're missing one or two things. And then the way we would test that is I sometimes call it my Einstein test, which is, can you train a system with the knowledge of cutoff of 1901, and then will it come up with what Einstein did in 1905, including special relativity, his annus mirabilis? Can it do that? Right? And then I think we could run that test. Maybe we should just run that test and keep seeing if that's possible. And once that is, then I think we're on the verge of these systems being able to invent something new, truly novel.
[37:53]
Gary
So last, last question for the people who are deeply technical in this room, who want to work on something, you know, even close to the scale that what you have created with, you know, it's one of the largest AI efforts in the world, and you've been a pioneer for all these years. So for that, I think everyone in this room thanks you and the folks at DeepMind very, very deeply from the bottom of our hearts. Thank you. What's the thing that you know now about building at the frontier that you wish you'd known at 25.
[38:20]
Demis Hassabis
I think we covered some of it in terms of actually you work out that going off to hard problems and deep problems is no more difficult in some ways than going off to a shallower, simpler, more superficial problem. They're just differently difficult. There's different things that are hard about each of those things. But I think given life's very short and you only have so much time and energy, you might as well put your life force into something that will really make a difference if you hadn't done it, if you hadn't been there to push it. So I would just think of it through that lens. And then the other thing is if you are, and we talked about deep tech and I love interdisciplinary work and I think that's going to be even more prevalent in the next few years in combinations of fields and finding the connections between those fields and it's going to be even easier to do that with AI. And then the only other thing I would say is if you know, if you have your depending on what your AGI timeline is, mine's like 2030 or something like this, then if you start off on a deep tech journey today, usually that you're Talking about a 10 year journey for true deep tech, in my opinion. So then now you have to just consider AGI appearing in the middle of that journey. So what does that mean? It's not bad necessarily, but you have to take that into account, right to will it be able to leverage it? What will the AGI system do with it? And it goes a little bit back to what you said earlier about AlphaFold and general AI systems. So one thing I can think see happening is Gemini Claude or one of these general systems making use of AlphaFold like specialized systems as tools. I don't think we're going to have it just in one giant brain because it will have too much regression. If I put all the proteins into Gemini, that wouldn't make sense. We don't need Gemini to do protein folding. Going back to your information efficiency, it will definitely affect its language skills or something like that in a bad way. Much better, I think is to have really good general purpose tool usage models that will then maybe they could even train those specific tools, but they would be in a separate system. So I think that's kind of interesting to think through the implications of that. And then what you might build today, also physical things too, like what kinds of factories would you build, what sorts of finance systems and so on. So I just think you need to really take that seriously. On the one hand is like and imagine what that world would look like. And then build something that would be useful if that comes in halfway through Demis this office.