wavePod

Get Wave AI

#330 Sebastian Risi: Why AI Should Be Grown, Not Trained - Eye On A.I. | Wave AI Podcast Notes

Back to Eye On A.I.

#330 Sebastian Risi: Why AI Should Be Grown, Not Trained

Eye On A.I.

Mon Apr 06 2026

Summary

Eye On A.I. – Episode #330: Sebastian Risi – Why AI Should Be Grown, Not Trained
April 6, 2026
Host: Craig S. Smith
Guest: Sebastian Risi

Episode Overview

This episode explores the concept of "growing" artificial intelligence, in contrast to the conventional idea of "training" via gradient descent. Host Craig S. Smith interviews Sebastian Risi, a prominent AI researcher, about the field of neuroevolution—a biologically inspired approach to creating adaptive, robust, and continually learning AI systems. The conversation covers how drawing from nature's methods—evolution, growth, plasticity, and self-organization—can overcome current limitations in AI and potentially lead to more flexible, resilient, and creative artificial agents.

Key Discussion Points & Insights

1. Gradient Descent vs. Neuroevolution

Explanation of Gradient Descent vs. Evolutionary Approaches
- [01:30] Host uses a vivid analogy:
  - Gradient Descent: Like a blindfolded person on a mountain, always stepping downhill to find the lowest point (local minima), but possibly missing the very lowest valley (global minima).
  - Neuroevolution: Like dropping explorers all over the mountain range with different strategies, keeping the best ones, and evolving variations – more likely to find global minima.
- “With neuroevolution, you improve by variation and selection… No one has to know which way is downhill to begin.” — Craig S. Smith [02:24]
Non-differentiable Problem Spaces
- [04:16] Sebastian explains that evolutionary methods don't require smooth, differentiable landscapes like gradient descent does, making them suitable for more complex or discrete optimization problems.
- “Evolution doesn’t really care if anything about it is differentiable or not.” — Sebastian Risi [04:54]

2. What is Neuroevolution?

Combines evolutionary algorithms with neural network design.
Can optimize not just weight values but architecture, learning rules, hyperparameters, and more.
Not restricted by the need for differentiable components.
- [03:28] "The nice thing is it doesn't only have to be the weights... it doesn't have to be differentiable. So it's quite versatile how you can apply it." — Sebastian Risi

3. Plasticity and Continual Learning

[09:13] Hebbian Learning and Adaptivity:
- Networks with evolving (plastic) weights can adapt to changes—even drastic ones never seen during training (like losing a "leg" in a robot).
- “These Hebbian networks... you can cut off a leg and oftentimes it will still be able to function, even though it has never seen this kind of variation during training.” — Sebastian Risi [00:10], [10:39]
[12:06] Neuromodulation:
- Inspired by biology, neuromodulatory neurons act as on/off switches for local learning, reducing catastrophic forgetting.
- "One functionality is that it tells parts when should they switch learning on and when should they switch it off." — Sebastian Risi [12:17]

4. Growing Networks and Developmental Programs

[13:40] Neurogenesis & Morphogenesis:
- Networks can “grow” by adding new nodes and connections—reflecting brain development in biology.
- “We call this a neural developmental program... that can then decide when should another node be created.” — Sebastian Risi [13:40]
[15:44] Adaptive Capacity & Fitness:
- Growth is moderated by selective pressures (comparable to energy constraints in biology), integrated via multi-objective optimization (performance + resource efficiency).
- Scaling up is possible, but balancing growth and plasticity is still a challenge.

5. Continual Learning and Model Merging

[27:54] Incremental Learning without Overwriting:
- The goal is networks that, instead of overwriting old knowledge, grow new nodes to store new learning acquired in operation.
- “That will be the ultimate goal…” — Sebastian Risi [27:55]
[28:20] Evolutionary Model Merging (Sakana):
- Merging already-trained models (e.g., one good at Japanese, another at math) to combine capabilities using evolutionary search.
- “You let evolution figure out how to combine them together and have a model that's good at Japanese and math.” — Sebastian Risi [28:50]

6. Artificial Life and Resilience

[30:52] Artificial Life Research:
- Exploring “life as it could be,” including growth, self-organization, and self-replication (e.g., neural cellular automata, virtual salamanders).
- “The nice thing is you can train those with supervised learning... If you don't know [the target]... you can use evolution.” — Sebastian Risi [32:04]
Robustness and Adaptation:
- Damaged agents can recover lost capabilities (e.g., a robot regrowing after being cut).
- Biological inspirations can increase the resilience of AI systems compared to brittle deep learning models.

7. Applications to Science and Creative Search

[35:07] AI Scientists and Sakana’s “Shinka Evolve”:
- Combining LLMs (as “mutation operators”) with evolution to generate, mutate, and evaluate hypotheses or designs.
- “The only thing you need is to be able to somehow score it based on some fitness function.” — Sebastian Risi [36:30]
- Demonstrated by generating academic papers, deriving code solutions, and even proposing new scientific ideas.

8. Limits of LLM Creativity and the Human Knowledge Boundary

[42:55] Are LLMs Fundamentally Constrained?
- Risi acknowledges that language models are still generally limited by their training data, but can sometimes recombine ideas to reach new territory.
- “For me, it’s less clear how far is that outside of what it has seen.” — Sebastian Risi [44:00]
Need for Real-World Experimentation:
- True scientific creativity may require AI to not only generate hypotheses but also run (automated) experiments and act in the world.

9. Open-Endedness and Environment Co-Evolution

[47:58] Evolving Agents and Environments (“POET,” “Omni”):
- Gradually increasing task complexity (via environment evolution) allows agents to develop stepping stones and solve harder challenges.
- “You need to go through these stages for it to discover these kind of stepping stones in the behavior to be able to do the final thing.” — Sebastian Risi [49:36]

10. World Models and Continuous Thought Machines

[51:48] World Modeling:
- Internal world models let agents “dream” or simulate interactions, supporting safe, imaginative, or counterfactual reasoning.
[54:05] Renaissance for Evolutionary AI:
- LLMs and evolutionary methods pair well: LLMs can propose rich, varied representations; evolution can select and combine at a scale and flexibility not possible before.
[56:23] Continuous Thought Machine (Sakana):
- An architecture where “neurons” are more complex, and computation time is allocated based on “internal confidence”—blending dynamic, biologically inspired processing.
- “The network itself can decide to think about a problem for longer time periods… not just input-output.” — Sebastian Risi [56:44]

Notable Quotes

On Resilience:
“Biological systems are incredibly resilient… deep learning, often you find these weird examples and it completely fails. So I think there’s a lot of promise in using these systems that can self-organize… they have inbuilt resilience that I think we could exploit…”
— Sebastian Risi [33:20]
On Model Merging:
“You can take a model that is good at Japanese, you take a model that’s good at math, and you let evolution figure out how to combine them together.”
— Sebastian Risi [28:50]
On Collaborative Intelligence:
“How do we best kind of combine it with what humans are good at and what machines are good at? … How can we combine the best of both worlds?”
— Sebastian Risi [41:40]
Limits of LLMs:
“Its creativity is constrained by the training data… Is it really going to come up with ideas that aren’t in some way embedded in the training data?”
— Interviewer [42:55]

Suggested Timestamps for Deep Dives

Gradient Descent vs. Neuroevolution Explanation: [01:30]
What Is Neuroevolution?: [03:28]
Plasticity, Hebbian Learning, and Lifelong Adaptation: [09:13]
Neuromodulation and Preventing Catastrophic Forgetting: [12:06]
Growing Network Architectures (Neurogenesis): [13:40]
Model Merging and Evolutionary Search: [28:20]
Artificial Life and Robustness: [30:52]
AI Scientists, Creativity, and Scientific Discovery: [35:07], [42:55]
Continuous Thought Machine and Future AI Architectures: [56:23]

Tone and Style

The tone of the episode is intellectually adventurous and technically in-depth but remains accessible and engaging. Both host and guest are enthusiastic about the future of AI that draws more deeply from the lessons of biology—evolution, growth, continual adaptation, resilience, and creativity.

Memorable Moments

Host’s mountain analogy for optimization methods ([01:30]).
Detailed discussion of robotic quadrupeds adapting to amputated limbs ([10:10]).
The concept of artificial salamanders in Minecraft regrowing after being cut ([32:00]).
Candid reflections on the limitations of current LLM-based creativity ([42:55]).
The emerging vision that “growing” networks—combined with evolutionary and language model techniques—could define the next major phase in AI.

This episode is a comprehensive dive into why tomorrow’s AI may look less like a rigidly trained machine and more like a living, evolving, and endlessly adapting organism.

Loading summary...

Transcript

Sebastian (0:00)

We can take an example of how nature evolved intelligence and use evolution instead. When you use a static fixed network that is not changing the weights during its lifetime, if you cut off a leg, it will probably fail because it can't adapt. But these Hebbian networks, they change the weights all the time. It's basically like a continually learning, updating system where you can cut off a leg and oftentimes it will still be able to function, even though it has never seen this kind of variation during training.

Host (0:27)

Let me jump in with a little explanation before we get started. This is a very technical podcast, but one of the more interesting ones that I've recorded in a while, and I want as many people as possible to benefit from it. I'll begin by explaining in simple terms what gradient descent is, which is used in most neural networks today, as opposed to neuroevolution, which is what this podcast is about. An illustration of gradient descent is standing blindfolded on a mountainside with the goal of finding the lowest point in the landscape. That lowest point is the solution. Your distance from it is the error or loss to get to the solution. You try to reduce that error step by step. So you feel around with your foot to find which direction the ground slopes downward. You then take a step in that direction. You repeat that process until every other direction feels uphill. At that point, you've reached a low point called a minima, though not necessarily the lowest point in the whole mountain range, which would be called the global minima. With neuroevolution, imagine a plane flies over the whole mountain range and drops many people with different search strategies in many different places. One wanders to his left, another to his right. One walks in widening circles, another takes big jumps, another small ones. After a while, you see who ended up at the lowest point. You keep the best strategies, make variations of them, combine some of them, maybe the jumping and the walking in a widening circle. And then you send out a new group of people with those strategies. Over time, your people get better and better at finding the lowest point, even though none of them ever knew which way was downhill. That is the difference. You have a better chance of finding the global minima, the lowest point in the entire mountain range. With gradient descent, you improve by following the slope. With neuroevolution, you improve by variation and selection. You try many candidates, score the results, keep the better ones, and make new variants from them. No one has to know which way is downhill to begin.

Interviewer (3:19)

You have this new book out, Neuroevolution, so maybe you can Start by explaining what neuroevolution is in AI.

Sebastian (3:28)

Sebastian (5:53)

If you have a function, you can take the derivative of that function, and that's basically what you're doing. When you train a neural network, you view the whole neural network as a big function. And if you take the derivative of it, like from math, like high school math, then it tells you the slope, it tells you which direction do I have to push the weights for the arrow to get lower so that's all it does. Like it gives you the slope of the function and it then means that should I take this weight in this direction or the other direction? And if I take it in this direction the arrow increases and in this direction the arrow goes down. So you have basically like let's say you have a three dimensional network with three weights. What you get is depending on how you vary those weights you get an error surface and that tells you. And then if you get the slope it tells you which way you should go down. Right. And if you have a million parameter network, it's a million space and back propagation is very good. If you can do that then it's great to finding that point, the minima, the maxima. But if you can't do it then very difficult to navigate that space and that's what you can use like evolution for. So yeah, basically the differences in, if you use evolution is that you don't need the gradient because yeah, you have a whole, you have a population that is basically distributed on this landscape, right. And you don't need to have this arrow signal. You can just basically you kind of sample like evolution strategy for example. You have like you are somewhere on that, that, that surface and then you, what you do is you sample, you slightly change the weights, right? Create like a hundred different mutations that are like around you and then you go into the direction of oh in this direction. So it's a more you locally sample but you have a population so you're not only sampling here but in, in many places at the same time. And that can give you a direction to go. And then the nice thing is also that if you, that you not only need to do mutations like slightly going in one direction, right. But you can also do big jumps by doing crossover. So like you know, the idea crossover is you take the genes of one parent and the genes of another parent and by combining them maybe you get the best of both worlds because each one has good building blocks. And by combining it together you get something even better. But if you want to do that with evolution you need to take care. You can't just randomly take half of a network and half of another network and assume that it's working well. So that neuro evolution researchers have developing algorithms that allow like a sensible crossover that you, you know, you don't want to have like suddenly like two left hands, you want to have a left and the right hand. And the same applies to kind of neural networks.

Sebastian (9:13)

Yeah, yeah, so. So, yeah, exactly. Like we have. So we're basically trying to see what are potentially building blocks from nature that we don't have in our current system that might hopefully a lot better. And one of those things is this plasticity that you mentioned. So how we learn is through one of the mechanisms that our brains learn is like if two neurons always fire together, then the connection between them gets stronger. So it's a local learning rule instead of this back propagation that is like this outside thing that changes everything about the network. And so what we have been working on is what if we only train those learning rules for each synapse, we train this local learning rule instead of having a global signal and we train it through evolution. But then we can put the. So we did experiments where we trained those Hebbian learning rules and they take into account like how much does the presynaptic neuron fire, like the source neuron, and how much does the postsynaptic neuron fire. And then depending on how much they fire together, we have a learning rule that says, oh, if this fires often or this one or them together, then maybe make it stronger, make it weaker. And for every connection in the network we evolve its own rule. And then we showed that if you do that, starting from when the agent is born, we can start from a completely random network. The only thing it has, the learning rules, but otherwise the weights are completely random. And in a few steps the network can self organize because it's trained by evolution. The learning rules are trained by evolution to self organize into a network that can, for example, control a car driving around or controlling a quadrupedal robot. And the interesting part is that this quadruped, when you use a static fixed network that is not changing the weights during its lifetime, if you cut off a leg, it will probably fail because it can't adapt. But these Hebbian networks, they change the weights all the time. It's basically like a continually learning, updating system where you can cut off a leg and oftentimes it will still be able to function, even though it has never seen this kind of variation during training. And so now we're Trying to, you know, extending those to also more complicated tasks, more, more like continually learning tasks. But the main idea is that the weights never stop changing. Like our, you know, your brain is not frozen at some point, but it keeps, keeps changing. And yeah, so I think this is a very like a promising direction towards like continually learning agents that are based on their own evolved learning rule that could be, for example, optimized to facilitate continual learning through some kind of meta learning.

Sebastian (12:06)

Yeah, so that can still happen with those networks. So, so one thing that, that, that people have been experimenting with is so in the brain we have this Hebbian learning, but we also have this thing neuromodulation. So, so neuromodulation is like another type of, you know, system in the brain that tells some parts of the brain when they should learn. Like that's one of the things it does many other things, but one, one functionality is that it tells parts when should they switch learning on and when should they switch it off. So us and others have been experimenting with adding another type of neuron to a neural network that can then tell other parts of the network when should learning be switched on and off. And so that's one way of, towards more continuous learning system that the system itself sense. Okay, I should override maybe this part, this, this weights are fine. The other parts maybe shouldn't be changed. And yeah, the, the. So the other thing we have been working on that as part of this, this EU project grow AI is that we are also trying to learn not just in a, in a fixed network, but also learning actually to grow a network like more taking inspiration from neurogenesis and morphogenesis in nature that we're not given. This brain, like this brain has been, it's been growing. And so that's one thing that we, in machine learning skip. We just. This is the neural network you have, you have. But in nature things are grown from like starting from a single cell. So we're trying to replicate that process. Trying, starting with one neuron and growing and then the hope, the ideas. So we have a system that can do that and currently it works for simple tasks. But ultimately the idea is to also take into account the environment during the growth process, like to take advantage. There's already some information in the environment, so why not take it into account when the network is created and developed? And so that's something we're working on. And also we have been doing a combination of. And we call this a neural developmental program. So it's basically like you're learning another small neural network that is a copy of those runs in every neuron of a normal neural network. And then that small network can then decide when should another node be created or how should the connection between two nodes change based on the activation. So it can learn in principle, any type of learning rule, which just makes it also harder to optimize. But it's basically like a graph neural network type system, but that this dynamic that can change while the agent is born and interacting with its environment.

Sebastian (15:44)

Yeah, except that it would be up to the. So there are two things. Like one is it could just grow without getting any information at the start. Like, you know, like how before our cells might get sensory information from the outside. You could just have a fixed program where the nodes communicate with each other and they exchange information and, and they figure out, okay, you should grow five times and I grow two times. And so this is like without any outside activation, that's one, that's the first like process that can run. But then it could be that. Yeah, then it could be that they figure out, okay, I have too much. So the system itself learns to do this. So, so we're not telling the system if you have too much information, but each. Because in each node you have another like recurrent network basically running like this genetic program, developmental program. And that could figure out, okay, I've getting all this information, so maybe I should split the cells to that, you know, you have more capacity. But that is something that we don't program in. But that the algorithm would have to figure that out by itself. And then how it would figure it out is that genetic programs that do that would get a higher fitness than genetic programs that don't do that. And so the other ones would select to be selected out and and some that do this a little bit would get initially better fitness, and then they would be selected. And. Yeah, and so that's one thing that is. That is a little bit of challenge, because the space of what you could learn is so large. Right. You could learn any very weird developmental program. And that's. We. So it probably would require to evolve really complicated things, have a good curriculum of tasks. Like, first, you know, you have to do some small. Some small task, and then we make the task more and more complicated, like, kind of. There's also some research that we talk about in our. In the Neural Revolution book, like one system called poet, where you're evolving the environment and the agent together so that both things get. Can, like, can scaffold off of each other. And so something like this is probably required to get that approach to work for really, like, complicated problems.

Sebastian (23:51)

So you can also use, you can also. We did some approach. You can also train it in a more supervised way or through reinforcement learning. It's just. It seems to be easier to train it with, with evolution. But the issue is also with this approach it kind of. There's in general machine learning and evolutionary computation there's this kind of issue of like deception that it's easy to get like a decent score, but if you want to get all the way to the goal you might have to first like decrease your go another way that decreases your performance for it to be then become better. Like there's this classical example of a maze and you can get very close to the, the goal of the maze but to get actually to the goal you would have to go all the way around the maze. So getting a decent score is, is okay, but if you want the, the really good score you have to get worse first. And so likely these kind of problems require like approaches that can deal with this kind of deception. And, and, and that's why in also neuro evolution people have been developing these methods of, of more open ended search methods. Like methods that don't just go for like one target but, but methods that are. It's called, it's under this umbrella term like quality diversity. You want to have an approach that explores much more of the space but also takes quality of the solution into account. So for this kind of growing approach to really work really well, we have to combine it with these kind of quality diversity approaches because all of these things kind of work together because otherwise it's really, it's quite difficult to explore the space. And we also, we did some, some work back in the day that just shows the, just the difficulty of learning to learn and plasticity. Like imagine you are like in a. There's typical experiment people use in biology like this tea maze. So you have a maze that looks like a, like a tea and the mice goes to one part of the maze who has to learn to remember like, oh, was there a big reward here or was it here? When they collected, they have you put them back to the start of the maze. And if you train this, we did experiments where we used heavier learning for that. Imagine you learned to always go to the small reward. Like you go to the small reward then, then you get put back and then it's the high reward here. So you learn to go to the small reward. This is like the worst thing you can do. It's worse than going always to one side of the maze because you would at least get 50% right. But in terms of how close is that network to actually learning? It's closer than the network at oi. That is just not reacting and always going to wanna stupidly going to the right side. Right. So if you use a traditional approach, this will be the worst. This will be directly be sorted out. So you need to have different methods to evolving these more like cognitive skills than just saying, you know, this is the fitness. Because otherwise you will get stuck in that like go 50% stupidly to one arm of the middle. So everything has to kind of work together. And that's kind of the challenge in this, kind of, in this way. You, you want to learn to learn, you want to learn to grow, you want to do everything at, at the same time. And that's kind of the challenge and

Sebastian (31:03)

No, no, that is. That is very related. Like in. So in artificial life, it's like the idea is that life, the instance we know is like one example of life. But artificial life is like life as it could be like. And some. And people simulate things that are lifelike properties. And. And one thing of lifelike property is growth. So, so self organization and growth and self replication is very like essential to life. And those are also things we explore with these growing networks. But we also explore them with what's called this neural cell automata. Like, it's also basically like neural networks, like copies of it. And they imagine just replacing the traditional rules of cellular automata, like of the game of life, which has these fixed rules. Like if you have three neighbors, you create a new cell. If you have four, you, the cell dies and you can replace that with a neural network. So instead you ask each cell says, ask the neural network, what should I do? What state should I be become next? And we changed that to. So we are able to scale that to 3D. Like we have a paper where we growing Minecraft structures with this. And the fun thing is like you can have a. You grow salament on Minecraft, you cut it in half and then it grows to salamanders. And the nice thing is you can train those with supervised learning. So if you have a target, you know, you want to grow a house or like a tree, then you can teach it to grow that. If you don't know, we also use it to train kind of soft robots that have like squishy, squishy robots, where we don't know what is a good morphology for locomotion. But there you can use evolution that you tell it, you know, grow a structure, put it in environment, see how well it works. If it doesn't work, then we throw it out. And through this process we can grow structures that are able to locomote and then we also able to damage those. You can cut off parts of the structure. And if it's trained to recover from it, then it can regrow just only based on the local information. So it doesn't need any other information. It just needs to sense the local part, like a salament that can regrow its tail. And these methods can be used to do that. And there's this community, artificial Life, that also is a few people at Sakana also working on these kind of ideas of artificial life. And yeah, it's an Interesting direction. That's a little bit not the mainstream machine learning, but I think there's a lot of promise, like taking some of these properties from biological system, putting them there. One is being resilient. So biological systems are incredibly resilient and still deep learning. Often you find these weird examples and it completely fails. So I think there's a lot of promise in using these systems that can self organize and based on local communication, they have an inbuilt resilience that I think we could exploit to make these deep learning systems also more robust. And also adaptive.

Sebastian (35:07)

Yeah, that's also something we're exploring at Sakana. It's this kind of idea of like an AI scientist, for example, with AI scientists, but also this thing we call Shinka evolve, which is like kind of alpha evolve. And the idea is that you can, and that's a combination of evolution and large language models. So large language models are good at for example, generating code and generating ideas. But to explore that space, for example, it can be really useful to use evolution. So like in basically you can use language model as a mutation operator. You start with one kind of example is this circle packing in the space and you have like a number of circles and you want to put them in there, like the maximum number of circles you can put into this space. And so what you can do is you can then ask a language model to give you a new solution and multiple solutions. And then you evaluate those solutions based on fitness. Like how, what's the score that it gets packing those circles? And then you, you do this again from the best ones and, or from the number of best individuals. And you ask again the language model give me variations of the solution and then you do it over and over again until you find a good solution that packs the most circles into this, into this space. So, so this is like you use evolution to navigate the space, but you're using the language model to give you as a mutation operator. And you can do this or you know, like scientific ideas for example, you can start with one idea and let the model generate your variations of it. And then the only thing you need is you need to be able to somehow score it based on some fitness function. So which is a little easier if you have the circuit packing. It's a little harder if you have like some, you know, scientific idea which is a little bit more complicated to say if this is a good idea or is it a bad idea. But this is a direction, I mean that a lot of Sakana is pursuing and other companies are pursuing where you have this kind of combination of evolution because it's creative in what it can discover, but you have it a little bit more grounded and because you have a language model that is the mutation operator. And people in evolution have for a long time done things like evolving programs with genetic programming. But those were always like very hand tailored to the kind of problem at hand. But now that you can use a language model, you can let it output code and you can ask it to modify the code and navigate in this space and applying all these lessons that we have learned from Neuro evolution, like more open ended setups, using things like quality diversity to kind of navigate the space and hopefully not getting stuck in too many of these local optima. And I think it will, yeah, it will change how science is made that you have this kind of AI scientist or like co scientists that you can exchange ideas. It's navigating some space, it's giving you some hypothesis to test. This is kind of where the direction is kind of moving towards.

Sebastian (40:29)

Yeah, yeah, I wasn't part of that paper. But I think the main thing is that that's probably the worst it will ever be. So I think that's kind of the idea that this was with an older model you get this paper. Right. But if you would replicate it now using like Gemini or some, some other model, it will probably push further on this. So, so the, so the better. The nice thing is about the framework that he can use the same kind of framework and he can switch out the language model that you're using. So the better the language models become, the better papers they should also be able to write. And I think that's kind of the main thing. Not necessarily that there was this, what ideas are generated then, but showing that you can kind of automate the whole pipeline and it will get better and better with better models. But also for me, I think the interesting part is how can we use this also as a kind of like a co scientist, like, because at least for some time there will be humans and AIs like working closely together. And I think it's very interesting. How can you make sure that it can take into account both ideas? How can you make sure that AIs and humans talk in the same language? There was an interesting keynote by Melanie Mitchell where she showed that basically like even the models that come up with it looks great on the benchmarks and it looks like the right solution and it gets a good score. But it solved it in a very different way that was not even intended by the humans. Like it exploited some kind of feature about the domain that wasn't even built in. So how do we kind of. We have to find kind of like a language that we talk in the same way. If we want to collaborate, then we have to find a common ground to being able to do that. And I guess there's already some common ground. It's natural language, it's trained on text, you can communicate with it, but you might not be sure about intentions or so I think to collaborate well, we need to do a Lot of work that goes beyond it just being able to write its own papers. But how do we best kind of combine it with what humans are good at and what machines are good at? And I've always been interested is kind of co intelligence or hybrid intelligence. Like, how can we combine the best of both worlds? And before it was a little more easy, like it was a little bit more complex. Clear. What, what are humans and computers good at? Now it becoming less clear. So I think that's something we need to kind of figure out.

Sebastian (48:41)

Right. So one of these examples is this algorithm called Poets, where basically the agent might be a bipedal robot and the environment is the terrain in this case. And it's easy to, if you have a flat terrain, it's easy for the robot to walk, but then you can introduce gaps or obstacles. And so the agent has to learn to deal with those. And so in this approach, it's initially you start with a very simple environment, but then over time you make it more and more difficult. And at the end it's solving like crazy environments that it goes down or it has to jump over things. Like it is really impressive what it can do in the end. And the interesting part is if you would have started with the really complicated environment at the end, it wouldn't have been able to solve it. So you need to go through these stages for it to discover these kind of stepping stones in the behavior to be able to do the final thing. And then people have also extended that to which we also talk about in the book. One is this approach called Omni, where you can extend this to things like environment generation in Unity, for example, like where you have not just this two dimensional flat landscape, but now that we can have language models. You can have the language model produce code that creates an environment, and initially it might create an environment that's very simple, and then it creates more and more complex environments. And so that's, I think really interesting and could allow also like neural evolution to scale up to really complicated tasks. And I mean, now you can even imagine combining this with. We now have also neural networks that can simulate whole 3D worlds. Like, and if you combine, I think neurorevolution with a controllable world that you can just prompt, how should this world look like? Should it be a simple world? Should it be very difficult worlds, more predators, more this and that, then I think we could really get like an increase in what the agents we can kind of evolve this way.

Sebastian (54:05)

I think it definitely wasn't in the main focus, but I think now that I think it's just a very good pairing, for example, LLMs with evolutionary algorithms. I think that's a really good match. And that's why I think more people also now interested in combining generative AI with evolution. Because it's just before, basically before an evolution revolution, you always had to think about what does your representation look like. And then you kind of with particular representation you might restrict the type of solutions you're getting. And now that you can use language model, you don't have to worry so much about, you can just have that come up with the representation or be the representation in that instance. It's quite, it's difficult to differentiate or use gradient descent. Like it's so searching that you know you can use the language model to give you ideas or like generate you artifacts and then searching that space. It's, it's beneficial using evolution because it's hard to back propagate using gradient descent to that space. So I think that's one reason why it's becoming also more popular now. Like the approaches like alpha evolve and this model merging, I think fit really well together. And then that's also something that Sakana is very much looking into. Like we want to go beyond the transformer. Like one of our co founders, Lanjosi is one of the, the authors on this transformer paper. And we think this is not where we should kind of stop and go beyond what the transformer can do. And so one of our approaches is this continuous thought machine, which is an approach where the network actually, it's not just like input outputs, like you get something in, you put something out, but the network itself can decide to think about a problem for longer time periods. So where the external input is actually not as important as like the internal thought process itself. And so we made an architecture that allows the model to kind of do that. And again like incorporating some more biologically inspired algorithms, like an activation memory of what comes into each neuron, making each neuron more complex. More complex. So not just the simple neuron models that are most times used, but, but slightly more complex that each neuron is, is its own neural network. And also this idea of the brain, like biological brains often use this kind of synchronization and oscillations to do like processing. And they seem to play a really, I mean, I love neuroscientists, but they seem a really important role in biological brains, like how neurons are oscillate together or synchronize together. And so that's one of the key components in this continuous thought machine that you see how much the neurons synchronize and you kind of use that as

Sebastian (57:59)

benefit of it is it can operate on continuous stream of data, but it can also be used for domains that don't inherently have like a sequential dimension. Like one thing we applied it to is image classification. And what happens there is that it, it kind of, it gets the image and it learns to look around on the image and to. Based on what it sees. It slowly builds up confidence like, oh, I'm seeing like an image of a dog. I'm seeing this image. And the other domain, we used it in this reinforcement the task. But also like maze navigation and maze navigation is. And there it's, it's quite a lot better than like a standard like LSTM long Short Term Memory network in kind of. Yeah. Learning to navigate out of a maze. So these are the tasks we applied it to. But in the future we want to also apply to more complicated tasks. But in this one it, it's really interesting. I think one of the very interesting parts for me is like that it looks very eerily human. Where it looks like it looks sometimes at the eyes or it looks at these parts. If you would compare it to I think how a human, if you track the eye tracking of what humans look at, I think it seems pretty similar to that. And then you can kind of track how confident it is that it sees a certain image. And it's interesting too, with some images it's more difficult. Like I don't know, a dryer, it might take more time to look around to be sure. And then some other images it looks very briefly and it knows what it is. So and then you could take that into account to give it, you know, say okay, when you're confident enough then you stop computing and if not then you just, you know, it can go on for a while. And that's one thing that I think it's starting to. I mean, in language models, people added this kind of reasoning, but it's often also, like, kind of reasoning that you elicit through reinforcement learning. And you're kind of reasoning in text. Like, you can see the models like, oh, I think this and that, and then. But this is another type of reasoning that's happening in the. In the neural substrate itself. And so there's some interesting difference. And the question now is how can. Can we take that and apply it to language models and how that would look and.