
Loading summary
A
The scaling laws have been remarkably robust. There's a lot we don't know yet in terms of the vulnerability of these systems.
B
If you don't need to buy the Galacticos, why do you have like an Andrew Tullock, a Daniel Gross and Alex Wang and the Galacticos assembling? If I gave you $10 billion, what would you spend it on? First knowing what you know, what do you not let your children do?
C
This is 20 VC with me, Harry Stabbings and today we have one of the leading minds in AI, Joelle Pinault, on the show. Joelle is the Chief Scientist at Cohere, where she leads research on advancing large language models and practical AI systems. Before joining Cohere, she was VP of AI research at Meta, where she founded and led Meta's AI Montreal lab. Joelle is also a professor at McGill University and renowned for her pioneering work in reinforcement learning, robotics and responsible AI development. But but before we dive into the show today, are you drowning in AI tools? ChatGPT for writing, notion for docs, Gmail for email, Slack for comms and you're constantly copy pasting between them all, losing context and losing time. This is the AI productivity tax and it's killing your output. At 20 VC we're all about speed of execution and Superhuman is the AI productivity suite that gives you superpowers everywhere you work. With the intelligence of Grammarly, mail and coda built in, you can get things done faster and collaborate seamlessly. Finally, AI that works where you work, however you work. Superhuman gets you from day one with zero learning curve and it's personalized to sound like you at your best, not like everyone else using generic AI. Get AI that works where you work. Unlock your Superhuman potential. Learn more@superhuman.com podcast that's superhuman.com podcast and once you're moving faster with Superhuman, make sure you you're moving safely with Vanta. Customer trust can make or break your business and the more your business grows, the more complex your security and compliance tools get. It can turn into chaos and chaos isn't a security strategy. That's where Vanta comes in. Think of Vanta as your always on AI powered security expert who scales with you. Vanta automates compliance, continuously monitors your controls and gives you a single source of truth for compliance and risk. So whether you're a fast growing startup like Cursor or an enterprise like Snowflake, Vanta fits easily into your existing workflows so you can keep growing a company your customers can trust. My listeners can get $1,000 off Vanta by going to vanta.com 20vc that's v a n t a.com 20vc 20vc for $1,000 off Vanta.
B
You have now arrived at your destination.
C
Joel, it is so great to have.
B
You in the studio.
C
I've heard many great things from Nick, Aidan Schrepp.
B
So thank you so much for joining me.
A
Thank you. Happy to be here.
C
Now you spent over six years at.
B
Meta and I want to start there because it's a very transformative time and place. What are the biggest takeaways for you from that time and how did that shape your mindset to how you think today?
A
Well, I was there from 2017 to 2025 and you have to see just how much AI changed over that period of time. What we were really focused on is fundamental AI research. One thing that I've learned is just sometimes how long it takes to prove out a hypothes. We feel like AI is moving at the speed of lightning. But in fact there's some things that it just takes a few years to mature, to get the right optimizer, the right compute, the right data for that to really make a difference.
B
I look at where we are today and everyone kind of goes, it's here, it's here, it's here. And then you actually look at what a lot of the leaders have been saying recently, where it's like actually Andre was saying it's not the year of the agents, it's the decade of agents. Sam's kind of pulling back too. Have we got over our skis and we're actually kind of all pulling back, realizing that time is the factor we need to rely on.
A
Well, I'll give you an example. You know, I've been in research for a couple decades now. I've been working on reinforcement learning for over 20 years and suddenly everyone's talking about reinforcement learning since the advent of reasoning models, agents and so on. So you know, sometimes you have to be a little patient with these ideas and the right algorithmic tweak, the right context, the right problem domains just opens up the magic.
B
I was listening to Andre yesterday and said in this show that reinforcement learning is terrible.
A
Less terrible than 20 years ago.
B
Have we over invested in RL based methods at the expense of maybe like more scalable alternatives?
A
Oh, I'm still super bullish on RL and that like the concept itself is so fundamental. You know, this idea of training through a system of rewards, of indicating what's valuable and what's not valuable through numerical values like that is so fundamental. It's not going away now, you know, where we're maybe getting a little bit ahead is thinking that just R.L. out of the box is going to give us AGI that part a lot less. So, you know, if you look at the curve of progress, RL is terribly inefficient. And so the amount of signal you need to get in order to really shape the behavior of a model is far from where we are today. And so we'll need to figure out how to really deal with this. With this learning efficiency problem, you're probably.
B
Thinking, what did I get myself in for? And I don't blame you. I ask questions that I think everyone else thinks, but I'm not afraid to say, I don't know. Why is RL so inefficient?
A
There's a few reasons. You're going to get me on like a.
B
This is great.
A
I really love this. There's a few reasons. One is the fact that RL is about sequential decision making. So, you know, think about, you're starting at a point, you need to figure out what you're going to do next. And you might pick the right side of the branch or the wrong side of the branch, and then like, the road keeps on splitting. So every time you make a mistake, it sort of compounds through the length of the series of actions you're making. So that means, like, the amount of error you can make can be very, very large. And to get it right is quite difficult. Sometimes people compare it to like a needle in a haystack is like finding the right solution in rl. So there's that part. The other part that's hard is the fact that to train the system, to train the models, you have to essentially take actions to learn. You can't learn from static data. You can learn some things from static data, but actually to get the right policy, you need to test it out. And so that means you need a simulator, you need to get the synthetic data. All of that can be really expensive also. And so we have difficulty getting like just a variety of environments in simulation to test RL.
B
When we look at the cost curve for RL, you said you've been working on it for 20 years. Have we seen that dramatically come down? Will we see it continue to dramatically come down? Or is it a case of it is just a fundamentally expensive method of training?
A
It's coming down especially in domains where we have good reward functions. So the place where most people started hearing about RL is around the AlphaGo time. You know, the game of go, which was sort of one of the goals for AI. Many people thought we were at that time we were still a decade away from being able to have machines play GO at the level of humans. And out comes a team from DeepMind goes off, plays against the world champion and shows that RL can basically do it. And so I would say in cases where we clearly know what's the goal, we can write down precisely the reward function. We're good, we can make a ton of progress. So that's why you're seeing progress in mathematics, very well defined reasoning tasks, games, these kinds of things. RL to shape the behavior of models to get them to be social creatures that we have no idea how to do. I mean, I don't know if you have children but like shaping their behaviors, you know, the number of times you can repeat the same thing and still they do something else. And so there's something there. You don't know how to write that out mathematically. And that's where I think we're still in for some hard work.
B
So when we look at the training versus the inference market today, we've had so much weighed on training so far and it's been incredibly costly and expensive. And then I hear everyone say, well actually inference is 95% of the market and that's where it's all going and that's where Nvidia will make most of their money. How do we think about the cost curve applied to training versus inference and where it sits today?
A
I think there's a lot of different variants and if you'll allow, maybe I'll pivot to pivot. Anyway, where I'm going with Cohere now just. I joined Cohere less a month ago. Super exciting company. I think one of the things that Cohere is doing is actually to develop AI models that run on premise. So that means enterprise bring it in, they run it locally. So the company has to worry about the training of the models. Obviously we want world class models for the needs of enterprise. Doesn't have to worry about the inference, doesn't have to worry about the inference costs. The client's customers have to figure out what's the right way for them to digest the AI. That means there's a lot of motivation to have very efficient models so that they can run really efficiently on premise. So you know, we get caught up into one paradigm for their other paradigms as well.
B
If they're the ones paying for the inference, is there not less incentive to make them efficient? Because sod it, you're not the one paying for it. If you're the one paying for the inference costs. God, I want it to be as efficient as possible because it's my dollar going to that. But if it's IBM's dollar, yeah, I would love it to be efficient, but we're not paying for it.
A
We're still in the early days of AI adoption and enterprise, so what's good for the client is good for us.
C
Totally get you.
B
What's the biggest challenge about capital efficient AI today? I know that sounds strange when you look at the economics, so to speak. What's the biggest challenge?
A
There's a lot of challenge today, I think, in terms of the economics of AI. I think one of the biggest challenges, the fact that it's very hard to have predictability. Everyone wants to know, when are we going to hit the breakthrough? Everyone wants to know, how many GPUs do I actually need? Everyone wants to know, like, what's the return I can expect? There's just a lot of uncertainty built into the system. A lot of that is because there's a lot we don't know about this technology. And so that means we have to take in quite a bit of risk. When you're building out, whether you're building out your data center, whether you're building out your workforce, whether you're trying to figure out, you know, how much data to, to curate. And so that makes it difficult for a lot of people. People want answers. And this is a world where we don't have that level of predictability compared to other industries.
B
Does progression happen in kind of a linear fashion or does it happen in step functions like AlphaGo, like a deep seq, which, depending on kind of what you believe, suggests a lot of efficiency in terms of model improvement. Is it step functional or is it linear?
A
I tend to decompose different ingredients that lead to progress. You know, people often talk about, like the algorithms, the data, the compute. I think in general, compute and data have a more linear effect on progress. You build more compute, you run bigger models, you can typically get better performance, you feed in more data. It's not just quantity. You need to worry about quality and diversity as well. But roughly it's more linearish with respect to the data. The algorithms are the ones that have the nonlinear effect. And so you can explore lots of ideas and then something like the transformer comes along and just changes the paradigm. And it's not just your transformer. You know, on the optimization side, suddenly we hit upon Adam, which is a technique to do the optimization of your model changes in the paradigm reasoning, suddenly we start thinking about how to put that in the loop reasoning and it changes the paradigm. So those ideas tend to have a nonlinear effect. The challenge with these algorithmic ideas though is that actually it may take a long time to prove themselves out. So like the paper can be sitting out there, there's thousands of papers coming out, the idea is sitting out there and we may not think to try with the right data, at the right scale, with the right combination of hyperparameters and so you don't notice that effect for a while. So it's hard to predict. And it's non linear more on the algorithmic side than I think on whether it's data, compute, even talent or other things.
B
With respect to Google, I mean transformers obviously birthed in Google and sat as papers for many a couple of years. When we look, you mentioned that compute, algorithms, data, if we just kind of go through them to understand. Everyone suggests that it's weird. There's two different worlds. Scaling laws exist. Just throw more compute at it. When you look at data center investment, when you look at all desirability of compute, and then again you have GPT5 seemingly focusing on efficiency and other signals, do scaling laws play out from here and if so, for how long?
A
The scaling laws have been remarkably robust. They don't play exactly as we expect, but still they've been remarkably robust. Lots of people have bet against scaling laws in the past and I would say overall we've seen a pretty robust effect. They don't work alone. We also need these algorithmic innovations, but most of the time I wouldn't bet against it.
B
On the algorithm side, is that the hardest to innovate on? You could think about you can buy more compute. It might be hard, but you can buy more compute and data. There are different ways, whether it's synthetic or human. Is algorithms the hardest to innovate on?
A
It's certainly the most creative work to be done. And the space of ideas is so wide that I would say it's hardest in the sense that you can move in. I'm a researcher at heart. You can move in so many different directions and picking the right one, you don't know till you get there whether it was the right one or not. It's a little bit like reinforcement learning. So in that sense I think it's the most interesting one, it's the most frustrating one and it's the most difficult one. Certainly from an investor's point of view to know where to put Your chips.
B
Speaking of knowing where to put your chips and moving from purely a research lens with, with Meta to now also building product, is there ever this inherent conflict between intellectually interesting research with the need to productize and monetize? And how do you think about that?
A
I mean, one of the reasons I'm really excited to be joining Cohereer actually is because we're at a stage where AI is really starting to be useful. Maybe not as useful as people think it is, but we are there. And by working on AI that's going into enterprise, I feel we're going to get such an interesting signal of what works and what doesn't work. You know, we keep on talking about, you know, AGI and AI for the masses and so on, but actually, like when you need to sell AI to a business, you get a real signal of what works, what doesn't work. And that's what I'm most curious to see. And you know, we've been using these academic benchmarks for many years. You get some signal, but it's not the same as, as getting this to do productive work. I'm curious to, I'm curious to learn out of that. You know, we're going to get new types of data, we're going to get, I think a lot of insights that are then going to drive the reach research ideas. I think that's the other thing to think through. When you have a large space of ideas to explore. Getting that feedback signal from the real world is super useful to guide you through that search of ideas.
B
I just had a great chat to David Kahn at Sequoia who said that he thinks a good barometer for utility value within enterprises is like, does it have the ability to replace the work of your bottom 5% in any category? He says we overestimate a lot. Can it replace the bottom 5% in any function? And if it can, that's a very meaningful improvement. Do you think that's a good barometer? And how would you advise an enterprise on whether something's useful or not?
A
As a yardstick, I prefer in terms of a barometer of productivity, something a little bit different, which is to say, can most of your employees do 10x the amount of work with AI versus on their own? That to me is actually a better barometer. I think human AI have very complementary abilities. So to just like flat out replace a portion of your workforce is actually pretty unrealistic. Some may try and some may be slowing down their hiring. But I actually think, Respectfully, I think.
B
10Xing your work feels more unreal. Is that not a bigger ask? I'm almost more intimidated by 10Xing my work.
A
Oh, I don't think that's unrealistic at all.
B
Wow. In a timeline that is next couple of years.
A
Yes.
B
I'm sorry, how does that actually shape out then?
A
I think you have to identify very concretely the types of work that you are delivering. But I think we're starting to see Hollywood quality productions being made in a matter of hours. We're seeing, to take a super concrete case, machine translation. If humans are doing the machine translation, compared to machines doing it, you go from hours to seconds on long form text, multi page documents. And so for a lot of work, it's not like AI can do all of the work. Humans still need to ask the right question. They need to verify the information, they need to shape the tasks. But once the task is well defined, the product are clear, like all the design considerations are fed into the prompt. You press the button and you've got an answer in seconds. For something that used to take sometimes weeks and months, I'm just kind of.
B
Trying to reevaluate a belief that I had for the last few months, which is like I'm a venture ambassador for a sin. For all of us to make money, we need to see the transition from kind of human labor budgets to AI spend. And it's with that transition where we obviously see the TAM massively increase and we make a lot of money. But when I hear you say that, I suddenly question that assumption. It's like the barometer for whether we make money. Because you're suggesting that actually we don't replace the human labor budget, it just makes us 10x more efficient. Is that correct?
A
Yes. There's a lot of nuance to all of that. And some work will be harder to get that same level of efficiency gain, whereas other work you'll see 100x in terms of efficiency gain. But I do think that for a lot of the work that's happening right now, that's absolutely feasible.
B
Where is the efficiency gains most tangible?
A
It goes back a little bit to this notion of what are the tasks that we can specify. Well, in any case, where we can be very precise about what a great result looks like, we'll be able to make that task automatic much more easily than tasks that are much more nuanced and have a lot of complexity. So it's ambiguity, ambiguity in the specification of the task is what's hard for our machines.
B
How have you seen enterprise reaction to this? There's fear from workers, sometimes there's excitement from leaders, there's apathy sometimes. How have you seen and measured enterprise response?
A
A lot of the workforce can be reasonably fearful about job displacement. There's a lot of also individuals who have a bit of an instinctive reaction to change. And change can be hard for a lot of people. And we're seeing a lot of change in a very short time span. And so I think there's also a generational effect. I think for some generations that change is more jarring. I think for the younger generations, I have teenagers, young adults at home. For them it's just native. They just kind of are going to grow up with that technology in a different way than some of the older generations.
B
It's interesting, Sam, speaking of kind of children there at home and how they engage with it, Sam Altman said that kind of young people engage with it as like an OS to the world and AI is that companion for them. And then older people kind of use it as a next gen Google. Do you agree with that and do you see that in your work?
A
I see a lot of people using it as a tool more than as a companion. People have this Swiss knife in their work life all of a sudden that can be super helpful. But that's really most of what I see.
B
What are Enterprise's biggest challenges with AI adoption at scale?
A
You know, one of the challenges is to make sure that the AI comes in and can be integrated in their workflows, their processes, their information. And so the challenge is to deploy in a way that allows them to exploit all of the information systems that they already have. And some of them have accumulated these over decades. That's, you know, some of the work that remains to be done figuring out.
B
How to do that, existing systems and data flows.
A
And that's, you know, of course that's something we see a lot of cohere because we do on premise deployment. You know, one of the things we focused on the most is data confidentiality and security so that Enterprise can exploit all of that information. So that's top of mind for us. But it's also a huge opportunity. I would say there's a big interest in that. But making sure to get that, that compatibility I think is a challenge in many cases for people. Change is hardest for people and so you have to get them curious about using the technology. Many people feel they have to get it right the first time. And I really think like a spirit of exploration and curiosity is much better suited to the phase of maturity of the technology that we have. Today we don't have all the answers of how it should be used. That's going to come from people on the field.
B
Security is a topic that, that we quite often glaze over, especially when investing in kind of application layer AI tools. What does no one know about AI.
A
Security that people should know with respect to AI security? I think there's a new front that's opening up with the development of agents and frankly there's a lot we don't know yet in terms of the vulnerability of these systems. With LLMs, we're starting to get a better understanding. We've had quite a bit of red teaming exercise and jailbreaking and so on. And so people have identified different risk vectors, prompt injections, things like that, which are vectors for malicious actors to interfere with a system. With AI agents, we haven't seen that. And one of the features of computer security in general is often it's a bit of a cat and mouse game, quite frankly. There's a lot of ingenuity in terms of breaking into systems and then you need a lot of ingenuity in terms of building defenses. And so we just have to stay very active in that sense.
B
What are the potential vulnerabilities though, in an agent world?
A
In terms of agents, you know, we worry a lot about hallucinations in LLMs. The parallel in agents is impersonation. So agents that come along and are essentially impersonating entities which they don't legitimately represent, and in doing so, taking actions on the behalf of these entities where they don't legitimately represent, whether it's infiltrating banking systems and so on. And so I do think we have to be quite lucid about this. Develop standards towards it, develop ways to test for that in a very rigorous way, there's ways to reduce that risk drastically. You run your agent completely cut off from the web, you're reducing your risk exposure significantly. But then you lose access to some information. So depending on, depending on your use case, depending on what you actually need, there's different solutions that may be appropriate.
B
That's a really hard one because then verification becomes the most important thing. But then it's like, who's the arbiter of verification? Is it governments? Is it companies? How does one think about that? Who says you're a valid agent versus an invalid agent?
A
Governments can be good for defining standards on which we all agree. Companies are much better at building the solutions at scale and deploying them.
B
Do you think governments are good at setting the standards? When you look at AI and where we're at. And then when you look at the sophistication levels of government programs or decision makers, with respect, they're just a little bit behind. Do you think they are actually equipped?
A
I don't think you should look at where government are in terms of necessarily AI regulation. That AI as a field is so incredibly young and fast moving and by nature, and there's some good in this, governments are moving a little bit more cautiously and usually need to benefit from our knowledge to make good policies. And so I do think you can look at other fields in terms of regulation. You know, you look at aviation, the security record for aviation today compared to where we were 50 years ago is just incredible. And governments have played a role in defining that in terms of standards and in terms of what are the norms and so on. So I'm quite hopeful. I'm an optimist about this. Maybe it's my Canadian side that governments can play a useful role. In many cases, clear standards actually means reducing uncertainty for a lot of companies in this space. But we shouldn't expect that to be ahead of the technology. I think that would be the wrong order of things. In some sense. We need to develop that technology with enough of a creative space and we need to learn fast and then develop the right guardrails for that technology from. From the real earnings we have.
B
We mentioned that kind of governments and their role. When I had Nick on the show, he was saying actually the benefits of not being an American company given some geopolitical challenges. Sometimes I'm just intrigued. Do you think we will have these sovereign models for each geo? We have Mistral in France, coheres obviously in Canada or founded in Canada, but then you've got global HQs. Do you think we will have these sovereign models and regionalized winners?
A
I do think it's healthy that there are models that are getting built in different places around the world, not just in the US and China right now. I think this is healthy in terms of diversity of thoughts. I think it's healthy in terms of having a greater amount of people with access to technology. I do think for cohere the vision isn't to be a Canadian company. The vision is to be a global AI company. And I think yes, you know, we have a headquarter in Toronto. We have teams that are distributed around the world. We have a great team here in London as well as in the US and France and other places. And so having that ability to deploy models that operate across the world I think is going to be absolutely an important Part of the strategy for cohere, I think there's a great opportunity. What it gives us to be headquartered in Canada is like a sensitivity to the fact that it's not always a one size fits all solution. You know, I go back to the research we've done, we've done leading work in terms of multilingual model. And it turns out it matters. You go to Japan, you go to Korea, and they do want models that work well in their language. People in the workforce are still operating in the language of the country. So having a company that is attuned to that, that values that internationalization of model is actually important on the global market.
B
On the team building side, obviously Canada has great talent. You mentioned obviously some in London as well. What have been your biggest lessons, observations on team building in this talent frenzy that we're in? Also, how do you analyze that?
A
One of the things that's important when you're building a team for AI, I do think you need people who have vision, who have a sense of what can we create just because we're in space, a space where there's so much innovation that is still needed. So you need an ingredient of vision that can be 1, 2, 3 people who bring that, that ingredient of vision. You need people who have amazing execution muscle. They don't care that it's their idea. They care that if the team agrees on an idea, they are just going to push this and get it done. They're going to build a system, they're going to run the experiments, they just have that technical rigor to execute. And then you need people who kind of like keep the team together, who have this sense of who needs what to operate well and who are that social glue. Humans are still social beings and that social glue in a team matters a lot. Where I've seen it fail is to have just sort of one type of person inside the team. So I'm a big believer in building teams with diverse complementary talents.
B
So you can't just buy the Galacticos.
A
I don't think you need to. I think you really have to be thoughtful about putting people in a group. The other thing that helps a lot is for the team to have focus. If it goes in all sorts of different direction, you'll lose that power that you get from people working together. So having a lot of clarity, what's the North Star? What's the goal? Where are we going? Even if over time that needs to change, but that level of clarity is required for everyone to be working in the same direction.
B
If you don't need to buy the Galacticos. Why do you have like an Andrew Tullock, a Daniel Gross and Alex Wang and the Galacticos? Assembling it is wrong.
A
You do need a few of these, like uber talents in the team. There's a relatively short number of people who just understand this technology very deeply. You do need some of this talent, and if you can afford it, you should get some of that talent. But you don't need all of your team. You need a team with complementary skill as well.
B
Does that create a good team? Like if I gave you $10 billion to go build a team and you could buy a couple of these luxury star players? I feel like it's Top Trump's cards for sports teams, but you can buy a couple. Does that create a good team when One is a $3 billion person and then the rest are just, just average $50 million people?
A
Yeah. I wouldn't say no. If someone offers me the opportunity to hire, there's definitely some really talented people in the field and they deserve to be fairly compensated. This technology is going to make a lot of people very rich and have major effects in terms of society. And so we should be rewarding the talent. But I'd be very thoughtful about what are the teams that I put together and, and how do they work together, rather than just like hire a roster of superstars without being thoughtful how they're going to work together.
B
So it's so funny. So because of the impact that you can have in these teams, actually the multi billion dollar price tags that you see can even be justified.
A
Time will tell. I don't think it's necessarily needed to go at that scale, but time will tell.
B
If I gave you $10 billion, what would you spend it on first?
A
One of the things you need is a balance between talent and compute. If you have too much talent and not enough compute, you're wasting your time. Usually like an equilibrium between those two pieces. I think we often underestimate the importance of data, and data is getting more and more expensive, and so I would certainly spend a good chunk of it on data as well.
B
So many things to unpack there. Do you feel you have sufficient compute today?
A
I think we are reasonably well resourced in terms of compute and building the models that we want to build. Yeah.
B
So access is not a massive problem.
A
No.
B
Why is data becoming more expansive?
A
Data comes in different forms. On the one hand, the days of having data labelers who can say this is a cat and this is a dog are somewhat over the easy task. The AI can do. So we're getting in a space where we need more specialized tasks. So imagine you're building AI for enterprise. There's a particular business logic. You need to make sure that you're catching the errors. You're going to need someone with deeper understanding of the tools, so that's more expensive talent to come in and actually prepare the data. There's also a lot of data that's synthetic data. When you're building agents, you need to build environments. And to build environments, you need some pretty creative folks who are going to build. You, like, synthetic simulators. We've seen this on the robot side for many years, people building robot simulators. Now you're building AI for enterprise. So you need to think of, like, how are you going to simulate these work processes in a reasonably realistic way that the AI can train on that? And so that generation of environments and benchmarks and dynamic domains can be pretty expensive too.
B
When you look at the expansive data and then you said, oh, cat, dog, lamppost, got these capers, you know, click. The ones which have like a. I get them wrong. I legitimately get them wrong.
A
I'm like, jesus, they're getting harder.
B
They're getting so hard.
A
They are. They are. It's not just you.
B
The other day I called up my cfo. I'm like, I failed the Revolut. I'm so sorry. I'll try again in half an hour.
A
Please let my AI agent answer that one for me.
B
It was embarrassing. But the question that I have is, you know, when you look at McCaw, when you look at Serge, when you look at Turing, how do you evaluate that market which is providing a lot of that talent? Is that an ongoing, enduring market or is that just, hey, for the next three to five years, we'll need it in the training phase of these models. But I don't know what it looks like beyond that.
A
I don't think it's a phase in the sense that I do think this partnership, we'll call it, between humans and machines, where human provide guidance to machine learning. We are in this for a long time. What will change is the nature of the information that the AI provides versus the information that the humans must provide as a complement. Some of these firms may not be around in five years, but this notion of having humans guide the behavior, guide and train the behavior of AI system, it's super interesting.
B
I kind of, as an investor and one of them see all of them converge around needing to do three things now. They used to just kind of be talent acquisition oh, we'll get you these people. And now they're like, we'll get you these people and we'll get you high quality data that you can really use. And now it's like, oh shit, we need this third pillar, which is we'll also help you implement that data into your models, do training and help you with benchmarking and proving that it's actually valuable. And now they need all three. Are you seeing that third one where it's like implementation of that data as well? They don't just hand it over the fence.
A
There's definitely some of that that's happening. I think for me, the even bigger trend we're seeing is the moving from just labeling data to, to crafting environments to produce new tasks.
B
You said about synthetic data and that also being a very important segment to consider. Do you get model degradation when you get this kind of reinforcing loop of models learning on synthetic data, which creates more data for synthetic and it actually degrades or does it improve?
A
It really depends how you're generating your synthetic data. So in some domains, if you think like images, languages like LLMs talking to each other at some point, you definitely get the degradation. And that degradation is due to essentially like a loss of diversity of your data. So, you know, you can make an analogy. You know, you, you take a bunch of people, put them on an island and let them reproduce. You know, at some point the genetic diversity is going to keep shrinking. And so you get a reasonably similar phenomenon with, with models because you're not injecting diversity into the data. So for their domains where lack of diversity means you get a collapse of distribution, there's other domains where you don't need diversity. If you think of like playing chess, playing go these kinds of games, we know exactly how to generate board configurations. And so we can generate tons of synthetic data. Not endless because it's a closed world, but still tons of synthetic data. And through that, learn for a long time. Then there's domains that are sort of in between. If I think of coding, we can generate synthetic code. You take normal code and we know how to inject diversity into the code. Like I can take a couple repositories, mix and match, apply an LLM to transform it. And so there's a way to generate synthetic data. The language is predictable enough and there's enough structure that I also know how to inject diversity so that you don't get that collapse. So, so the hope is that especially in these domains, we can use a lot more synthetic data. And do it without suffering from the degradation of performance.
B
Do you worry that we are creating a world with just much worse code? A lot of people are concerned by the quality of code that's being outputted and actually how we're just relying on it pretty haphazardly. Do you worry about that?
A
Let me make an analogy in terms of the quality of generation. You ask about code generation, but let me take you back to 2015 and image generation. I don't know if you have it in your mind, but the quality of the images that were generated. We had image generation models in 2015. They were really bad. The resolution was bad, the composition was bad, and so on. And from 2015 to about 2022 or so, we saw huge progress in terms of the quality of the image generation. So you think of code generation like right now we're in the phase we were for image ten years ago. Yes. There's a lot of bad code that's generated. There's a lot of code that will get thrown away, but wait another 10 years. And I think the quality of the code that's produced is going to be excellent.
B
What will the developer world look like in 10 years when that is the case?
A
If I carry my analogy further, I don't know if it's a reassuring scenario, because if we look at where we are today in terms of image generation, there's just like the volume of image getting generated is huge. What matters now is sort of picking the quality out of the volume. And so if I fast forward 10 years on code generation, when we have the ability to generate a ton of code, to do a ton of different things, we're going to need some selection mechanism to decide what code we actually want when there's actually value. And so that's going to come. There's still going to be some sort of editorial design choice. Someone needs to decide, like, of all the code we can generate, what's the code we want to generate? What do we need, need to be running in terms of our digital world?
B
So it's like a chief curation artist.
A
Yes. Within the curation doesn't go away. Curation verification. This is work that doesn't go away.
B
Does the structure of teams fundamentally change then? It's funny kind of playing that back to you and then also playing back to you, what you said earlier about the human. And if that is the case, there's not much of a partnership, is there, between human and AI? It's a chief curation person sitting on top of a huge amount of artificially Created code.
A
Well, that's your 10x productivity improvement there. It is.
B
But you're ticking that box. But it removes the human element there.
A
You still need people with intent. That's one thing that you need to decide what you want to build and what purpose does it serve. And so that intent is still there. That role of critique is still there. So the team composition does change significantly once you suddenly have designers who in their hand have amazing tools to go directly from the ideas in their head to the digital world, maybe eventually to the physical world. That equation definitely changes.
B
Do you think prompts and the way that we interact today with prompts and with chat largely is the enduring interface for human engagement with AI?
A
It's awfully limited and, and prompts can mean a few different things, but the idea of like typing in a box, that to me is very limited. And we're going to break out of that box already we're seeing a lot of cases where voice is a lot more natural as an interface. I do expect we'll see, you know, gesture, eye gaze, these kinds of much more multimodal ways to interact with the AI rather than just stick in that prompt box. But language is incredibly powerful. So if you think of prompt as being more language as a way to express ideas and communicate with a machine, that's a powerful paradigm. I mean, as humans, so much of our communication is based on language. I don't think we're going to move away from that. It encodes information. You know, language, words are symbols that encode so much information so efficiently. And so I don't think we're close to getting away from that.
B
When you think about what you did believe, that you now have changed your mind on what's most prescient.
A
Oh, I'm a scientist that is happy to be proven wrong anytime, as long as there's new evidence. I'm genuinely curious to know. Other scientists are much more like holding on to very, very strong conviction. I have weak conviction, but very strong respect for the scientific method and rigor, experimental rigor, theoretical rigor, as well as I used to be quite skeptical that neural networks were necessarily the ultimate solution to machine learning. I'd seen enough cycles of neural networks kind of peaking and then being less useful. And I used to think every time you change the scale of the data, you know, you go from hundreds of examples to thousand thousands to hundreds of thousands to millions of examples every time you change the size paradigm, that neural networks were the first thing we tried because they're a universal function approximator. And Then something else comes out that was better and that was true for the previous generations. You know, some of you may remember SVMs as like being better than the neural networks in early 2000s seem to be quite wrong on this one. Like neural nets seem to be here to stay and the ability to do backpropagation and gradient descent and all that seems to be a really powerful way to learn.
B
What does everyone else believe quite strongly that you think they are quite wrong on?
A
I don't have a lot of patience as a scientist for people who are predicting sort of the extremist scenarios, whether it's the catastrophic risks of AI or whether it's the winner takes all AI becomes our overlord kind of scenario. I don't have a lot of patience for that. I wouldn't say it's necessarily widespread, but I just think you lack scientific rigor to analyze these kinds of scenarios. I'm much more pragmatic, grounded. I'm pro innovation. I'm excited to see where AI is going and the problems it can solve, but I'm not so interested in just going around and making up science fiction scenarios.
B
You've been on the most incredible, what you said there about image generation 2015 and partly how much it's improved. We're seeing this kind of, of unbelievable capital supply going to the space in a way that we haven't seen obviously for many, many years. Is it a good bubble where we are getting incredible improvements and it's fundamentally advancing technology? Or is it a bad bubble where costs are becoming too exorbitant, teams are too impossible to build computers, too difficult? Is it a good bubble or a bad bubble?
A
I think about it as a bubble with bigger variants. It's like the upswing is going to be bigger and there's going to be big swings as well. And so there's a lot of variance into the system right now. As long as people have a tolerance to risk, then I think AI is a great investment and we should continue to be supporting risk taking new enterprise, new ideas. There's a ton of exciting new startups being created. We should continue to support them. You just have to be tolerant to risk.
B
I've had some people on the show suggest that, that evals are, to put it delicately, bullshit and that they don't actually mean anything anymore. And humanity's last test, what does that really even mean? And we have these new tests that come up, what is this? And leaderboards, what is this? Is that fair or do you think they actually serve a very effective utility to the ecosystem.
A
I do think they are really good indicators. So I think you do need to take evaluation seriously in terms of knowledge, but you shouldn't take them seriously in terms of the ultimate goals. So you know, evaluation and there's lots of different benchmarks and so on. You have to decide like, what type of model are you building, what's the characteristics of your system? And then think of evaluations as like unit test for the performance of your system. Software engineers will know what that is. Right. Like you run through that evaluation and that gives you like a signal of, of how the system is doing in a particular dimension. But as we're building systems that are more and more general, do very specific tasks. You don't optimize for these. We build AI systems that go into enterprise. None of our clients ask about like, are you able to win the Math Olympiad with this model? That's not what they care about. They care about bringing value to their business. Now we're curious to know how well we do on math problems because it can be predictive of behavior on other things. But you don't obsess over specific benchmarks. You kind of look at the ROI in terms of what you're trying to build.
B
We mentioned there about access for enterprises. Enterprises have money and that's a great luxury in a lot of cases. Research institutes, universities often don't. With the kind of bubble like tendencies, people with money are able to afford to compute the talent. Are we seeing this kind of lack of access or democratization for great institutions that are educational maybe who now can't afford to compete in this new world?
A
Certainly a lot of universities have a lot less resources than companies today. That's not completely new. When I joined Meta In 2017, one of the reasons I did that is because I could already see the disparity in terms of access to compute. And I was really curious to see how you could do research with a lot more compute. But there's still amazing research that's being done in universities. You go to the major international conferences, NeurIPS, ICML and others, and often the best paper awards are actually won by researchers out of universities. There's a lot of good ideas that you need to test out at small scale. And in a university you have a lot more freedom to pick pretty risky ideas at a small scale. But still, you know, no one's asking you to justify your research in ways that often happens in companies. So I think they have, they play different roles in the ecosystem and what's actually especially good is talent flows between them. University students come in, do internships, take jobs at companies. We've also seen a movement of people coming out of these large companies, going back to university, teaching, sharing with the next generation what they've learned.
B
How important is it to have seen success and how valuable that makes you. When you look at people like Amira raising 2 billion at a 10 billion, it's like, well, no one's seen the success that she's seen with OpenAI. So it's valid. Help me out as an investor. Is it that valid to place that much of a premium on access to seeing it at that level that we are, or is that slightly overpricing it?
A
I think in many cases, when it comes to deciding where to invest, very early on, when you don't have tangible information, I mean you look at people's track record and there's a part of that that may be like what have they learned in terms of the core recipe. But the other thing is also the achieve of having put together amazing teams who are building world class models. And there's a lot of subtlety to that. And so I think, you know, both of these ingredients are important to consider. I don't.
B
If you were investing today and you were joining my team, which category would you most like to invest in? Be it security, be it generative, AI compliance, you name it.
A
Yeah, there's a lot of verticals, whether healthcare, scientific discovery that I think have incredible promise, where we're going to see real tangible progress within five years that are going to change completely the face of what we can do. That's probably where I'd push.
B
That's very exciting on the healthcare front in particular, when you think about that timeline as well. I'd love to do a quick fire round with you if that's okay. So I say a short statement. What would you most like to do? But because of technical or financial limitations you're not able to.
A
I'm super keen to figure out how we build societies of AI agents. We're doing it implicitly, but how do we look at populations of AI agents interacting together and having like a sandbox for doing that? Maybe something I'll get to do is it lack of time, resources, something else. There's just like a ton of different things to do, but keen to see what happens there.
B
When you think about that ecosystem of agents, you have children and AI changes our relationship with other humans and friendship and social. How does AI impact social, friendship, connection?
A
It definitely does. And you know, there's a sense that we spend a lot of our Time in the digital world. And for some folks, you know, I look two of my children, they spend a lot of time in the digital world playing online games with their friends. It's still very social. It's. There must be some AI, there's the digital platform, but it's still a very social experience. Others have more individual experience. There's definitely a shift of this time we spent towards that platform where we go look for the, that for that social element.
B
Knowing what you know, what do you not let your children do?
A
Eat too much sugar?
B
Totally. So that's that like physical diet. Completely agree with that, yeah. Is there a technical diet?
A
I spend some time discussing like settings. I mean like you get an Instagram account, great. You can have an Instagram account. But like, what are the settings on that account? Making sure they understand. I mean, they'll go and change them if they want.
B
We're going to discuss settings.
A
I know that was not a popular one.
B
Did they listen?
A
The thing with children is you don't know till later.
B
Do you limit screen time?
A
I spent a lot of energy, especially in their younger years, limiting screen time. My kids did not have a cell phone till they were 14, 15.
B
Did you see adolescents?
A
I have not.
B
Okay, watch it. Basically a little boy goes up to his bedroom and gets lost down kind of rabbit holes of TikTok and Reddit and it does not turn out well.
A
I've heard about it, I just haven't had time to sit down and watch it.
B
Do you worry about the loneliness pandemic and then also just the mental health crisis that we have?
A
I do worry a lot in general about making sure that people are healthy mentally. We have to be careful about taking shortcuts and saying because suddenly we have certain platforms, we have AI and so on that is causing that mental illness. There are a number of people who are suffering and they deserve to have good answers to the situation and we deserve to find real solutions to that. And there's a lot of people looking for shortcuts and short answers. But I think more research into that is definitely warranted.
B
What's your biggest lesson from working with Zara?
A
He is incredibly deep into understanding the work. Like he does not coast, you know, when he started getting into AI, just the depth of the question that he'd ask. He just gets really interested in the topic and goes super deep and that then just informs everything he does after. So you can have the most amazing team. But like, as a leader, you need to go deep and understand and the work.
B
Did you see him change as anyone.
A
Gets more knowledgeable about a topic, you get more decisive. There's a phase where you're really learning and trying to understand and there's a phase where you understand a lot of things and then you make your decisions faster. So certainly that shift happened.
B
What one AI buzzword would you ban if you had a magic wand?
A
Existential risk.
B
Why?
A
Because it just makes people afraid. And it's not out of fear that we make our best work and we take good decisions.
B
Do you find the cost of talent exorbitant?
A
Talent is costly. Talented people deserve to be paid well. And again, someone coming in just because of money rarely is going to be the right person. But you do need to compensate people fairly.
B
Final one, what are you most excited for? You don't like the existential risk? I don't like the doomsday planning. When you think about the positivity that can come, what are you most excited for when you look forward to the next three to five years?
A
I do think some of the work in terms of AI for a scientific discovery is going to be pretty fascinating to see. Just in terms of the doors, it's going to open up the ability to explore combinatorial space of solutions. So I'm curious about that. And then I super curious to see how can we actually make our models more efficient. There's larger and larger and larger models. No one wants to run these models. I spent a lot of my career building open source models and I'll give you one example. We were in the frenzy of large language models and I pulled the stats on most downloaded models of last month. We had a model like Roberta from 2019. Small language model was getting 20 million downloads a month. People want efficient models that they can use, that they can run. So I'm also super keen to see what we're going to be able to do at the scale that runs on like one or two GPUs.
B
Final, final one. You said there about kind of open. We seem to like be reverting to a closed world now. Is that the world with which we should predict and plan on?
A
That's a deep mistake. I mean I will continue to believe that especially for research, the ideas need to circulate. And this thought that you can just like close us down is absolutely false. I mean people are circulating.
B
Do you not think we are though moving into that world? Everyone seems to be closing systems, closing access.
A
There are definitely a number of places people that are closing down access. I don't think that is going to be effective. Ideas will CIRCULATE and I also think it's a mistake from a point of view of fostering innovation.
B
This has been such a joy. I've learned so much from this conversation. Thank you so much for putting up with my very basic questions, but I've loved having you on the show.
A
My pleasure. Thank you.
C
But before we leave you today, are you drowning in AI tools? ChatGPT for writing, notion for docs, Gmail for email, Slack for comms and you're constantly copy pasting between them all, losing context and losing time. This is the AI productivity tax and it's killing your output. At 20 VC we're all about speed of execution and Superhuman is the AI productivity suite that gives you superpowers everywhere you work. With the intelligence of Grammarly, mail and coda built in, you can get things done faster and collaborate seamlessly. Finally, AI that works where you work. However you work, Superhuman gets you from day one with zero learning curve and it's personalized to sound like you at your best, not like everyone else using generic AI. Get AI that works where you work. Unlock your Superhuman potential. Learn more@superhuman.com podcast that's superhuman.com podcast and once you're moving faster with Superhuman, make sure you're moving safely with Vanta. Customer trust can make or break your business and the more your business grows, the more complex your security and compliance tools get. It can turn into chaos. And chaos isn't a security strategy. That's where Vanta comes in. Think of Vanta as your always on AI powered security expert who scales with you. Vanta automates compliance, continuously monitors your controls, and gives you a single source of truth for compliance and risk. So whether you're a fast growing startup like Cursor or an enterprise like Snowflake, Vanta fits easily into your existing workflows so you can keep growing a company your customers can trust. My listeners can get $1,000 dollars off Vanta by going to Vanta.com 20VC that's V-A-N-T A.com 20VC 20VC for $1,000 off Vanta.
Episode Title: Cohere's Chief Scientist on Why Scaling Laws Will Continue | Whether You Can Buy Success in AI with Talent Acquisitions | The Future of Synthetic Data & What It Means for Models | Why AI Coding is Akin to Image Generation in 2015
Host: Harry Stebbings
Guest: Joelle Pineau, Chief Scientist at Cohere
Date: November 3, 2025
In this episode, Harry Stebbings interviews Joelle Pineau, the Chief Scientist at Cohere. Joelle, formerly VP of AI Research at Meta, is known for her pioneering work in reinforcement learning, robotics, and responsible AI development. The conversation spans the challenges of scaling in AI, the undervalued importance of algorithms versus data and compute, the role of synthetic data, talent acquisition, the evolving interface between humans and AI, and the societal implications of rapid AI progress and adoption—particularly in enterprise settings.
“Sometimes how long it takes to prove out a hypothesis. We feel like AI is moving at the speed of lightning, but in fact there’s some things that it just takes a few years to mature...” (03:18)
“Sometimes you have to be a little patient with these ideas and the right algorithmic tweak, the right context, the right problem domain just opens up the magic.” (04:06)
“Training through a system of rewards…is so fundamental. It’s not going away.” (04:47)
“Every time you make a mistake, it sort of compounds… And to get it right is quite difficult. Sometimes people compare it to like a needle in a haystack.” (05:45)
“Enterprise…run it locally...motivation to have very efficient models so that they can run really efficiently on premise.” (08:37)
“It’s very hard to have predictability...Everyone wants to know how many GPUs do I actually need?” (10:07)
“Algorithms are the ones that have the nonlinear effect...you can explore lots of ideas and then something like the transformer comes along and just changes the paradigm.” (11:12)
“The scaling laws have been remarkably robust...but most of the time I wouldn’t bet against it.” (13:14)
“When you need to sell AI to a business, you get a real signal of what works, what doesn’t work.” (14:43)
“Can most of your employees do 10x the amount of work with AI versus on their own? That to me is actually a better barometer.” (16:16)
“The challenge is to deploy in a way that allows them to exploit all of the information systems that they already have.” (20:42)
“Agents that come along and are essentially impersonating entities...taking actions on the behalf of these entities where they don’t legitimately represent...” (23:11)
“Clear standards actually means reducing uncertainty for a lot of companies in this space.” (24:50)
“Having a company that is attuned to that, that values that internationalization of model is actually important on the global market.” (27:51)
“You do need a few of these, like uber talents...but you don’t need all of your team.” (29:55)
“You need more specialized tasks...so that’s more expensive talent...also a lot of data that’s synthetic...generation of environments and benchmarks...can be pretty expensive too.” (32:14)
“If you think like images, languages like LLMs talking to each other… you definitely get the degradation… due to a loss of diversity.” (35:41)
“We had image generation models in 2015. They were really bad...wait another 10 years...the quality of the code that’s produced is going to be excellent.” (37:31)
“That’s your 10x productivity improvement… But you still need people with intent.” (39:37)
“The idea of like typing in a box… is very limited. And we're going to break out of that box.” (40:28)
“Neural nets seem to be here to stay and...to do backpropagation and gradient descent...is a really powerful way to learn.” (41:31)
“I don’t have a lot of patience…for people…predicting sort of the extremist scenarios...” (42:49)
“As long as people have a tolerance to risk, then I think AI is a great investment…” (44:04)
“Evaluation...gives you like a signal...But as we're building systems that are more and more general, do very specific tasks. You don't optimize for these.” (45:01)
“That’s a deep mistake. I will continue to believe that especially for research, the ideas need to circulate.” (54:51)
“AI for scientific discovery is going to be pretty fascinating to see…super keen to see what we’re going to be able to do at the scale that runs on like one or two GPUs.” (53:42)
On Scaling Laws:
“The scaling laws have been remarkably robust. Lots of people have bet against scaling laws in the past and...I wouldn't bet against it.” – Joelle Pineau (13:14)
On RL Inefficiency:
“Every time you make a mistake, it sort of compounds...Sometimes people compare it to like a needle in a haystack.” – Joelle Pineau (05:45)
On Data's Growing Importance:
“You need more specialized tasks...that's more expensive talent...also a lot of data that's synthetic...generation of environments and benchmarks...can be pretty expensive too.” – Joelle Pineau (32:14)
On Prompt Interfaces:
“Typing in a box…is very limited. We're going to break out of that box...Language is incredibly powerful, but we're going to see much more multimodal ways to interact.” – Joelle Pineau (40:28)
On Extremist AI Narratives:
“I don’t have a lot of patience…for people…predicting sort of the extremist scenarios...I'm much more pragmatic, grounded. I'm pro innovation.” – Joelle Pineau (42:49)
On the Need for Openness:
“That’s a deep mistake...for research, the ideas need to circulate.” – Joelle Pineau (54:51)
| Timestamp | Topic | |-----------|---------------------| | 03:18 | Meta years: Patience in AI research | | 04:47 | Fundamentals and inefficiencies of RL | | 08:37 | Training vs. inference – enterprise focus | | 10:07 | Economic unpredictability in AI development | | 11:12 | Step changes: Algorithms vs. linear scaling | | 13:14 | Endurance of scaling laws | | 16:16 | Useful enterprise barometers for AI productivity | | 20:42 | Integration and challenges in enterprise AI deployment | | 23:11 | Security risks for agents and LLMs | | 24:50 | Government and standards in AI regulation | | 27:51 | Globalization and multilingual/sovereign model development | | 29:55 | The myth/reality of “buying” superstar teams | | 32:14 | Data’s growing expense and complexity | | 35:41 | Synthetic data’s risks to diversity and performance | | 37:31 | AI code vs. image generation analogy | | 40:28 | Future multimodal interfaces with AI | | 41:31 | Changing beliefs: neural nets’ dominance | | 42:49 | Skepticism of AI “existential risk” narratives | | 44:04 | The AI bubble: high risk, high opportunity | | 45:01 | Benchmarks: useful indicators, not ultimate metrics | | 53:42 | AI for scientific discovery and efficient models | | 54:51 | Open research in an era of closed models |
The dialogue is candid, technical but accessible, with Joelle blending pragmatism and optimism. She continually emphasizes the need for patience, creative risk-taking, openness, and realism in both technological and societal discussions on AI.
For newcomers to the world of AI or veterans seeking up-to-the-minute thinking from top minds, this episode offers a grounded yet ambitious take on where the field is going—and why building for the long term, with an eye on both scaling and creativity, is the smart bet.