
Loading summary
A
What year an AI model can do? Drive any city, drive anywhere from in Europe, Madrid to Moscow and nail it safely 100 out of 100 times, a thousand out of a thousand times.
B
Pick a year to a degree, I think it would actually already work 10,000.
A
Out of 10,000 times perfectly.
B
No, no, probably not.
A
Some folks have moved to hiring experts for 50 bucks an hour asking them questions and then just having proprietary streams of knowledge. We have a company, micro1. How do you think about hiring experts to build out the knowledge base?
B
What we found is that onboarding and having full time employees that actually have the expertise to judge whether we are actually making progress is super important. There's still an amount of information that you're not going to get to whatever you pay for if you're not partnering with a company who actually has the knowledge you need to source people that are experts in their fields, usually PhDs and have an interest for computer science. If you find these two things, certainly you have someone who is interested in driving the competence of an AI model forward in their field.
C
This week in Startups is brought to you by nexos. Stop Shadow AI in its tracks with the unified platform for secure AI adoption and productivity. Try it with a free 14 day trial at Nexus. AI twist LinkedIn ads start converting your B2B audience into high quality leads today. Launch your first campaign and get $250 free when you spend at least two two hundred and fifty. Go to LinkedIn.com thisweekinstartups to claim your credit and Squarespace turn your idea into a beautiful website. Go to squarespace.com twist for a free trial. When you're ready to launch, use offer code Twist to save 10% off your first purchase of a website or domain.
A
Hey everybody, welcome back to this week in startups. We have a great show for you today. On the back half of the show we will have one of the co founders of Mistral, you know, the language model from France, from Europe, their champion. We have a great interview that Alex and I did. But first up some news. What's in the news, Alex?
C
The biggest thing that I saw today that got me the most excited, Jason, is that in the wake of the Gemini 3 launch, Alphabet shares shot up by 5%. Now why do we care about a 5% move for a company? Well, the company's worth 3.5 trillion, which means that 5%, Jason, is worth about $175 billion, which means that I think the market just repaid Alphabet for all of its AI work ever in a single day.
A
I think you've kind of nailed it. If they increased their market cap by 100 billion and they're going to spend 100 billion this year on their build out. If they had done a secondary offering for their shares and raised 100 billion, yeah, they could have put that to work. So in some ways the market is rewarding them for their progress. Now I don't know that they've got $100 billion in market cap. So let's think about that for a second. If they were to get a 10x multiple on revenue, they would need to have 10 billion in AI revenue. Do they have 10 billion in AI revenue or will this drive 10 billion? I don't actually know how to do that because they do charge for their product, right? Yes. But we have no insight into how many paid customers they have. And they do have an API and they do have their cloud computing. So we do know how money their cloud is making and we know its growth rate, which has been very strong.
C
Strong and profitable.
A
Yeah, so that's actually, I think, where it's just an interesting note. They've got over 10 million developers using their models. They've always been very popular with developers. They have a global market and their AI is helping their ad network, helping their search results. And this is why I think a lot of people got it wrong. Everybody thought chatgpt would just run over Google because people would, their behavior would change, they'd stop going to Google, they would start going to this new product. Well, network effects are strong and strong companies realize when they have competition. There's a similar thing going on with DoorDash and Uber and Grab and Didi every saying, oh wait, there's new entrants into the market who are doing self driving, therefore they're going to run over the incumbents with network effects. Well, it's not easy to run over DoorDash because they have all those relationships with the restaurant. So if you are Zipline and you want to compete with DoorDash, which I believe is their business model, is to compete with DoorDash. Well, they have to go and they have to replicate the network of restaurants. That DoorDash has spent over a decade building those relationships and building that fabric. It's not impossible. Like I think Zipline could be attaching themselves. They're building essentially the drive through window for drones. So if you think about a standalone Starbucks, not in the city, but when you live out in the suburbs, a Starbucks is like a standalone island and you drive around it. Right? Because there's a drive through and the drive through becomes a lot of the business people order while they're driving. Maybe hopefully they're in a self driving car. From their app, they order their Starbucks and then boom. Or they order it from the little box there. They use a language model probably, as we saw I showed on a previous episode. Long story short, what you're having happen is the incumbents are now going to have to deal with zipline. So here's Zipline, and that zipline is people having to walk over to that little area. So that's like the one we're seeing on their homepage. The delivery is the same. They drop down a little box on a tether and they fill it in this little area so that the humans are far away from the drones, they don't get clipped by them, which would not be a pleasant experience.
C
No.
A
However, they have a new version that's on the side of a Starbucks. So imagine that same setup, but the person's inside a Starbucks. So if you see, try to find that. I think it might be on YouTube. They've shown it before. But imagine you've got a Starbucks. It's one of those Island Starbucks or a McDonald's. And instead of handing food out the drive thru window to the person in their car, imagine there's a second window and that drone comes down to the roof and they just put into that second window into the drone, the Big Mac, the Frappuccino, and then it leaves from there. Well, they're doing that at the same time that doordash is sending their proprietary robot.
C
There it is on the side of a building and it drops down into the chute there. And then it's available at the bottom for easy pickup, either inside or outside of a building.
A
Yeah. So is this from zipline or is this from.
C
This is from zip. Yeah, this is from four days ago.
A
Look at how interesting this is. I got told about this privately, but you see that zip line being tethered, the little thing going down, the carrier?
C
Yeah. So it's right here.
A
You can see it come down, pause it, you'll see it come down and then inside. So there it is. So the zip line drone is above it. They bolt this onto the side of the McDonald's so they don't have to build a new depot. They just figured out a clever way to do this. So long story short, competitors are coming. Just like ChatGPT was coming for Google. Google realized that Sergey is like, you know what, I need to go back to work. Five days a week, Sergey starts going Back to work. Starts motivating the troops and now they're competitive. Every time the new leaderboard comes out, they're in the top 1, 2 or 3 slot. Congratulations, Grog, Gemini, ChatGPT. Maybe not as much these days, they're all just starting to aggressively go after that. But this idea that Google search would go away and would just get run over and Google would just sit there and take it, or Uber or doordash will sit there and take it when a competitor comes. Oh, contrail. Mon Ferrari show the doordash robot because they built their own purpose built one that drives not on the sidewalk like the slowbots. They built one that drives in the street. Yes. That can go 35 miles an hour, I believe. 25. 35 miles an hour. This is the future. So doordash isn't going to take it sitting down. And they've already got the network effect, but here is their the doordash robot. So instead of doordash partners with folks, they made this. It can drive, as you can see in the street. Look at it. Boom. It can park in a parking spot. So it comes to your, you know, door. It comes to a doordash restaurant, comes to the McDonald's, and the person steps out, drops it in the front of the bay and this thing zips, I believe, 25 or 35 miles per hour on the street. And obviously when it's on the sidewalk, it goes five. So it can do both. This is happening fast and furious.
C
Wow, man. There's another upgrade to this that's basically the size of a stroller. Imagine if you could put your baby in there, right?
A
Yes.
C
And then send it to daycare without having to go yourself. Right. Just put a little window on there.
A
That's going to be safer than dad doing it on their phone, playing chess, or, you know, putting in their prize picks for the Knicks game tonight. And when I say dad, I'm referring to me.
C
I was going to say, which finger are you pointing here?
B
Me.
A
I mean, come on. Dad's going to do dad things.
C
I know, I know. I'll also. You know what's really not safe is having everyone drive up in their 16,000 pound SUV and then crawl ahead right next to a school where they're tiny children.
A
Absolutely. So these things are going to be safer. I'll be totally honest, sending your kids to school in that thing reminds me of Grogu from the Mandalorian. Remember, he's in that protective little floating case. That's where they got the inspiration for that. It's the clamshell from the Mandalorian where Grogu is in a little thing, that's where they got the inspiration from, is from science fiction pop culture.
C
Is this, is this what you're referring to? I found a small frog sitting in a mixing bowl. Is this, is that it?
A
Let's see. Yeah, that's not a frog. That's Grogu from the Mandalorian.
C
It's a frog in a mixing bowl. But we can call it Grogu if we want, baby.
A
Yoda. Squarespace has been a friend and partner of this week in startups for over a decade. And it's because they are one of my favorite startups of all time. You know, Squarespace, the all in one platform that's going to help you make a beautiful website to help establish your brand online and scale your business with you as you grow. Doesn't matter if you're selling content or a course, but what if you want something super customized? Well, you can use their AI powered blueprint feature that works right alongside you as a creative partner. You tell it what vibes you're going for, what the aesthetic is and it makes something totally beautiful and personalized to you and your business. And new features keep coming from our friends at Squarespace. You remember they did SEO, you know, they've been doing that for, for a decade now. They're doing it for AI. Yes, AI search people are starting with AI and then you see all those citations. They lead to Squarespace websites because they do AI search optimization. Go to squarespace.com twist to get 10% off your first website or domain purchase. Anyway, important story there you will face competitors and that's a great way to rally the troops to compete. Uber has done 15 investments in every self driving company other than Zoox and and partnerships with other than Zoox and Tesla. So it's 15 people on one side versus those two Uber plus those 15 versus those two players. DoorDash. They're not taking the zipline or the street robots for granted. They're making their own. And here obviously Google's not taking it sitting down.
C
All right, next up on the show, we are talking to a company called Mistral AI. They are out of Europe. They are Europe's foundation model champion. We have Arthur Minch here on the show. Arthur, I'm so glad that you're here. We talked about your company ad nauseam, but it's good to hear directly from the source. Welcome to Zwist.
B
Thank you for hosting us. Thank you for hosting me.
A
I guess the first question is what's it like? You've got so much competition and this is every week here on the show we have a new model racing up the LM arena, solving humanity. Most difficult questions. What's the pace like at the company when you're up against Gemini Grok OpenAI? It just seems like a bit of a marathon filled with sprints. What's it like running a large language model in 2025?
B
Well, it's been quite an intense race ever since we started. So around two years and a half ago. I think what has been very interesting is that we had simultaneously to build a science team that were able to compete with very large competitors that were actually better funded than we were. So we had to be more efficient, we had to be more careful about the way we spend the money. But we actually managed to create models and to lead on the open source front. What has been a challenge and very interesting as well has been to build a business on top. So building the science and the business on top, able to generate the revenue, I think that has maintained quite a lot of pressure on Mistral but but we learned fast. So we are now in quite a good position.
A
Tell me about the business. Obviously there is a consumer business that's emerging where people pay 20 bucks a month, maybe prosumer they'll pay 30, 40, 50 bucks a month. They have these corporate accounts I know we pay for, I Think Quad and OpenAI. At the current time people on the team will go AWOL and just start subscribing to AI products on their corporate cards because hey listen, they're so affordable. And then of course there's the API space. What's Mistral's business today? And then who are you competing with?
B
Our business is really to serve the enterprise. So we have self served versions of our products. But really what is driving the business forward and the value proposition that we have is to partner with enterprises and to go for their most complicated problems. So we see two things in the enterprises. They either want to automate things to drive efficiency or they want to create models that are driving growth and making their product better. So on the physical space or on the digital space. And so what we have is we got known for releasing open source models and that's the mission of the company is really to bring AI into everyone's hands. But the business we've built on top is an enterprise business where we orchestrate the models, we connect the models to the enterprise data, we deploy the platform on secured spaces. If need be, we can go on private cloud we can go on prem, we can address sovereign needs. And based on that, the value proposition we have is that we actually go all the way to the delivery of the use cases. So we have a forward deployment engineering team and a forward deployment scientist team that are using our orchestration platform to build applications and automation, that are using our training platform to build models that are connected to proprietary data, to proprietary images, to proprietary signals. And so that's how we drive value. So we sign large contracts, we general multi years, we bring the platform to train the platform, to orchestrate the platform to observe, and we bring the experts. Because what we realized in the last two years is that early enterprises are testing out a few things, they're buying out a few things on the AI side, but they are not getting the value. And that's really their pain, is that everybody is telling them that AI is magic. That is going to change entirely the way enterprises are run. But then down the line they've done a few prototypes and nothing is working. And so what we do is really to correct that and to bring them to the value.
A
Yeah, that seems like a really great jumping off point for this debate that's going on. Are enterprises actually deploying AI, getting value from it and able to actually capture that efficiency, or this MIT study that's just constantly being cited? Oh, they're deploying but not getting value. What's the reality as you're seeing it, because you're embedded inside these enterprises and really trying to help them on prem, it seems like close that gap between this is a pilot, a project, a skunk works versus hey, this is fully deployed and providing value that the cfo, the COO can quantify and sort of make into the future roadmap.
B
I think the reality is mixed. There's been a lot of pressure on the C suite of large enterprises to deliver value. And the way they went about it is to do a bunch of prototypes and then they realized that it was actually hard to scale. Because the problem of building with AI is that you're still building software, you still need to iterate, you need to create first version, a second version, but to create a new version, you need to do data science. Because to improve your AI agents, you need to get new data, you need to get feedback on the behavior of the agent on the edge cases it hasn't handled, and you need to fix it. So that iterative mindset of going from a prototype to production by reducing the errors and improving the accuracy through acquisition of data is something that is kind of missing in enterprises. Because it turns out to be the mindset that you would find in AI scientist. And those are pretty rare to find because they started to work five years ago. And so that's really the problem that we solve. So we come, we either go on iconic use cases, so we try to work backward from the outcomes. We, we work with enterprises, tell them that they shouldn't be looking at AI as they should start from a problem, not from a solution. So they should start from a problem, look at how the process is being run, how do you map that onto an AI agent that is going to execute it for you? And if they do so, they should realize that it's going to take some time. So first version going to work 80% of the time you will have edge cases, then you go forward and you make it better and better by getting more data, more feedback from users. You need to manage everything. And that's what we bring, that's the technology we bring. But if you take that iterative mindset and you combine it with ability to change the organization in the company so that you actually save costs, or you actually make your researchers work faster, then you can get to the value. And so that's what we bring. So in general, our ICP has oftentimes tried a few things on its own, failed. And when they start working with us, they realize what they need to do and they get to it.
C
On the going to enterprises, helping them get to 80% use their data to improve the model, get better accuracy and more value out of it. Is that a system in which eventually that they don't need to have Mistral staff on site to help them get this stuff right? Or is there a long term need for Mistral or a similar foundation model company to kind of stay on site to help ensure that these systems stay functional and keep delivering value?
B
I think the end goal is for us to disappear and for the AI products and the AI platforms to be fully usable by the business users. Today this is not the case today. This is still too complex. And I think what is really missing is the mindset of iteration and the methods to follow to use the right tools are still a little bit missing. So we come and we engage our teams on the long term. So like for two, for three years, I think down the line the products are getting more abstract, the models are getting better, the tools are getting better and it's all going to get easier, easier. But we should realize that it's going to take 10 years for enterprises to transform themselves with AI. And to your point, whether it's with people or not. Processes and automation that you deploy with AI systems are inherently, they need to grow with time. When you run a company, your processes are never stable. You change them over time. You reorganize yourself, you change the people, you adapt the software, you change the IT stack, et cetera. The same goes. And the people in the company actually can deal with that because they are humans. They can deal with changes. What's interesting is that the same goes for AI agents. They can deal with changes, they can deal with legacy, they can deal with badly connected systems, they can deal with errors, and they can be adapted. So we should really see, and that's what we tell to our customers, that it's always going to be iterative. It's iterative to build and to improve. But even when it's running, you should always monitor what's happening because the input can change, the situation can change, and you need to then adapt the system.
A
Founders, listen. If you've got a B2B company and you're marketing your product or service to other businesses, you need to use LinkedIn ads. Otherwise, you're wasting your time and you're wasting your money. LinkedIn isn't some random social network where everyone's using a fake name and posting nonsense. It's a network for professionals. LinkedIn knows who the big players are and who the key decision makers are in your vertical based on the data they're already collecting. Everyone's on LinkedIn, me, you, everyone. The platform has more than a billion members, and that includes those 10 million C suite executives. And that's the kind of reach you need to grow your business. LinkedIn's advertising tools aren't just throwing your messages around for anyone who's clicking no B2B marketers report 2.5 times higher returns on ad spend on LinkedIn compared to other social platforms. That's why my firm launch advertises our own projects on LinkedIn. We want to reach people who are actually doing exciting work at real companies. Not randos, not bots, not trolls, real executives. So here's your call to action. We've got a special offer just for twist listeners. Launch your first campaign and get $250 free. When you spend at least $250, that's one for one. Go to LinkedIn.com thisweekinstartups to claim your credit. LinkedIn.com thisweekinstartups.
C
I want to talk about the models you guys make. We keep pretty close tabs here on, you know, market share and who's leading the leaderboards. But one interesting divide out there, Arthur, is just open source versus closed source. And from where we sit here in the US it feels like American companies are doing really well at closed source models. And over in China there is a bevy of high quality open weight models at a minimum. Mr. L has long had an open source, open weights core to it. Is that more of a philosophical approach or is that something that enterprise customers are actually demanding from the company?
B
I think it's both. The reason why we started the company was to disrupt the market with open source models. I was at Google, which wasn't following that strategy, and Guillaume and Timothy were at Meta, which was at the time not following that strategy. And so we started and we showed the way to actually build great models. And actually we kind of triggered, I think the Chinese company to do similar things. They've been competing with one another there to innovate and they go actually quite fast. But for us, because it's philosophical and because we have the funding and the business to do it, we'll be continuing to largely contribute to that. We feel in particular that Europe and the US are really missing a leader in that field and that because of all of the progress we've made, we're able to take the lead. So we'll have very interesting things to announce pretty soon. To your second point of whether this is actually interesting for businesses, it is. It is for two reasons. One of them is sovereignty, which means that when you use an open weight model, you are able to deploy it wherever you want. And when you do so, you can actually get rid of the data dependency and the processes that you need to involve. If you use co source APIs, sure, it's really a problem. When you're running critical workloads or you're running public sector services, or you're running different systems, you can't afford to rely on a closed source API. So that's the first thing. Second thing is that because we've all made similar progresses in compressing the knowledge available onto our models, the next frontier is today to actually use the data that is within enterprises to make models better. So you fine tune them, you do reinforcement learning on them, you use human feedback on them. The only way you can do that is actually to get access to the weights and to change them. If you don't, you don't have enough handles and you can't go deep enough to make your own models. So our customers are actually coming to us and they want to own the weights of the models, they want to start from an open source model, they don't know how to train it because it's actually pretty hard. You have multiple steps, you have 10 steps to actually train a model these days. And we actually help them prove these steps so that they get something that is actually distilling their internal knowledge onto the model itself.
A
Should people trust OpenAI with their data? There's a big debate going on in the U.S. now for startups, let's say, who are starting to embrace open source models. But there's a concern, hey, OpenAI is going into a lot of different businesses. And if you're to your point, taking your data, giving it to another closed system, you're kind of training that system. They may eventually become your competitors. So as part of your positioning in the market, hey, we're not going to compete with you as a startup company. You should use our systems as opposed to say, OpenAI, which is competing on all fronts with all companies.
B
Because we have a cost structure that is fairly different from some of our competitors. We don't need to do everything to actually sustain the business in the long term. So what that means is that there are parts in which we are not going. So like we are not doing consumer seriously, we're exposing our technology to showcase what we can do with enterprises, but we'd rather work with a consumer company than to actually build to compete with them. And then I guess the question that startups need to ask themselves is whether in the long term, can you depend on a company that needs to steal your business to become competitive and to remain in business itself because it's spending a lot of money? So I think that definitely drives people toward us. That aspect around strategic autonomy, it's driving defense companies toward us, but it's also driving AI companies to others, I would say, for similar reasons. And so, yeah, I guess it's not for me to tell. But of course, depending on the supplier that will compete, it's something you can do on the short term. But at some point you need to create leverage. And open source models happen to be the best place in which you can create leverage because you can just take it. And then provided you have the right platform and we can help with that, you can customize it, you can deploy it where you want, and then suddenly you're no longer dependent on someone who can compete with you.
A
Yeah, that is seems to be an important question for startups to think about. If you watch OpenAI that, you know they're launching a copilot for coding, they're going to launch you know Sora and have a social network and an image. They need to make $1 trillion or pay for 1.4 trillion in infrastructure they're going to build out. Which kind of is. My next question is what's your take on this colossal, no pun intended, amount of infrastructure being built? Is it necessary? Are we going to overbuild and are there problems we can only solve if we have massive amounts of compute? Or do you think we're kind of getting to the point where it's not about the compute as much as it is the tuning of the models?
B
I think it has always been the case that you have two bottlenecks in machine learning. One of them is compute, the other one is data. And sometimes we didn't have enough data, sometimes we didn't have enough compute. I would say we didn't have enough computer from 2020 to 2024. No, we are actually running out of data. That's the problem. We're kind of done compressing the world knowledge into pre trained models. We can create reinforcement learning systems that are creating new kind of signal on their own. But then there's a lot of reinforcement learning system and knowledge that sits within people's brain and employees brain and a lot of the knowledge that remains to be used is actually the IP of companies. So I think the race today is really to get the right reinforcement learning environment and to partner with companies that can bring their specific knowledge onto those models. And so for us, what that means that decentralized systems is actually a much better way of doing it. So you bring the model to the company and you help them build something that is really custom and defendable from them. The question around compute, I mean you can always go faster with more compute. The question is at some point you need to pay for it and at some point you need to create the long term value for enterprise enterprises. And that's going to take time. It actually takes some people because the expertise is not there. So we need to, that's why we have really built that full stack offering of going from model to services and then also to the infrastructure when we build things in Europe. And so I think we need to create this long term value in order to pay for the infrastructure that we're building. So it's useful. I think in particular the data centers and the concentration of power is going to be useful. The question is can we actually justify that kind of investment over such a short period of time given the viscosity that we experience in the enterprise Then our point of view is that we'd rather be on the downstream and on the creation of value than on being on the infrastructure itself and only on the infrastructure itself.
A
How do you think about hiring experts to build out the knowledge base if all of the information on the web is being captured? Now you have the open crawl, Reddit's licensing its data, Quora's licensing its data. You know, there's a finite amount of data out there. Some folks have moved to hiring experts for 50 bucks an hour, asking them questions and then just having proprietary streams of knowledge. We have a company, Micro One, in our portfolio that we see invested in, you know, that does this and helps LLMs sort of build up their knowledge base and fill it in. So are we getting to that point where now that we've collected all the information on the open web that's legally able to be created, you know, indexed, indexed and or legally able to be licensed, we now have to say, you know what, we just need to hire 100 lawyers, 100 accountants, 100 experts on construction and ask them very fine tuned questions and show them queries that actual users are doing and making sure that this knowledge gets put in, in a more deliberate way than just, you know, scrape it all off the web. There are so many amazing AI tools out there that can make your team faster and more effective. Any founder would be a fool not to embr these tools, but a large team making use of a diverse array of cutting edge AI apps can also pose a privacy and security nightmare for your organization. But now there's Nexos AI, the workspace that will help your team up their game with greater efficiency without sacrificing your data or your peace of mind. Nexus AI is going to give your teams a simple browser based environment to work in, integrating all the latest and greatest AI models and replacing the various potentially problematic tools and subscriptions your staff can't even keep track of anyway. You're probably paying for two or three things over and over again redundantly. Finally, your admin team gets full visibility and oversight into what's happening, so your team can enjoy all the productivity gains of AI without the data leaks and your platform remains compliant while your data stays safe and secure. But don't take my word for it, try it yourself. Go to Nexos AI twist for a 14 day trial. That's N E X O S AI twist.
B
It is indeed very important and there's really two ways in which you can do it. So either you do it on horizontally and on your own, and that's what we do as the model grow in capacity, the frontier of their capacity become bigger. So you can make the model much better in physics, you can make it much better in mathematics. For that you can either come up with synthetic environments. Those are useful. But at some point you also need to get the right expert. You don't need to hire them necessarily, you can actually subcontract them to some of the startups you mentioned. What we found is that onboarding and having full time employees that actually have the expertise to judge whether we are actually making progress is super important. So that's why we're analyzing that aspect around, evaluating whether we're making progress and setting up the right evaluations, looking at what kind of data sets could be actually useful to move us forward. So that's indeed quite a big focus of the company today, I'd say. The second thing which is important is that there's still an amount of information that you're not going to get to whatever you pay for if you're not partnering with a company who actually has the knowledge. So when we work with companies like isml, ISML has unique knowledge in order to build machines that DO lithography at 8 nanometer precision. And their physicists and their research actually have this kind of knowledge that is nowhere else to be found, which is sufficiently precise that they will never get rid of it, they will never license it, they will want to keep it. It's their defensibility. And so by exporting our training platform to such customers, we actually enable them to do it on their own. So part of our approach is to annotate ourselves, but then also explore the annotation tool to our customers.
A
So there's an annotation tool, you actually hire experts. What is the title for that person and where do you source them from? And they're previously just experts at universities or they were journalists or analysts at banks. Where do you source an expert on one of these topics to to do that horizontal training and do that annotation.
B
Tool, or you can call them AI trainer. And the people you source are. You need to source people that are experts in their fields, usually PhDs and have an interest for computer science. If you find these two things, suddenly you have someone who is interested in driving the competence of an AI model forward in their field. So that's how we look for people.
C
Do you keep them long term though, or do you need your physics team for like two, two years and then you gently let them go?
B
I mean, we're a two years old company, so I don't really have the insight to answer. But no, I mean the evaluators, the people that know and that can judge whether we're doing progress, those needs to be experts in the field and those you keep. Of course, then at some point you have surge in annotation campaigns and then you look for people that can do it faster than what you could do internally. But this is still a part of the company that is important and that the people that are driving and defining the evaluations and verifying whether we're making more models, better at conversation, at mathematics, at medical aspects, at this is critical. Actually.
C
Does that show up in the benchmarks? Jason and I were talking yesterday about the benchmarks for some new AI models and we're trying to figure out how much stock to put in them, Arthur, how much to trust them. Because it sounds like you're doing quite a lot of work to make your models smarter for individual companies in a way that might not show up on Ella Marina or, you know, just a normal kind of like benchmark testing set.
B
So LM arena shows progress on the conversational and ux, which is one of the area where we are investing. And so that's why we are. We like the. When we released, our last release was in August and so that it actually ranks second at the time. So you can. And we have new releases coming. The question is. It's a hard question. Like, evaluations are very critical. The problem with evaluations that are widely used is that they tend to be benchmarked a bit because it's always. You look better if you optimize for it, either consciously or unconsciously. But at some point the benchmarks stop making sense. So they are useful if they remain proxy of the performance, but they always tend to fall. At some point, they cease to be a good proxy for performance. And what we find is that the actual good proxy for performance is usually how enterprises think about it when they run their evaluations themselves. So that's why I wouldn't put too much money on the benchmarks. It's still useful. Certain of them are. The lower you are from, the farther away you are from 100% on maximum, the better the proxy may be.
A
So are people gaming those rankings, do you think? Do engineers and developers look at them and say, oh, I know how to hack this. So we go up the rankings, you know, which would be performative, get you a little bit of press, maybe a social media cycle or two, but don't actually help move them forward because I have heard that like a lot of the questions on them get recycled into the LLMs. And then of course they perform better on them, but they're not performing any better on a unique job given to the LLM. And so maybe it's an incentive that needs to be de incentivized.
B
I mean, the way to look about it is that if you too explicitly set a benchmark as a goal to a team of AI scientists, they will optimize for it and they will put the right guardrails to not overfit. But down the line, if they look at it too much, they will overfit a little bit because they will find data sets that are correlated or they will just hill climb the thing in whatever way possible. They know they can't train on training sets, but they can look at ways of doing selection. We've seen some companies actually do selection on lmoranina, which is if you do it with two models, then that's fine. If you do it with 30 models, then you're definitely overfitting. So there's a very thin line in between. People are not cheating, but people are optimizing a little bit for benchmarks, either explicitly or implicitly. So that's why the best way to think about it is that you train your model, you come up with your own internal proxies, you train for your right proxies and then at the end of it, you look at the LM arena score and then if you know that you're getting high there and that you haven't been optimizing for it, then that means you've actually made progress. If you have been optimizing for it, then maybe you don't know whether you've made progress or not, or whether you've been benchmaxing a little bit, even unconsciously.
A
Benchmaxing. That's the internal. I like it.
C
Arthur, I want to ask about Europe. There is discussion about changes to the AI act, the 28th regime, possibly making it easier for startups to be formed around the eu. Lots of debates about the place of government in technology today. What's the mistral perspective on the European regulatory regime and how to ensure that your company and others that follow it can grow into world size?
B
Well, yeah, I mean Europe is our. It's our Angkor market and we do little more than half of our activity in Europe. The rest is in the US and in Asia. We've just announced actually a partnership with SAP. So really address the enterprise needs and the public states needs in Europe yesterday, with the French President and the German Chancellor, Europe is going into the right direction. I think they made some mistake on the AA act back when we were very early. So 2023, they started to regulate the technology. There was actually quite a lot of lobbying from US companies to make it happen. Because then you get the strategy being you put the right terms in the EU law and then you import it to the Californian law and then suddenly you're starting to do regulatory capture. So the consequence of that is that there's some amount of regulation on the technology itself, on the AI act, which doesn't make sense. We've worked with the enforcer to make it workable and we can actually now ensure that our customers are compliant with the AI act, etc. We're big enough to deal with it. The problem that we have is that this is not sending the right signal to the investors and to the founders when it comes to creating a company in Europe. And that's exactly what we need. We need people to create company in Europe. We need investors and global capital to be invested in Europe. So. So I think we have started, people have started to understand that the problem is that Europe tends to get a little bit into its own way because it's a lot of bureaucracy. But you can see with the recent announcements on the 28th regime, which is going to simplify things, there's a lot of good decisions that are being taken. It's going too slow. The AI act was poorly designed. We can deal with it. I'd say I think we have tailwind in Europe because I think there's a growing realization that strategic autonomy on digital services is important and that has really grown our business quite significantly. Now the regulatory framework could be simpler. We need to make it simpler. We'll continue to say it and I think we'll get there.
A
These language models, coming down to the device level, there's been some discussion of some of these open source models, the smaller ones, last generation, being able to be run on your MacBook Pro, your M4, Mac mini, maybe even your phone. Do you have a take on where we will be in a couple of years in terms of on device language models, AI running locally? Obviously you brought up this incredible example of ASML wanting to not have, you know, all their important, you know, information and innovations on somebody else's LLM. You know, that's going to eventually happen. I think consumers will realize maybe I shouldn't have all my photos educating Sora and then having, you know, my kids pictures informing somebody making photos on some public social network or you know, whether it's metas or sora so how do you think about that? And specifically Apple seems like a really interesting player that you could potentially partner with or others to kind of bring an LLM to the device level.
B
We are, I think we're the only frontier company which is actually working on the edge today. And you mentioned language models, but really the one thing which is interesting on small models that can be deployed on devices are not really the language models, but more the audio models and the models, because everything that sits on the edge typically entails that you don't have a keyboard available. So you want to use different ways of interacting. So that can be audio or that can be video. Now, so that means in general. So those are the two modalities where we have invested on that domain. We've released actually the strongest transcription model on audio called Voxtrail in July. It's open source, can test it, can deploy it on device. You mentioned MacBooks and smartphones as a natural place to deploy them. You can do that, but it's not that interesting because in general when you have a MacBook, you're connected to the web. What is much more interesting is that when you start deploying it on drones, when you start deploying it on embodied AI, when you start deploying it on devices that may go out of network and that can actually take actions, it's important for safety. So for firework drones, for instance, it's important for defense. So we work with Helsing, which is a defense AI company, on drones that are able to detect mines. So quadruped drones detecting mines. And we actually bring the vision model to them and that enable to control the drone in a fully automatic way. So to me, the opportunity of edge AI is multifold. But the biggest one is with robotics, the other one is around portability. So when we work with asml, they have their own customers and when they deploy their technology with their own customers, well, they need to take their software to their own customers and the data is actually not flowing back to them. So they need a portable solution to analyze the data of their customers themselves. So the portability is super important in many B2 when you're doing B2, B2, B2. So when you're serving business to business companies, they have the same problem as you do. They need to handle the IT settings of their customers. And so the portability means that you can actually make them portable and they can address more needs. So portability, which doesn't necessarily need going small because you can actually have servers that gets deployed on prem, is important. I think Device deployment is useful in certain domains. Audio for smartphones, definitely useful for privacy. It's very useful and it's getting better. So the models are getting that can run on device are getting better. So we'll get to a point where some of the workloads do not need to sit on the cloud. But really the opportunity is in robotics because just do so much more things with edge models there.
A
Knowing what you know about those sort of edge models in robotics, what's your take on when a humanoid robotic will be able to meaningfully help somebody as a housekeeper, just to pick one really sought after use case. And obviously if you can do the dishes, make dinner and fold laundry, feed the dogs, take them for a walk, you're going to be able to do a lot of other things in the real world. These are kind of fine motor tasks, right. Like folding laundry, et cetera. When do you think those will. Because obviously they're only going to cost 10 or 20k. So putting cost aside, when do you think they'll actually be ready to make an impact and be as good as the average housekeeper?
B
Well, first disclaimer, the thing we do at Mistral is to build the software for robotics. So like the image to action models. But the one thing I'm not an expert in, everything aptic related and fine motor control related, this is actually a complex problem on the hardware side that requires to build a hand that is working very well. These are problems that are actually not solved yet. My take is that there is a lot of value creation to be done in robotics with businesses before doing it with consumers. Deploying a housekeeper with consumers is going to be extremely painful on the regulatory side for I think obvious reasons of safety. And it's an extremely complex problem on the hardware side. On the other hand, the model we build are actually already able to operate drones that are operating in scenario where you don't want to send a human. So it's actually safer to send a drone than to send a human. So suddenly you don't have the safety problems that you run into.
A
It's quite the opposite actually. Right. Sending a drone into a fire situation is better than a human. Sending a robot into a home is dangerous. Right?
B
Exactly. Suddenly regulation is a tailwind and not a handwritten. And the other thing is that you don't need to have fine motor control to do many things in the fact actually bring a drone to fire, you have different kind of problems, but not that kind of fine motor control problems. And so we're more, I would much more bullish on the B2B opportunity of robotics and on the B2C opportunity down the line, it will work. And the kind of models we build are actually good for that, I think, and they will be useful, but it will take a fair amount of time. So just look at like self driving cars. They're still very far away from general availability. I think it's going to stay the case. And Housekeeping Robotics will be even more the case.
A
What year an AI model can do Drive any city, drive anywhere from in Europe, from Madrid to Moscow. Car can drive from Madrid to Moscow just. And now it's safely 100 out of 100 times. 1,000 out of a thousand times. Pick a year and then Alex and I will gamble on it. You pick the year, we'll do the over under. We need action here.
B
Theoretically, I think it would actually already.
A
Work 10,000 out of 10,000 times perfectly.
B
No, no, probably not. I think it's the same when we deploy like AI agents and enterprises. The problem and what takes time and what prevents from going into production is, are the edge cases and need to hire out the edge cases. So I'm not the expert. I think it's getting there. I have some good, good. There's some good companies out there like Wave AI, which is actually doing this and I think we'll get there on robotics as well. If you ask me as the one who knows about whether the model can do it and whether they can process images and take the right decisions, the answer is yes. If you have the wrong data, then yeah, you can actually drive the accuracy. Right. There's a lot of other problems I.
C
Think you got, you got to pick a year though, man. You can't just give that really polite answer and not answer the question. Come on, give me, give me a year. 2028, 2029.
B
That's right. But I would say like, yeah, four years from now.
A
Four years from now. 2020. So 2029 can go from Madrid to Moscow 10,000 times out of 10,000 times. Yeah. Wow. What would you take, Alex? Oh, the under over, under, under. Yeah, I think we're both going to take the under on that one.
C
Oh, yeah.
A
I think we both, we both have a lot of faith in, well, what you're working on.
C
Yeah, I've also driven in Russia and the roads at times leave something to be desired. So it may be that the last stint to Moscow might take some more time, now that I think about it.
A
Oh, and then some of those streets in Spain are pretty narrow, so.
C
Pretty narrow.
A
Listen, continued success and appreciate you coming on the program and educating us on all the great stuff. Thank you very much going on over there. Appreciate your time.
B
Thank you.
A
Bye.
B
Wow.
A
One of the major large language models coming on the pod giving us all the details.
C
I always love talking to founders that I've never met before. And I also love to figure out what a company is doing past what you and I can kind of see from the surface. Like we knew they were making models and open stuff and so forth, but I didn't know how deep their forward deployed engineer framework was, how deep they were going into enterprises, the on prem stuff. So this is why we talk to founders, folks. This is why we do it. Always good to learn. Next up on the docket, Suno raises $250 million. This values the AI music generation startup at 2.45 billion. Jason and I was kind of surprised by this for a couple reasons. One, we have seen its competitor Udio solve one of its lawsuits with major rights holders. We have not seen Suno yet reach any settlements or any kind of agreement.
A
Suno still in lawsuits?
C
Yeah, I believe they're still being sued by all the big three. So of the two companies and six available lawsuits, one has been kind of sorted out. Now my read is and tell me if I'm wrong here, Jason, but if the investors in this case, Menlo and another, a couple of other venture firms are going to bet $250 million on this company, they must have a line of sight to resolution of those lawsuits.
A
Really depends on if SUNO did the wrong thing or not. Right. So the penalty if they actually pirated music is $250,000 per infraction. It is giant. As I've said on previous episodes, you don't f with the music industry as a startup, you get their explicit permission before you do anything.
B
Yes.
A
And you use royalty free generic stuff with a signed contract if you're going to do anything so that you don't get into problems. The lawsuit claims Suno pirated music by downloading tracks from YouTube and then you streaming ripping mesh methods violating the DMCA. They're contesting it, arguing AI generated music outputs are new independent creations and that use of copyright music for AI training constitute fair use. It does not. They're going to lose 100%. I'm happy to place a bet with anybody who wants to place a bet. They will settle a major settlement with the music industry. It'll wind up costing them a percentage of their company plus hundreds of millions of dollars is my guess. So I think they're going to have to give 10% of their company to the music industry, minimum, maybe 20% to settle this. Or they're going to have to give them, you know, a quarter billion dollars, something gigantic. So this whole amount, if it's true that they trained it. And this is so dumb because when we talk to Mistral on today's episode, they talk about hiring AI training experts. Here's what you could do. You could hire musicians. If you're SUNA with a $2.5 billion valuation. Musicians are not expensive. Go to tonebase.com and you'll see the greatest classical musicians performing and you can hire them to be your tutor for a very modest amount of money. Tonebase, one of our investments. And they do educational training. Really great. Awesome. If you're trying to teach your kids or yourself how to do classical, but they have like the world's greatest flute player, bass player, guitar player, you get the idea. And they're available for work. Let's just leave it at that. Musicians are available for work. So you could have a guitar player come in. You could have an entire band. You could just go here to 6th street, hire a band, and say, we want you to play these classic songs that are in the public domain. We want you to play all of the different chords. We want you to play the chords in this tempo. And they would just sit there all day for 50 bucks an hour and do it till the cows come home. And you would have a proprietary data set. You could videotape them doing it. They. You would have musicians lined up. You could have five different guitarists do every. Not blues songs, but the classic blues riffs that are not owned by anybody. You could literally have five do it. Then have the AI pick which one of the five to use or to make an amalgamation of the five drop the worst. Do the best, whatever you can. Have every possibility. Why would you take the Rolling Stones or Bob Dylan from, you know, their labels and treatment? Why? Because people were lazy at the beginning of this and technologists, they all want to break rules. Please listen to me, folks. Don't break the rules. When it comes to copyright, you will get pinched. Especially when it comes to the music industry, the book industry, and now the news magazine industry. You used to be able to steal from the news and the, you know, industry a little bit because they were dopes and the magazine industry were dopes and they would just let it happen. And they let Google run a moss with their one box because they were getting traffic. They were kind of dopey. New York Times Was dopey in the past. They let people steal from them. They let people index them. They fell for it. Not anymore. They're going to the mat with OpenAI and they're going to get, I predict they'll get at least $1 billion. I'm making two predictions right now. They're going to settle these suits, these collection of suits for a minimum of $250 million, which means they could have hired every musician in Austin and Nashville for 10 years. Literally got hired every single one of them for 10 years. Put that aside, guaranteeing billion dollar settlement with the New York Times and open a minimum 1 billion.
C
All right, next up in the docket, cracking the crypto exchange. Oh yeah, two part announcement here on the 18th they announced they raised $800 million including 200 million from Citadel Jason, that valued them at $20 billion. Then they one up themselves by today announcing they filed privately to go public. Now we've seen a couple of crypto companies list this year, Circle being the best known one, the Stablecoin company and giant on the show. But my question for you, Jason, is very simple. Why would you raise a mezzanine round and then the next day file privately to go public? I always thought that you raised a chunk of money with the banks and other people you want to have on your cap table when you list maybe like 12 months before. So I was just a little surprised by the time.
A
You're just, you're just de risking. It's a straight up de risking move. It would be like, why would I, I don't know, take an incremental speaking gig if I didn't need to? It's like, you got the money, you make hay when the sun shines. So what happens in a situation like this is there's somebody who has a lot of faith in the company. Could be a private equity firm, could be a mezzanine financing company. And they know it's going to do good post ipo. They want to secure a big position. Now. So let's say that company is worth. Did they mention in any way the valuation of what that company was worth or a previous valuation?
C
So 20 billion now.
A
20 billion. Okay, so 2 billion would be 10%. 1 billion would be 5%. So they're buying about 4% here, 4% ownership. Now let's say they're even paying the prospective IPO price. What they're doing is they're saying, I have so much faith in this company, I want to buy my position. Now, we think it could pop in the IPO it's just easier for us to lock it in now. So whoever that person is locks it in now. And on the other side, there might be early investors in Kraken who don't want to wait for the six month lockup. So they're saying, hey, I know this thing could go down and I'm already weighing the money I need to get my LPs, their money back. I'm an individual investor. I could be a founder. I could be the founder's parents. And we had the founder of Crack It On. He's a unique cat. I like him.
C
Jesse Powell, he was on episode 1487 back in June of 2022.
A
I like Jesse because he is like a non consensus thinker, kind of a fun guy. I like the. He's kind of Alex Karp territory, like, you know, non consensus, interesting cat.
C
I like him because Kraken sponsors the Williams F1 team and anyone who's supporting Williams F1. Good egg in my book.
A
I'm going to F1 this weekend, leaving right after the show to go to Vegas, shout out to the Venetian, which gave me a ridiculous suite. It's good to be podcast famous micro celebrity. But the F1 team reached out from McLaren and they offered me to do a dry run. I don't know if this is in the podcast Porsche dry run or in the actual F1 if they can fit two people in it, but I think they have. They're all single seats, but I think they might have a modified one where there's two seats where VIPs get to do a dry lap. So anyway, I've been offered a dry lap. It's either in a Porsche or something because they have like a whole set or it's in like one of these converted F1 cars. And I know this because they just asked me to sign a waiver. I don't know if I'm going to do it or not, but should I do a drive run?
C
If you don't do it, I'm going to scream. I will come down there and give you a noogie.
A
I get motion sickness. Maybe I'll vomit in it.
C
You move to Texas, you bought a weighted vest, you shoot guns, get in the car, go around a couple of times, FaceTime me, FaceTime me while you're doing it and then put it in the front facing camera so I can go with you.
A
I think they do have. I think they do have you crying like a baby on camera. So I guess I'll do it. But you know, I have kids and stuff.
C
Like that.
A
So I gotta just make sure. Has anybody ever died in a dry lap? That's what I need to know. Has anybody been dried or maimed in a dry run that. Please get me the statistics. I want a risk analysis.
C
Wow. I'm not a very jealous person, but I am actually, for the first time.
A
In a while, burning Life of Jake Al is a little bit crazy these days. Let's keep going. Anything else?
C
Let's do a really tiny note on some regulatory stuff. First of all, Meta beat the FTC this week. We're not going to go through the whole thing here, but the gist is that Meta is not going to have to divest WhatsApp or its other acquisitions.
A
Jason, was this done under Lina Khan or was it before? It might have been before Lina Khan.
C
It started during Trump 1, it continued through Biden. And then the FTC under Trump too pursued the case and kept at it. So this was a multi, multi platform, bipartisan, everyone taking part in this, but.
A
They wanted to go backwards and unwind. 10 year old acquisition plus year old acquisition, saying that they should not have approved them. That was the. I remember that was the case.
C
Yeah, that was the case. And they lost. And the judge said, effectively, you didn't prove your case. You didn't prove that they have a monopoly in personal social networking. That was the defined market in question. The FTC said, we are deeply disappointed in this decision. The deck was always stacked against us with Judge Boasberg who is currently facing articles of impeachment. We're reviewing all our options now. Why do we care about this for startups?
A
Oh, I can tell you why this is.
C
I thought you might.
A
Oh, this is a yum yum time. Speaking of racing, this is the Five Lights. This is the start of an amazing M and A race. It's all on the table, folks. Everybody, it's time to activate your M and A teams. Google, Apple, Meta, Microsoft, Xai, ChatGPT. It's on like a Donkey Kong. Lina Khan is busy dismantling the greatest city in the world. She has no job. She's completely ineffective. She's gonna make groceries. She's gonna be delivering groceries for free in socialist New York. Guess what? It's capitalism. All day, every day for the next three years. Buy everything in sight. Go buy doordash. Make an offer for Uber Robinhood. Hostile takeovers. Buy 10% of the stock on the public market. Let's have added, folks, merge these companies. Who cares? Antitrust. All of these shenanigans were wrong. Meta doesn't Have a monopoly. They have 60%. That's not a monopoly. You got TikTok, you got X. Is Meta the lead player? Yes. But the bogus FTC claims are gone and it's on like Donkey Kong.
C
The thing I'm curious about is if it is on to that extent. Jason, does this save the unicorns that have been sitting and dying on the private markets?
A
Sure, sure. Those are all. Now, those are all worth twice as much. Any of those that were like, you know, 400 million in revenue, growing 20% a year, everybody gave up. And their peak valuation was 8 billion and then they went down to 3 billion. Now they're back up to 7 billion. And there's going to be a massive amount of M and A because these were the high profile cases. The Google case wound up being like a whimper. The Meta case has been tossed. I mean, are there any other major antitrust cases now? Apple lost their case with Epic. Right. And they have to like open the App Store.
C
The Google advertising case. So there was the Google search monopoly case and then the Google Search advertising monopoly case. So I think that one's still going on.
A
It's all going to be settled, folks. It's, it's right, there's, there's nothing left.
C
What about companies like Crunchbase that raised.
A
There's no secondary market for those shares. I would think somebody would want them.
C
But Tens of millions of ARR. But it's not, it's, it's like, fine. And I'm curious if the companies that are like. Because I see VCs talk about, oh.
A
That should just get bought by Carta, that should just be bought by Angellist or Angellist Plus, Crunchbase Merge, etc. The hilarious thing is Crunchbase is worth much more than TechCrunch.
C
Oh, that's been true for a long time.
A
Yeah, just think about that. Like, what a crazy turn of events. But you know, a database of, you know, any of these databases of startups and corporate information, they're, they're worth tens of millions to low hundreds of millions because they're not the hardest things in the world to replicate. And the data from the historical data is not actually super valuable and data is not copyrightable. So the data from any chart, like somebody produces data and you see this all the time. Like there was this E marketer website, was that the name of it? That just took everybody else's data and made their own beautiful charts and then they sold a subscription to it and. But I know we would use their charts and I was like, wait a Second, how is this possible? And then I looked it up and there were all kinds of lawsuits around this. But you can produce whatever, any data that's on the Internet, you can just take it and make a chart, put your own logo on it and then just put source of the data and then put your own interpretation of the data. You can't copyright data. You can have a terms of service that says you can't go to Crunchbase and scrape it. You could have that terms of service. Yeah, you, there are other ways to protect it. But a person in Israel or India or Manila could buy a Crunchbase account, I believe. And in those markets where, you know, it's even harder to sue somebody for this kind of stuff, they could just hire people for a dollar an hour to make their own databases and essentially copy the information and there's very little recourse. And LinkedIn lost their case. When they did the LinkedIn case with somebody, there was somebody scraping it. I think Mark and Facebook lost their case. There was somebody in Israel I believe, doing data scraping and made a social database of like profiles from Instagram.
C
And so yeah, but the middle market is kind of what I was more more trying to get at than just the crunch basics. I probably shouldn't have named a company but like just.
A
It's an interesting diversion but the mid market is going to be back big time.
C
Great news for Frickin Venture liquidity for those 2021 funds, Jason, who have been.
A
It should be absolutely great. And I see in the notes here that Tomas Tungu GP at Theory Ventures, he says the same thing. It's going to eliminate the gymnastics that major acquirers are going through and should really open up the door. He's 100% right, I'm with him.
C
The thing that I not to be negative, but the FTC pursued this and kept going with the case. Jason, so do you think that the FTC has changed temperature or do you think that this loss in court is just going to kind of scare them away from being active in the next couple years? Because I hope you're right.
A
What it comes down to is these lawsuits were on the margins political and they were 100% of the time in the review mirror. So you put those two things together, they weren't the strongest cases. The strongest cases would be against Apple for, for the App Store, which they have like, you know, a certain percentage of the revenue versus Google's and a certain percent of the market share in that duopoly. And I guess the next best one would be the Google search one. And those are going to be go out with a whimper.
C
So business wins.
A
We'll see you next time on this week in Startups. Bye Bye.
Episode Title: Why data is the biggest AI bottleneck (feat. Arthur Mensch of Mistral AI)
Release Date: November 20, 2025
Host: Jason Calacanis
Guests: Arthur Mensch (CEO & Co-founder, Mistral AI)
Co-Host: Alex
This episode dives deep into the critical role of data as the primary bottleneck in advancing large language models and AI applications, especially for enterprise and edge use cases. Jason Calacanis and Alex are joined by Arthur Mensch, CEO and Co-founder of Mistral AI—the leading European AI foundation model company—to discuss open source versus closed model philosophies, enterprise AI adoption realities, the nuances of model training and benchmarking, and where AI hardware, data, and regulation are heading. The conversation is candid, technical, and entrepreneurial in tone, giving listeners an inside look at the present and future of AI development.
Pace & Competition:
Dual Challenge: Simultaneously building a scientific, model-creating org and a business that generates revenue is tough but a differentiator.
On the importance of expert data for model advancement:
“Onboarding and having full time employees that actually have the expertise to judge whether we are actually making progress is super important.” — Arthur, 31:21
On the iterative nature of enterprise AI adoption:
“You still need to iterate… That iterative mindset… is kind of missing in enterprises.” — Arthur, 16:34
On benchmarks and real progress:
“If you too explicitly set a benchmark as a goal… at some point the benchmarks stop making sense.” — Arthur, 36:34
On open-source philosophy and business:
“We feel… Europe and the US are really missing a leader in that field and that because of all of the progress we've made, we're able to take the lead.” — Arthur, 22:15
On the timeline for true self-driving cars in Europe:
“Four years from now.” — Arthur, 48:36
| Segment | Timestamp | |-------------------------------------------------|-------------| | Big Tech AI & Competition | 02:06–11:27 | | Mistral AI's Beginnings and Race Perspective | 11:27–12:58 | | Enterprise AI Adoption Reality | 13:32–20:20 | | Open Source vs. Closed Source | 21:42–24:30 | | Trusting OpenAI/Data Sovereignty | 24:30–27:06 | | Compute vs. Data Bottlenecks | 27:06–29:03 | | Hiring Experts & Annotation | 29:03–33:44 | | Model Benchmarks & "Benchmaxing" | 34:26–37:56 | | European Regulation & AI Act | 37:56–40:32 | | Edge AI & B2B Robotics Focus | 40:32–44:31 | | Consumer Robotics, Self-Driving Car Bar | 44:31–48:36 | | Drive-from-Madrid-to-Moscow Over/Under | 47:16–48:52 |
This episode stands out for its nuanced, real-world discussion of what’s actually slowing down AI progress—not just computational power, but the specificity, quality, and ownership of data. Mistral AI’s focus on enterprise, open source, and the deep embedding of AI within real client contexts sharply contrasts with the monolithic, closed approaches in the U.S., offering a vision rooted in sovereignty and long-term value. The interviews are lively, conversational, and peppered with insights useful for founders, investors, and anyone tracking the AI frontier.
Data—especially proprietary, expert, and real-world data, not compute alone—will define the AI leaders of the coming decade. The move toward enterprise value, open source, and edge computing is already underway, and regulatory, business, and technical strategies must keep pace.