![Gavin Baker - Nvidia v. Google, Scaling Laws, and the Economics of AI - [Invest Like the Best, EP.451] — Invest Like the Best with Patrick O'Shaughnessy cover](https://megaphone.imgix.net/podcasts/d97fee14-d4d7-11f0-8951-8324b640c1c1/image/d3a8b5ecbf3957de5b91f278a191c9ff.jpg?ixlib=rails-4.3.1&max-w=3000&max-h=3000&fit=crop&auto=format,compress)
Loading summary
A
Here's an interesting question to think about. If your finance team suddenly had an extra week every month, what would you have them work on? Most CFOs don't know because their finance teams are grinding it out on lost expense reports, invoice coding, and tracking down receipts until the last possible minute. That's exactly the problem that Ramp set out to solve. Looking at the parts of finance everyone quietly hates and asking, why are humans doing any of this? Turns out they don't need to. Ramp's AI handles 85% of expense reviews automatically with 99% accuracy, which means your finance team stops being the department that processes stuff and starts being the team that thinks about stuff. Here's the real shift. Companies using Ramp aren't just saving time, they're reallocating it. While competitors spend two weeks closing their books, you're already planning next quarter. While they're cleaning up spreadsheets, you're thinking about new pricing strategy, new markets, and where the next dollar of ROI comes from. That difference compounds. Go to ramp.com invest to try ramp and see how much leverage your team gains when the work you have to do stops getting in the way of the work that you want to do. To me, Ridgeline isn't just a software provider. It's a true partner in innovation. They're redefining what's possible in asset management technology, helping firms scale faster, operate smarter, and stay ahead of the curve. I want to share a real world example of how they're making a difference. Let me introduce you to Brian.
Brian, please introduce yourself and tell us a bit about your role. My name is Brian Strang. I'm the technical operations lead and I work at Congress Asset Management. How would you describe your experience working with Ridgeline? Ridgeline is a technology partner, not a software vendor, and the people really care. I get sales calls all the time and I ignore them. Ridgeline sold me very quickly. We went from $7 billion to $23 billion and the goal is $50 billion. Ridgeline was the clear frontrunner to help us scale. In your view. What most distinguishes Ridgeline, they reimagined how this industry should work. It was obviously they were operating on another level.
It's worth reaching out to Ridgeline to see what the unlock can be for your firm. Visit ridgelineapps.com to schedule a demo. One of the hardest parts of investing is seeing what's shifting before everyone else does. AlphaSense is helping investors do exactly that. You may already know AlphaSense as the market intelligence platform trusted by 75% of the world's top hedge funds, providing access to over 500 million premium sources from company filings and broker research to news trade journals and over 200,000 expert transcript calls. What you might not know is that they've recently launched something game changing AI powered Channel checks Channel checks give you a real time expert driven perspective on public companies weeks before they show in earnings or consensus revisions. AlphaSense uses an AI interviewer to run thousands of expert calls with real human experts every month, asking consistent questions across experts so the signals are clean, comparable and useful. You get live updates as interviews come in full transcript access and coverage across every major sector. Instantly compare insights across experts and analyze quarter over quarter trends in sentiment and key performance indicators. For investors trying to stay ahead of the fast moving markets, it's already table stakes.
Hello and welcome everyone. I'm Patrick o' Shaughnessy and this is Invest like the Best. This show is an open ended exploration of markets, ideas, stories and strategies that will help you better invest both your time and your money. If you enjoy these conversations and want to go deeper, check out Colossus Review, our quarterly publication with in depth profiles of the people shaping business and investing you. You can find Colossus Review along with all of our podcasts@joincolasis.com.
Patrick O' Shaughnessy.
B
Is the CEO of Positive Sum. All opinions expressed by Patrick and podcast guests are solely their own opinions and do not reflect the opinion of Positive Sum. This podcast is for informational purposes only and should not be relied upon as a basis for investment decisions. Clients of Positive Sum may maintain positions in the securities discussed in this podcast. To learn more, visit Psum vc.
A
I will never forget when I first met Gavin Baker in 2017. I find his interest in markets, his curiosity about the world, to be as infectious as any investor that I've ever come across. He is encyclopedic on what is going on in the world of technology today and I've had the good fortune to host him every year or two on this podcast. In this conversation we talk about everything that interests Gavin. We talk about interest, Nvidia, Google and its TPUs, the changing AI landscape, the math and business models around AI companies, and everything in between. We even discussed the crazy idea of data centers in space which he communicates with his usual passion and logic. In closing at the end of this conversation, because I've asked him my traditional closing question before, I asked him a different question which led to a discussion of his entire investing origin story that I had never heard before. Because Gavin is one of the most passionate thinkers and investors that I know. These conversations are always amongst my most favorite. I hope you enjoy this latest in the series of discussions with Gavin Baker.
I would love to talk about how you like in the nitty gritty process, new things that come out in this AI world because it's happening so constantly. I'm extremely interested in it and I find it very hard to keep up. And I, you know, I have a couple blogs that I go read and friends that I call. But like maybe let's take Gemini 3 as like a recent example. When that comes out, take me into your office. Like, what are you doing? How do you and your team process an update like that, given how often these things are happening?
B
I mean, I think the first thing is you have to use it yourself. And I would just say I'm amazed at how many famous and august investors are reaching really definitive conclusions about AI. Well, no, based on the free tier. The free tier is like you're dealing with a 10 year old and you're making conclusions about the 10 year old's capabilities as an adult and you could just pay. And I do think actually you do need to pay for the highest tier, whether it's Gemini, Ultra, Super Grock, whatever it is, you have to pay the 200 per month tiers, whereas those are like a fully fledged 30, 35 year old. It's really hard to extrapolate from an 8 or a 10 year old to the 35 year old and yet a lot of people are doing that. And the second thing is there was a, you know, an Insider post about OpenAI and they said to a large degree OpenAI runs on Twitter Vox. And I just think AI happens on X. There have been some really memorable moments, like there was a giant fight between the Pytorch team at Meta and the Jax team at Google on X. And the leaders of each lab had to step in publicly, say, no one from my lab is allowed to say bad things about the other lab and I respect them. And that is the end of that. The companies are all commenting on each other's posts. You know, the research papers come out, there's a list of, you know, if on planet Earth there's 500 to a thousand people who really, really understand this and are at the cutting edge of it and a good number of them live in China. I just think you have to follow those people closely and I think there is incredible signal. Everything in AI is just downstream of those people. Yeah, everything. Andrej Karpathy writes, you have to read it three times minimum. Yeah, he's incredible. And then I would say anytime at one of Those labs, the four labs that matter, OpenAI, Gemini, Anthropic and Xai, which are clearly the four leading labs, anytime somebody from one of those labs goes on a podcast, I just think it's so important to listen. For me, one of the best use cases of AI is to keep up with all of this, listen to a podcast, and then if there are parts that I thought were interesting, just talk about it. With AI, I think it's really important to have as little friction as possible. I'll bring it up. I can either press this button and pull up grock or I have this.
A
It's so amazing. Can you believe we have this?
B
I know, it's like somebody said on, on, on X, you know, like we imbued these rocks with crazy spells and now we can summon super intelligent genies on our phones over the air. You know, it's crazy, crazy.
A
Okay, so. So something like Gemini 3 comes out. The public interpretation was, oh, this is interesting. It seems to say something about scaling laws and the pre training stuff. What is your frame on, like the state of general progress in frontier models in general? Like, what are you watching most closely?
B
Yeah, well, I do think Gemini 3 was very important because it showed us that scaling laws for pre training are intact. They stated that unequivocally. And that's important because no one on planet Earth knows how or why scaling laws for pre training work. It's actually not a law. It's an empirical observation. And it's an empirical observation that we've measured extremely precisely and has held for a long time. But our understanding of scaling laws for pre training, and maybe this is a little bit controversial with 20% of researchers, but probably not more than that, is kind of like the ancient British people's understanding of the sun or the ancient Egyptians understanding the sun. They can measure it so precisely that the east west axis of the great pyramids are perfectly aligned with the equinoxes and so are the east axises of stoneage. Perfect measurement. They didn't understand orbital mechanics. They had no idea how or why it rose in the east, set in the west and, you know, moved across the horizon.
A
There's the aliens that don't.
B
Yeah, yeah, our God in a chariot. And so it's really important every time we get a confirmation of that. So Gemini 3 was very important in that way. But I'd say I think there's been a big misunderstanding of maybe in the public equity Investing community or the broader, more generalist community based on the scaling laws of pre training, there really should have been no progress in 24 and 25. And the reason for that is, is after XAI figured out how to get 200,000 hoppers coherent, you had to wait for the next generation of chips because you really can't get more than 200,000 hoppers coherent. And coherent just means you could just think of it as each GPU knows what every other GPU is thinking. They kind of are sharing memory, they're connected, they scale up networks and scale out, and they have to be coherent during the pre training process. The reason we've had all this progress, maybe we could show like the Arc AGI slide where you had 0 to 8 over four years, 0 to 8% intelligence, and then you went from 8% to 95% in three months. When the first reasoning model came out from OpenAI, we have these two new scaling laws of post training, which is just reinforcement learning with verified rewards. Verified is such an important concept in AI. Like one of Karpathy's great things was with software, anything you can specify you can automate. With AI, anything you can verify you can automate. It's such an important concept and I think an important distinction. And then test, time, compute. And so all the progress we've had, and we've had immense progress since October 24th through today was based entirely on these two new scaling laws. And Gemini 3 was arguably the first test since Hopper came out of the scaling law for pre training. And it held. And that's great because all these scaling laws are multiplicative. So now we're going to apply these two new reinforcement learning with verified rewards and test time compute to much better base models. There's a lot of misunderstanding about Gemini 3 that I think is really important. So the most important thing to conceptualize. Everything in AI has a struggle between Google and Nvidia. And Google has a TPU, Nvidia has their GPUs and Google only has a TPU and they use a bunch of other chips for networking. You know, Nvidia has the full stack. Blackwell was delayed. Blackwell was Nvidia's next generation chip. The first iteration of that was the Blackwell 200. A lot of different SKUs were canceled. And the reason for that is it was by far the most complex product transition we've ever gone through in technology. Going from Hopper to Blackwell. First you go from air cooled to liquid cooled. The rack goes from weighing round numbers a thousand pounds to 3,000 pounds, goes from round numbers 30 kilowatts, which is 30American homes, to 130 kilowatts, which is 130American homes. I analogize it to imagine if to get a new iPhone you had to change all the outlets in your house to 220 volt, put in a Tesla power wall, put in a generator, put in solar panels. That's the power. You know, put in a whole home humidification system and then reinforce the floor because the floor can't handle this. So it was a huge product transition and then just the rack was so dense it was really hard for them to get the heat out. So Blackwells have only really started to be deployed and really scaled deployments over the last three or four months.
A
Can you explain why it has been such an important thing that Blackwell was delayed?
B
This black hole is so complicated and it was so hard for everyone to get these exquisitely complex racks working consistently. Had reasoning not come along, there would have been no AI progress from mid 2024 through essentially Gemini 3, there would have been none. Everything would have stalled. And you can imagine what that would have meant to the markets for sure. We would have lived in a very different environment. So reasoning kind of bridged this 18 month gap. Reasoning kind of saved AI because it let AI make progress without Blackwell or the next generation of TPU which were necessary for the scaling laws for pre training to continue. Google came out with the TPU V6 in 2024 and the TPU V7 in 2025. In semiconductor time, imagine like Hopper. It's like a World War II era airplane and it was by far the best World War II era airplane. It's a P51 Mustang with the Merlin engine. And two years later in semiconductor time that's like you're an F4 Phantom. Okay? Because Blackwell was such a complicated product and so hard to ramp. Google was training Gemini 3 on 24 and 25 era TPU's which are like F4 Phantoms. Blackwell, it's like an F35. It just took a really long time to get it going. So I think Google for sure has this temporary advantage right now from a pre training perspective. I think it's also important that they've been the lowest cost producer of tokens. And this is really important because AI is the first time in my career as a tech investor that being the low cost producers ever matter. Apple is not worth trillions because they're low cost producer of phones. Microsoft is not worth trillions because they're low, low cost producer of software. Nvidia is not worth trillions because they're the low cost producer of AI accelerators. It's never mattered. And this is really important because what Google has been doing has the low cost producer is they have been sucking the economic oxygen out of the AI ecosystem, which is an extremely rational strategy for them. And for anyone who's a low cost producer, let's make life really hard for our competitors. So what happens now? I think this has pretty profound implications. One, we'll see the first models trained on Blackwell in early 2026. I think the first Blackwell model will come from Xai. And the reason for that is just according to Jensen, no one builds data centers faster than Elon. Jensen has said this on the record. And even once you have the Blackwells, it takes six to nine months to get them performing at the level of. Because Hopper is finely tuned, everybody knows how to use it, the software is perfect for it, the engineers know all its quirks. Everybody knows how to architect a Hopper data center at this point. And by the way, when Hopper came out, it took six to 12 months for it to really outperform Ampere, which was generation four. So if you're Jensen or Nvidia, you need to get as many GPUs deployed in one data center as fast as possible in a coherent cluster so you can work out the bugs. And so this is what XAI effectively does for Nvidia because they build the data centers the fastest, they can deploy Blackwells at scale the fastest, and they can help work with Nvidia to work out the bugs for everyone else. So because they're the fastest, they'll have the first Blackwell model. We know that scaling laws for pre training are intact and this means that Blackwell models are going to be amazing. Blackwell is, I mean it's not an F35 versus an F4 Phantom, but from my perspective it is a better chip. You know, maybe it's like an F35 versus a Rafael. And so now that we know pre scaling holding, we know that these Blackwell models are going to be really good. Based on the raw specs they should probably be better. Then something even more important happens. So the GB200 was really hard to get a coin. The GB300 is a great chip. It is drop in compatible in every way with those GB200 racks. Now you're not going to replace the GB2 hundreds but just no new powerwalls. Yeah, yeah, just any data center that can handle those. You can slot in the GB three hundreds and now everybody's good at making those Racks and you know how to get the heat out, you know how to cool them. You're going to put those GB3 hundreds in and then the companies that use the GB3 hundreds, they're going to be the low cost producer of tokens, particularly if you're vertically integrated. If you're paying a margin to someone else to make those tokens, you're probably not going to be. I think this has pretty profound implications. I think it has to change Google's strategic calculus. If you have a decisive cost advantage and you're Google and you have Search and all these other businesses, why not run AI at a minus 30% margin? It is by far the rational decision. Take the economic oxygen out of the environment. You eventually make it hard for your competitors who need funding, unlike you, to raise the capital they need and then on the other side of that maybe have an extremely dominant share position. Well, all that calculus changes once Google is no longer the low cost producer, which I think will be the case. The Blackwells are now being used for training. And then when that model is trained, you start shifting Blackwell clusters over to inference and then all these cost calculations and these dynamics change. It's very interesting during the strategic and economic calculations between the players, I've never seen anything like it. Everyone understands their position on the board, what the prize is, what play their opponents are running. And it's really interesting to watch if Google changes its behavior because it's going to be really painful for them as a higher cost producer to run that negative 30% margin, it might start to impact their stock. That has pretty profound implications for the economics of AI. And then when Rubin comes out, the gap is going to expand significantly versus TPUs, versus TPUs and all other ASICs. Now I think Trainium 3 is probably going to be pretty good and Trainium 4 are going to be good.
A
Why is that the case? Why won't TPU V8, V9 be every bit as good as a couple of things.
B
So one, for whatever reason, Google made more conservative design decisions. I think part of that is so Google, let's say the tpu. So there's front end and back end of semiconductor design and then there's dealing with Taiwan semi. And you can make an ASIC in a lot of ways. What Google does is they do mostly the front end for the TPU and then Broadcom does the back end and manages Taiwan semi and everything. It's a crude analogy, but the front end is like the architect of the house. They design the house. The back end is the person who builds the house and the imaging Taiwan semi is stamping out that house like Lennar or Dr. Horton. And for doing those two latter parts, Broadcom earns a 50 to 55% gross margin. We don't know what on TPUs, let's say in 2027 TPU, I think it sets US estimates maybe somewhere around 30 billion. Again, who knows? 30 billion I think is a reasonable estimate. 50 to 55% gross margins. So Google is paying Broadcom $15 billion. That's a lot of money. At a certain point it makes sense to bring a semiconductor program entirely in house. So in other words, Apple does not have an ASIC partner for their chips. They do the front end themselves, the back end, and they manage Taiwan semi. And the reason is they don't want to pay that 50% margin. So at a certain point it becomes rational to renegotiate this. And just as perspective, the entire opex of Broadcom semiconductor division is round numbers $5 billion. So it'd be economically rational now that Google's paying. If it's 30 billion, we're paying them 15. Google can go to every person who works in Broadcom semi, double their comp and make an extra 5 billion in 2028. Let's just say it does 50 billion. Now it's 25 billion. You could triple their comp. And by the way, you don't need them all. Yeah, and of course they're not going to do that because of competitive concerns. But with TPU V8 and V9, all of this is beginning to have an impact because Google is bringing in media tech. This is maybe the first way you send a warning shot to Broadcom. We're really not happy about all this money we're paying. But they did bring media tech in. And the Taiwanese basic companies have much lower gross margins. So this is kind of the first shot against the bow. And then there's all this stuff people say, but Broadcom has the best Serdes. Broadcom has really good Serdes. And Serdes is like an extremely foundational technology because it's how the chips communicate with each other. You have to serialize and do serialize. But there are other good Serdes providers in the world. A really good Serdes is Maybe it's worth 10 or 15 billion a year, but it's probably about worth 25 billion a year. So because of that friction and I think conservative design choices on the part of Google and maybe the reason they made those conservative design choices is because they were going to a bifurcated supply TPU is slowing down, I would say as the GPUs are accelerating. This is the first competitive response of Lisa and Jensen to everybody saying we're going to have our own asic is hey, we're just going to accelerate, we're going to do a GPU every year and you cannot keep up with us. And then I think what everybody is learning is like, oh wow, that's so cool you made your own accelerator has an asic. Wow. What's the NIC going to be? What's the CPU going to be? What's the scale up switch going to be? What's the scale up protocol? What's the scale out switch? What kind of optics are you to use? What's the software that's going to make all this work together? And then it's like, oh shit, I made this tiny little chip. And you know, like, whether it's admitted or not, I'm sure the GPU makers don't love it when their customers make ASICS and try to compete with them. And like, whoops, what did I do? I thought this was easy. You know, it takes at least three generations to make a good chip like the TPU V1. I mean it was an achievement in that they made it. It was really not till TPU v3 or v4 that the TPU started to become like even vaguely competitive.
A
Is that just a classic learning by doing thing?
B
A hundred percent. And even if you've made from my perspective, the best ASIC team at any semiconductor company is actually the Amazon ASIC team. They're the first one to make the Gravitron cpu. They have this nitro, it's called Supernic. They've been extremely innovative, really clever. And like Trainium and Infantry one, maybe they're a little better than the TPUV one, but only a little. Trainium two, you get a little better. Trainium three, it's I think the first time it's like, okay. And then, you know, I think Trainium 4 will probably be good. I will be surprised if there are a lot of ASICs other than Trainium and TPU. And by the way, in Trainium and TPU will both run on customer owned tooling. At some point we can debate when that will happen. But the economics of success that I just described mean it's inevitable. Like no matter what the companies say, just the economics make it and reasoning from first principles make it absolutely inevitable.
A
If I were to zoom all the way out on this stuff, I find these details unbelievably interesting. And it's like the grandest game that's ever been played. It's so crazy and so fun to follow. Sometimes I forget to zoom out and say, well, so what? Like, okay, so project this forward three generations, past Rubin or whatever. What is like the global human dividend of all this crazy development where we keep making the loss lower on these pre training scaling models? Like, who cares? Like, it's been a while since I've asked this thing something that I wasn't kind of blown away by the answer. For me personally, what are the next couple of things that all this crazy infrastructure war allows us to unlock because they're so successful?
B
If I were to posit like an event path, I think the Blackwell models are going to be amazing. The dramatic reduction in per token cost enabled by the GP300 and probably more the Mi450 than the Mi355 will lead to these models being allowed to think for much longer, which means they're going to be able to do new things. I was very impressed. Gemini 3 made me a restaurant reservation. It's the first time it's done something for me. And I mean other than like go research something and teach me stuff. If you can make a restaurant reservation, you're not that far from being able to make a hotel reservation and an airplane reservation and order me an Uber.
A
And all of a sudden you got an assistant.
B
Yeah. And you could just imagine. Everybody talks about that, but you can just imagine it's on your phone. I think that's pretty near term. But some big companies that are very tech forward, 50% plus of customer support is already done by AI and that's a 400 billion industry. And then if you know what AI is great about is persuasion. That's sales and customer support. And so of the functions of a company, if you think about them, they're to make stuff, sell stuff and then support the customers. So right now maybe you're in late 26, you're going to be pretty good at two of them. I do think it's going to have a big impact on media. Like I think robotics, you know, we talked about the last time, are going to finally start to be real. You know, there's an explosion and kind of exciting robotic startups. I do still think that the main battle is going to be between Tesla's Optimus and the Chinese because, you know, it's easy to make prototypes, it's hard to mass produce them. But then it goes back to that. What Andrej Karpathy said about AI can automate anything that can be verified. So any function where there's a right or wrong answer or right or wrong outcome, you can apply reinforcement learning and make the AI really good at that.
A
What are your favorite examples of that so far?
B
Or theoretically, I mean, just does the model balance? They'll be really good at making models. Do all the books globally reconcile? They'll be really good at accounting, double entry bookkeeping. It has to balance. There's a verifiable, you got it right or wrong. Support or sale? Did you make the sale or not? That's just like AlphaGo. Did you win or you lose? Did the guy convert or not? Did the customer ask for an escalation during customer support or not? It's most important. Functions are important because they can be verified. So I think if all of this starts to happen and starts to happen in 26, there'll be an ROI on Blackwell and then all this will continue. And then we'll have Rubin and then that'll be another big quantum spin. Rubin and the Mi 450 and the TPU V9. And then I do think just the most interesting question is what are the economic returns to artificial superintelligence? Because all of these companies in this great game, they've been in a prisoner's dilemma. They're terrified that if they slow down gone forever and their competitors don't, it's an existential risk. And you know, Microsoft blinked for like six weeks earlier this year. Yeah, I think they would say they regret that. But with Blackwell and for sure with Rubin, the economics are going to dominate the Prisoner's Dilemma from a decision making and spending perspective, just because the numbers are so big. And this goes to kind of the ROI on AI question. And the ROI on AI has empirically, factually, unambiguously been positive. I just always find it strange that there's any debate about this because the largest spinners on GPUs are public companies. They report something called audited quarterly financials. And you can use those things to calculate something called a return on invested capital. And if you do that calculation, the ROIC of the big public spenders on GPUs is higher than it was before they ramped spending. And you could say, well, part of that is, you know, OPEX savings. Well, at some level that is part of what you expect the ROI to be from AI. And then you say, well, a lot of it is actually just applying GPUs, moving the big recommender systems that power the advertising and the recommendation systems from CPUs to GPUs and you've had massive efficiency gains. And that's why all the revenue growth at these companies has accelerated. But like, so what? The ROI has been there. And it is interesting, like every big Internet company, the people who are responsible for the revenue are intensely annoyed at the amount of GPUs that are being given to the researchers. It's a very linear equation. If you Give me more GPUs, I will drive more revenue. Give me those GPUs, we'll have more revenue, more gross profit, and then we can spend money. So it's this constant fight at every company. One of the factors in the Prisoner's dilemma is everybody has this like religious belief that we're going to get to asi and at the end of the day, what do they all want? Almost all of them want to live forever and they think that ASI is going to help them with that. Right?
A
Good return.
B
That's a good return. But we don't know. And if as humans we have pushed the boundaries of physics, biology and chemistry, the natural laws that govern the universe, then maybe the economic returns to ASI aren't that high.
A
I'm very curious about your favorite sort of throw cold water on this stuff type takes that you think about sometimes one would be like the things that would cause, I'm curious what you think the things that would cause this demand for compute to change or even the trajectory of it to change.
B
There's one really obvious bear case and it is just edge AI and it's connected to the economic returns to ASI in three years on a bigger and bulkier phone to fit the amount of dram necessary, you know, and the battery won't probably last as long. You will be able to probably run like a pruned down version of something like Gemini 5 or Grok 4, Grok 4.1 or ChatGPT at 30, 60 tokens per second and then that's free. And this is clearly Apple's strategy. It's just we're going to be a distributor of AI and we're going to make it privacy safe and run on the phone and then you can call one of the big models, you know, the, the God models in the cloud, whatever. You have a question. And if that happens, if like 3060 tokens a second at a 115 IQ is good enough, I think that's a bear case. Other than just these scaling laws break. But in terms of if we assume scaling laws continue and we now know they're going to continue for pre training for at least One more generation and we're very early in the two new scaling laws for post training, mid training, rlvr, whatever people want to call it, and then test time computed inference. We're so early in those and we're getting so much better at helping the models hold more and more context in their minds. And as they do this test time, compute and that's really powerful because everybody's like, well how's the model going to know this? Well, eventually, if you can hold enough context, you can just hold every slack message and outlook message and company manual in, in a company in your context and then you can compute the new task and compare it with your knowledge of the world, what you think, what the model thinks, all this context. And it may be that like just really, really long context windows are the solution to a lot of the current limitations and that's enabled by all these cool tricks like kv, cash offload and stuff. But I do think other than scaling loss, slowing down, other than there being low economic returns to ASI Edge, AI is to me by far the most plausible and scariest bear case.
A
I like to visualize like different S curves you invested through the iPhone and I love to like see the visual of the iPhone models as it sort of went from this clunky bricky thing up to the what we have now. We're like, each one's like a little bit, you know, obviously we've sort of petered out on its form factor. If you picture something similar for the frontier models themselves, does it feel like it's at a certain part of that natural technology paradigm progression?
B
If you're paying for Gemini Ultra or Super Grock and you're getting the good AI, it's hard to see differences. Like I have to go really deep on something like do you think PCI Express or Ethernet is a better protocol for scale up networking? And why show me the scientific papers. And if you shift between models and you ask a question like that, where you know it really deeply, then you see differences. I do play fantasy football. Winnings are donated to charity. But it is like, you know, these new models are quite a bit better at helping who should I play? They think in much more sophisticated ways. If you're a historically good fantasy football player and you're having a bad season, this is why, this is why, because you're not using it, you know, and I think we'll see that in more and more domains. But I do think they're already at a level where unless you are a true expert or just have an intellect that is beyond mind. It's hard to see the progress. And that's why I do think we need to shift from getting more intelligent to more useful. Unless more intelligence starts leading to these massive scientific breakthroughs and we're curing cancer in 26 and 27. I don't know that we're going to be curing cancer, but I do think from almost an ROI s curve, we need to kind of hand off from intelligence to usefulness. And then usefulness will then have to hand off to scientific breakthrough. Just that creates whole new industries.
A
What are the building blocks of usefulness in your mind?
B
Just being able to do things consistently and reliably. And a lot of that is keeping all the context. Like there's a lot of context. If someone wants to plan a trip for me, like, you know, I've, I've acquired these strange preferences. Like I follow that guy Andrew Huberman. So I like to have an east facing balcony so I can get morning sun, you know, so the AI has to remember, here's how I like to fly. Here are my preferences for that. Being on a plane with starlink is important to me. Okay. Here are the resorts I've historically liked. Here are the kinds of areas I've liked. Here are the rooms that I would really like at each. That's a lot of, of context. And to keep all of that and kind of weight those, it's a hard problem. So I think context windows are a big part of it. You know, there's this meter, task evaluation.
A
Thing, how long it can work for.
B
How long it can work for. And you could think of that as being related in some way to context. Not precisely, but that just task length needs to keep expanding. Because booking a restaurant and booking is economically useful, but it's not that economically useful. But booking me an entire vacation and knowing the preferences of my parents, my sister, my niece and my nephew, that's a much harder problem. And that's something that like a human might spend three or four hours on optimizing that. And then if you can do that, that's amazing. But then again, I just think it has to be good at sales and customer support relatively soon. And then after that it has to be, and I think it is already here. I do think we're going to see kind of an acceleration in the awesomeness of various products. Engineers are using AI to make products better and faster.
A
We both invested in Fortel, the hearing aid company, which is just absolutely remarkable, like something I never would have thought of.
B
And we're going to See, I think something like that in every vertical. And that's AI being used for the most core function of any company which is designing the product. And then it will be, you know, there's already lots of examples of AI being used to help manufacture the product and distribute it more efficiently. Whether it's optimizing a supply chain, having a vision system, watch a production line, a lot of stuff is happening. The other thing I think is really interesting in this whole ROI part is Fortune 500 companies are always the last to adopt a new technology. They're conservative, they have lots of regulations, lots of lawyers. Startups are always the first. So let's think about the cloud, which was the last truly transformative new technology for enterprises, being able to have all of your compute in the cloud and use SaaS. So it's always upgraded, it's always great, et cetera, et cetera. You can get it on every device. I think the first AWS reinvent I think was in 2013 and by 2014 every startup on planet Earth ran on the cloud. The idea that you would buy your own server and storage box and router was ridiculous. And that probably happened like even earlier. That had probably already happened before the first reinvention, the first big Fortune 500 companies started to standardize on it like maybe five years later. You see that with AI. I'm sure you've seen this in your startups. And I think one reason VCs are more broadly bullish on AI than public market investors is VCs see very real productivity gains. There's all these charts that for a given level of revenue, a company today has significantly lower employees than a company of two years ago. And the reason is the AI is doing a lot of the sales, the support and helping to make the product. I mean there's, you know, iconic, has some charts, a 16Z. By the way, David George is a good friend, great guy. You know, he has this model busters thing. So there's very clear data that this is happening. So people who have a lens into the world of venture see this and I do think it was very important in the third quarter. This is the first quarter where we had Fortune 500 companies outside of the tech industry give specific quantitative examples of AI driven uplifting. So C.H. robinson went up something like 20% on earnings. Should I tell people what C.H. robinson does? Yeah, let's just say a truck goes from Chicago to Denver and then the trucker lives in Chicago. So it's going to go back from Denver to Chicago. There's an empty load. And C.H. robinson has all these relationships with these truckers and trucking companies and they match shippers demand with that empty load supply to make the trucking more efficient. You know, they're a freight forwarder. You know, there's actually lots of companies like this, but they're the biggest and most dominant. So one of the most important things they do is they quote price and availability. So somebody, a customer calls them up and says hey, I urgently need three 18 wheelers from Chicago to Denver. In the past they said it would take them, you know, 15 to 45 minutes and they only quoted 60% of inbound requests with AI, they're quoting 100% and doing it in seconds. And so they printed a great quarter and the stock went up 20% and it was because of AI driven productivity that's impacting the revenue line, the cost line, everything. I was actually very worried about the idea that we might have this Blackwell ROI air gap because we're spending so much money on Blackwell. Those Blackwells are being used for training and there is no ROI on training. Training is you're making the model. The ROI comes from inference. So I was really worried that, you know, we're going to have maybe this three quarter period where the CAPEX is unimaginably high. Those Blackwells are only being used for training. Right.
A
R Staying flat, eyes going up.
B
Yeah, exactly. And so roic goes down and you can see like Meta. Meta they printed, you know, because Meta has not been able to make a frontier model. Meta printed a quarter where ROIC declined and that was not good for the stocks. I was really worried about that. I do think that those data points are important in terms of suggesting that maybe we'll be able to navigate this potential air gap. And roic.
A
Yeah, it makes me wonder about in this market, I'm like everybody else, it's the 10 companies at the top that are all the market cap more than all of the attention. There's 490 other companies, the S&P 500, you studied those too. Like what do you think about that group? Like what is interesting to you about the group that now nobody seems to talk about and no one really seems to care about because they haven't driven returns and they're a smaller percent of the overall index.
B
I think that people are going to start to care if you have more and more companies print these C.H. robinson like quarters. I think the companies that have historically been really well run, the reason they have a long track record of success. You cannot succeed without using technology. Well and so if you have a kind of internal culture of experimentation and innovation, I think you will do well with AI. I would bet on the best investment banks to be earlier and better adopters of AI than maybe some of the trailing banks. Just sometimes past is prologue, and I think it's likely to be in this case. One strong opinion I have, all these VCs are setting up these holding companies and you know, we're going to use AI to make traditional businesses better. And they're really smart VCs and they're great track records, but that's what private equity has been doing for 50 years. You're just not going to be private equity at their game.
A
This is what Vista did in the early days, right?
B
Yeah. And I do think this is actually private equities maybe had a little bit of a tough run. Just multiples have gone up now. Private assets are more expensive, the cost of financing has gone up. It's tough to take a company public because the public valuation is 30% lower than the private valuation. So PE's had a tough run. I actually think these private equity firms are going to be pretty good at systematically applying AI.
A
We haven't spent much time talking about Meta, Anthropic or OpenAI, and I'd love your impression on everything that's going on in this infrastructure side that we talked about. These are three really important players in this grand game. How does all of this development that we've discussed so far impact those players specifically?
B
First thing, let me just say about Frontier models broadly in 20, 23 and 24, I was fond of quoting Eric Fisher and Eric Fisheria's statement, our friend, brilliant man. And Eric would always say, foundation models are the fastest appreciating assets in history. And I would say he was 90% right. I modified the statement. I said foundation models without unique data and Internet scale distribution are the fastest appreciating assets in history. And reasoning fundamentally changed that in a really profound way. So there was a loop, a flywheel, to quote Jeff Bezos, that it was at the heart of every great Internet company. And it was, you made a good product. You got users, those users using the product generated data that could be fed back into the product to make it better. And that flywheel has been spinning at Netflix, at Amazon, at Meta, at Google for over a decade. And that's an incredibly powerful flywheel. And it's why those Internet businesses were so tough to compete with. It's why they're increasing returns to scale. Everybody talks about network effects. They were important for Social networks. I don't know to what extent Meta is a social network anymore. It's more like a content distribution. But they just had increasing returns to scale because of that flywheel. And that dynamic was not present in the pre reasoning world of AI. You pre trained a model, you let it out in the world and it was what it was. And it was actually pretty hard. They would do RLHF reinforcement learning with human feedback and you'd try and make the bot model better and maybe you'd get a sense from Twitter vibes that people didn't like this and so you'd tweak it. There are the little up and down arrows, but it was actually pretty hard to feed that back into the model with reasoning. It's early, but that flywheel has started to spin and that is really profound for these Frontier Labs. So one reasoning fundamentally changed the industry dynamics of Frontier Labs.
A
Just explain why specifically that is like what is going on?
B
Because if a lot of people are asking a similar question, they're consistently either liking or not liking the answer, then you can kind of use that. Like that has a verifiable reward, that's a good outcome and then you can kind of feed those good answers back into the model. And we're very early at this flywheel spinning.
A
Yeah, got it.
B
Like it's hard to do now, but you can see it beginning to spin. So this is important fact number one for all of those dynamics. Second, I think it's really important that meta, you know, Mark Zuckerberg at the beginning of this year in January said, I'm highly confident I'm going to get the quote wrong, that at some point in 2025 we're going to have the best and most performant AI. I don't know if he's in the top hundred, so he was as wrong as it was possible to be. And I think that is a really important fact because it suggests that what these four companies have done is really hard to do. Because Meta threw a lot of money at it and they failed. Yann Lecun had to leave. They had to have the famous billion dollar for AI researchers. By the way, Microsoft also failed. They did not make such an unequivocal prediction, but they bought inflection AI and there were a lot of comments from them that we anticipate our internal models quickly getting better and we're going to run more and more of our AI on our internal models. Nope. Amazon, they bought a company called Adept AI. They have their models called Nova. I don't think they're in the top 20. So clearly it's much harder to do than people thought a year ago. And there's many, many reasons for that. Like, it's actually really hard to keep a big cluster of GPUs coherent. A lot of these companies were used to running their infrastructure to optimize for cost instead of performance.
A
Complexity and performance.
B
Yeah, complexity. And keeping the GPUs running at high utilization rate in a big cluster, it's actually really hard. And there are wild variations in how well companies run GPUs. If the most anybody, because the laws of physics, you know, maybe you can get 2 or 300,000 Blackwells coherent, we'll see. But if you have 30% uptime on that cluster and you're competing with somebody who has 90% uptime, you're not even competing. So, one, there's a huge spectrum in how well people run GPUs. Two, then I think there is, you know, these AI researchers, they like to talk about taste. I find it very funny, you know, why do you make so much money? I have very good taste. You know, what taste means is you have a good intuitive sense for the experiments to perform. This is why you pay people a lot of money. Because it actually turns out that as these models get bigger, you can no longer run an experiment on a thousand GPU cluster and replicate it on a hundred thousand GPUs. You need to run that experiment on fifty thousand GPUs. And maybe it takes, you know, days. And so there's a very high opportunity cost. You have to have a really good team that can make the right decisions about which experiments to run on this. And then you need to do all the reinforcement learning during post training. Well, and the test time, compute. Well, it's really hard to do and everybody thinks it's easy. But all those things, you know, I used to have this saying, like I was a retail analyst long ago. Pick any vertical in America. If you can just run a thousand stores in 50 states and have them clean, well lit, stocked with relevant goods at good prices, and staffed by friendly employees who are not stealing from you, you're going to be a $20 billion company, a $30 billion company. Like 15 companies have been able to do that. It's really hard. And it's the same thing. Doing all of these things well is really hard. And then reasoning with this flywheel, this is beginning to create barriers to entry. And what's even more important, every one of those labs, Xai, Gemini, OpenAI and Anthropic they have a more advanced checkpoint internally of the model. Checkpoint is just you're kind of continuously working on these models and then you release kind of a checkpoint. And then the reason these models get fed.
A
Yeah, the one they're using internally is further.
B
And they're using that model to train the next model. And if you do not have that latest checkpoint, it's getting really hard to catch up. Chinese open source is a gift from God to meta because you can use Chinese open source, that can be your checkpoint, and you can use that as a way to kind of bootstrap this. And that's what I'm sure they're trying to do. And everybody else, the big problem and the big. A giant swing factor. I think China's made a terrible mistake with this rare earth thing. So China, because, you know, they have Huawei Ascent and it's a decent chip versus something like the deprecated hot preserving something. It looks okay. And so they're trying to force Chinese open source to use their Chinese chips, their domestically designed chips. The problem is Blackwell is going to come out now and the gap between these American Frontier Labs and Chinese open source is going to blow out because of Blackwell. And actually Deep Seq, in their most recent technical paper, v3.2, said, One of the reasons we struggle to compete with the American Frontier Labs is we don't have enough compute. That was their very politically correct, still a little bit risky way of saying, because China said, we don't want the Blackwells, and they're saying, won't you please.
A
Give us the black wells?
B
That might be a big mistake. So if you just kind of play this out, these four American labs are going to start to widen their gap versus Chinese open source, which then makes it harder for anyone else to catch up because that gap is growing. So you can't use Chinese open source to bootstrap. And then geopolitically, China thought they had the leverage. They're going to realize, oh, whoopsie daisy, we do need the Blackwells. And unfortunately, they'll probably. For them, they'll probably realize that in late 26. And at that point there's an enormous effort underway. DARPA has, there's all sorts of really cool DARPA and DoD programs to incentivize really clever technological solutions for rare earths. And then there's a lot of rare earth deposits in countries that are very friendly to America, that don't mind actually refining it in the traditional way. So I think rare earths are going to be solved way faster than anyone thinks. You know, they're obviously not that rare. They're just misnamed. They're rare because they're really messy to refine. And so geopolitically I actually think Blackwell is pretty significant and it's going to give America a lot of leverage as this gap widens. And then in the context of all of that going back to the dynamics between these companies, XAI will be out with the first Blackwell model and then they'll be the first ones probably using Blackwell for inference at scale. And I think that's an important moment for them. And by the way, it is funny, like if you go on open router, you can just look, they have dominant share. Now, Open router is whatever it is, it's 1% of API tokens, but it's an indication they process 1.35 trillion tokens. Google did like 8 or 900 billion. This is like whatever it is last seven days or last month anthropic was at 700 billion. Like XAI is doing really, really well and the model is fantastic. I highly recommend it. But you'll see XAI come out with this. OpenAI will come out faster. OpenAI's issue that they're trying to solve with Stargate is because they pay a margin to people for compute and maybe the people who run their compute are not the best at running GPUs. They're a high cost producer of tokens. And I think this kind of explains a lot of their Code Red recently. Yeah, well, the $1.4 trillion in spending commitments and I think that was just like, hey, they know they're going to need to raise a lot of money, particularly if Google keeps its current strategy of sucking the economic oxygen out of the room. And you go from 1.4 trillion rough vibes Code Red pretty fast. And the reason they have Code Red is because of all these dynamics. So then they'll come out with a model, but they will not have fixed their per token cost disadvantage yet relative to both XAI and Google and Anthropic at that point. Anthropic is a good company. You know, they're burning dramatically less cash than OpenAI and growing faster. So I think you have to give Anthropic a lot of credit and a lot of that is their relationship with Google and Amazon for the TPUs and the trainiums. And so Anthropic has been able to benefit from the same dynamics that Google has. I think it's very indicative in this great game of chess. You can look at Dario and Jensen, maybe have Taken a few public comments, you know, that were between them jousting. A little bit of jousting. Well, Anthropic just signed the $5 billion deal with Nvidia. That is because Dario is a smart man and he understands these dynamics about Blackwell and Rubid relative to TPU. So Nvidia now goes from having two fighters, Xai and OpenAI, to three fighters. That helps in this Nvidia versus Google battle. And then if Meta can catch up, that's really important. I am sure Nvidia is doing whatever they can to help Meta. You're running those GPUs this way. Maybe we should twist the screw this way or turn the dial that way. And then it will be also, if Blackwell comes back to China, which it seems like it'll probably happen, that will also be very good because then Chinese open source will be back.
A
I'm always so curious about the polls of things. One poll would be the other breakthroughs that you have your mind on. Things in the data center that aren't chips that we've talked about before. As one example.
B
I think the most important thing that's going to happen in the world, in this world, in the next three to four years is data centers in space. And this has really profound implications for everyone. Building a power plant or a data center on planet Earth. Okay. And there is a giant gold rush into this.
A
I haven't heard anything about this, so please.
B
Yeah, you know, it's like everybody thinks like, hey, AI is risky, but you know what? I'm going to build a data center. I'm going to build a power plant that's going to do a data center. We will need that. But if you think about it from first principles, data centers should be in space. What are the fundamental inputs to running a data center? There are power and they're cooling, and then there are the chips.
A
If you think about it from a total cost perspective.
B
Yeah. And just the inputs to making the tokens come out of the magic machines in space. You can keep a satellite in the sun 24 hours a day, and the sun is 30% more intense. You can have the satellite always kind of catching light. The sun is 30% more intense, and this results in six times more irradiance in outer space than on planet Earth. So you're getting a lot of solar energy. Point number two, because you're in the sun 24 hours a day, you don't need a battery. This is a giant percentage of the cost. So the lowest cost energy available in our solar system is solar Energy and space. Second, for cooling in one of these racks, a majority of the mass and the weight is cooling. The cooling in these data centers is incredibly complicated. The H vac, the CDU's, the liquid.
A
Very cool to see.
B
Yeah, it's amazing to see in space. Cooling is free. You just put a radiator on the dark side of the satellite, it's fucking gold. And it's as close to absolute zero as you can get. So all that goes away. And that is a vast amount of cost. Okay, let's think about how these. Maybe each satellite is kind of a rock. So one way to think of it, maybe some people make bigger satellites that are three racks. Well, how are you going to connect those racks? Well, it's funny. In the data center, the racks are over a certain distance, connected with fiber optics. And that just means a laser going through a cable. The only thing faster than a laser going through a fiber optic cable is a laser going through absolute vacuum. So if you can link these satellites in space together using lasers, you actually have a faster and more coherent network than in any data center on Earth. Okay, for training, that's gonna take a long time.
A
Because it's so big. Yeah.
B
Just cause it's so big. Training will eventually happen. But then for inference, let's think about the user experience. When I asked Grok about you and it gave the nice answer, here's what happened. A radio wave traveled from my cell phone to a cell tower. Then it hit the base station, went into a fiber optic cable, went to some sort of metro aggregation facility in New York, probably within like, you know, 10 blocks of here. There's a small little Metro router that's routed those packets to a big XAI data center somewhere. The computation was done and it came back over the same path. If the satellites can communicate directly with the phone and Starlink has demonstrated direct to cell capability, you just go boom, boom. It's a much better, lower cost user experience. So in every way, data centers in space, from a first principles perspective, are superior to data centers on Earth.
A
If we could teleport that into existence, I understand that portion. Why will that not happen? Is it launch cost? Is it launch availability? Is it capacity?
B
We need a lot of the starships. Like the starships are the only ones that can economically make that happen. We need a lot of those starships. Maybe China or Russia will be able to land a rocket. Blue Origin just landed a booster. This is a big idea. And I do think it's an entirely new and different way to think about SpaceX. And it is interesting that Elon Postjourner said in an interview that Tesla, SpaceX and Xai all kind of converging, were converging and they really are. So XAI will be the intelligence module for Optimus made by Tesla with Tesla Vision, has its perception system and then SpaceX will have the data centers in space that will power a lot of the AI, presumably for XAI and Tesla and the Octopuses and a lot of other companies. And it's just interesting the way that they're converging and each one is kind of creating competitive advantage for the other. If you're xai, it's really nice. Then you have this built in relationship with Optimus. Tesla's a public company, so there's going to be like, I cannot imagine the level of vetting that will go into that intercompany agreement. And then you have a big advantage with these data centers in space. And then it's also nice if you're xai that you have two companies with a lot of customers who you can use to help build your customer support agents, your customer sales agents with kind of built in customers. So they really are all kind of converging in a neat way. And I do think it's going to be a big moment when that first Blackwell model comes out from XAI next year.
A
If I go to the other end of the spectrum and I think about something that seems to have been historically endemic to the human economic experience, that shortages are always followed by gluts in capital cycles. What if in this case the shortages compute? Like Mark Chen now is on the record as saying they would consume 10x as much compute if you gave it to them in like a couple weeks. So like there seems to still be a massive shortage of compute, which is all the stuff we've talked about today. But there also just seems to be this like iron law of history that gluts follow shortages. What do you think about that concept as it relates to this?
B
There will eventually be a glut. AI is fundamentally different than the software, just in that every time you use AI takes compute in a way that traditional software just did not. I mean, it is true. Like I think every one of these companies could consume 10x more compute. Like what would happen is just the $200 tier would get a lot better. The free tier would get like the $200 tier. Google has started to monetize AI mode with ads and I think that will give everyone else permission to introduce ads into the free mode. And then that is Going to be an important source of ROI.
A
OpenAI is tailor made to.
B
Yeah, absolutely, all of them. And actions like, you know, hey, here are your three vacations. Would you like me to book one? And then they're for sure going to collect a commission. Yeah, there's many ways you can make money. I think we went into great detail on maybe a prior podcast about how just inventory dynamics made these inventory cycles inevitable in semis. The iron law of semis is just that customer buffer inventories have to equal lead times and that's why you got these inventory cycles. Historically, we haven't seen a true capacity cycle in semis arguably since the late 90s. And that's because Taiwan semi has been so good at aggregating and smoothing supply. And a big problem in the world right now is that Taiwan semi is not expanding capacity as fast as their customers want. They're in the process of making a mistake just because you do have intel and with these fabs and they're not as good and it's really hard to work with their pdk. But now you have this guy Leep Bu, who's a really good executive and really understands that business. I mean, by the way, Patrick Gelsinger I think was also a good executive and he put intel on the only strategy that could result in the success. And I actually think it's shameful that the intel board fired him when they did. But Leap Bu's a good executive and now he's reaping the benefits of Patrick's strategy. And intel has all these empty fabs and eventually given the shortages we have of compute, those fabs are going to be filled. So I think Tawan Simi's in the process of making a mistake. But they're just so paranoid about an overbuild and they're so skeptical. They're the guys who met with Sam Altman and laughed and said he's a podcast bro, he has no idea what he's talking about. You know, they're terrified of an overbuild. So it may be that Taiwan semi single handedly that their caution is the governor. I think governors are good. It's good that power's a governor. It's good that Taiwan Simi is a governor. If Taiwan Simi opens up at the same time when data centers in space relieve all power constraints. But that's like, I don't know, five, six years away that data centers in space are a majority of deployed megawatts. Yeah, I think you get it overbuilt really fast. But just we have these Two really powerful natural governors and I think that's good. Smoother and longer is good.
A
We haven't talked about the power other than alluding to it through the space thing. Haven't talked about power very much. Power was like the most uninteresting topic. Nothing really changed for like a really, really long time. All of a sudden we're trying to figure out how to get like gigawatts here, there and everywhere. How do you think about. Are you interested in powers?
B
I'm very interested. I do feel lucky. In a prior life I was the sector leader for the telecom and utilities team. So I do have some base level of knowledge. Having watts as a constraint is really good for the most advanced compute players. Because if Watts are the constraint, the price you pay for compute is irrelevant. The TCO of your compute is absolutely irrelevant. Because if you could get 3x or 4x or 5x more tokens per watt, that is literally 3 or 4x or 5x more revenue. If you're going to build an advanced data center costs 50 billion. A data center with the ASIC maybe costs 35 billion. If that $50 billion data center pumps out 25 billion of revenue and your ASIC data center at 35 billion is only pumping out 8 billion, well, like you're pretty bummed. So I do think it's good for all of the most advanced technologies in the data center, which is exciting to me as an investor. So as long as power is a governor, the best products are going to win irrespective of price and have crazy pricing power. That's the first implication that's really important to me. Second, it is in the only solutions to this. We just can't build nuclear fast enough in America, as much as we would love to build nuclear quickly, we just can't. It's just too hard. Nepa, all these rules, like, it's just, it's too hard. Like a rare ant that we could move and it could be in a better environment can totally delay the construction of a nuclear power plant. You know, one ant, you know, that is America. Yeah, it's crazy. Actually, humans need to come first. We need to have a human centric view of the world. But the solutions are natural gas and solar. And the great thing is, the great thing about these AI data centers is apart from the ones that you're going to do inference on, you can locate them anywhere. So I think you are going to see, and this is why you're seeing all this activity in Abilene, because it's in the Middle of a big natural gas basin. And we have a lot of natural gas in America because of fracking. We're going to have a lot of natural gas for a long time. We could ramp production really fast. So I think this is going to be solved. You know, you're going to have power plants fed by gas or solar, and I think that's the solution. And, you know, you're already. All these turbine manufacturers were reluctant to expand capacity, but Caterpillar just said, we're going to increase capacity by 75% over the next few years. So, like, this system on the power side is beginning to respond.
A
One of the reasons that I always so love talking to you is that you do as much in the top 10 companies in the world as you do looking at brand new companies with entrepreneurs that are 25 years old trying to do something amazing. And so you have this very broad sense of what's going on. If I think about that second category of young, enterprising technologists who now are like the first generation of AI native entrepreneurs, what are you seeing in that group that's notable or surprising or interesting?
B
These young CEOs, they're just so impressive in all ways, and they get more polished faster. And I think the reason is, is they're talking to the AI. How should I deal with pitching this investor I'm meeting with Patrick o'. Shaughnessy? What do you think the best ways I should pitch him are?
A
Yeah, and it works.
B
Do a deep research thing. And it's good. You know, hey, I have this difficult HR situation. How would you handle it? And it's good at that. We're struggling to sell our product. What changes would you make? And it's really good at all of that today. And so. And that goes to. These VCs are seeing massive AI productivity in all their companies. It's because their companies are full of these 23, 24 or even younger AI natives. And they're impressive. I've been so impressed with, like, young investment talent. And it's just part of it. Like your podcast is part of that. There's just very specific knowledge has became so accessible through podcasts and the Internet. Like, kids come in and they're just. I feel like they're where I was as an investor, like in my, you know, early 30s. And they're 22, and I'm like, oh, my God.
A
Yeah.
B
Like, I have to run so fast to keep up. These kids who are growing up native in AI, they are just proficient with it in a way that I am trying really hard to become.
A
Can we talk about semis VC specifically? And like what is interesting in that universe?
B
One thing I just think is so cool about it and so underappreciated is your average semiconductor venture founder is like 50 years old. Okay. And Jensen, and what's happened with Nvidia and the market cap of Nvidia has like single handedly ignited, ignited semiconductor venture. But the way it's ignited is ignited in an awesome way that's like really good for actually Nvidia and Google and everyone is like, let's just say you were the best DSP architect in the world. You had made for the last 20 years. Every two years, because that's what you have to do. Semiconductors, it's like every two years you have to run a race and if you won the last race, you start a foot ahead. Over time those compound and make each race easier to win. But maybe that person and his team, maybe he's the head of networking at a big public company and he's making a lot of money and he has a good life and he's 50 years old. And then because he sees these outcomes and the size of the markets in the data century, he's like, wow, why don't I just go start my own company? But the reason that's important is that I forget the number. But I mean there are thousands of parts in a Blackwell rack and there's thousands of parts in a TPU rack and in the Blackwell rack. Maybe Nvidia makes two or three hundred of those parts, same thing in an AMD rack. And they need all of those other parts to accelerate with them. So they couldn't go to this one year cadence if all that stuff didn't keep. Everything was not keeping up with them. So I think it's the fact that semiconductor venture has come back with a vengeance. You know, Silicon Valley stopped being Silicon Valley long ago. My little firm maybe has done more semiconductor deals in the last seven years than the top 10 VCs combined. You know, but that's really, really important because now you have an ecosystem of companies who can keep up. And then that ecosystem of these venture companies is putting pressure on the public companies that are also need to part of this if we're going to go to this annual cadence, which is just so hard. And it's one reason I'm really skeptical of these asics that don't already have some degree of success. So I do think that's a super, super important dynamic and one that's absolutely foundational and necessary for all of this to happen because not even Nvidia can do it alone. AMD can't do it alone. Google can't do it alone. You need the people who make the transceivers, you need the people who make the wires, who make the backplanes, who make the lasers. They all have to accelerate with you. And one thing that I think is very cool about AI as an investor is it's just, it's the first time where every level of the stack that I look at, at least the most important competitors are public and private. Nvidia, they're very important private competitors. Broadcom, important private competitors. Marvell, important private competitors. You know, Lumintum, Coherent, all these companies. There's even like a wave of innovation in memory, which is really exciting to see because memory is such a gating factor. By the way, something that could slow all this down and be a natural governor is if we get our first true DRAM cycle since the late 90s.
A
So see more what that means if.
B
The price of dram, if like a DRAM wafer is valued at like a five carat diamond. In the 90s, when you had these true capacity cycles before Taiwan semi kind of smoothed everything out and DRAM became more of an oligopoly, you would have these crazy shortages where the price would just go 10x unimaginable relative to the last 25 years, where like a giant DRAM cycle, a good DRAM cycle, as the price stops going down, an epic cycle is maybe it goes up, you know, whatever it is, 30, 40, 50%. But I mean, if it starts to go up by Xs instead of percentages, that's a whole different game. By the way, we should talk about SaaS.
A
Yeah, let's talk about it. What do you think's gonna happen?
B
Well, I think that application SaaS companies are making the exact same mistake that brick and mortar retailers did with E Commerce. So brick and mortar retailers, particularly after the telecom bubble crashed, you know, they looked at Amazon and they said, oh, it's losing money. E Commerce is going to be a low margin business. How can it ever be more efficient as a business? Right now our customers pay to transport themselves to the store and then they pay to transport the goods home. How can it ever be more efficient if we're sending shipments out to individual customers? Amazon's vision of course will eventually we're just going to go down a street and drop off a package at every house. And so they did not invest in E Commerce. They clearly saw customer demand for it, but they did not like the margin structure of E Commerce. That is the fundamental reason that essentially every brick and mortar retailer was really slow to invest in E Commerce. And now here we are. And you know, Amazon has higher margins in their North American retail business than a lot of retailers that are mass market retailers. So margins can change. And if there's a fundamental transformative kind of new technology that customers are demanding, it's always a mistake not to embrace it. And that's exactly what the SaaS companies are doing. They have their 70, 80, 90% gross margins and they are reluctant to accept AI gross margins. The very nature of AI is software. You write it once and it's written very efficiently and then you can distribute it broadly at very low cost. And that's why it was a great business. AI is the exact opposite where you have to recompute the answer every time. And So a good AI company might have gross margins of 40%. The crazy thing is because of those efficiency gains, they're generating cash way earlier than SaaS companies did historically. But they're generating cash earlier not because they have high gross margins, but because they have very few human employees. And it's just tragic to watch all of these companies like you want to have an agent. It's never going to succeed if you're not willing to run it at a sub 35% gross margin because that's what the AI natives are running it at. Maybe they're running it at 40. So if you are trying to preserve an 80% gross margin structure, you are guaranteeing that you will not succeed in AI. Absolute guarantee. And this is so crazy to me because one, we have an existence proof for software investors being willing to tolerate gross margin pressure as long as gross profit dollars are okay. It's called the cloud. People don't remember, but when Adobe converted from on premise to a SaaS model, not only did their margins implode, their actually revenues imploded too. Because you went from charging up front to charging over a period of years. Microsoft, it was less dramatic. But you know, Microsoft was a tough stock in the early days of the cloud transition because investors were like, oh my God, you're an 80% gross margin business and the cloud is the 50s. And they're like, well it's going to be gross profit dollar accretive. It probably will improve those margins over time. Microsoft, they bought GitHub and they use GitHub as a distribution channel for copilot, for coding. That's become a giant business. A giant business now for sure. It runs at Much lower gross margins. But there are so many SaaS companies, like I can't think of a single application SaaS company that could not be running a successful agent strategy and they have a giant advantage over these AI natives and that they have a cash generative business. And I think there is room for someone to be a new kind of activist or constructivist and just go to SaaS companies and say stop being so dumb. All you have to do is say, here are my AI revenues and here are my AI gross margins. And you know it's real AI because it's low gross margins. I'm going to show you that. And here's a venture competitor over here that's losing a lot of money. So maybe I'll actually take my gross margins to zero for a while. But I have this business that the venture funded company doesn't have and this is just such a like obvious playbook that you can run. Salesforce, ServiceNow, HubSpot, GitLab, Atlassian, all of them could run this.
A
And the way that those companies could or should think about the way to use agents is just to ask the question, okay, what are the core functions we do for the customer now? Like how can we further automate that.
B
With agents effectively if you're in CRM? Well, what our customers do, they talk to their customers. We're customer relationship management software and we do some customer support too. So make an agent that can do that and sell that at 10 to 20% and let that agent access all the data you have. Because what's happening right now, another agent made by someone else is accessing your systems to do this job, pulling the data into their system and then you'll eventually be turned off. And it's just crazy. And it's just because, oh wow. But we want to preserve our 80% gross margins. This is a life or death decision and essentially everyone except Microsoft is failing it. To quote that memo from that Nokia guy long ago. Like their platforms are burning.
A
Burning platform.
B
Yeah, yeah, there's a really nice platform right over there and you can just hop to it and then you can put out the fire in your platform that's on fire. And now you got two platforms and it's great.
A
You know, your data centers in space thing makes me wonder if there are other kind of less discussed off the wall things that you're thinking about in the markets in general that we haven't talked about.
B
It does feel like since 2020 kicked off and 2022 punctured this kind of a series of rolling bubbles so in 2020 there is a bubble in like EV startup EVs that were not Tesla and that's for sure a bubble. And they all went down 99% and there was kind of a bubble in more speculative stocks. Then we had the meme stocks, Gamestop and now it feels like the rolling bubble is in nuclear and quantum and these are fusion and smr. It would be a transformative technology. It's amazing. But sadly from my perspective, none of the public ways you can invest in this are really good expressions of this theme, are likely to succeed or have any real fundamental support. And same thing with Quantum. I've been looking at Quantum for 10 years. We have a really good understanding of Quantum and the public Quantum companies, again are not the leaders. From my perspective, the leaders in Quantum would be Google, IBM and then the Honeywell Quantum. So the public ways you can invest in this theme, which probably is exciting, are not the best. So you have two really clear bubbles. I also think quantum supremacy is very misunderstood. People hear it and they think that it means that quantum computers are going to be better than classical computers at everything. With Quantum you can do some calculations that classical computers cannot do. That's it. That's going to be really useful and exciting and awesome. But it doesn't mean that quantum takes over the world. I think the thought that I have had this is maybe less related to markets than just AI. I have just been fascinated that for the last two years, whatever AI needs to keep growing and advancing, it gets. Have you ever seen public opinion change so fast in the United States on any issue? Has nuclear power just happened like that? Like that? And like, why did that happen right when AI needed it to happen? Now we're running up on boundaries of power on Earth. All of a sudden data center's in space. It's just a little strange to me that whenever there is something, a bottleneck that might slow it down, everything accelerates. Rubin is going to be such an easy, seamless transition relative to Blackwell and Rubin's a great chip. And then amd getting into the game with the Mi 450. Whatever AI needs, it gets.
A
You're a deep reader of sci fi, so.
B
Yeah, exactly.
A
You're making me think of Kevin Kelly's great book what Technology Wants. He calls it the technium. Like the, like the overall mass of technology that just like is supplied by humans.
B
Absolutely. More powerful. Yes. It just grow more and more powerful. And now we're going into an end state.
A
You founded at Tradies in the pre chatgpt and Pre covered era, the world has changed dramatically. How has Atreides changed the most in that same period?
B
Before Atreides, I had never had a big position where it wasn't my position. I was doing the fundamental work. I was the analyst. I was really a one man show. It was such a crazy feeling the first time I made money on an idea that was not my own. And that was CrowdStrike in 2020. And that had never happened to me, ever. And I'd run funds for 15 years at that point. I think that was a big evolution for me. I've been thinking a lot about actually the NFL in the context of investing. I think it is so interesting. Sam Darnold, Baker Mayfield, Daniel Jones, left for dead, utter failures, embarrassments of draft picks, okay? And now they're some of the best quarterbacks in the league. It just turned out they needed a different system and a different coaching mindset. I have really been trying to work on how I can make sure that if there is someone who is clearly talented and working really hard, how do I make sure that if I have that person, they succeed? And I just think investing is all about finding the right balance between the courage of your convictions and the flexibility to admit when you're wrong. To quote Michael Steinhardt, it's all about finding the right balance between the arrogance to believe that you have a variant view that's going to be right in a game that is being played of tens of millions of people worldwide with the humility to recognize that at any moment there might be a new data point that is outside of your expected probability space that invalidates that variant view and to be really, really open to that. And I think a lot of that comes down to finding a investment philosophy and process that fits your own emotional makeup so you can be rational when you're wrong and you could strike that balance between conviction and flexibility. What every investment firm has to find is the right balance between incentivizing risk taking and accountability for mistakes. If you don't have enough accountability, people take way too much risk. On the other hand, if you have too much accountability, people don't take risk. What I have tried to do and really, really institute is make it really safe for people to change their mind. I want to make it safe for people to say, here's some new data points that have invalidated my investment hypothesis. I'm the child of two attorneys. I've never taken an argument personally. I love to argue, okay? It's like, I love it like, it's just like bloodsport. And I can just do it all day. But I do believe at some level, kind of investing is the search for truth. And if you find truth first and you're right about it being a truth, that's how you generate alpha. And it has to be a truth that other people have not yet seen. You're searching for hidden truths. I think that the best way to arrive at truth is through discourse. And so I try to really incentivize people to tell me that I'm wrong, celebrate it when people tell me I'm wrong. Having what we call bull bear lunches, where the aside analyst presents their case and then the skeptic presents their case. And then there's a very robust discussion that I generally try not to participate in. I would say decision takes an immense amount of work, but decision quality is high after that. I want people to feel safe taking risk. I want people to feel safe changing their mind. And then I want people to feel extremely safe telling me that I'm wrong and telling each other that they're wrong. Because the more we have that, like, definitionally, if you were not wrong, you're not learning. If I'm not wrong about three things in a day, I didn't learn anything. So I want to be told I'm wrong as much as possible. And so I try to incentivize all of that.
A
I have a selfish closing question, speaking of young people. So my kids, who are 12 and 10, but especially my son who's older, is developing an interest in what I do, which I think is quite natural. And I'm going to try to start asking my friends who are the most passionate about entrepreneurship and investing, why they are so passionate about it and what about it is so interesting and life giving to them. How would you pitch what you've done, the career you built, this part of the world, to a young person that's interested in this.
B
The earliest thing I can remember is being interested in history. Looking at books with pictures of the Phoenicians and the Egyptians and the Greeks and the Romans. I loved history. I vividly remember, I think in the second grade as my dad drove me to school every day. We went through the whole history of World War II in one year, and I loved that. And then that translated into a real interest in current events very early. So as a pretty young person, I don't know if it was eighth grade or seventh grade or ninth grade, I was reading the New York Times and the Washington Post, and I would get so excited when the mail came because it meant that Maybe there was an economist or a Newsweek or a Time or a US News. And I was really into current events because current events is kind of like applied history and watching history happen, thinking about what might happen next. I didn't know anything about investing. My parents were both attorneys. Anytime I won an argument, I was super rewarded. Like, you know, if I could make a reasonable argument why I should stay up late. Yeah, my parents would be so proud and they'd let me stay up late, but I had to beat them. You know, that was the way I grew up. I was just kind of going through life and, you know, I really love to ski and I love rock climbing. And I go to college and rock climbing is by far the most important thing in my life. I dedicated myself. I didn't know that completely. I climbed. I did all my homework at the gym. I got to the rock climbing gym at 7am Would skip a lot of classes to stay in the gym. I'd do my homework on like a big bouldering mat. Every weekend I went and climbed somewhere with the Dartmouth Mountaineering Club. It was super important to me. I'm not a good athlete, so I was never a very good climber. But I did dedicate myself entirely to rock climbing. And as part of that, like on climbing trips, the movie Rounders came out while I was in college. So we started playing poker. I liked to play chess. I mean, I was never that good at chess or poker, you know, never really dedicated myself to either. And my plan after two or three years of college was I was going to leave. I was a ski bum at Alta in college. I was a housekeeper. I've cleaned a lot of toilets. It was shocking to me how people treated me. And it is permanently impacted how I treat other people. You'd be cleaning somebody's room and they'd be in it and they'd be reading the same book as you. And you'd say, oh, wow, that's a great book. I'm about where you are. And a. They look at you like you're a space alien. Like you speaks. And then they get even more shocked you read, you know. So it like had a big impact on how I've, like, just treated everyone since then. Like, being nice is free. But anyways, I was going to be a ski bum in the winters. I was going to work on a river in the summers, and that was how I was going to support myself. And then I was going to climb in the shoulder seasons. I was going to try and be a wildlife photographer and write the next Great American.
A
I can't believe I never knew this.
B
That was my plan. This was, like, my plan of record. I was really lucky. My parents were very supportive of everything I wanted to do. My parents had very strict parents. So, of course, they're extremely permissive with me. So, you know, I'll probably end up being a strict parent. You know, just the cycle continues. And my parents were lawyers. You know, they'd done reasonably well. They both grew up in, I'd say, very economically disadvantaged circumstances. Like my dad talks about. Like, he remembers every person who bought him a beer. He couldn't afford a beer. He worked the whole way through college. He was there on a scholarship. He had one pair of shoes all through high school. And so they were super on board with this plan. And I'd been very lucky. They sent me to college, and I didn't have to pay for college. So they said, you know, Gavin, we think this plan of being a ski bum, river rafting guide, wildlife photographer, climbing the shoulder seasons, trying to write a novel, we think it sounds like a great plan. But, you know, we've never asked you for anything. We haven't encouraged you to study anything. We've supported you in everything you've wanted to do. Will you please get one professional internship? Just one, and we don't care what it is. The only internship I could get this was at the end of my sophomore summer at Dartmouth. It was an internship with Donaldson, Lufkin and Jeannette. Dlj. My job was to. Every time DLJ published a research report, it was in, like, the private wealth management division. And I worked for the guy who ran the office. And my job was, whenever they produced a piece of research, I would go through and look at which of his clients owned that stock. Then I would mail it to the clients. So this day we wrote on General Electric. So I need to mail the GE report to these 30 people, mail the Cisco report to these 20 people. And then I started, like, reading the reports, and I was like, oh, my God, this is, like, the most interesting thing imaginable. So investing, I kind of conceptualized it. It's a game of skill and chance, kind of like poker. And, you know, there's obviously chance in investing. Like, if you're an investor in a company and a meteor hits their headquarters, that's bad luck. But you own that outcome. So there is chance that is irreducible, but there's skill, too. So that really appealed to me. And the way you got an edge in this greatest game of skill and chance imaginable was you had the most thorough knowledge possible of history, and you intersected that with the most accurate understanding of current events in the world to form a differential opinion on what was going to happen next in this game of skill and chance. Which stock is mispriced in the Perry Mutual system, that is the stock market. And that was like day three. I went to the bookstore and I bought the books that they had, which were Peter Lynch's books. I read those books in two days. I'm a very fast reader. Then I read all these books about Warren Buffett. Then I read market wizards. Then I read Warren Buffett's letters to his shareholders. This is like during my internship. Then I read Warren Buffett's letters to his shareholders again. Then I taught myself accounting. There's this great book, why Stocks go Up and Down. Then I went back to school. I changed my majors from English and history to history and economics. And I never looked back. And it consumed. I continued to really focus on climbing, but instead of, I would be in the gym and I would print out everything that the people on the Motley fool wrote. They had these fools, they were early to talking about return on invested capital. Incremental ROIC is like a really important indicator. And I would just read it and I would underline it. And I'd read books and then I'd read the Wall Street Journal. And then eventually there was a computer terminal finally set up near the gym. And I'd go to that gym and just read news about stocks. And it was the most important thing in my life. And like, I barely kept my grades up. And, yeah, that's how I got into it. History, current events, skill and chance. And I am a competitive person and I've actually never been good at anything else. Okay. I got picked last for every sports team. I love to ski. I've literally spent a small fortune of private skiing lessons. I'm not that good of a skier. I like to play ping pong. All my friends can beat me. I tried to get really good at chess, and this was before you actually had to play the games. And my goal was to beat one of the people. I'm sure there's a park somewhere. It's literally right there.
A
The most famous one is right there.
B
Okay, well, there's one in Cambridge and I wanted to beat one of them. Never beat one of them. I've never been good at anything. I thought I would be good at this. And the idea of being good at something other than taking a test that was competitive was very appealing to me. And so I think that's been a really important thing too. And to this day, this is the only thing I'm good at. I'd love to be good at something else. I'm just not.
A
I think I'm going to start asking this question of everybody. The ongoing education of Pearson Maeve. Amazing place to close. I love talking about everything.
B
Thank you so much. This is great man. Thank you, thank you, thank you.
A
If you enjoyed this episode, visit joincolas.com where you'll find every episode of this podcast, complete with hand edited transcripts. You can also subscribe to Colossus Review, our quarterly print, digital and private audio publication featuring in depth profiles of the founders, investors and companies that we admire most. Learn more@joincolasis.com subscribe.
Guest: Gavin Baker
Title: Nvidia v. Google, Scaling Laws, and the Economics of AI
Date: December 9, 2025
Host: Patrick O’Shaughnessy
In this episode, Patrick O’Shaughnessy sits down with Gavin Baker—founder of Atreides Management and renowned technology investor—for a characteristically candid and wide-ranging conversation about the accelerating developments in artificial intelligence (AI), semiconductors, and market dynamics. The discussion ranges from the technical arms race between Nvidia and Google, the importance of scaling laws in AI, transformative new business models, and the implications for global economics and geopolitics. Gavin also shares personal stories and insights into cultivating investment talent, closing with a candid account of his own career journey.
New Feedback Loop:
Widening the Gap:
China’s Position:
Edge AI Risk:
On S-Curves and Frontier Model Utility:
Productivity Gains:
SMB and Startup Impact:
On Scaling Laws and the State of AI
On Data Centers in Space
On the Economics of AI
On SaaS and AI Margins
On Investing and the Search for Truth
On Raising Entrepreneurship Talent
| Segment Topic | Start Time | |-------------------------------------------------------|------------| | How Gavin processes new AI developments | 04:58 | | Scaling laws: meaning and importance | 08:04 | | Nvidia Blackwell vs. Google’s TPU | 11:35 | | Temporary advantages and cost of tokens | 13:22 | | Frontier models, checkpoints, and data cycles | 42:49 | | Edge AI bear case and model usefulness | 29:22 | | Power, supply gluts, & natural regulatory governors | 60:13 | | Data centers in space—first principles and economics | 51:49 | | AI productivity and startup/enterprise adoption | 35:03 | | VC, semiconductor renaissance, AI-native founders | 63:18 | | SaaS crossroads—agent strategy & margin contraction | 68:28 | | Niche “bubbles” (nuclear, quantum), market lessons | 74:03 | | Investing philosophy, behavioral culture at Atreides | 77:01 | | Gavin’s origin story in investing | 81:22 |
Gavin reveals the unusual path that led him to investing—via a deep interest in history, a stint as a ski bum, and parental urging to get “just one” professional internship. A serendipitous role reading stock research for a wealth management firm exposed him to the game-like nature of markets, fusing his love for history, competition, and argument.
“The way you got an edge in this greatest game of skill and chance imaginable was you had the most thorough knowledge possible of history, and you intersected that with the most accurate understanding of current events in the world to form a differential opinion on what was going to happen next.” (84:12)
Baker’s enthusiasm for technology, investing, and discovery animates the conversation. The tone is analytical yet irreverent, especially when dissecting technical nuances or calling out strategic blunders in the industry. Baker is equally candid about his own limitations and passions. For investors, entrepreneurs, or anyone interested in technology’s trajectory, this episode delivers a dense but accessible primer on how the hardware, economics, and culture of AI is reshaping markets and careers on a global scale.
For deeper dives and episode archives:
Colossus | JoinColossus.com