![Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442] — Invest Like the Best with Patrick O'Shaughnessy cover](https://megaphone.imgix.net/podcasts/799253cc-9de9-11f0-8661-ab7b8e3cb4c1/image/d41d3a6f422989dc957ef10da7ad4551.jpg?ixlib=rails-4.3.1&max-w=3000&max-h=3000&fit=crop&auto=format,compress)
Loading summary
A
The best operators have a relentless focus on leverage, finding ways to multiply their impact rather than just working harder. But here's what I see happening in finance teams everywhere. Brilliant people getting buried in expense management. Busy work. If you think about it, you become a finance leader because you love strategic work. Modeling scenarios, optimizing capital allocation, finding the insights that actually move the business forward. But instead you're chasing receipts and categorizing transactions. It's the opposite of leverage. This is exactly why I'm so bullish on what the team at Ramp has built. Kareem and Eric understood that every minute spent on manual expense management is a minute stolen from high leverage work. So they automated all of it. Automatic categorization, receipt matching, spending controls that actually work. I love the network effect that this creates. When finance teams at companies like Shopify and Stripe automate the mundane stuff, they free up cycles to think bigger, to ask bigger questions, spot patterns others miss, and make the kind of strategic bets that separate great companies from good ones. The math is simple. Get your time back, focus on what matters. Check out ramp.com invest and see what happens when you eliminate the busy work cards issued by Sutton bank member fdic. Terms and conditions apply. Longtime listeners of this show will know that AlphaSense is the market intelligence platform I've admired for years. It gives institutional investors access to over 500 million premium sources, from company filings and broker research to news trade journals and more. Plus over 200,000 expert calls covering the world's most important companies and industries. All of it in one platform so investment teams can move faster, go deeper and make high conviction decisions with confidence. I'm excited to join AlphaSense at their inaugural Alpha Summit 2025 this October in Brooklyn. I'll be on stage alongside leaders from ubs, Wells Fargo, Accenture, Google Stripes Group, the Carlyle Group and more to talk about how AI is reshaping investment research and decision making. Alpha Summit is about showing the real workflows and strategies that top firms are using today. The event features an incredible lineup of industry leading keynote speakers. Over three days you'll hear from these industry leaders, connect with peers across finance and corporate strategy and be part of the conversations you won't find elsewhere. Join me At Alpha Summit 2025 October 6th through 8th at the Refinery at Domino. To register and to see a complete list of speakers and the full agenda, go to AlphaSense.com invest in asset management Growth often depends on customization. It's the nature of the beast in our industry and I know having experienced the problem firsthand. As an active manager, it's a competitive differentiator to tail products and services to clients preferences. Those of us growing our businesses always want to say yes to customers. It means delivering a tailored portfolio, a tailored report, or a tailored expectation for service. Saying yes leads to growth and it also leads to customization and a big trade off. The more you grow, the more complexity you absorb. The more you say yes, the harder it is to scale efficiently and consistently. That's where Ridgeline comes in. Ridgeline automates customization. It gives asset managers the ability to deliver personalized experiences at scale without adding headcount, manual work or operational risk. Risk Having been an early design partner myself, I saw firsthand the power of taking an entirely clean sheet of paper to building the system. We've all been waiting for a front to back platform that combines all of a firm's core functions on a single data set. It's how leading firms stop choosing between growth and efficiency and start saying yes to both. I believe the best firms will be built on Ridgeline as their operating system. I also believe they'll be a leading case study in combining the power of systems of record and AI. If you haven't spent time with him yet, I urge you to see what Ridgeline might unlock for your business. Hello and welcome everyone. I'm Patrick o' Shaughnessy and this is Invest like the Best, this show is an open ended exploration of markets, ideas, stories and strategies that will help you better invest both your time and your money. If you enjoy these conversations and want to go deeper, check out Colossus Review, our quarterly publication with in depth profiles of the people shaping business and investing. You can find Colossus Review along with all of our podcasts@joincolasis.com.
B
Patrick O' Shaughnessy is the CEO of Positive Sum. All opinions expressed by Patrick and podcast guests are solely their own opinions and do not reflect the opinion of Positive Sum. This podcast is for informational purposes only and should not be relied upon as a basis for investment decisions. Clients of Positive Sum may maintain positions in the securities discussed in this podcast. To learn more, visit psc um VC.
A
My guest today is Dylan Patel. Dylan is the Founder and CEO of Semianalysis at Semianlysis. Dylan tracks the semiconductor supply chain and AI infrastructure build out with unmatched granularity, literally watching data centers get built through satellite imagery and mapping hundreds of billions in capital flows. Our conversation explores the massive industrial build out powering AI from the strategic chess game between OpenAI, Nvidia and Oracle among others to why we're still in the first innings of post training and reinforcement learning. Dylan explains infrastructure realities like electrician wages doubling and companies using diesel truck engines for emergency power, while making a sobering case about US China competition and why America needs AI to succeed. We discussed his framework for where value will accrue in the stack, why traditional SaaS economics are breaking down under AI's high cost of goods sold, and which hardware bottlenecks matter the most. This is one of the most comprehensive views of the physical reality underlying the AI revolution that you'll hear anywhere. Please enjoy my conversation with Dylan Patel. I was going to lay out this idea of going through the past, present and future of COMPUTE as like the big, big idea for our conversation, but since it just happened, I don't think I've heard you talk about it anywhere. I'd love to start by asking about this whole OpenAI Nvidia thing. Sounds exciting, seems vague. Not really sure what's going on. Maybe you can explain it to us as you see it and what the strategic implications are of the big announcement.
B
I think it's very, very simple. You've got OpenAI paying Oracle lots of money. You've got Oracle paying Nvidia lots of money. You've got Nvidia paying OpenAI lots of spider man meme. We've got the infinite money glitch here. No, no, no, that's not actually what's happening. If they pay each other, then their market caps all keep going up. What's really happening is OpenAI has an insatiable demand for compute. The compute precedes the buildup of business. You have to have the cluster before you can run models on it for inference. You have to have the cluster to train the model that's good enough that it unlocks new use cases which then can be adopted. And there's an adoption curve there for any new use case. So you have to have all these things like sequenced. Given this is a game of the richest people in the world, or rather the biggest tech giants in the world, right? It's. It's Zuck. It's all the biggest people in the world. It's Elon, right? Google, Larry and Sergey is like constantly in the business now. Again, there's very much a risk of OpenAI being too small to matter, which is crazy to say because they've got 800 million users. But where is the revenue? Where's the compute? They could easily get swamped in terms of how much compute they have if they don't move fast enough and if they don't have the most compute or among the most compute, they will get beaten. The magic of OpenAI was that they just spent way more compute on a single model run on GPT 3 and 4. And they had the foresight and the vision and the execution, but they made that bet and they were able to secure it. And at the time it was like, meh, right, it was a few hundred million dollars, whatever, right? That's a ton of money. But like now it's sort of like Mark Zuckerberg sees how much compute he's going to have to get, even though he has this insane cash flow that he's like, oh wait, I need to go sign a deal with Apollo for $30 billion on this data center in Louisiana, this mega data center I'm going to build. It's like, wait, why don't you just fund this with cash flows? You have so much cash flow. It's like that's just the physical data center. Now what I'm going to put in it is like so much money. It's the amount of capital that people are going to have and are dumping into this is insane. Google was slow to wake up, slow to pivot. Their data center operations are slow to do everything while they could have way more compute than anyone by a humongous degree. And then they have like how much they allocate to search. And generative search is not really necessarily competing with OpenAI. It's the mega models. So if you have this tremendous vision of what's going to happen with AI, you know that it takes a ton of compute to build them, you know, pretty much the amount of compute you could dedicate to these models as limitless. And they will get better. Now it's a log log scale, right? I. E. You need 10x more compute to get to the next tier of performance. You might think of it as diminishing returns, but what if the next tier of performance is like a 6 year old versus a 16 year old, a 6 year old. You can't get to do much. Right? And this is not exactly the way to think of AI, but this is the conundrum that OpenAI is in. They have to race with the giants. These giants are trillion dollar businesses. So how does OpenAI get there? Well, it's partnering with Microsoft. Well, that soured some, right? It's partnering with Oracle. Well, Oracle can do a lot, but Oracle doesn't even have a balance sheet like Google and Microsoft and Amazon and Meta Elon.
A
Sport of kings.
B
Yeah, this is very much like the Pascalian wager nature of all of this with the tech giants, Oracle can be part of it. But OpenAI needs allies. They need people to effectively spend the capex ahead of the curve and trust that they'll be able to pay the rental income, because that's what it is. At the end of the day. OpenAI is committing to 5 year deals. These 5 year deals cost X amount of money. It's 10 to 15 billion dollars per gigawatt of data center capacity that you pay a year. And then that 10 to 15 billion dollars for a gigawatt of data center capacity, you're paying that for five years, okay, that's 50 to $75 billion of cash that goes out the door to OpenAI for 1 gigawatt of capacity. And you talk about what Sam saying is like, hey, I need 10 gigawatts more than 10 gigawatts. Then you end up with this challenging aspect of like how do you pay for that? And hey, that's only the rental price. If I were to actually do the capex because it's front loaded, right, it becomes who is the balance sheet for this? That's the reason these deals are coming about. Oracle is making a massive bet. Larry, he's getting good margin off of it, but he's making a massive bet that this capex that he's going to pay for OpenAI will actually be paid because he signed a 300 billion deal with OpenAI. It's like your revenue is like 15 billion ARR this month. Maybe on a run rate basis it'll get to 20 by the end of the year pretty clearly. Maybe it's like 16 now, but it's very tough to get to. How do you pay $300 billion of revenue if the bet works out? They've just made 100 billion of profit, pure cash profit. It's crazy, but if it doesn't work out, they've got this huge and they're starting to raise debt. There was a small deal they signed recently, but they're going to start raising more and more debt now. Nvidia's kind of got the same conundrum, right? It's like Google and Amazon are doing these deals whether it's to other vendors for TPUs or for Trainium, whether it's Anthropic or others. They're trying to court OpenAI, they're trying to court other companies. How do I get into this game? Right, okay, fine, I can rely on Microsoft somewhat, I can rely on Oracle somewhat, but at the end of the day If I want GPUs to be king, part of it is just like my chip is the best, but part of it is also who's going to pay the capex upfront. Google and Amazon will pay the capex upfront. If it's for TPUs or Trainium, they won't pay the capex up front necessarily for that same capacity of GPU. So you've got this challenging aspect and so that's where this Nvidia and OpenAI deal comes from.
A
I want to dig into the underlying assumptions driving this on the training and inference side because obviously there's the willingness like Zuckerberg just needs to go down the hall to CFO to get access to all this capital. He doesn't even need to go down the hall. He can just make it so he's.
B
Got the voting shares.
A
Sam's got to fly to Norway and Saudi and other places. But I want to make sure I understand your thinking on the underlying two sides of this one, which is like your view on the diminishing return curve on just.
B
I don't think it's a diminishing return. Right. I think that's important to recognize. Right. Given it's a log log chart, scaling laws are given. There's no model architecture improvements. You just throw more compute data model size at it. It gets better at this pace.
A
But you're confident that that will continue.
B
Everything has shown that it will continue and it's continued over.
A
And that orders of magnitude wasn't some like.
B
Well, GPT 5 is not, not necessarily that much bigger than 4.0. Right. And 4.0 is smaller than 4. What's changing is sort of the paradigm of how you spend the compute. And also like, if they made a bigger model, could they even serve it now? They did 4.5 and it was terrible. No one could serve it. It was actually like quite a bit smarter, but they couldn't actually serve it at any reasonable cost and speed. Anthropic has the same issue. I wouldn't even call it an issue. But like all of their revenue comes from four. Sonic doesn't come from 4.1 opus, which is the better model. It's bigger, but it's slow because the hardware's not caught up in terms of inference speed for that. And so no one wants to use a slow model. Right. The user experience sucks. But as far as like, if the model gets better at each scale of hardware spend, I would say all the tech giants believe it. I believe it. I think a lot of people in the Financial community are like, this is fricking scary. Yeah. Because the moment it stopped, you know, wherever you were on the wrong. Right. If we went from $50 billion spend to $500 billion spend, well that $500 billion spend is never going to have ROI. It was one thing if 50 billion didn't have ROI, but now this 500 doesn't have ROI. It's a big problem. So anyways, one could think of it as diminishing returns because if you, when you go from $50 billion of spend to $500 billion of spending, you only move up one tier of model capabilities. In absence of major algorithmic improvements, I'm holding those sort of off to the side for now. But that iterative performance improvement in the model, it's like a 6 year old versus a 13 year old maybe, right. The amount of work you can get a 13 year old to do is actually quite valuable relative to a six year old. And the same applies to like a college intern versus someone who graduated and has even one year of work experience. Because there's a learning curve for kids coming out of college all the time. While it may be in order magnitude more of compute the amount of value. If you made a company full of high schoolers and you had to refresh them every six months so they didn't learn too much, right. And become really good, it would be really hard to create a valuable company. The most you could do is like dig trenches and do yard work. But then all the time these kids wouldn't even show up. Right. Like as a function of like how valuable the business could you do if you had unlimited high schoolers that refreshed so they didn't build knowledge versus college students versus 25 to 30 year olds. The value of that business that you can build, even though incrementally it's just five years between each of them, it's a drastic value change.
A
Where do you think we are today? Like which level are we at, do you think?
B
Depends on the domain, right? Like for software developers, like I think we're, we're really pretty good. And that's where we're seeing the most value creation happen. Anthropic have gone from like a billion or less of revenue to 7 to 8. It's the fastest revenue ramp we've ever seen for anything of this kind of.
A
Basically all code related, right?
B
I mean like, you know, some of it's their own cloud code product, some of it's, you know, cursor, some of it's GitHub, Copilot, which also offers anthropic models and has since the beginning of the year. It's Windsurf. It's all these different avenues to access the same thing and these companies aren't all doing the same thing. There, there's tweaks and nuances to how they're doing things differently, but it's all code. And you know, in that sense it's like If I had 30 year old senior engineer at Google and if I had infinite of those, all it costed was capex for chips and the operational cost is actually quite low. Then you could build businesses worth insane amounts. You could have a replacement for the $2 trillion of wages that go to all the software developers in the world today. Or rather you could augment them and build, you know, twice as much or 5 times as much or 10 times as much if you could augment them. Because these things don't like just run on their own. Right. It's more of like a force multiplier to the existing person. So the value creation potential is, is there. It's obvious if you've coded it all in your life. I mean it even works for vba. It's not that great for vba. So I know a lot of people in this audience probably know the VBA users.
A
Yes.
B
But like it's, it's not even that terribly bad for making macros. But the value creation potential there is incredibly high. So let's capture it. How do you capture it? This draws back to the OpenAI Nvidia deal because I think most people in the market don't quite get it right. They're like, oh, this is just like round tripping. It is to some extent. They agreed 10 gigawatts of capacity. Nvidia will do $100 billion of equity investment into OpenAI in the form of cash and Nvidia gets return capital. The first chunk of the deal in the press release is 1 gigawatt $10 billion. So pretty straight line, 1 gigawatt to build as we established earlier is like $50 billion. So Nvidia is paying 10 billion. OpenAI still has to come up with other 40 somehow go to the markets, get a loan or get someone else to put a loan. Right. There's these infrastructure funds that are trying to get into this. You know, all these commercial real estate people are trying to get in this. There's some way where they'll be able to figure out other people to front the capital and then come up with like a deal. Much like, like it is Oracle. But OpenAI has to do more of the work in terms of setting up the cluster, the software, the networking, etc. The nice thing for Nvidia is of that 50 billion they capture maybe 35 billion of that is capex that goes directly to Nvidia. So year zero OpenAI slash, its partner spends $50 billion on the data center. The timing is not exactly that, but they spend $50 billion on the data center. 35 goes into Nvidia. Nvidia's gross margin is 75%. So you know, again, I'm going to make it simple numbers. Let's say it's 10 and 40, 10 billion cogs, 40 billion revenue, $30 billion of gross profit. If we fix the numbers, it's effectively like half their gross profit from that deal is going directly to OpenAI in the form of an equity investment. The 25%, that's COGS, Nvidia's paying for that and then they keep the other half of the gross profit on their balance sheet or do buybacks, whatever they want to do with it. So Nvidia is not necessarily like they are like round tripping some of this. What effectively is happening is OpenAI gets the opportunity to pay for a big chunk of it in equity and Nvidia's lowering their prices without lowering their prices effectively and they're getting ownership of a company. But Nvidia comes out great because they're getting the capex dollars up front. So all they're really doing is they're saying half of my money that's in this. Sure it does make its way to me somehow, but in reality I still made half of that gross profit and the other half is equity in a company that may or may not be worth something. A company that may or may not be able to pay hundreds of billions of dollars of compute deals that they've signed, in which case they'd be bankrupt.
A
It's about the highest stakes like capitalism game of all time. And it's so interesting to think about when it might run out. You mentioned like if we hit that final point and we don't see the return, we're kind of toast in a big hole. But I'm also curious about the other side of ability to serve and just demand for like today's models by inference. The stat I last saw is token demands doubling every two months or something crazy. Obviously there's all these reasoning tokens that are really exciting for some of the longer thinking models. How do you think about the growth of the pool of demand for inference tokens themselves? Even in today's models, like even if we just like stop things and fix things and we'll leave that other side of the equation just for a second. What's your model for thinking about that today? What most interests you?
B
So the thing I like to call it is tokenomics. I stumbled upon the word actually. It's like a crypto kill off crypto. Finally, once and for all, I'm trying to make tokenomics SEO direct to us talking about tokenomics and then hopefully you talking about tokenomics.
A
Hopefully everyone using tokenomics 20 more times.
B
Yeah, it's got to be in the title now, right? You've got some powerful SEO. We kill the crypto bros for this. But tokenomics, economics of the tokens, right? How much compute is being spent? How much is the gross profit? What's the value being created by these tokens? That's the end of the day. What's relevant here, right? Nvidia keeps saying AI factory which produces intelligence, that intelligence has value. Let's say you have a gigawatt of capacity. What can I serve? Well, I could serve a thousand times of a model. That's really shitty. I could serve one times of a model. That's good. And I could serve like 0.1 times of a model. That's amazing. Now multiply that by whatever factor, how many users, what's the number of tokens outputted? I could do X number of tokens, x times 100x times a million tokens, right? Depending on the model quality. This is sort of where the whole GPT5 thing comes around. OpenAI had a challenging thing, right? They're like, hey, we have a couple gigawatts of capacity effectively, right? By the end of this year, roughly a couple gigawatts of capacity too. How do they maximize their serving capacity with this? One avenue is we continue to serve big models and we make bigger models and the tokens are more expensive. But this log log scale is really challenging because yes, the value is way more, but the cost is way more. And then the real whammy is the user experience is way worse. If I serve a massive, massive model, it's slow and users are fickle. You need the response to be faster than they can hard recalibrate. Yeah, yeah. So there's this like user experience challenge. But really in the end it's like for a given model level, I think there's a saturation point of how much demand of intelligence there is. You can only have such large child army, right? Of, of like people digging trenches or like Cony 2012, whatever it is, like this is very cancelable, but you know, but you could have a much larger army of or business of like the larger level of intelligence. When you think about what could I have done with GPT3? GPT3, even if we paused there, paused the model capabilities, right? You know, obviously the cost to serve a model quality of GPT3, it's like 2000 times cheaper now. And then GPT4, same thing, right? People were freaking out about deep seq because it was like 5, 600 times cheaper. GPT OSS came out and that's even cheaper than that for roughly the same quality. Actually, I would argue the GPT open source model is actually a little bit better than GPT4 OG because it can do tool calling. And anyways, the cost of these things tanks rapidly with algorithmic improvement. Not necessarily model getting bigger, but at X level of intelligence, you can only serve so much demand. The flip side is it takes time for people to realize how to use it. So when GPT3 launched, no one cared. When GPT 3.5 launched, it was like, still most people didn't care. Chad GPT launched with GPT3.5 people cared a little bit. GPT4 launched on ChatGPT then. People cared a lot. But a model tier of GPT3, 5 or 3 still can be very useful in a lot of the world. Now it's not useful for like a lot of use cases, right? Like for coding, it was terrible. For copywriting, it's okay, there's some level of use case and it happens to four, but it takes time for that adoption to happen. And so you've kind of got this challenge of like if I pause on a model capability, then I end up taking way too long for adoption. And also how can I get people to adopt it if I don't let people use it? So OpenAI had this tremendous problem with GPT 4.0. 4 Turbo was smaller than 4 and 4.0 was smaller than 4 Turbo. OpenAI basically did was they made the model as much smaller as possible while keeping roughly the same quality or slightly better. So four to four Turbo was like the model was less than half the size and 4 turbo to 4.0, like 4.0's cost is way lower than 4 and they just kept shrinking the cost. Now 5, what could they have done? They could have gone, oh, we'll go big step. They actually tried that with 4.5. They screwed up some things because it was really hard to get, you know, a hundred thousand GPUs to work properly. There's challenges there. Also, they hadn't figured out the whole reinforcement learning paradigm at that time. The scaling laws are like, it's a chart of quality versus compute. But that compute breaks down into how much bigger do I make the model, how much more data do I put in the model? And if the Internet only has so many tokens, you're kind of screwed. There was potentially a cliff until reinforcement learning happened, where you can generate data and train the model to be better without the Internet having that data. So they kind of had this problem of, you have X amount of compute, you can service your users. But hey, today if people want to use my API, I rate limit them because I can't actually serve them all. I have to rate limit the people who have ChatGPT, Free Pro and Max. Whatever the $2,200 deal is. There's like different rate limits. You can only do deep research so much. I have multiple ChatGPT accounts because I use deep research. It's like you kick off a bunch, you read it and you're like, wow, I learned a ton. Move on. So you have this challenge of like, you can't actually serve your user base enough, so how are they ever going to move up this adoption curve? So then as OpenAI, what's your choice? Do you go from 4.0 to 5? Do you make the model way bigger and not be able to serve anyone? And plus, because you can't serve anyone and it's slow to serve, the adoption curve doesn't really get going. Or do you make the model the same size, which is what they did for GPT5. It's basically the same size as 4.0 and roughly the same cost. That's actually a little bit cheaper, potentially. And then you just serve way more users and get everyone up the adoption curve more. And then you can, instead of putting them on a bigger model, you put them on models that do thinking. If you've used GPT5 thinking or GPT5 Pro, there's more intelligence there. This is the whole conundrum they have. And this is where the whole tokenomics thing comes into play. The question you had, I wanted to level set it right, which is, how do you serve these users? The demand is growing so much. I'm not doubling my hardware every two months. Yes, this capex is crazy, but I'm not doubling my hardware every two months, but I'm doubling my tokens every two months. So there has to be enough of a cost decrease. And there is. At a given level of intelligence, if.
A
You could snap your fingers and change a dial somehow, that would most unlock and unleash more development. Is it just inference latency? Because then we could do bigger models and serve them much faster in a way that consumers would enjoy. Is that the main, like bottleneck to be attacked?
B
All of these things are curves and it's a trade off. Right? Everything in engineering is a trade off. So you have inference latency versus cost on any given hardware. GPUs can do lower latency to a certain extent, but then the cost is way higher. Or you can do really, really high throughput and the cost is way lower. And the company, they set the dial where they think it makes the most sense. Yeah. And there's other types of hardware which aim for their curve to be at a different spot. Maybe the GPU curve is here, but latency, you know, over here, you know, you're in very diminishing returns. And so actually someone made a little curve right here that's like, okay, maybe that's a useful point, but actually the market cares about this point. So anyways, there's a curve of like, who cares about latency? I think if I could just press a magic button. Is it capacity? Is it latency? That's a tremendous question. I'd probably still say capacity slash cost is more important than latency. Really. I think existing levels of latency are fast enough for a lot. If the latency was 10x lower for GPT5, then they could have made a model that was 10x bigger, served at the speed. But then you would have the same capacity issue. If you could have your cake and eat it, which is all the capacity in the world and the lowest latency in the world, you'd make the models way better. It's the physical realities of if I'm at OpenAI, what do I choose to do? Do I invest more in the model that people can use and that's fast? Do I invest a lot in the model that most people won't use because it's expensive, first of all. And even those that can't afford it will often go back to the regular one. I have access to Claude 4.1 opus. I still use Sonnet way more.
A
Just because it's a better experience.
B
Right. Slow. My time's worth something. If you had the magic button, I think OpenAI wouldn't have been afraid to like make a model way, way, way bigger and a terrible user experience.
A
And as a result we're just going to probably have to wait a little bit longer to see what the bigger models are in practice to see what consumers actually do with them. Because it's just going to be too hard.
B
It's not necessarily even bigger. Right. Like, there's this whole concept of over parameterization that is if you just throw more parameters in a neural network. I'll equate it to humans, right. When you had a vocab test or you had some test you memorized before you understood, and it wasn't until you did multiple repetitions and in different forms that you actually understood the content rather than just memorized. It takes cycles. When you do an LLM, it's the same thing, right. If you throw some data at it, it'll memorize it before it generalizes. It's this concept called grokking. You grok the subject that is, it's.
A
Like the aha moment of understanding it.
B
Yeah. The models do the same thing. They memorize it up until then they understand it at some point. And if you make the model bigger and bigger and bigger without the data changing, you just memorize everything. And actually it starts to get worse again because it never had the opportunity to generalize because the model was so big and there's so many weights and there's so much capacity for information. The challenge today is not necessarily make the model bigger. The challenge is how do I generate and create data in useful domains so that the model gets better at them? Nowhere on the Internet does it show you how to fly through a spreadsheet using only your keyboard and all these, like, functions and all these things. Right? Like that's, that's a repetition, that's bars. But there's no data on the Internet about this. So how do you teach a model that it's not going to learn it from reading the Internet over and over and over again, which you and I could never do. And so it hasn't a level of intelligence that we can't do. We can't read the whole Internet, but it can't do basic stuff, which is like play with a spreadsheet. How do you get it to learn these things? That's where this whole reinforcement learning paradigm.
A
Giving it environments, specific environments to learn it and then fold back in.
B
Right, exactly. That's where there's sort of a challenge in terms of building those environments. So there's like 40 startups now in the Bay doing these environments and, you know, questionable whether or not any of them will make it or what will happen. And then these companies are also making their own environments. But these environments can be anything and everything. So it's like, as simple as, like, here is a fake Amazon. Amazon Terms of Service ban chat models and all these things, but here's a fake Amazon full of items. Figure out how to click around and purchase items. Figure out how to compare the two items and pick. I've generated a list of deodorants. Three of them are fake, one of them's real, one of them's not the one I want. Here's the prompt. And you know, it tries many things and, you know, vary the prompt and all these things, but eventually, you know, it's bought the right deodorant and you've succeeded and you fold it back in. That's a simple thing. Or it could be clean this data, right? Here's this table. Ton of dirty data in there. Oh, there's like colons and stuff. There's an address in one column. How do I separate out the column? So the address is like street address, city, zip code. And I'll try a bunch of stuff, but like, hey, maybe it can't do that yet. You give it an iterative, like, here's addresses, here's different formats, and you slowly, iteratively teach it. So another example is like, you're in a game, whether it's a Tic Tac Toe or Call of Duty or a math puzzle, whatever the game is. And that's what a lot of these environments initially have been is like math puzzles. Do this math puzzle. Oh, well, I can't do this one because it's too hard. Here's an easier one. Okay, I can. I can spin on this one. Okay, I'm better enough. Okay, now I can learn this one iteratively, step through those to where? From Q4 of last year to Q2 of this year. These things Hill climbed up math puzzles like crazy. And a lot of that was not, hey, I just know the math. A lot of that was, here's how I use Python to write something that does the math for me. And now these things are actually quite good at math. The environments can be super varied. And it doesn't need to be something that's like clear cut and dry. It can be. Here's. Here's a medical case. What's wrong with it? And then there's like. And then you have another model say, well, here's instructions on how you would grade the result of a case. What looks like they didn't even try this or didn't even look up this. Okay, you did that wrong. So these environments can be very, very complicated. So building those out is a Challenge, Right. It was one thing to say, I'm taking all the Internet data, I'm going to filter it somewhere, I'm gonna throw it to the model. There's tons of engineering challenges there, for sure. There's a different set of engineering challenges that take time to build out.
A
In those two, like in pure raw Internet pre training world and in this new, like, environments world, what inning are we in in each of those, would you say, like, how far into the potential benefits this is where, like the.
B
Whole, like, oh, well then, Dylan, what you're saying is you never need to make models bigger again, right? Because you've already run out of data. And until you figure out how to generate tons and tons of data, that's great. But actually we haven't, you know, we've seen another angle where it's mostly just been pre training scaling, right. Is VO3 and banana nano, right? These Google image and video models and genie and like all these Google image and video models. And that's, that's purely like scaling on multimodality. Right. The models still aren't that great at video and audio and images. They're fine. They could be a lot better. So there's like angles of scaling there, right? Because when I said we've run out of training scaling, we've run out of the text. Yeah. There's tons of video and image and audio. It's just so expensive. So, you know, like, we, we didn't get to that.
A
Maybe late innings on text, mid innings on pre training.
B
I think we're early on. Non. Yeah, we're quite early on. And then the other angle is. Just because you've used the text doesn't mean you can't learn faster. You know, it's like, take a classroom again. Like, you know, this is like Machiavelli. I don't even if Machiavellian's the right word. But like, you take a class, you give them all a book, you tell them to read it once, and you test them all. It's like one kid's gonna get a hundred and one kid's gonna get a 40, right? It's just the reality of life. And if you read the book out loud to them, the kid who got 100 might get a 30, and the kid who got a 40 might've got a 60, right? So there's like these different parameters, and when we talk about model architecture, the same thing happens there. So it's not like you stop training new models. It's not like you don't have algorithmic Improvements or smarter kids. Right. You know, so it's not like pre training is done. It's the base of everything. So you want to keep having gains because any gains on pre training, I. E. The model learns a little faster or the model's a little bit smaller for the same quality feeds into the next stage, which is this whole post training side which will subsume the majority of the compute at some point.
A
And inning wise, is. Are we in the second inning of that?
B
Like, I think we've like thrown the first ball. Wow. I think my favorite thing, my brother just had a baby. He had his third month birthday like a couple days ago. I'm gonna go see him in another week. But like, this baby will literally stick his hand in his mouth. He's like calibrating the senses on his fingers by sticking his hand in his mouth because his tongue is the most sensitive thing. He doesn't know he's doing it, but like, that's how he's calibrating. He's like, oh, that's me. Oh, I can touch and feel. Right. It's like, how does the model learn these sorts of things? Right. It's like you just have to try stuff and fail. And we're so, so early. Think about how much we see throughout our life and how much of that information we throw away. We throw all of this information away. Do I remember what I had for lunch yesterday? No. But if it was amazing or bad, I would have remembered that. Oh, I don't like this. Or I like this. Right. There's all this information we throw away. And these models, these environments. Yeah, we're generating tons of data and throwing most of it away and training the model, but it's like infinitesimal compared to what humans have done. And so I think there's so many environments you can put the model in. There's people who even think you don't get to the magical AGI until you embody it, I. E. You put the model in something that could interact in the real world. Elon and Xai, they're a bit more along that angle of like, they think embodiment is required to get to artificial general intelligence because you need the model to be able to say, like, pick this up, or like, oh, wow, this is like a rotate Y thingy, which you could never get from like just watching a video about it. You wouldn't get the concepts of it even. I think we're so early in the reinforcement learning because that's what humans are. We're reinforcement learners.
A
Let's Say we fast forward and we're in the seventh inning of that or something like this. What do you think the way that the average person will most feel that difference in terms of the utility of the model?
B
It'll be very different like modus of using it. It's one thing to like ask for information or ask it to organize information versus it just doing things. Those 12 year olds, you need to really direct them how to dig a hole because a lot of them haven't dug a hole.
A
But you're talking about order me this vitamin and just like it's just done, right?
B
And we're actually like not too far away from that. I think if you try and research electric toothbrushes, like this is something. Cause you know your electric toothbrush, I leave it at a hotel all the time. And I've been obsessive about this. Like in 2021 I made a spreadsheet of all the electric toothbrushes because based on how many ICs were in each one of them, right? Like this one has a Bluetooth ic. Why I don't. This one has a display ic. Like it has a color display ic. Like what's going on? Right? Like I made a spreadsheet of all this and so like, I don't know, it's like this weird like little thing that I do. But I've been finding like how I research which toothbrush I want to buy. Now I bought A oral B IO like series 9 or whatever. These models now can figure out exactly what you want. More than 10% of Etsy's traffic is straight from GPT. Wow. Amazon blocks GPT, but otherwise it would be really high. People make purchasing decisions through GPT. They just don't make the purchase. OpenAI's head of applications or CEO of applications was at Shopify and created the shopping agent. This is very clear. This is how they monetize. And this is like something we, we wrote about, like on the newsletter. Once the models are going to purchase for you, they're going to do actions for you. And then the company that does those actions for you, the model that will be able to take some sort of take rate. Even if it's like 0.1%, even if it's 1%, it's 2%. It'll be like a credit card transaction. Visa is the most amazing business in the world because of this, right? And chat could be that too. If I'm making my decisions on purchasing all sorts of things, I mean, I already almost outsourced. Like what am I going to eat to like the front page recommendation of like Uber eats sometimes or Google Maps reviews. It's like, oh, I'm feeling like pizza. It's like, okay, well the first one is what I'm going to go to, right. I already outsource a lot of decisions. It's not too much further till I've like completely outsourced a decision and a purchasing intent. That's what's made Meta and Google such amazing companies is they figured out how to get the thing you want to purchase in front of you. All their work on recommendation systems is figuring out what you like, how to keep you on the platform longer, whether it's YouTube or Instagram or ByteDance, right. With TikTok. Or it's hey, here's the ad of the thing you'll probably click on and buy because that's how I get paid. And everyone likes to claim they don't like pay attention to ads, but you do.
A
Before asking even more holistically your view on where we're going, there's a third category which is the reasoning part of the equation. So we've got pre training, we've got RL and environments, post training. What about raw time spent reasoning and where that going as its own independent part of the overall scaling.
B
So I think the scaling laws, again, like if you zoom out, it's not actually what like original paper is, but in spirit, sure, scaling laws are more compute, better intelligence and that could be bigger and bigger model. Each iterative token is better. But again, when I talk to you and I whatever, like word garbage I spew out, if I went back and I wrote about everything I talked about in this, I could make it way more condensed. It could be way more clear potentially. Right now the benefit of podcasts is a lot of times people are more fun this way. Driving. It's fun. Yeah, exactly right. They're walking their dog and they're listening whatever it is they're working out. But like an important thing here is that by putting in these environments, you're teaching it like humans. If I asked you to go figure something out, you might not necessarily know the answer right away, but I know you could probably figure it out in a given amount of time. That's reasoning. You're spending more brain cycles. The magic again of like intelligence, of humans, of people, is not that they are the best at information retrieval. GPTs are amazing at information retrieval. We're really good at because we've been trained in these environments, which is our world, at figuring out how to do things. Iteratively and so reasoning and these RL environments are linked together. If I'm telling a model, hey, do this math puzzle. It's not just spewing out like, oh, the answer's one. Oh, the answer's two. Oh, the answer's three. Okay. The answer was actually seven. And when I got there, I trained it again. It's like, okay, now it knows. Next time. Oh, the answer is six, seven, or eight. No, it's like seven. Okay, great. It's not like, now it instantly knows the answer. It's actually like, oh, here's this puzzle. Oh, like these numbers. Oh, this line. It's Sudoku. These numbers add up to this. Oh, it has one through nine, but it's missing eight. Okay, it's eight, Right? Like, it's thinking through it. Right? Like you and I would solve a Sudoku. Now, eventually, when you get good enough at Sudoku, probably just like, spit out answer, you could do it in your sleep. This reasoning time is a way of spending more compute, more brain cycles on the task without actually scaling the model. And then the model becomes more versatile. Right? Because humans have a rate. If I just held a match against you and you didn't notice it, you'd immediately jerk. The rate at which you operate is hundreds of hundreds of hertz. Your body can actually take actions at like, hundreds of actions per second. If you look at a fighter pilot's reaction time, what reaction can they do is completely primal, instinctual. Very little thought is put into it. This alien intelligence that we're trying to make, is it immediately going to one shot? The answer? Always no. But at times it needs to. At times it needs to be able to tell me exactly the answer in two seconds or half a second or whatever action it needs to take immediately. But a lot of times also needs to think through the problem, go and do stuff. That's why you hire interns, because you're like, yeah, I know this data exists. Here's the format. I kind of want it on. Go figure it out. And then they. They spend a whole summer doing something you could have done in, like, three days. But, like, great, they learned a shit ton. These models need to go through that progression. And so when I think about reasoning rl, it's a lot about how the human psyche and intelligence works. There's a caution of, like, trying to make it too much like humans, because it's not the fundamental substrate is not like humans. The processing is not like humans. Our brain is very different from how these alu's on a chip works. Like the scaling of these things is very different. The raw speed, the amount of words they can, everything is so different. But at the same time, it's important to like, reckon back to what actually makes people, you know, smart.
A
On the topic of like, embodiment, continuing with the human analogy, how do you think about things like short and long term memory in a human versus raw model capacity or something? What role does that analogy of memory, I don't mean literally like semiconductor memory, but like memory in a model. How do you think about the importance that that will play and where are we in that?
B
The magic of Transformers was attention. I. E. I calculate everything in my context. Length. I calculate the attention to each other. Basically. In a vector space like king, queen, there's these vectors. There's like dozens of vectors for each number. And king and queen are actually exactly the same on a ton of stuff. But then it's the opposite on one number because one's a male, one's a female. And then that will have a lot of other, like, ramifications throughout other literary stuff. Like what adjectives do you put with a male of, you know, this vector? So it's like regal and, you know, like powerful and could be ruthless, whereas a queen could be like dignitary or whatever. When you think about how that applies to humans, we're terrible at exact recall. Absolutely horrible. I could tell you a sentence and tell you to repeat it.
A
Yeah, it's like six numbers the average person can remember or something like that, right?
B
But like, you get the gist of the sentence. If I told you a whole paragraph, you'd get the gist of it. You could repeat the meaning of it to someone. You could translate that meaning. So models, very different, right. Fundamentally transform. Our attention has been calculating the attention to everything to each other and getting the models to actually be able to recall. That's been a training data problem. But like, you can get the model to repeat exactly what you want. Anything in its context. Length. It's like a needle in the haystack is the like. It's a benchmark that people did for a while because models had to get good at that. But now models are just amazing. But what they really suck at is having infinite context. The real word is sparse. You've taken this entire world and you've encoded it in such a small amount of data that lives in your brain and it's so sparse. But you understood how to grab the fundamental reason and put it down there. Whereas models, they haven't been able to create something sparse yet. How do you reason over the context of infinity humans? Maybe we have like a short term memory and a long term memory. I think it's a lot more blurry than that. There's no like clear line, oh, this is in my short term memory. Oh, this is in my long term memory. It's like, it is much more blurry as we go back and back and back, it's more and more sparse, right? If we think about, hey, what do you remember as a kid? The most crazy thing in psychology. I remember when I learned it, I was like, wait, my memory of what I did as a kid with my dad at this like thing, right? Is fake. It's me remembering it and inventing the picture and me remembering that picture like successively. But like the actual memory of what happened is like morphed a little bit over time. The way humans collapse information is super, super dense, but we are able to extract all the relevant information out. Now, models, there's a ton of research going on in this domain of long context. How do I get longer and longer context without blowing up my model cost? This is a big challenge with reasoning. This is why we had this HBM bullish pitch for a while, right? Is like, you need a lot of memory when you extend the context. Simple thesis. The fundamental algorithm needs to change and improve over time iteratively to get to something like this short and long context of memory. That doesn't necessarily mean the model has to work like we do. Why can't the model just reason and have a database that it writes stuff in? Or like a Word document that it writes stuff in and then it takes it out of its context, works some more and like roll calls back. It's like, oh yeah, we don't do that, right? Like you and I refer to our notes, we refer to our calendar, we refer to our text, we refer to anything. All the shopping list, right? Like, great, I know I need food for dinner, I go to the store, I'm like, I need a shopping list because otherwise I'm going to buy like stupid shit. So the model doesn't necessarily have to fundamentally work the same way as humans. But there is that challenge of like, how do I train the model to operate over the context length of a human? How do I train it to interact with these databases and these word documents that it writes to? Because it's never going to learn that from pre training, has to learn that from an environment. But these environments have to be like architected in a way where the model knows it can write stuff down and refer back and so one of the first things OpenAI did was deep research. Everything is not in deep research's context. Deep research is working for like 45 minutes. It's outputting millions and millions of tokens and it's creating this amazing thing that it wrote. And it's pretty good research, I would say. A lot of like, memos that you read from people are like on par with like deep Research, at least like a junior. How they do that was they enabled it to be able to write something down elsewhere and have this recall and, and effectively use language to compress information that it looked at, put that off to the side, use language to compress other information off to the side, use language to compress other information off to the side. And then looking at all this compressed information and writing something, that's sort of what deep research is. So how do models get there? I'm not sure. Right. I think it's a fundamental research challenge. It's why these companies need millions of GPUs to train on. Not for, oh, I'm going to make a million GPU model, but because I need to try a bajillion different things because I don't know what will work. What's going to work for humans is so different from what works with models. And there's any number of parameters or things you could tweak that could end up, like, changing how it develops and how good is it at if I do it this way versus that way? Right. That's the whole point of ML research. Constantly trying stuff out and trying to get better and better.
A
If I add all of this up and hold the mirror up, it seems like I would put you in the category of unbelievably bullish on what these things are going to be able to do in 10 years time or something like, pick your timeframe. Am I calibrated the right way? Like, amongst everyone you talk to, who you respect and think is, I'm much.
B
More bearish than a lot of people, actually, which is the crazy thing.
A
So help me understand that distinction. Like, where are you 1 through 10amongst the people that you respect? 10 being the most bullish? And then if you're not a 10, what's the difference between you and the person who's a 10?
B
I respect you, but I know I'm way more bullish than you. And I respect Mark Zuckerberg, but I know he might be. I don't know if he's more bullish than me, but I know Sam Altman is definitely way more bullish. Than me. He says we have artificial general intelligence in less than a thousand days. Or Dario. Like, I respect him immensely, but he's way more bullish than me. My roommates, one of them is an anthropic ML researcher, and one of them is another podcaster, Dwarkash. They're both way more bullish than I am. Even. They are not as bullish as, like, some researchers in this field. But then if I go talk to someone I respect, like a famous investor, right? Like any of these famous investors. I don't want to name one because I'm scared, but there's all these famous investors, right? They're not more bullish than me. And the stuff I'm saying sounds like crazy shit.
A
Some of it, though, is timeline. I'm actually even more curious about the upper limit.
B
The upper limit. I think I'm among the most bullish.
A
You can get, because that's what I mean.
B
Yeah, like, upper limit of this is that this will just be smarter than humans. I don't think that'll happen, like, anytime soon. Even if that doesn't happen anytime soon, there's so much valuable stuff that can be done with these models that economically we will skyrocket. There's so much value that can be created in the world just by, hey, if the models know how to do COBOL to, like, C and Python. Migration of, like, mainframes.
A
Migrate everything.
B
Migrate everything from mainframes to cloud. The world is how much more efficient, making all these random applications and, like, automated reports and, like, stop using Excel as a database, but instead, like, you can make a real database and manipulate stuff in Excel, but, like, you know, there's all sorts of humongous business efficiencies that could happen or automation that could happen without the model ever being. We could literally just pause it at, like, six months from now timeframe of, like, how good it is at software development. And it would be, like, godsend in terms of, like, how much efficiency and value can be created for the economy. And it doesn't ever have to get to, like, digital God level. Now I do believe we're gonna get the digital God level eventually. Eventually. Now, is that. Is that. Is that. Is that 10 years? Is that five years? Is that a hundred years? Is that a thousand years? I don't know, because there's. There's so many unknown unknowns. I know how many unknowns there are, but, like, there's so many unknown unknowns. And so, like I mentioned, right, like, these babies are putting their freaking hand in their mouth to calibrate and then later they put their foot in their mouth and they're like, oh, that's my foot. Oh, here's the senses on it. And then they can pick up stuff in their hand and they no longer have to put it on their most sensitive part of their body because they know what it is. Or they're like, oh, this is a speck on the ground. What is it? It's not food. But now I know what it feels like inside my hands and I've calibrated, right. It's like the models have not gotten there yet. It has no idea how to do this. Digital God is like, well, one, I like, kind of believe in embodiment and like, you need non digital God. You need a physical and you need the capability of like, having touch and feel and all that to truly have an experience like humans and be smarter than us in every way. But, you know, that's so far away.
A
What do you think about what physical intelligence is doing? Attacking the. Whatever you want to call a large movement model or large robot model or something?
B
Yeah, I mean, like, what are they actually doing today is like, holy shit. It's so simple in terms of, like, to a human, to models, it's like picking this up is fricking hard. Like, how much do I squeeze my pinky vs this finger vs this finger is this finger. I don't know, like, like. But you pick up a glass of water and you tilt it and it's like, this is impossible for a model today. And it's likely like at the level of dexterity, fills a wine glass and I was swishing it. Think about how simple that is. You don't even think about it, but, like, you instinctually pick up a wine glass and you swish it and it lets the aroma out and you smell it, but it's like, oh, that little swish is so much tactile feedback and movement. And it's like these models can't do that shit yet. Like, nowhere close. But it doesn't need to be that good. It doesn't need to be able to swish a wine glass and not break the wine glass and put it back down and tilt it perfectly and not spill it. It doesn't need to be able to do any of that to be tremendously valuable. What it needs to be tremendously valuable is pick this up and put it down here after knowing what it is. So there's so much value that can be created just by being really good.
A
At, like, getting data.
B
Yeah, yeah, yeah. The robotics world is Huge. I think we're warming up. We haven't even left the dugout. Right. Like, we're, we're like, nowhere close to the scaling on. On robotics. There's a ton of, like, the data Flywheel needs to get going there.
A
One of the most interesting subplots of this whole world is the talent wars. And a cool idea is that as these things get better, maybe we begin to automate some of the research function that people formerly would have played. Do you see a world where, like, we're squeezing down the fewer and fewer number of people that really matter. That will have all the impact on where we go in terms of, like, net new research. And that means that all this crazy spending that's happening at Meta or elsewhere makes a lot of sense, that maybe even those numbers should be higher or something like this.
B
I think it's, like, tremendously hilarious that people are like, oh, my God, this person's getting paid a billion dollars. Or like, oh my God, this person's getting paid $100 million. Hilarious to me that it is infeasible. It's like, how could this person possibly be worth that much? Well, they're running the experiment on chips that cost $100 billion. If every wasted experiment they do, if they just used like a third of the compute and their ideas and their impact on it. Wasted the compute. It was an idea that was already done. Or like, there's so much wasted compute. Call it wasted. It's trying stuff and failing. But, like, none of us know what to try and what not to try. And these things are so complicated. There's like a group of people just trying different stuff on the existing data. How do you mix it? What order do you feed it into the model? How do you filter it? There's a different group of people that are doing, what's the architecture? There's different people working on long context. There's different people working on every single aspect of the model that, like, if you just make them a little bit more efficient, that they come up with the idea that's 5% more efficient. Well, fantastic. I just saved not only 5% of my training time. I also say 5% across my entire inference fleet. Because we're so far away from like, these models being anywhere near as efficient as a human brain, and we know it can at least get as efficient as us. Maybe the compute substrate isn't the same, but, like, whatever. Adding more people to the problem doesn't make it faster because there's so many things you're trying learning the stuff from experiments is something that you run these experiments, you learn something and then you implement it. And then you try a bunch of experiments, right? You tweak these knobs these ways in a hundred different ways and then you see the trend line and you're like, oh, so actually I should tweak it this way. Let's implement that. There's so much like gut feel, there's so much reading data, understanding it, re implementing it, learning by doing type stuff that if you add people, you're going to slow it down. And in a sense, a lot of Meta's problems before they did the super intelligence thing was that they just had too many people that weren't led by leadership. That was amazing. And they had a lot of failed experiments and wasted time doing things that didn't matter. There's a tweet from one of my friends at OpenAI. He's pretty famous on Twitter. His name's Rune. He made a tweet about like, I get viscerally angry every time I think about how many H1 hundreds meta's wasting. It's like, it's such a funny tweak because it's like, well, yeah, they're wasting a ton of compute. They were, you know, maybe they still are, but you know, like everyone's wasting compute, right? OpenAI is wasting tons of compute because what's the Pareto optimal model architecture?
A
Another thing I saw Rune say recently, which is so interesting was, why don't we just go make even more ridiculous offers around the people that have process knowledge for things that we want here in the US in other countries? Like, if we're getting pretty good at the Arizona Fab that we've built and we think that we can sort of extract the process knowledge from the people, why don't we like go acqui. Hire like all the best people in Shenzhen or all the best people in other places in the world. Do you think it starts to escalate to that level? Like, so much is dependent on the process knowledge of a relatively small group of people. The talent war should actually be. It shouldn't be Meta and OpenAI. It should be like the US maybe through Meta and OpenAI and people from all over the world. Like, do you think it starts to get that extreme and should it?
B
That's almost a function of why intel is falling off a lot, right? Is like you have all these geniuses in nanochemistry and PhDs and all these like random things, whether it be chemistry, physics, all these like incredibly smart people, but there's a whole Class of incredibly smart people that never went that way because they're like, oh, those guys are making like 200k. Why would I do that? I'm gonna go to Google and make 800k and now I'm gonna go to Meta and go make 100 million. Right. Like any like smart 18 year old is gonna be like that. I'm doing this. Why do the smartest doctors, and I don't mean to say the smartest doctors in a general sense, but their skews really smart population of doctors that want to be dermatologists and anesthesiologists. It's like, is that the most valuable thing for them to do? No, but those are the two professions that give you good working hours and great.
A
Yeah.
B
Not to say that the general doctor is not smart as them, but if you took the population of general like family doctors, the random doctor and you took the population of dermatologists, the newest coming out of school, the ones that are being dermatologist and anesthesiologist are way smarter or at least scored better. Were able to get into the field that was coveted. So yeah, talent wars. We've sort of been through this process of like it's always been human capital and capital goods, sort of those two vying off of each other. And for a long time with mechanization, industrialization, we had the human capital decreasing as the industrial capital increased. That got to a point where especially in the 70s, it really started to tank as the ability to globalize and all these things started to really hit the US and that's why you have a lot of the population level dynamics and income inequality that we have today. That is very bad for the psyche of the US and stability of it. But you now have. We're in such a age of well, actually manufacturing things is pretty commodity. Most of the value doesn't come from the manufacturing of it. It comes from the creation of the idea. One thing Jensen told me, which I thought was like amazing, right? He's like, you know, Dylan, the reason America is rich, like people have it all wrong. The reason we're rich is because we've exported all the labor. What we've kept all the value. And that's what Nvidia does, right? They've exported the labor of making their chips and Apple, right? Everyone, it's. It's done in Asia. Those companies make money.
A
Not as much money as Nvidia and Apple, right?
B
All the gross profits are going to them and then they're either reinvesting it or buying back stock or Whatever. However they allocate the capital. If, like, as you said, the process knowledge is so valuable, why aren't we doing this? That's a great idea.
A
Marin's idea, not mine.
B
Yeah, I mean, I think. I think the challenge is how to choose people is really difficult. Judging someone is smart or not for some roles, someone who can talk the talk, they're great. People just automatically assume they're great because they can talk the talk. But you know how many people suck at talking and are really fricking good at doing? Yeah, yeah, but then you don't know. But then there's people who talk about being able to do better than the person who's doing. And like, these tests are never as good. How do you select? And this was a big challenge for Meta. So some of the criticisms are like, they didn't get all of the best people. They actually got a lot of, like, bad people. It's like the cope from like, OpenAI and anthropic. And, you know, these kinds of companies are like, no, no, no, they didn't get our best people. That's what Sam said, right? He's like, they didn't get our best people. It's okay. Meanwhile, he did have to do counteroffers internally. As far as the process knowledge. I think the ML researchers are an extreme of how much value one can do. But my favorite analogy that I came up with recently is that ML research is the exact same as semiconductor manufacturing in the sense of there's a ton of jobs in semiconductor manufacturing that don't exist in ML research, but it is a ton of tune. A thousand different knobs. Oh, you put the wafer in this tool, you're going to change the pressure of the chamber when you're doing the deposition, or you're going to change the mix of the chemicals flowing in, which chemicals you're putting in, what speed you do it at. Do you do it for 30 minutes? Do you. For 31 minutes? Do you do it for. You know, obviously it splices way down. There's so many knobs on every single tool.
A
And you have a thousand input and process knobs.
B
Right. Process knobs on each tool. Plus it's like the sequence of them all. You frankly cannot test everything. It's impossible. It's too large of a search space. Just like designing a chip is too large a search space. You have 100 trillion transistors. How can possibly try every single thing? Impossible. You just have to have enough intuition, like, pick that point, pick that point, pick that point. See the data. Oh, Okay, I think the answer is here. Obviously, once you think the answer's here, you test here. A different person might have seen these three and then said, okay, the answer's actually here, not here. The data is like fuzzy. It's like somewhere in the center. But like, you know, it's like this, this whole like idea of like ML research. You spend a lot of time on compute training, doing what effectively were useless things besides teaching yourself what's the right thing to do and what's the wrong thing to do. And semiconductor manufacturing is the same way. And actually all process manufacturing is the same way. If you're iterating super fast and you're trying to get better and better and better, or you're optimizing a process on a chemistry or whatever it is, you try, you fail, you learn, you do. In semiconductor manufacturing, maybe it's just running tens of thousands of wafers. And so your R and D cost of your main fab that is running the R and D is very, very high. And it's producing zero economic value besides that, it's teaching you how to do the next node which then you can deploy at volume. And that is what actually makes the money.
A
I want to go back all the way to where we started and ask about what I'll call like the wellspring or the fountain of power in this whole ecosystem. So I want to understand how you think about who has the power and how to keep or generate power as a business. I mean, maybe talent is like the very beginning of the chain and he who has the talent, like on the long enough timeline, has the power or something like that. But also there's structural stuff, like just the industrial scale of some of these things. It just takes forever to build or whatever. How do you think about even smaller zoomed in examples like, okay, cursor is unbelievably popular. The revenue is insane. So much of it goes back to anthropic, like who has the power in that relationships. How does that dynamic change over time? It just seems like the power dynamics are so fascinating in this world. And I'm curious where you think it comes from in the first place, like where it exists today and where it will go in the future.
B
Well, you know, way back when monkeys were, you know, before we became humans, we were territorial. So, you know, and me having two bananas makes me better than you when we think about the power structures. Right. Like you mentioned a really interesting one. Does anthropic hold all the cards in this cursor relationship? Cursor has like nearly a billion dollars of revenue. Now if you do current month times 12, that's a ton. But their margins are what they are and they're sending most of it back to Anthropic. Some people say their margins may be negative. I think they're slightly positive. But regardless, they're sending most of it back to Anthropic.
A
The gross profit dollars are at Anthropic right now.
B
Yeah, and. But then Anthropic is taking all the gross profit dollars and putting them into Compute for training. So then all those gross profit dollars.
A
Are going to like Jensen laughing hilariously.
B
Well, maybe Jensen or maybe like Amazon or Google who's sending it to Broadcom. Right. Like the gross profit dollars are going to the hardware layer from all of this for sure. Does Anthropic have all the power? Like the common view is yes, from a lot of people. But then it's like, well, Anthropic only makes the model that's generating the code. There's a lot more in this system. Cursor gets all of the data they get, all of the users they get. How do they interact with this? Anthropic doesn't get that. They get prompt to send a response. Now they have cloud code which is like taking share and it's very different than Cursor but like they get prompt response and then like Cursor is like, oh well, I'm training embedding models on your code database. There's actually multiple models that I've made. I've made the embedding model, I've made the autocomplete model. Oh, I can switch the anthropic model to open the eye model whenever I want to. I'm only using Anthropic model because it's the best one. Oh, and because I have all this data, maybe I can train a model not for everything better than you, but for the segment better than you. And it's so it's like the power dynamics are, you know, it's weird. It's frenemies. Right? Everyone's a frenemy. Right? Same as OpenAI and Microsoft. The most crazy power dynamic that's going on in the world where they signed a MoU that said they had an understanding of like what the deal would actually be for them converting to for profit. Like what is going on here? Like this sounds like the most non announcement announcement ever. The power dynamics of this all. It's the most fascinating soap opera ever. Right. Like one of my friends was telling me about K Pop Demon Hunters. I don't know if you've heard of this.
A
I have a nine year old daughter, so that's all I hear about.
B
You've seen it a lot. I just heard about it and they're like, oh, let's watch it. I'm like, what? Whatever. But like it's like there's drama, but like this, this real world power drama is way cooler than this.
A
Like, which parts of the drama interest you personally the most? Like, where do you think the stakes are the highest in the various like subplots?
B
There's a few different ones. The Microsoft OpenAI one is absurdly interesting because at one point, right, like 2023, it was like, Microsoft's gonna own the world. 2024, a lot of it too. And then like H2, 2024, Microsoft backed down a lot. They pulled back because Amy Hood and whoever else at Microsoft, Sundar, whoever were like, maybe we don't need to be on the hook for a $300 billion. We're not going to build out $300 billion worth of compute for OpenAI. Like that's. They can't pay for it.
A
Yeah.
B
At least had to go through their head when they cut back and so they paused a bunch of data centers, they said, we don't need to be the exclusive compute provider. You can go to Oracle, it's fine. And they relinquished this power. Now Oracle has that deal. OpenAI sends like 20% of their revenue to Microsoft or API revenue or something like this. And then Microsoft has this like 49% capped profit structure on OpenAI. And then there's like this whole like IP sharing, like this deal. It's like really hard to understand the Mechanics of the OpenAI Microsoft deal. Even. So you have this whole power dynamic and they're trying to renegotiate this. The whole deal is like, oh, one, we have AGI, you no longer have IP rights. And it's like if you ask someone 20 years ago and you put them in front of ChatGPT, it's like this fucking AGI, like it knows everything and it could have a conversation. I can't tell it's not a human. Actually I can tell it's way smarter than a human. But now it's like, ah, whatever, I can't do xyz. So the thing, the bar always moves no matter what the level of intelligence is. And for me it's going to be like one. The thing puts its hand in its mouth and it's like, yeah, this is me, I'm a human. Right. Like, you know, that's Sort of like the sentience, the consciousness of it. All right, that's one power dynamic that's like crazy. Another power dynamic is the one around Nvidia and the hyperscalers. Nvidia is the king. All of the gross profit is going to them today. Pretty much all of it. Sure, TSMC makes some, sure, SK, Hynix makes some, but they have to invest a ton in Capex. Sure, Broadcom makes a bunch and you know, Broadcom makes a ton of gross profit off of these companies. But like Nvidia makes by far the most gross profit in the industry. It's not even close. They're king and they want to continue to be king and they want to make sure GPUs continue to be most used. But also they can't buy anything. They weren't even allowed to buy ARM when they were like a nobody. They were like pretty much a nobody on the grand scheme of things. And they weren't allowed to buy arm. They totally could not buy any major companies. They'll buy startups. Like they bought a startup that I was like a seed investor and like an advisor in. But like they can't buy a real company, so what do they do with all this cash flow? Sorry, but you're a loser if you just do buybacks like that. Just. That's admitting, that's admitting you can't get higher returns on your capital, which is fine. Like Meta, Apple, Google, they were mature companies for a while. Guess what those companies are going to do Buybacks ever fucking again. Right? Or not like ever again, but for a while. They think there's better ROI for their capital now. And Nvidia, if you look at Jensen, he's like, he's always like flirted with buybacks but like mostly he's been like reinvesting in the business but you can't reinvest that much into the business.
A
He's doing demand guarantees. He's doing like all this crazy stuff now.
B
Yeah, right, right. He's using his balance sheet to win. Yeah, Try and win more, which is an interesting dynamic. I don't know if there's ever been anything like this in terms of the anti competitive nature of this where you backstop clusters. Core. We've recently got a deal with Nvidia where it was like they backstopped a cluster. Now core, we would have never built this cluster because it's for like short term demand. And renting GPUs on short term is like a terrible business model. You want to do long term contracts and you want to do long term contracts to people with balance sheets, that's the golden goose. But that doesn't exist so much. So you do long term contracts with people who don't have a balance sheet like OpenAI. And if you can't do that, then you'll do short contracts with people who do have a balance sheet. Right. Like there's this whole matrix of like who you rent GPUs to. But from Nvidia's interest it's like, you know what I really love is when venture capitalists fund a company and then 70% of their round is spent on compute. They fucking love that. Right? And that's what's happening with all these companies. Whether it's physical intelligence, they're spending a lot on like robot arms and shit too. But they're also spending a lot on compute. Or it's like any other startup that's racing cursor, whoever. Right. And even if it's not directly, it's indirectly going to GPUs. They love when people spend their entire round on GPUs. Would be really good is if it wasn't like a two year deal or three year deal for that compute. If it was, you can spend 70% of your round on one training run. Leave a company with these ideas, gather the data, do the training run and then you have a product and you show how good the model is and you try and raise again. That's what would be really great for Nvidia. But no one wants to build a cluster who's predicated on that as a business model. That's crazy. So they have to backstop a cluster, do that. Or hey, OpenAI might go to their own chip. They might go to some ASIC from another company. They might even buy TPUs. They might even like go to Amazon. They don't really care. They're not beholden to Microsoft anymore trying.
A
To serve a product to a customer.
B
Yeah. And they want to build digital God. And they want to serve a product. Right. Make revenue. Right. So they don't have to go to Nvidia. Nvidia is the best option. But you know, be really, really helpful is if I could, going back to the earlier part in this discussion is I get the compute up front and I don't have to pay for the compute for the first year. It'd be really good if I could do that because then for a full year I can do training, I can subsidize inference, I can do all these things that build up a user base and then I can actually pay for it. I have a year of a gigawatt to figure out a business model, whether that is serving free tokens and then implementing this purchasing, purchasing stuff for the free user. A lot of that is like almost no fee initially purchasing and then slowly rising the fee over time. Or it's I have to serve this model at worst gross margins or negative gross margins initially but then eventually I can serve it at positive gross margins because the models keep getting cheaper. Or it's I train the next generation model that's so much better than everyone else and then I'll win all the business for that level of intelligence because I'm the only one with a 18 year old. You guys all have 14 year olds who are working for you. They can do whatever they want with this allocation. It's not an allocation of capital per se, allocation of compute. They get to decide what they allocate that compute to. And Nvidia is helping them by effectively front loading it if they can find a capital. And that company is like oh yeah, Nvidia's backing this too. There's all these other things. It's much more reasonable for someone to say oh yeah, I'll pay the capex because I know the first year is already going to be paid because you've got that investment from Nvidia. What about the next four years?
A
If you ask a bunch of investors who are like students of economic cycles through history, like Carlo Perez type stuff, they'll say that the concern is that every shortage is followed by a glut and we always overbuild on long lead time, big CapEx projects and you've got multi gigawatt power being installed, you've got all this crazy stuff in semiconductors. At some point it just gets overbuilt. All the stuff we talked about earlier feels like we're not really close to that. Like there's so much demand.
B
If the models don't improve, yes, we will overbuild, right? Like it's pretty simple. It's like yes, there will be like supply chain things where switches from one supplier to another and like that's a lot of the stuff we focus on at my company. But at the end of the day, if the models don't improve, we're absolutely screwed. And if this lasts another year and then it happens like the US economy will go into a recession straight up because of this. And probably Taiwan as well, and probably Korea as well because there's so much buildup and revenue flowing through to us for this. But when you look at these other things, like the bubbles of the past. Some of them were just silly nonsense, right? Like tulips. Silly nonsense, right. Crypto. Complete Ponzi scheme. Right. But then there's other stuff that's like this was real, right? Like the UK like spent like some absurd percentage of their GDP on railroads for like a decade.
A
6% or something crazy.
B
Yeah. We're nowhere close to 6% of our GDP. Like holy shit. That was like, okay, there's tangible but it's like, oh well, we over did overbuild because like how many goods are there to transport? You must build these railroads to reduce the cost of transport so much because you have no clue when the demand stops and you've overbuilt. And because there's 10 people trying to do it at once, you're obviously going to overbuild at some point. Same thing with fiber. And like a lot of the argument against this is like, well no, but this time it's the strongest balance sheets in the world. It's the world's most profitable companies. They can all pull the plug at any point. Microsoft pulled the plug at one point before. They're like, oh shit, no, no, plug it back in. They recently plugged it back in. They're like, oh wait, we're starting, we're restarting this. We're going out into the market. We're signing deals with Nebbys for GPUs. Like I don't remember how big the deal was. It's like $19 billion for nebbys. And it's like if they had just not pulled the plug on their data centers, they wouldn't have had to do that and they wouldn't have to pay those gross profit dollars to Nebbys. But Nebbys made the bet that the demand is there. And they were right. When you think about this, it's like what is the level of demand where this stops? Right. If scaling laws continue, of course there's an adoption curve, there's a pace, there's realities with capital, there's realities with supply chains. Things take time. Adoption for businesses takes time. But if you like boil it down to it, it's like your demand for 30 year old senior engineers at Google who know how to make and program anything is effectively like, I don't want to say infinite, but it's $2 trillion of value. If I could have an intelligence as smart as a Google Senior engineer, that's $2 trillion of software value. Because that's how much I pay. The world pays to Software engineers today and you just go down the list of every other use case. If you have just a simple physical intelligence robot that can recognize headphone versus water and versus phone, right, and pick up the right thing and manipulate it properly and put it in the right spot and sort it, that's worth how much to the distributions supply chain, right? I don't know, but a lot. We don't need to get digital God for there to be immense value. But the interesting thing here is that, you know, it's like human capital, capital goods. All of these other revolutions have been capital goods that reduce the amount of human capital you need, whereas this is just creating human capital in a sense. If I like get everyone bowled up, we're on this podcast. I don't know if you've heard the curse, right? It's like if you talk about the stock of this podcast and it goes down, right? It's like we're popping the bubble right now because the limit of AI is infinite.
A
For the record, we went and did the math one time because I was sick of hearing about this curse. And it's just market performance. It's not.
B
Oh really? Does your other podcast, I talked about Applied Materials and the stock was up like 70% the six months after. There you go. Yeah, I broke the curse. I was like, hell yeah.
A
What do you think about all the companies in the middle? We've talked a lot about Nvidia and then like people at the end serving applications. What about these companies like together and base 10 and fireworks and you mentioned Nebius, like all these interesting middle layer players. Are there amazing businesses to be built there, do you think? Are they temporary patchwork to make the system work and serve demand? Like what do you think of that?
B
The cloud business model, right? Like let's, let's say Neo cloud business model. So there's. She's sort of. You mentioned inference providers and Neo clouds. The NEO cloud business model is absolutely amazing or terrible depending on how you do it. It's terrible if you like sign short term contracts and you just hope and pray you have short term contracts forever. And actually initially your short term profits have amazing cash flows. You bought a GPU and you put in a data center and the power and all that. The cost per hour over a six year period for Blackwell is $2. Let's just call it for simplicity's sake, it's $2. It's not exactly that, but. And if I sold it for six months, I could get like three fifty or four dollars. Holy shit. That margin's Insane. But what happens two years from now, three years from now, when I'm still selling six month contracts or one month contracts and the next generation of Nvidia chip is out and it's 10x faster for 3x the cost. Okay, so now naturally the price of this should tank. The other way to do it is, hey, I have a long term contract of the other end of the spectrum is what Nebi has just signed. Just. I'm signing $19 billion to Microsoft. They will pay me no matter whatever. Yeah, they're. The market literally believes Microsoft will pay its obligations before the US Government because it's like literally a cheaper bond rate, which is like insane to me, but whatever. This $19 billion has a huge gross profit and it's not exactly $3 and it's not $2. But like the margins here are really good. Nebius is going to make at least $6 billion of gross profit off of this. I would do that all day. And Core Weave did until Microsoft stopped going to Core Weave. But Core Weaves turned around and they found other customers and all these things selling to Google and selling to OpenAI. But now, oh, OpenAI is. You can't rely on their balance sheet. I still have amazing margins when I sell it to OpenAI, but they don't have a balance sheet. So how can I be sure that they're actually going to pay the thing that they've signed up to? In theory, this contract is worth a ton of money. In Corey's books today are all the contracts they've signed are mostly Microsoft, mostly money in the bank. The opening contracts, like, what if they can't afford to pay for this? Okay, now there's like, there's a bigger risk and there's a longer and longer tail of like these businesses. So like, yeah, you absolutely can make a ton of money. There have been more recent deals with crypto miners Google and Fluidstack. Because Google's really short on data center capacity, people want to use more TPUs. They can't reserve them all themselves, so they're going to sell TPU systems to providers they're backstopping the deals with. Terrawolf is one of the companies. I can't remember the other one, but there's two companies they've signed deals with where they're backstopping the data center, plus like selling the TPUs physically to another company and then they're being deployed and then they're getting rented and Google still makes all the money. But like those companies, you know. Yeah, that's great. As well. But then there's a long tail of like, is the enterprise demand there? Who's taking the risk? And it's like OpenAI is taking the risk because they're betting their entire company could go bankrupt if it doesn't come. Oracle's taking a risk because they're signing up for $300 billion of contract, okay, $200 billion of hardware spend across data centers and chips. And of that they're going to have to go get debt. So they're on the hook and they'll probably be able to pay for it if it happens, but they'll just be like, their EV will tank if OpenAI can't pay for all the hardware that they bought. Luckily for them, it phases in over time. And, and then you go to the inference providers and it's like there's a business to be made here too, right? Like, I'm serving models, I'm serving them efficiently. Maybe Roblox comes to me and they want to put an LLM in their game. Another company like Shopify wants to put an LLM for customer service. And yes, they could do it themselves, but actually inference is a hard thing, especially as you get to larger and larger models and more complicated models and all the things, or all these different use cases where people want to serve models. And maybe it's just open source models and maybe it's fine tuning of those open source models, which those companies can help you do, or you can do and they can serve for you and they have scalable, reliable capacity. There's businesses to be made here, but there's also like, I'm selling tokens to random people who are trying to build SaaS, apps and NSF and maybe they run out of Runway. Okay, that funding doesn't directly go to Nvidia, but you go through some steps and it's going to Nvidia or some value chain. Yeah, And Nvidia is holding no risk. Everyone in the middle's got a lot of risk.
A
Back to the other side of the equation, the app side stuff. We're going to use these models to do at the significance of this switch from like deterministic code to a much different thing. And it seems like what we're doing is the thing we always do. Apple used to call this like the skeuomorphic era where you just basically use the new technology to do the old thing you used to do. So we're making engineers better. That would be like an obvious current example. It seems like we haven't yet gotten into the world where we're going to start using this, this technology to do things that we couldn't do before with deterministic code. I'm curious how you think about that side of like pushing the envelope.
B
Why is that? Like, I feel like that's exactly what we do with it, right? Is the cost to develop things is so high that you can't do it or the cost to like have someone go buy stuff for you. It's like, okay, great, you might have an executive assistant and you can tell them to like do this. But like the vast majority of people don't. And now GPT is on the cusp of doing that. Go buy this for me. And they'll find the best thing and they'll buy it. Right. And you just trust them enough. It takes time to trust them. But like these things, tech is the most deflationary thing in the world ever, right? In terms of quality of life. It's. It gets cheaper way faster than the revenues go up, but the revenue still go up. That's sort of like the fundamental basis of semiconductors, of tech, everything. Are we doing things that we can't do before with tech? With AI? Sure. I mean like the COVID vaccine was created with AI. AI drug discovery. There's like entire briefs about how it was done with AI. And guess what? If like another pandemic happened, I bet it'd be even faster to discover the vaccine if there's a vaccine for it or whatever. There's all these protein folding things, there's all these optimization things. There's AI for material science and AI for all these other aspects of society, there's optimization. Maybe it's not in your face, right? It's like, oh my God, AI just made this drug. No, I mean AI worked with the researchers who made the COVID vaccine and so we didn't have to all be stuck inside forever or whatever. Right.
A
Point being where it's already happening, right.
B
And the whole like use the new thing to make the old thing faster. It's like, sure, but if I go back three years, how many people would it have taken to deploy a image recognition model that looks at every data center in the world and looks at what's the pace of constructions in and what equipment they have.
A
This is something you do, this is something we do.
B
Right? It's like, it's like how many people would that have taken? I don't think it would have been possible. My business model, like, this is the second highest revenue product for us, would not have been possible if it weren't for AI vibe coding, like being able to dig through permits and regulatory filings, being able to run image recognition on satellite photos, like this would not be possible. This business is not possible without AI. Am I using it directly? Like, oh, are you sure? I'm scraping through the regulatory filings and permits through with LLMs and then manually reviewing it with people and doing that. The same with like the images, satellite images, yes. There's a lot of stuff that, you know, the image recognition model does, but we just also look at them a lot and then it's like compiling them and selling a spreadsheet that you get bi weekly reports on like all the data centers or what's changed? Or like, hey, actually this Amazon data center, the fans are starting to spin. So actually there's revenue going on from this Amazon data center. So we can forecast Amazon's revenue. It's like, oh, okay, like this is relevant. I don't think this would have been possible just a few years ago. There's demand for it because everyone wants to track this and it's so important, but it's like it begets each other. And I think like, at least in my daily life, it's like, I don't think I could have taken that step from where I was in a business which was still a research provider. But like, that is a monumental jump. And like being able to do it with three people out of the gate versus like fifty or a hundred. Like I don't know how many people it would've taken, but I don't think it's possible. And it's like mainframe migration is something people have always wanted to do. Amazon leaving Oracle took fucking 20 years and they wanted to do it 20 years ago. And they have their highest revenue products after EC2, the next four were like database products at AWS and yet they still freaking used Oracle's database because it's hard. Mainframe migration can be way faster. Or like migration from one tech stack to another can be way faster. You can make your business more efficient, you add more automation. I think as far as like, yes, the tech exists. Go to all the businesses around the world and it's like they aren't using the leading edge of what they could. They aren't using what a 2020 company could have done without AI. No one is doing that. And if they did, they'd be so much more efficient. All of these things just take too long to build. They're too expensive to build. You have your existing processes, how do you hand them over? How do you switch them over how do you teach people to do this? AIs can help you with all of this. You can take the pessimistic view of like, oh, we're just doing the same things. But it's like the value here is.
A
Humongous if it's tokens on one end. We haven't talked much about, like, watts at the very beginning. And power. What are your thoughts on, like, what is going on here and how humanity is responding to this crazy new demand for just raw power?
B
The first approximation is that it's not that much power yet. Data centers are like 3, 4% of the US power data centers, period. Of that, two is regular data centers and two is like AI data centers. So like, that's nothing, dude. Like, that's literally nothing. It's just we haven't built power in like 40 years or like, we've transitioned from coal to natural gas more and more over 40 years. So mostly we just don't know how to. And there's these regulations and like, there's not enough labor and like, the supply chains for like, ever. Nova and their dual combined cycle gas reactors are not there yet. And same for like, Mitsubishi. And oh, like this random UV curing process for transformer coils is like, there's only this much capacity and it takes two years to build them. It's a supply chain thing. It's a like, lack of labor thing. It's not that it's actually that much yet. At the end of the day, it's like, okay, wait, wait, you're telling me OpenAI is making a data center with 2 gigawatts and that's like the entirety of the power consumption of like, Philadelphia. That's insane, right? But like, in our slack, we used to get like, excited about finding like a couple hundred megawatts new data center. Now it's like, if it's not a gigawatt, the guy who leads that team, he was like, oh, it's just 500 megawatts. I also agreed immediately. Then afterwards, I was like, wait a second, dude. That's like a lot of power. That's like, how much? Wait, 500 megawatts is $25 billion of capex. Like, come on. Like, once you put in the GPUs and everything, right? It's like, that's a ton of money. But like, snore, because there's so many of it happening. And so when you think about what happens to the country, what's happening here is like, we're learning how to build power again. We're getting the supply chains to do it again. We're reshaping the grid. There's all these challenges with these AI data centers with regards to demand response and making grids unstable. AI workloads because they change so much so fast. Especially training, you can just cause like brownouts or blackouts. Especially if the grid doesn't have enough inertia or if you're not putting enough things to dampen it in between the workload and the grid. And even if it's not destroying it, the grid runs at like 59 hertz or whatever, right? If you skew it up and down too much, these transient power responses, your refrigerator will break down sooner. The motor's in it and you might not even know it because the data center's nearby. So like all these things, because maybe it's not like turning off the power, but the hertz is not perfectly 59. It's like oscillating too much and then the motor in it and the coil windings and all this is burning out faster because of this variation. Like a car, right? Like if you were to floor a car on floor instead of just like accelerating slowly. Like, think of it the same way. You're flooring it, letting go, flooring it, letting go. There's so many like third order effects here. The funnest one is just that we're building power, right? And it's like whether it's gas, which is a lot of it, whether it's through efficient dual combined cycle reactors, or it's like random generators that are not nearly as efficient single cycle, or even worse, diesel generators. There's a company that's putting a bunch of truck engines in parallel. Diesel truck engines. Because the industrial capacity for diesel truck engines is huge and no one's tapped it yet. So why don't we just put a ton of them in parallel and create this power generation thing right here, right? And then you're generating power with a bunch of diesel truck engines in parallel. And then you're able to power a data center, right? Like, okay, great, because I can't get turbines. There's all these like crazy things people are doing. Elon buying some power equipment from Poland and shipping it to America because he needed that power equipment. But whatever, couldn't get it here because the supply chains were weird. I'll just get it over there. Any lacks capacity in the supply chain is being eaten up immediately. And then everyone's like, okay, let's invest. So GE's like, I'm going to double my turbine production. It's like, holy crap. Okay, that's awesome. And Mitsubishi is doing the same thing. And, you know, you go down the list, it's like my Transformer supply chain is expanding like crazy and, like, fully sold out. So I'm gonna go to the Korean guys, and that's fully sold out. So I'm gonna figure out how to get the Chinese stuff in, even though it's not exactly, like, what people want to do, right? It's like there's all these weird things, but the funnest one is that, like, electrician wages have, like, doubled for mobile electricians that can work on data center stuff. If you're down to move to West Texas, it's like 2015 again. And like, being a fracking guy, you don't need to be super duper skilled. You can go to West Texas and make a shitload of money off of fracking, go to the Permian. But there's not enough of those people. That's why, right? Like, if there were enough electricians in West Texas, there are enough electricians in America, we could build these data centers faster. All these little supply chain quirks. And everyone's supply chain is different because the way Google makes their data centers different from the way Vantage makes their data center, which is different from the way that Edge Connects makes their data center, which is different from the way QTS makes their data centers, which is different from the way Amazon makes their data centers. The other three companies I mentioned, like, rent data centers to the hyperscalers mostly. All these companies make data centers differently, and so their supply chains are not exactly the same. And so you get all these weirdnesses in all these different supply chains. And it's really fun, but it's also, no one really knows it because everyone who tracked the supply chain or, like, knew it. You go talk to, like, power people. It's like on one end of the spectrum is like Dario. And then you take a few steps and it's like the average ML researcher. Then it's like, me, and then it's like you in terms of how bullish we are on AI. And the guy at the power utility is like, over here, there's like a few more people. There's like the standard New York semis investor, then there's the New York not semis investor. And then there's like the Sequoia guy who thinks that A has been a bubble since 2023. And then there's this, this utility guy. You know, this utility guy is like, I'm not building power. Power doesn't go up, whatever. And then you have, like the regulations around it. How can I build a data center in this density? Because now all of a sudden the grid's like. So this has happened in Texas, ercot and it's happening in pjm Northeast kind of area. Ish. These two grids are putting these rules where, hey, we're gonna actually say, hey, big loads. We can tell you 24 hours or 72 hours beforehand. We're gonna cut off half your power, which is fine. Right? Because like, we need to.
A
Because we need for something else.
B
Yeah, like people. People need to have their homes powered. So anyways, like, it's like in Texas and in pjm, you can cut half the power if you give them a notice. This is SB6 in Texas and PJM is considering it. If you do that, then you need to turn on the generators that are there on the site. It's often diesel generators. Maybe it's gas, maybe it's like hydrogen stuff. There's all sorts of weird stuff people try to do just to ramp up power for that period of time, but then all of a sudden, oh, crap, the density of my generators means that I fail the air permit if I run the generators for more than eight hours. So now what do I do? Right. It's like there's all these, like weird regulations. Even if it's like Texas, there's like so much interesting stuff happening because we've decided to build again. It's really fun that watch it and then watch the supply chain and try and like, from my perspective, provide the data so people can trade on it or provide the data so people can adjust their supply chains industry wise. Right. And, you know, people who go to your audience, they can trade on it.
A
If I were to line up all the stages of this between the US and China, so, you know, power semis, mods, models, applications, etc. Where do you think the most interesting differences are? Like, what are the storylines between US and China at those various layers of, like, the AI stack that are the most interesting to you when you look.
B
At China, they are very formidable competitor. I think if we didn't have the AI boom, the US probably would be behind China and no longer the world hegemon by the end of the decade, if not sooner. I think most people would have agreed with that. And a world where the US is not the hegemon is a bad one for Americans at least. You know, I'm sort of like a bald eagle, like, carrying, like American. I'm like, it's bad for the world. But you know, like, that we're not the world hegemon because then we can't spread freedom without AI. Like we're definitely just going to lose. Our supply chains are slower, they cost too much. Our debt is like unsustainable. Our economy's not growing fast enough to maintain the level of debt we're over consuming relative to what we produce. The financialization. There's all this like dearth in like the US in terms of like social instability partially because of income inequality, but also largely because of the visual nature of income equality and the tendency of people to flaunt their wealth more because of social media and how that hacks people's brains. And then also like, because the algorithm serves people different content. We're drifting further and further apart in culture, right? Monoculture of everyone watching the same movies in the 50s and 40s and 30s versus like now. You and I are pretty similar and our feeds are completely different. So think about someone who's not in this world. Their feed is like insanely different. Our memes are different. I think the US would literally fall apart if we don't do something. And by do something I mean like AI has to dramatically accelerate GDP growth because once you start talking about dividing the pie, you're screwed. It has to be growing the pie. And the US really, really needs AI. China's view is like, I think it's like a little bit different, right? They don't necessarily need AI to win. If AI doesn't take off like a rocket, whatever that is, even if it's not AGI, it's like you can do the software. $2 trillion in value of software developers, right? Whatever it is, China was always played the long game. They always like, yeah, we're going to really screw over consumers by taking all this wealth and dumping it into EVs and losing money on gaining market share. And the EV industry has still not created positive market value for the Chinese economy yet. Absolutely will at some point. I think it hasn't created it yet. Maybe it started to, but like, rather we want to like kick all of the European and American and Japanese and Korean car companies out of the global automobile market. And they're doing that. They've always played this long game. They did it with steel, they've done it with like rare earth minerals, they've done it with solar panels, they've done it for producing phones, they've done it for PCBs, they've done it for so many freaking industries incrementally. They're just going to Continue to do that and then they're going to win because they work harder and they're on average smarter, right? I would say on average in the sense of their government is more intelligent in the way they allocate capital and the way they think about things. We're smarter in the sense that we get to brain train the rest of the world and we get to maybe allocate capital more efficiently because of our more free market capitalism. In some ways, maybe the whole point here is that I don't think most people have as pessimistic of a view as this as I do. I also view like, if we don't have super powerful AI systems, we'll run out of easily accessible like nickel and cobalt and oil and natural gas, and we won't be able to make solar panels efficient and fast enough and everything will just start to get more expensive and the pies will reduce and will also tear each other apart in that way. So I have like a very pessimistic view that if we don't accelerate, we die. If that's your worldview, then like, we really need to win AI. And China's worldview is sort of like, you know, they want to be the world hegemon. Like, who doesn't want to be the world hegemon? But there's only two countries in the world that can legitimately do it and legitimately are trying, right? The US and China. The way like the Chinese AI ecosystem thinks about this is we don't necessarily need to have the biggest compute cluster. They're still not pilled enough that. When OpenAI is trying to make a 2 gigawatt data center full of GB2 hundreds and GB3 hundreds and they'll have another 5 gigawatts of Vera Rubin or whatever, right? Like all these different chips and those chips are way faster than the chips that will sell. China slash the chips China can make themselves and China's deploying less of them. The dearth of compute is huge. We're kind of doing what China's done historically, just dumping tons of capital into something and the market becomes interesting and the beneficiary is like, oh, if OpenAI, you know, they have 800 million users today. When they have 3, 4, 5 billion users across the world, which is possible of ChatGPT and whatever applications they come up with, then they can start to make money, right? It's sort of like YouTube lost money forever, but now it's the platform for watching videos across the world, right? And ChatGPT will be the same Thing. So sort of that like aggregation theory argument. But then there's also the other argument of like, you know, what if we're wasting the capital? China doesn't necessarily think of it the same way, but they are still incredibly pilled on. Like, well, we want to be able to make everything ourselves, right? So make all of the chips ourselves. We don't actually care that much about making all the chips ourselves. Sure, Trump's doing the tariff. Sure, we have the CHIPS Act. Those were drops in the bucket compared to how much money China's releasing into the semiconductor ecosystem has been for the last 10 years. They're made of China plan in 2015, five year plan and then the five year plan in 2020. And then it just continues like they've dumped at least like 4 or $500 billion into this ecosystem through SOEs, through certain tax policies, through certain, like land grants, through provincial governments, through the big funds, which is like government venture funds. They're just called the big fund. There's all these different ways that they've dumped capital and then Huawei, whatever you want to call them, whether they're the government PLA or not, like, and I think there's an argument there, like, so they've dumped so much more capital into semiconductors than we have in an unprofitable way because they want to build that ecosystem. And over time, you know, it's like if you take any country in isolation, China is the one that has everything at the highest level on average. Right? Sure, they're like 30 years behind on jet engines or 20 years or 10 years, whatever it is. But they don't need to go outside of China for any of the materials besides like raw materials. Whereas like the US needs like titanium from here and like, you know, blah, blah, blah from there. Right. And the same applies to their semiconductor ecosystem. Sure, the US and Taiwan and Korea are way ahead, like three, four years. But then they also have the accumulated capital base of all of the existing equipment and all of the existing fabs. But they need to import from all these different places because it's a global supply chain. And so China is like much more concerned today about being insular than being the best in doing this aggregation theory, which is sort of maybe what ChatGPT is kind of doing. I wouldn't say OpenAI is fully pilled on that view. There are also like a lot of people who view the like, oh, we're going to make AI. That makes AI better. That makes AI better. That makes AI better. And is AGI the other view is like, you know, hey, we're going to make AI so much better that it can do software engineering and we have trillions of dollars of value. China doesn't fully believe in those things yet, I think on a total basis. But because they're so talented and they have an insular supply chain. Yes. They purchase some stuff from the foreign world. They rent stuff. They have ByteDance, who's, I think the third largest user of GPUs in the world after OpenAI and probably Meta, although ByteDance may be bigger than Meta, but third largest user of GPUs in the World, or second, maybe even ByteDance is. They have all the other major Chinese tech companies. They have all of these amazing graduates. They don't have a talent. War companies don't poach from each other. Deep SEQ engineers make a lot more than other engineers, but they're not making $10 million, even though it may be worth it. There's this real big perception difference. China could build way faster than us. If they wanted to build a 2 gigawatt or 5 gigawatt data center, they could probably smuggle a lot of chips. If they wanted to build a 10 gigawatt data center, I bet they could build it in like a few years. Whereas the US is not going to build a single 10 gigawatt data center for a while. Right. Like the total capacity of an OpenAI will be like 10 gigawatts in a few years. Optimistically, China's not. They don't have the best chips. Huawei's speed rating, trying to get better and better and faster and faster. They don't have the best memory. They're trying to get better and faster there. They do have the most power. They can build stuff way faster. We're impressed at how fast Elon does stuff. Elon slow compared to China. And I think he knows that, which is why he's maybe like the one who's like actually using the Chinese ecosystem more in terms of the battery facilities making in China. And he probably recognizes it too. There's these major differences in viewpoint and approach. And I think we could dive into any part of the supply chain. But the important thing is, philosophically, we're so different on what we're trying to do here because China wants an insular supply chain. They want to have supply chain security. We talk about wanting that, but we don't actually put the money behind it. The slot machine of where the American capital is being allocated, it's building the biggest data centers, it's training the best models. Whereas in China, the capital is being allocated into growing the EV supply chain, growing the semiconductor supply chain. Catching up in all these areas. The us sure we want to catch.
A
Up, but maybe Jensen was right that what you want to own is the end customer thing. They're doing the same thing they've done forever, which is like prepare at the base level and be behind at the customer side. The value, the value happens close to the customer.
B
But then like you get to the point of like, okay, well what happens in like three, four years? Even if the US AI is amazing, the doomsday scenario of like China decides to blockade Taiwan or even invade it or create some political instability or try and so like people talk about like Cambridge analytics and like Russian trolls, whatever. Like China could do a billion times that into Taiwan, especially with how good AI is now, and somehow subvert it or coup or blockade or whatever and we no longer have Taiwan. U.S. economy kind of free falls because we can't make refrigerators without Taiwanese chips. We can't. I mean, they're American companies chips, but made in Taiwan. We can't make, you know, cars, we can't make AI data centers, we can't grow any of the cloud. That means we can't deploy any more SaaS applications. What the hell can we do?
A
Back to going to acquire all the talent, get them over here, right?
B
I think like that's sort of like the catch 22 of this all is like if you push China too hard, they totally will. Like you back someone into a corner, they're going to start swinging. Whereas if you just bump into them at their shoulder and then keep walking like it's fine. You know, like you pickpocket them, you run away like it's fine. But if you push them into a corner, it will like blow up. But then there's all these like dynamics. Like if you don't push them a little bit away from having the chips because they have all this talent, probably half the AI engineers in the world are Chinese, whether they're immigrated to America or not. You look at the list of meta superintelligence, they poach 80% Chinese people and how many of them are from China versus being ABCs, American born Chinese, you could look it up, but it's push China too hard. They have the talent, they could go crazy if we no longer have Taiwan, actually China could build a way bigger cluster than us. And if compute is all that matters, they could do all of these things and they own the means of production for everything. There's this like challenging aspect of geopolitical risk. That's why people don't want to invest in tsmc. But it's like almost like you can't invest into Amazon or Apple or Google or like Microsoft if you have geopolitical risk, if you believe Taiwan has risk. And so it's like, yolo invest in TSMC. I know a lot of people, PMs are like, oh, you can't invest in TSMC because geopolitical risk. And it's like, no, dude, you can't invest in fucking Apple.
A
Who is your favorite AI bear? Like, someone that is far distant from you on just their perspective on the direction of this whole thing that you nonetheless like and respect the most.
B
I'm having a hard time because everyone I like, think of is just like more bearish than me, but they're not like bears. There's some of the, like AI researcher like gods, Yann Lecan and like these kind of people who are AI bears. I, I respect them. I like their ideas. I think they're completely wrong. But.
A
And their argument is what? Like, if you had to summon, you.
B
Know, the ways we're doing this won't work.
A
LLMs on scale or.
B
Right, but. And it's like, okay, yeah, yeah, auto regressive. Like, it's like, okay, autoregressive pre training on the Internet doesn't work to get you to AGI. Not gonna lie, a lot of people believe that. Yeah, maybe I believed it like 2023. Right. Like, let's be clear, like, you're 2022. Like, or not AGI, but like super powerful AI systems. That's obviously not the case. Just pre training on web scale Internet will not get you there. So he was completely right on that. But then like, he'll turn around and be like, RL systems. And all these things are not the right way either. Right. Like, you know, it's sort of like the. No buts. There's some investors that I know who like, think this is bullshit, but they're just making tons of money on it anyways.
A
Think it's bullshit in what sense?
B
Like, it's like, like it's an overspend and this is a dumb way to do it. And like, yes, I bought Oracle before earnings because we see these deals happening, whether it's through our data or some other means. They saw these deals happening with OpenAI, but we don't think OpenAI can pay for it. But we know the market's perception will be this and therefore the stock will go up and so we'll own it there's investors who like kind of have that viewpoint and they're like kind of ruthless. I guess I would respect them to some sense. But it's more and more the level of evidence that's there that this stuff is going to get super powerful, hard to not right again, like this AI bubble is going to pop. Because this podcast man, I assure you.
A
It'S just the market return.
B
No, no, no, it's a coin talk.
A
What startups interest you the most? Like real startups, relatively young, not to market with their product yet, like that sort of thing. I'm especially interested as potential accelerants to all of this. Like people that are attacking some interesting bottleneck.
B
So one of the startups, it's the most recent investment I've made, it's called periodic labs, mostly OpenAI people, it's a Google guy and you know, the couple material scientists. The area of AI that we've all been talking about is like large scale web training. Rl, all text, all digital. God. But you know what would drive a shitload of value for the economy besides automating programming of everything is like if we just like came up with a battery chemistry that was like 25% more efficient. Like, holy shit. You know, like the main, main cap against like us all having like face glasses and things like that is like batteries are not good enough and the power dissipation. But the batteries like, sick, terrible. So you have to do make all these compromises. But if I could have the processing power of like a laptop on my face, we'd be way further ahead. And like, and then if we all had like these like super powerful machines attached to our face, we could do inference on things and recognize and interact with the AI at much higher speed and velocity and, and that dramatically improve our productivity, right? Like, things like this are like so gated by hard tech, moving faster. And so what Periodica is trying to do is they're taking this RL paradigm, but they're trying to do it with like real world, right? Like test chemistry for something. Here's an optimization, here's something that the model spit out. You can do it in like all the like CAD type programs, not, not exactly cad, but like computer Aided design programs and test it simulators and test it. But then you also want to test it in the real world and then feed that feedback back into the model. And so you do this like chain of instead of purely being digital. Which is why, right? RL is like really hard because you need to generate a bunch of responses Test and then train the model. So the flywheel is so freaking fast. The flywheel in the physical world is so slow. You mean I need to make a chemistry, I need to try this, I need to test the thing, I need to input it back in and keep calibrating and keep doing this. It's so much more expensive, it's so much harder to do. But actually there's a ton of low hanging fruit there. I bet in terms of like large scale experiments, whether it's drug discovery is one angle, right? But design of like one of the most complicated chemistry ecosystem in the world is semiconductors. There are tons of things that we know are the next thing to build and now the next five years of work is actually building it, right? And it's like next next five years is actually getting the chemistry to work, getting it to be a low enough yield, a low enough defectivity or it's. We can't, we don't know what material could do this. Do superconductors exist? Do they work? Right, like these are the sort of things that are like the fly I should be investigating. Yeah, yeah. I think that that's, that's one. I.
A
What about in the hardware world? Like just in the pure hardware space, attacking some other interesting bottleneck.
B
I mean I think, I think like when we talk about like where we are in tech. Semiconductor manufacturing is super space age, right? It's like the most complicated tools we make in the world. Including like tools that cost like half a billion dollars, right? Like and they're super, super amazing feats of engineering. Then the software behind them all is like really shit, right? So it's like, you know, you could, you could accelerate all that in the hardware world. The biggest challenge is that like I'm not really a big bull on the accelerator companies.
A
I've never been defined how accelerator companies, right?
B
Like companies competing with Nvidia. I'm competing with Nvidia, with TPUs, with Trainium, with AMD. Not a big bull on those kinds of companies. There's just too many things to do. It's too capital intensive. There's not enough of a revolutionary leap. There's too many predicated things. I wish it could happen, right, It'd be fun. Maybe it does happen, but it would take hell of a badass thing. But I think there's a lot of individual parts of the supply chain which are not space aged. Nvidia space age, yes, it's the biggest value owner today. But their supply chain has so much old shit. Whether it's their supply Chain or the hyperscaler supply chain transformers have not changed in like fifty hundred years. Right.
A
There's a guy building one a company in that space.
B
Solid state transformers. Right. Like things like this. Yeah. So there's all sorts of interesting things there on the power delivery side, all the way from super high voltage AC all the way converting all the way down to, you know, 0.8 volt or 0.4 volt DC that goes into the chip. There's so many interesting companies in that space because there's so much innovation to be done and there wasn't that much of a need to do innovation before.
A
You don't get high return on it. Now you do.
B
Yeah. Another area is like networking between chips because as we extend context length, the memory requirements become bigger and bigger. And yes, new memory technologies would be awesome, but DRAM is an industry has so much invested capital, goods, so much existing factories, it's really hard to attack. But networking is less so. And there's more breakthroughs that can be done in networking that, okay, maybe you don't have better memory technologies, but you've tied the chips closer together so you can use each other's memory on the problem. So the simplest one is like NVL72 from Nvidia Blackwell is them condensing a lot of networking into one. You know, sort of the chips are networked together, but there's so much more that you can do in terms of the optics space bridging the gap between electrical connectivity and optical connectivity. Because like Nvidia created Blackwell, they had a ton of manufacturing problems and challenges with it for their supply chain. Balance sheets went up for various companies in the supply chain were building servers and stuff because they're trying to figure it out. AI data center deployments were slowed because of these challenges. There's reliability challenges because these things are connecting to each other at absurd bandwidths. Every chip in the rack can connect to every other chip in the rack at 1.8 terabytes a second. Right. If you think about how much data that is, the amount of bandwidth, like, is so high for connecting these chips together. Like, you can't fathom what a terabyte a second is. You can't fathom what a gigabyte a second is. Okay, a gigabyte a second is like a video, right? Like, or like a megabyte a second, but actually that's a million bits of information. Oh, what's, what's a byte a second? Okay, you can understand what a byte a second is because that's eight Bits, Okay, I'm transmitting eight bits to you back and forth every second. That's pretty fast. That's what it used to exist and it's like where we are now. There's still tons of innovation left to be done there. I think in the hardware space I think I'm super bowled up on advancing the techniques. Like I think part of the reason intel is behind is also that data sharing internally was terrible. Just within the fab, the lithography team doesn't want to share their data with the etch team and that data can't leave the FAB and go to an AWS data center to run correlations and all these other things. So you don't learn from the experiments you do fast enough. Now TSMC is not perfect here either. They won't send their data to a cloud either. But like you know this experimentation experiment, analyze the data, figure out the new experiments, cycle is slow and how you break that is actually like changing these companies culture which I think Liputan is trying to do. But also it's a lot of it is like building better simulators and simulating the world more accurately. And so there's all these world model companies and like world models that dives into the software realm again. Right. But actually some of these world model companies are actually just focused on. So world models generally are like hey, I'm going to simulate the world I'm going to walk around in. The common one I think is Genie 3 that Google made, right. Where you can walk around in a Cape State and you walk around the world and you can like see cars driving and like they're talking about like. But it's like actually what a world model could also be is just like simulating molecules but not through classical methods. Right. It's not computational fluid dynamics, it's the model experiencing this enough and then running, training a model on physics and then feeding that back through and doing it through a AI method instead of. And so world models can be doing any sorts of things. You can make a world model to train robots, how to pick up cups, or you can make a world model that is simulating some chemistry in a chemical reaction or a fire. Right. Like you can do all sorts of different things. So there's a lot of world model companies out there. Some of them are really interesting especially when they're targeting the physics and reality of the world. I love robots robotic stuff as well. I think there's so many like venture scale investments. But also I think like most of the cool innovation is just happening at big companies or already existing companies. Right. That's just the nature of it all actually. TSMC is doing the most cool innovation and Nvidia is doing the most cool innovation and like Amphenol is doing cool innovation. It's like all these companies are doing cool innovation.
A
Could we do like a quick speed round where like I say some company and you just give me like, you know, a sentence or two on like your impression of them, how you feel about them in this moment.
B
Yeah.
A
Start with OpenAI.
B
Oh yeah, super. Awesome.
A
That's it.
B
I mean we've talked about them all day. You know, I was worried it was.
A
Going to be like random anthropic.
B
I'm actually more optimistic on anthropic than I am OpenAI.
A
Why?
B
Their revenue is accelerating way faster because what they're focused on is more relevant to that $2 trillion software market versus OpenAI is split between. Yeah, they're going to do that. They're also going to do these other things, but they're also going to do like target AI for science and they're going to also target AI for the consumer app and doing the like take rate thing, which all of these businesses could be amazing and OpenAI maybe executes on all of them. Yeah. But anthropic is definitely executing on the software side better. Amd, I love them, but they're pretty mid.
A
Could they not be mid? Why do you love them if they're.
B
You know, when you grow up like building computers and like liking computers and like AMD's innovating and they're always like fostered this underdog mentality against intel and against Nvidia. Evil intel and evil Nvidia, you know, and like AMD is like, you know, the nice company that's like the underdog and like they're always, they've always got the, oh, they're going to take share from them pieces. It's like, it's like, it's hard not to love them, you know, like, and I know so many people there and I like the, I like all these major hardware companies. Right. There's not one that I don't like as in terms of the people. But like AMD's got a soft spot because like I think that was my first multibagger as well. Like $2, like my first multibagger. I can't own stocks anymore because compliance. But like, you know, like it was my first multibagger. Yeah, right. In terms of like I bought it at like 2 and then I sold it at like 10 and then I bought it at like 9 and I sold it at 15. And then I bought it like 9. I sold it like 14. And I bought it like 10 and sold it like 30. And bought it like 15 and sold it like 80.
A
Like, I just kept doing good memories.
B
It was like. It was like, great. I mean, I was like. I had, like, no money. Yeah, I was like. It was like. It was like high school and college and, like, the initial stages of working, but it's like, you know, sorry for the rent, but I fucking love amd, you know, I also love Nvidia.
A
But Middle xai, they're in a real.
B
Danger of not being able to raise capital. Of course everyone's gonna give Elon capital, but, like, the scale of capital required for him to keep up, he can get the next bet. He can get to Colossus too. This mega data center that he's building. Largest data center in the world. When he builds it. 300,000, 500,000 Blackwells. Right? Like, it's gonna be really great. But if he doesn't figure out a business model besides, like, Porn Bot, which is what Annie is, which also, I think he's monetizing the wrong way. Like, I think you could monetize it so much better. How you've captured the zeitgeist with a cute anime girl that talks to you in a cute voice and, like, will riz you up and. And you've got, like, these users who actually fall for it. And it's, like, not realistic enough yet, but it will slowly get more realistic. You're selling, like, outfits for the same price. You should make it a random. Like, hey, you have the chance to buy the outfit of her, like, looking like this one anime girl from this one anime. Yeah. Hey, you have the chance to buy this outfit. That's her in a nun suit and also will change her personality. And it's like, yeah, instead it's just like, you can buy an outfit. But I think, like, he has to figure out, like, some business model beyond just this. Although I think this could be a big business. Like, he should partner with OnlyFans and make manifestations of the OnlyFans creator that are Annie. And then he subsumes the OnlyFans platform into X, the Everything app, and be like, xxx. Like, you know, it's like, you could. You could just like, trojan horse, like, OnlyFans away. Because, like, the discovery mechanism for OnlyFans is like Instagram and Twitter, as far as I understand. And, like, you own one of them and you could partner with the biggest onlyfans creators to like get them over, right? They don't have to respond. Like they can also just like train a model that like acts and looks like them and talks to them anyways. Like there's all these different monetization methods and I don't think that's what he should only focus on. Right. To be clear, my point being, and sorry, this is way more than two or three sentences. Xai can get to the next stage of compute. They won't have more compute than OpenAI. They don't have more compute than any individual company, Google, Meta, et cetera. But they will have the biggest individual data center. They'll have a very focused team and what they do with that, they have to do something like really big. Otherwise they will fall behind in the race. And Elon will not let that happen. Like he doesn't want that happen. But he can subsidize and fund this round. But like as rich as he is among the richest people in the world, maybe the richest, he can't go to a 3 gigawatt data center unless he gets capital, which he can't do unless he gets revenue and fundraising. Oracle. Oracle's gonna make so much fucking money. If you believe, if you believe OpenAI is successful, then the question is like people who loan money to the neoclouds are idiots. Like why would you ever loan money on a cluster? Either you get paid on your loan or you don't. You should just own equity. Like people who loan money to Core weave only in the early days are so dumb. They should have just bought equity. If they believe the thesis, they should have bought equity. And those who bought equity made tons of money, right? The same applies to owning Oracle. Maybe they're going to make a ton of money if OpenAI is successful, but if you think OpenAI is going to be successful enough to pay $300 billion to them, how many users do they have and what's that IP worth? And also there's reasons you shouldn't own OpenAI. Like the Microsoft stuff and the risks around Anthropic and all these things. But in most worlds where Oracle gets paid $300 billion by Opening Eye, OpenAI is like a $10 trillion or $5 trillion company or something crazy.
A
We'll end with the old last generation, best two business models. First being Meta, I think Meta's got.
B
The cards to potentially like own it all. I don't know if you've seen these new glasses. They came out with the screen. As we go through the history of Computing you have, it was like punch cards programming, then it was like DOS terminals, right? And then it was like, oh, you have gui's and mouses and keyboards, then you had touched. And the next paradigm, a human computer interface is we don't actually have to touch it at all. We tell the AI what we want and the AI will translate that into reality, whether it's hey, send an email to this person, send a text to this person, that's basic stuff, you can already do that with Siri or whatever, right? But like, oh, go buy this. We're so close to all of these things. The input method into a computer changing entirely. And the only company in the world who has the full stack from good hardware, that is what Meta just showed with their glasses, with the screen. That plus the good models, plus the capacity to serve them, plus the knowledge and know how around recommendation systems to know what content to put in front of the user. It's not just generating the content, it's not just interpreting the user's word and taking actions, it's also putting the right content in front of the user. It's all four of these that you need put in front of the user plus the capital. Plus the capital. I think Meta is so close to being the only company that can do that, but there's a lot of risks there too, right? So I like Meta a lot.
A
Google finish it off.
B
It was pretty bearish Google like two years ago, but I'm like super bullish Google.
A
Why would change?
B
They're waking up on every front. They're taking the TPUs, they're selling them externally, they're taking the their models and they're actually like competitive on them and they're training much better and better and better. They're being aggressive on infrastructure investments. There's still a lot of dysfunction throughout the company, you know, but they do have the hardware business that they can pivot into this. They won't be as head as Meta is, they won't be as good as Apple is. But like they do have Android, they do have YouTube, they do have like all these IPs they've searched that can come together when we turn to that next interface of consumer. But also they can also dominate the professional sense too, potentially. Whereas Meta I don't think can dominate that professional sense, only the consumer sense. And I think Google's well positioned to go capture both markets or a meaningful share of both.
A
I feel like we've covered like an incredible amount of ground. Is there anything that we haven't talked about that you feel is like really critical to what happens in the future that we didn't cover.
B
I think the question of everyone that I constantly get asked is like, okay, Dylan, you know, you're lucky. Your obsession is that you loved hardware and you like followed it and you followed the supply chain and you built this business on it. But like you don't follow the software side nearly as much and all the value is going to get created. There's. When is that coin going to flip over? But I think the thing that most people don't realize, I'm not saying hardware is going to dominate forever at all, but like software is not the same as it was five, 10 years ago. You've had dramatic changes in software and the business model is going to change as well. Go back like five years, three years, whatever. When SaaS was the darling, let's call it October 2022 I think was when SaaS peaked.
A
Multiples peaked in April 21st.
B
April 21st. Okay, okay. Multiples, multiples. Okay, okay. I just remember November 22nd was when shit like hit the fan.
A
The nasdaq high was 1231, 21.
B
It was November. Okay, it was November 21st. Sorry, November 21st. I remember SaaS started tanking and at the time it was like mostly like they were over earning and all these other things. Doesn't matter. The interesting thing about the business model is that it is such a good business model. When your R and D is sort of this, it stays flat, right? And you grow a little bit. But really R and D doesn't flex that much. Your cogs are super low. The flip side is in a SaaS business, your customer acquisition cost is quite high. And so when you look at what like certain companies have done when they've acquired a business is they've just crushed the customer acquisition cost or crossed sgna. They made the business amazing. Whether it's like Broadcom with VMware and stuff, it's not really customer acquisition. They just had a bunch of wasted sga. But like this sga, this customer acquisition that was most of your cost, R and D was small but not like crazy. And then once you hit critical mass, you just cash, cash, cash, money, money, money, money, money. But software changes a lot. When the cost to build that software that you have tanks like crazy. You look at non US markets and the prevalence of SaaS and it's very different. I will bring up China as an example. On a counterpoint, China doesn't have that much of a SaaS business. Actually their cloud business is pretty small. Too right. Relatively to the US despite them importing tons of CPUs and storage. Historically most people just did stuff on prem and designed their own software. Because the cost of developing software in China was so much less than America that the SaaS business model didn't work as well. People could just build rather than rent it out and buy. And that creates inefficiency in the market. I'm sure those weren't the best of breed solutions always. Anyways that's what the software development costs may be like. Software developers in 2015 in China were getting paid maybe 1/5 of the US and they were maybe twice as good or something like that. So it was 10x lower cost of software. I'm making up numbers, you know they had 10x lower cost of software. And so SaaS never happened, cloud never happened. And at least as big of a way as it did in the US and around the world for all the companies that have that same economic reality. And that's despite the outsourcing, right. To India and and Eastern Europe and to South America, etc. You changed all of this with AI software development. Right? And AI SaaS products generally, not just AI software development. So there's two sort of coins here. So AI software development tanks the cost of building a competing software stack. Do you now move to a world where actually I can just build instead of buying slash renting. And the cost is very low for most functions. I can just vibe code it out that level of what you can vibe code out at very low cost source. And I have a way more software developers in the software engineering market. Like there's a ton of people out there who don't have jobs. And so like that's one angle building the SaaS product is easier. Two is if you are a SaaS business and your customer acquisition cost remains the same and Most businesses in AI and in SaaS are going to remain having a high customer acquisition cost Sales is hard. Breaking into a cumbency is hard. But now you add this AI part of it. You've now added a humongous cogs. Your cost of goods sold in any AI software is really hard and really big. And this is partially why I think Google also has an advantage. They have the lowest cost of goods sold for any token of any company because they have their own vertical stack on TPUs. But anyways, coming back to this, because you have this high customer acquisition cost and you have this high cogs and then the cost of anyone developing it themselves or competitors in the market means you're going to have a very fragmented SaaS market or they're just going to build it themselves. And therefore you never hit the escape velocity where your customer acquisition costs and your R and D get amortized. And because you have such a high cogs, your amortization point means your net profitability is actually much worse. And so I think the era of like software only businesses is really, really tough. In the age of AI now already scaled businesses can do great. I think YouTube is going to have its glory days and I'm sure it'll always be amazing. But with the cost of generation of content falling and falling and creating content, he who controls the platform is going to win and win and win and win. The functionality you build within Salesforce is actually going to be like way less like what you can build on your own or whatever it is. I'm not saying it's a take on Salesforce itself specifically, but I think many software businesses will have a reckoning with the fact that their cogs is going to soar, their customer acquisition cost isn't going to fall and they have a lot more competitors and so then they don't hit that escape velocity. And I think that's the thing that maybe software. It's something I've thought about. There's a couple of people in my company, Douglas o', Laughlin, he's the one whose idea this actually is, but he pilled me on this along with some of the ideas I've had on the cogs of tokens and tokenomics and whey. There's different people on my team whose idea it is, but I get to steal it and say it. I think this is something most people don't recognize.
A
Yeah, yeah. The fundamental economics of software has to change.
B
Yes.
A
And it's changing. This has been incredibly fun. I love, love learning from you and listening to you and reading what you put out. I think you're just one of the most energetic and awesome thinkers in this whole space right now. So thank you for all the work you've done. When I do these, I ask the same traditional closing question. What's the kindest thing that anyone's ever done for you?
B
It'd have to be my brother. Everything he's done in my life, I've been a shithead my whole life and I still am a shithead. Every time he like pulls me back on path, he corrects me. He loves me unconditionally. I think my brother is probably the most. He's done the kindest things for me. Right. And I've Been an like so much of my life. Right. Like unconsiderate and like everything. Right. He's just always been there for me and always been like, why are you guys. Why?
A
Yeah, if you're aware of it, it makes it interesting.
B
No, no, it's terrible. Yeah. No, but what it is is like, I don't know, like the. And maybe this is like the mo of like who I am and maybe that's why I'm like a good thinker. But like I. I like vibe really hard and I'm in the moment really hard and I digest tons of information. But I'm very like bad at like task orientation, remembering to do specific things. Like, I'm very bad at those things. And thankfully I've like been able to surround myself in my life, whether it's through birth or not, with people who help me with the things I'm bad at. Because I'm very bad at a lot of things. When I don't like, call people or like, be considerate of what they're thinking because I'm just vibing and I'm doing whatever, you know, I'm like focused in on like this path. That path ends up hurting someone else. Right. Whether it's like, hey, I didn't call someone or I didn't like think about their feelings when I did an action or when I said something, but that makes me an asshole. And yes, I should be more conscious of this and I try to be, but it's like, it's just one of the things I'm going to wrestle with in my life forever. And a lot of times I don't even realize I'm being a freaking idiot until my brother's like, you're a freaking idiot. That's the kindest thing anyone's ever done for me is like, my brother through my whole life. I love it.
A
I love it. Wonderful place to close. Thanks so much for your time.
B
Thank you so much. Yeah.
A
If you enjoyed this episode, visit joincolas.com where you'll find every episode of this podcast, complete with hand edited transcripts. You can also subscribe to Colossus Review, our quarterly print, digital and private audio publication featuring in depth profiles of the founders, investors and companies that we admire most. Learn more@joincolasis.com subscribe.
In this episode, Patrick O’Shaughnessy interviews Dylan Patel, Founder and CEO of Semianalysis. Dylan is renowned for tracking the semiconductor supply chain and the AI infrastructure buildout with a level of detail that encompasses satellite imagery of data centers and mapping out billions in capital flows. Their expansive conversation explores the ongoing physical and economic revolution powering AI. They dig into the gigantic capital commitments from tech titans, the OpenAI–Nvidia–Oracle “infinite money glitch,” the evolving “tokenomics” of AI, reinforcement learning, power grid bottlenecks, US–China competition, talent wars, software economic models, and where real value might ultimately accrue in the AI stack.
[05:40–10:43]
[10:43–14:52]
[17:01–23:28]
[23:28–32:56]
[29:09–32:30]
[32:30–38:47]
[43:32–46:39]
[47:49–54:58]
[56:12–65:12]
[65:12–69:05]
[69:05–72:53]
[72:53–77:15]
[77:15–83:47]
[83:47–94:57]
[96:42–104:34]
[104:44–111:58]
[112:08–117:30]
On the AI Buildout Stakes
“This is about the highest stakes capitalism game of all time.”
—Patrick, [17:01]
On Diminishing Returns
"What if the next tier of performance is like a 6-year-old vs a 16-year-old?"
—Dylan, [08:07]
On Tokenomics
"Hopefully everyone using tokenomics 20 more times. It’s got to be in the title now."
—Dylan, [17:53]
On Power Infrastructure
"Electrician wages have, like, doubled for mobile electricians that can work on data center stuff. If you're down to move to West Texas, it's like 2015 again."
—Dylan, [78:53]
On the Talent War
"It is infeasible. How could this person possibly be worth that much? Well, they're running the experiment on chips that cost $100 billion."
—Dylan, [48:23]
On US–China AI Geopolitics
"Without AI, we're definitely just going to lose... The US would literally fall apart if we don’t do something—and by do something, I mean AI has to dramatically accelerate GDP growth."
—Dylan, [84:14]
On SaaS and AI
"I think the era of software-only businesses is really, really tough in the age of AI now... Many software businesses will have a reckoning."
—Dylan, [115:21]
Dylan Patel’s realism and granular market perspective expose the thrilling, high-wire act that is AI’s industrial buildout. While bullish long-term on the technology, he is sober about the capex risks, bottlenecks, and software model disruptions ahead. The US faces a “must-win” race against China, not just for global leadership, but for economic stability. In the meantime, the fortunes and failures of the world’s largest companies are being shaped by trillion-dollar bets on semiconductors, power, and AI’s rapidly mutating stack.
For more in-depth discussions and profiles of leaders shaping business and investing, visit joincolossus.com.