Summary7 min read

Odd Lots – Why Cerebras CEO Andrew Feldman Built The World's Largest Computer Chip

Podcast: Odd Lots (Bloomberg)
Host(s): Joe Weisenthal, Tracy Alloway
Guest: Andrew Feldman (CEO, Cerebras)
Date: May 21, 2026

Episode Overview

This episode dives deep into the story behind Cerebras’ revolutionary AI chip, the largest ever built—roughly “the size of a dinner plate.” Hosts Joe Weisenthal and Tracy Alloway speak with Cerebras CEO Andrew Feldman to understand the motivations, technical breakthroughs, economics, and market impacts of pursuing wafer-scale chips. They also explore the broader AI infrastructure ecosystem, the business realities behind Cerebras’ recent blockbuster IPO, competition with Nvidia, the economics of inference, and shifting dynamics between closed and open-source AI.

Key Discussion Points & Insights

1. The Rationale Behind the Giant Chip

Technical Edge: Cerebras’ chips are 58 times larger than any typical chip, making them “about the size of a dinner plate” (05:06).
Why Go Big? “Larger chips process more information in less time...that produces faster results.” – Andrew Feldman (06:29)
Memory Innovation: Traditional GPUs use memory that stores a lot but is slow; Cerebras’ massive chips use far more fast memory—even though it doesn’t store as densely per mm<sup>2</sup>, the enormous surface area enables much greater capacity and speed.
- “By building this big chip, we were able to stuff it to the gills with this fast memory. And that's why we're 15 times faster than the fastest GPU.” (07:17)
Result: Dramatic performance gains: 15x–1,000x faster than GPUs in certain tasks.

2. Overcoming Historical Barriers to Wafer-Scale Chips

Prevailing Wisdom: Previous attempts (notably, Gene Amdahl’s Trilogy in the 1980s) failed due to technological barriers.
Breakthroughs Achieved:
- Collaborated with TSMC on novel lithography
- Invented new packaging, power delivery, cooling solutions, and wrote custom software—“All of these had never been done before.” (08:46)
Cost & Timeline: First successful chip took 5 years and $500M; a decade-long journey (08:46).
Major Wins: $20B+ contract with OpenAI and close partnership with AWS.

3. The AI Supply Chain and Industry Dynamics

Inference vs. Training:
- Inference (using models) now dominates AI compute needs—“an explosion of demand on inference” (11:09).
- Cerebras’ architecture can handle both training and inference, though present focus is on inference.
Market Shifts:
- Demand for real-time, fast inference is skyrocketing as AI models move into the real economy.

4. The Economics of Speed and Tokens

Premium on Speed:
- Even incremental speed ups are valued, e.g., Anthropic charged a 6x premium for double-speed tokens (15:34).
- Traditional GPUs make slow tokens cheaply but become dramatically less efficient and costlier as speed increases—“like as you go faster in your car, your miles per gallon decrease” (16:25).
- Cerebras can offer fast tokens at vastly lower price and power usage than GPUs.

5. Supply Chain and Scaling Constraints

Supply Advantages:
- Cerebras avoids some current bottlenecks by not requiring scarce HBM memory, COAS packaging at TSMC, or latest 3nm processes—uses 5nm (20:12).
Biggest Bottleneck:
- The main growth constraint now: “powered buildings”—available data center space and power, not the chips themselves (21:40).

6. Cloud Services, Open vs. Closed Source AI

Cerebras Cloud:
- Operates own cloud, serving open-source models, offering rapid inference (23:35).
Open-Source Models:
- Typically 3–5% lower in quality than closed, but far cheaper per “unit” of intelligence (24:52).
- There’s a “battle...underway between closed source and open source” for dominance; winners not yet clear (25:38).
Future Prognosis:
- Predicts a diverse landscape—no single dominant model, but several major players, both open and closed (26:42).

7. Competing with Nvidia – The Software Moat Myth

CUDA's Diminishing Role:
- Nvidia’s CUDA was crucial historically, but “not important now” for inference; can port models from GPU to Cerebras in “10 keystrokes” (28:01).
- Two of the three top frontier models (Gemini, Claude, GPT) now bypass CUDA (28:17).
Industry Acknowledgement:
- Feldman recognizes Nvidia’s achievements but views their “moat” as less relevant for the current era.

8. Financialization & IPO Backstory

Financial Instruments for Compute:
- Data center/compute capacity is being financialized (e.g., spot/hedging markets)—“as this market...matures, there'll be people making bets on either side and financial instruments will be created to do it” (31:30).
Major Revenue Sources:
- Significant revenue from Abu Dhabi’s G42, the UAE’s national AI champion, which uses Cerebras for both training and cloud inference across industry and academia (33:04).
IPO Journey & Influence:
- Fielded scrutiny over national security (CFIUS review) and investor politics (G42, 1789 Capital/Donald Trump Jr.), but Feldman insists these played no role in IPO approval (47:09).

9. Strategic & Political Economy Issues

Data Sovereignty for Enterprises:
- Feldman foresees companies increasingly separating model providers from inference providers to maintain data confidentiality, especially for sensitive/proprietary data (37:49).
Chips Act and US Fab Building:
- Building fabs in the US is hard due to cost, complexity, regulation, and the multi-decade sustained commitment required—“strategic assets” worth prioritizing but difficult to realize (40:38).
Export Controls:
- Increasingly important; Feldman supports cautious restriction of advanced US technology from China, even at the cost of market access (43:19).

10. Life After the IPO & Organizational Culture

Becoming a Billionaire:
- “It was a big nothing for me...Far more important...we made more than 800 millionaires.” (48:16)
Balancing R&D and Public Markets:
- Cerebras plans to keep investing heavily in hardware innovation, seeing “the best work is still ahead of us.” Emphasizes love for building difficult, high-impact physical technologies (49:50).
The Spirit of Engineering:
- Feldman extols the ethos of measured, large-scale innovation—“You have to like that mistakes...are really expensive...You have to like the fact that you breathe life into a chunk of silicon” (51:10).

Notable Quotes & Memorable Moments

On why wafer-scale works:
- “By building this big chip, we were able to stuff it to the gills with this fast memory. And that's why we're 15 times faster than the fastest GPU.” – Andrew Feldman (07:17)
On overcoming engineering odds:
- “Every previous effort in the 75 year history of our industry had failed...It was a decade long process. It took us five years and about $500 million to deliver the first one.” – Andrew Feldman (08:46)
On speed as a differentiator:
- “Nobody wants to wait...The market for slow search? Zero. Dial up Internet? Zero.” – Andrew Feldman (13:26)
On transition from CUDA:
- “Two of the three leading models today use no CUDA. That's a hemorrhaging of share.” – Andrew Feldman (28:17)
On forming a company culture:
- “We made more than 800 millionaires. That's something I’m proud of every minute of every day.” – Andrew Feldman (48:16)
On political economy and export controls:
- “We have to do it thoughtfully. And we have to recognize that means some markets will be foreclosed to us. And I’m okay with that.” – Andrew Feldman (44:58)
On the beauty of physical innovation:
- “You have to like being engineers. Not because it’s a path to money, but because you like building things and you like building hard things.” – Andrew Feldman (51:45)
Philosophical Moment:
- “Carbon and silicon are right next to each other on the periodic table...If we can make artificial life, we need silicon.” – Joe Weisenthal & Andrew Feldman (52:03)

Timestamps for Key Segments

The “AI psychosis” era: host banter on AI obsession – 02:28–04:08
Introducing Cerebras and the giant chip – 04:29–06:09
Technical rationale for wafer-scale – 06:09–07:55
History and challenges of building wafer-scale – 08:31–10:42
Inference vs. training, where Cerebras fits – 10:42–12:47
Speed, cost, and token economics – 14:57–17:45
Supply chain shifts and bottlenecks – 19:40–22:38
Open vs. closed models: business realities – 23:35–26:42
CUDA’s demise and Nvidia’s moat – 27:28–29:47
Financialization of compute – 29:47–32:38
G42’s relationship with Cerebras – 32:38–34:36
Why inference might be separated from model makers – 37:07–40:03
The challenge of building US fabs – 40:03–42:36
Export controls and strategic tech rivalry – 42:36–45:08
IPO backstory, scrutiny, and investor politics – 46:08–48:08
The reality of “overnight” billionaire status – 48:08–49:50
Managing R&D and public company demands – 49:50–51:45
Philosophical closing: silicon, carbon, and artificial life – 51:45–52:28

Takeaways for Listeners

Cerebras’ “giant wafer” is not a gimmick but an engineering breakthrough that radically alters how AI chips are built and deployed, unlocking new economies of speed, scale, and efficiency.
Current industry constraints are shifting away from chip manufacturing toward physical data center space and energy.
The open vs. closed source AI battle, and the ability to port software flexibly, will reshape the competitive landscape.
Political and industrial policy—export controls, domestic chipmaking—are rising factors in business strategy, not just technical innovation.
The future of AI hardware is still very much in flux, and the battle for inference dominance is only just beginning.

This summary strives to capture the spirit, technical detail, and key arguments of the conversation, serving as a useful guide for both industry professionals and curious listeners eager to understand how the world’s largest computer chip just might reshape the economics of AI.

Loading summary

Transcript135 lines

[00:00]
Tracy Alloway
Oddlauts is brought to you by Vaneck. For years, investors basically forgot about real assets, energy, gold and infrastructure. But look at what's driving markets now. Central banks loading up on gold, massive capex cycles, currencies doing weird things. These assets are at the center of it. RACS the VanEck Real Assets ETF is an actively managed one stop shop for real assets spanning gold, commodities, natural resource equities and more. Go to vaneck.com raaxpod to learn more fun disclosures later in this episode.
[00:33]
Advertisement/Commercial Voice
If you follow markets, you know the value of long term thinking. You plan, you diversify, you prepare for volatility. But in life, even the best strategies can't prevent every bad day a fire, a loss, a disruption that demands immediate attention. When that happens, what matters isn't just what you planned, but it's who shows up. That's where Cincinnati Insurance comes in. For more than 75 years, they've helped individuals and businesses navigate life's toughest moments with care, expertise and personal attention. Together with independent agents, Cincinnati Insurance focuses on relationships, not transactions. Their approach is grounded in experience, follow through and trust built over time. Bad days happen, and when they do, you deserve an insurance partner who understands risk, repeated respects what you've built and is ready to help you move forward. The Cincinnati Insurance companies Let them make your bad day better. Find an independent agent@cin fin.com the thing
[01:35]
Andrew Feldman
about AI for business, it may not automatically fit the way your business works. At IBM, we've seen this firsthand. But by embedding AI across hr, IT and procurement processes, we've reduced costs by millions, slashed repetitive tasks, and freed thousands of hours for strategic work. Now we're helping companies get smarter by putting AI where it actually pays off, deep in the work that moves the business. Let's create smarter business. IBM,
[02:07]
Advertisement/Commercial Voice
Bloomberg Audio Studios Podcasts Radio News.
[02:23]
Joe Weisenthal
Hello and welcome to another episode of the Odd Lots Podcast. I'm Joe Wiesenthal.
[02:28]
Tracy Alloway
And I'm Tracy Alloway.
[02:29]
Joe Weisenthal
Tracy, I have to say, unfortunately, I don't have AI psychosis. I'm certain of that.
[02:34]
Tracy Alloway
Debatable.
[02:36]
Joe Weisenthal
I'm pretty sure I don't have AI psychosis. I do have to say, unfortunately, the amount of time now where it's like it feels like AI related questions, and there's many of them, are sort of like swallowing up the other thoughts that I have in my head. Whether it's questions about which model's best and why and what are the economics of inference and how much training is pre training versus post training for each model like, it's just sort of like this blob that's growing that's taking up more and more of my thoughts.
[03:11]
Tracy Alloway
What is your definition of AI psychosis? Because one would argue that maybe thinking about AI literally all the time would be a form of psychosis.
[03:20]
Joe Weisenthal
Well, let's just say, like, I'm not the type who thinks that. Like, I don't, like, think that the AI is a friend. For one thing, I'm not in love with the AI models. I don't think that in collaboration with ChatGPT that I'm stumbling on unified theory of physics and things like that. So, like.
[03:39]
Tracy Alloway
But you do spend a lot of time inputting instructions, pressing the button, and seeing what comes out.
[03:45]
Joe Weisenthal
And seeing what comes out. I'm just saying I think I'm aware that I'm talking to machine and that we're not establishing any great breakthroughs, of which we are collaborators and partners and friends.
[03:55]
Tracy Alloway
Recognizing you have a problem is the first step towards healing. Joe. Seriously though, there's. There's a good reason to think about AI more and more, which is that a huge chunk of not just the market, but the real economy is now revolving around AI.
[04:09]
Joe Weisenthal
Right, Totally. So, anyway, again, within the AI conversation, there are a lot of subcategories. One, One of the subcategories happens to be another odd lot's favorite topic, which is chips. Of course, chips are used in multiple different ways. Their chips are used in different parts of the AI supply chain. Different types of chips have different roles. And so we have to learn more.
[04:29]
Tracy Alloway
We have to learn more. And I have to say I'm particularly interested in the company we're about to speak to, partly because the two things I know about them are, number one, they just had a huge IPO.
[04:40]
Joe Weisenthal
Yep.
[04:41]
Tracy Alloway
Right. Raising something like 5.5 billion at kind of insane multiple. I can't even do a price to earnings multiple because they're not profitable yet. But I think just on a sales basis, it was like 67 times forward earnings, which is pretty juicy. Pretty hot. And the second thing I know about the company is they make giant wafers.
[05:02]
Joe Weisenthal
Yes.
[05:03]
Tracy Alloway
Which is just a fun image to have in your head.
[05:06]
Joe Weisenthal
That's right. So if you were thinking it's like, okay, there is a hot entrant in this space. What is their differentiator? Well, one fact about them is their chips are just enormous, about the size of the dinner plate. One might think you're reading an Onion article. But in fact, it's real. And apparently it actually has some real technical advantages.
[05:26]
Tracy Alloway
So, and it's different to what everyone else is doing. So everyone else is, I guess, doing this sort of like modular networking thing where you get together a bunch of chips and you connect them together, and that's how you get more compute, more memory, more power, basically. But this company has done something different in the form of the giant wafer.
[05:44]
Joe Weisenthal
The giant wafer. And if you figure that to get maximum performance, you sort of want to lessen the distance between things, then put it all on one wafer. Anyway, we're going to learn a lot more. I'm very excited to say Giant wafers. About Giant Wafers and more. I'm very excited to say we do have the founder and CEO of Sarah Bross on the podcast, Andrew Feldman, Truly the perfect guest. So, Andrew, thank you so much for coming on the podcast on the week of your ipo.
[06:09]
Andrew Feldman
Well, thank you so much for having me. What a pleasure.
[06:11]
Joe Weisenthal
Absolutely. Why don't you just start us off? The big giant chip, they're apparently real. They're as big as a dinner plate. What is the technical reason why this actually makes sense as a superior form of architecture for at least some aspect of AI?
[06:29]
Andrew Feldman
I think larger chips process more information in less time.
[06:33]
Joe Weisenthal
Okay.
[06:33]
Andrew Feldman
And that produces faster results. And everybody had gone to bigger chips. Nvidia had moved from 400 square millimeters to 800 square millimeters over the course of five or six years for this exact reason. And in the compute industry, wafer scale, which is building a chip for those
[06:54]
Joe Weisenthal
who are, by the way, for those who are just listening, Andrew is now holding up the chip. And yes, it looks. It actually looks bigger than a dinner plate, to be honest. But that is a big.
[07:04]
Andrew Feldman
That's a big chip.
[07:05]
Joe Weisenthal
That's a big chip.
[07:05]
Tracy Alloway
Beautiful.
[07:07]
Andrew Feldman
It's 58 times larger than any other chip that had ever been.
[07:10]
Advertisement/Commercial Voice
Wow.
[07:11]
Andrew Feldman
And what it did was it allowed us to use a different type of memory.
[07:17]
Joe Weisenthal
Okay.
[07:17]
Andrew Feldman
A type of memory that, at the beginning, there are two types of memory. There's memory that can store a lot, but it's really slow. And there's memory that can't store very much per square millimeter, but it's blisteringly fast. And historically, all graphics processing units used this memory that could store a lot, but was really slow. And that's the reason they do inference so slowly. So if you're using Claude right now or you're using anything but ChatGPT, what you'll frequently feel is you'll enter your prompt and you'll wait for an answer. Right. And that's because the memory is slow and they have to move a ton of information from memory to compute. Now, by going to wafer scale, we could use this fast memory. Now, we couldn't make that memory store more information per square millimeter, but we could add square millimeters. And so by building this big chip, we were able to stuff it to the gills with this fast memory. And that's why we're 15 times faster than the fastest GPU. That's why on some problems, we're 50, 100, even a thousand times faster than graphics processing units.
[08:32]
Tracy Alloway
Wait, can you explain how you actually managed to do this? Because I know there have been previous attempts to do wafer scale and I seem to remember there was even like an early attempt in the 1980s or something to do it. How were you able to pull this off?
[08:47]
Andrew Feldman
Yeah, it was an ambitious undertaking, that's for sure. Every previous effort in the 75 year history of our industry had failed, including Gene Amdahl, who's sort of on the Mount Rushmore of compute in our industry. He failed sort of spectacularly in the mid-80s at a company called Trilogy. Not only that, but after we succeeded, people who had visited us, who'd been in our labs, tried to copy us, and they also failed. And so what we were able to do is solve a set of really fundamental problems. And those problems cut across a wide swath of technology. They cut across lithography. So we had to collaborate closely with TSMC and they turned out to be a great partner. We had to make inventions in material and packaging. That's how you put a processor, how you put a piece of silicon on a motherboard, deliver power and IO to it. We had to make inventions in power delivery. When you build a giant chip, you're going to deliver way more power to it than if you do a chip the size of a postage stamp. We had to invent ways to cool it. We had to write new types of software that ran on it. All of these had never been done before. And it was a decade long process. It took us five years and about $500 million to deliver the first one. And it's been an extraordinary run since. In December, we signed a deal with OpenAI, north of $20 billion, one of the largest contracts ever signed in Silicon Valley. And then in March, we signed a deal with AWS where they would deploy our systems in their data centers. In their AWS data centers. And so it's just been an extraordinary run, but it took a long time. It took extraordinary engineering. And there were certainly Long periods of time when it wasn't clear we were gonna make this work.
[10:42]
Joe Weisenthal
Obviously, you've hit this remarkable milestone. You have, in fact, IPO'd and so forth. And right now, market's valuing your company at $64 billion. Early days of the IPO, just for the listener to understand. The chips, are they solely an inference as opposed to, you know, a training? When we think about AI, we think about, okay, there's training. Training the model and then answer giving. That's the inference. Are the chips just for inference?
[11:10]
Andrew Feldman
So a couple things. I think you framed it exactly right. Training is how we make AI, and inference is how we use AI. And so what happened was that in sort of 2025, in the first part of 2025, the models we made were smart enough to be useful, and there was an explosion of use. And we use AI by doing inference. So there was this sort of tidal wave of demand on inference. And that has continued in 2026, and we think it will continue for years and years to come. And so that's what had happened in 2015. When we began thinking about the company, we knew that AI was on the horizon and it would eat a huge amount of compute, right? And we made sort of two fundamental bets. We bet that it would need dedicated silicon, and graphics had needed dedicated silicon. That's how you got the graphics processing unit. Mobile compute had needed dedicated compute. That's where you got ARM processors. We made that bet. And we made a bet that modifying the GPU architecture wouldn't be right. You needed to start with a clean sheet of paper. And so what we started with was a new vision. And that vision could do training and it could do inference, and it was orders of magnitude faster at both. But right now, what we're seeing is such an explosion in demand for inference that a lot of the business this minute is inference, Even though we're just as fast at the same amount, faster than GPUs. On training.
[12:47]
Joe Weisenthal
That's interesting. Maybe we'll get more to the theoretical training market a little later. Just real quick on inference. Ben Thompson, who writes a newsletter about tech, he wrote a piece in which he distinguishes between answer inference and agentic inference. So answer inferences like, you know, format by resume or whatever, or write me an essay on X or Y, or answer some questions. And then agentic inference is like, okay, here's this thing that's going to go around. Do you distinguish and do services for you not producing visual answers? Do you distinguish between those two? Is that a real divide in Your view and can your chips do both?
[13:26]
Andrew Feldman
Our chips can do both. I think it is a divide. I think speed matters equally in both. I think if you are engaged with the AI, if you're writing code, which is agentic, if you're writing code or you're doing work, nobody wants to wait. I mean, we could just turn the question around and say, well, how big is the market for slow search? Zero. How big is the market for dial up Internet? Zero. Why is that? Because nobody wants to wait. So if you're engaged with the AI, speed is of the essence. But if the AI is doing agentic work and your competitor gets 3 times, 5 times, 10 times as much work done in 20 minutes than you do, you're going to get smoked. And so this notion somehow that Ben proposed that speed isn't very important in agentic flows is dead wrong. That speed is important in all aspects of productive work and that your ability to get more done in less time is a fundamental advantage that accrues over time. If while your competitor is doing one unit of work, you can do three, and in the next time they do one unit of work, you do six, this adds up over time to. And you beat them in any line of work. And so speed, which is sort of our specialty, is important across the board.
[14:57]
Tracy Alloway
What do giant wafers and speed in general actually mean for, I guess, the economics of tokens? Because one way I think about it, I have this sort of vision in my head, like, okay, if I'm out shopping for toothpaste, I know I need toothpaste every once in a while, and I go into like a CVS store, I get one thing of toothpaste and then maybe a week later I get some more toothpaste. Or, or I could go to Costco and buy a giant thing of toothpaste and take it home, probably at a cheaper cost. And that's sort of how I think of the giant wafers. Maybe it's a bad analogy, but what does speed actually mean for the cost of tokens?
[15:35]
Andrew Feldman
Well, I think there are a couple observations. I think people have chosen so far to price speed a little higher. For example, Anthropic offered a premium service in which they offered tokens twice as fast and charged six times as much, and they sold it out and they couldn't meet the demand. Now, just to give you an idea, we're 15 times faster than they're twice as fast. And so people value speed because it allows them to do more work and they value their time. And when you can do More work in less time, you are making people more productive. That's why people have chosen to price them at a premium. They don't cost more to make. In fact, the GPU architecture is an extremely good architecture and extremely efficient at building very slow tokens. And if you don't mind slow, the cost per token on a GPU is extremely low. But the GPU has a characteristic that as you try and go faster, the cost and the power used per token increase. Sort of like as you go faster in your car, your miles per gallon decrease, right? So what happens is as you try and get fast enough to be useful, fast enough to be interesting, fast enough to keep users intelligence focused on this product, they become extremely expensive and extremely power hungry. And so the question is not just what people are paying for a token, what people are choosing to price them at, but what they actually cost to make. And GPUs make very slow tokens very cheaply and they're unbelievably expensive at fast tokens. We make fast tokens vastly less expensive than GPUs and we use a tiny fraction of the power.
[17:47]
Joe Weisenthal
Data centers need electricity, AI needs copper, reshoring needs steel. And gold's run may tell you something about how the world is repricing money and debt. All of those point back to real assets. The RAX ETF is an actively managed one stop real asset shop. From gold to commodities to natural resource equities, adjusting as conditions change. Visit vaneck.com raaxpod to learn more. An investor should consider the investment objective, risks, charges and expenses of the fund before investing. To obtain a prospectus and summary prospectus which contains this and other information, visit vaneck.com Please read the prospectus and summary prospectus carefully before investing. RACS is distributed by Vaneck Securities Corporation distributor.
[18:33]
Andrew Feldman
The thing about AI for business, it may not automatically fit the way your business works. At IBM we've seen this firsthand. But by embedding AI across hr, IT and procurement processes, we've reduced costs by millions, slash repetitive tasks and freed thousands of hours for strategic work. Now we're helping companies get smarter by putting AI where it actually pays off. Deep in the work that moves the business. Let's create smarter business. IBM.
[19:03]
Advertisement/Commercial Voice
Everyone has been there. Your team's feedback is scattered across emails, chats and sticky notes. It's a mess, but PDF spaces in Adobe Acrobat gives you one collaborative workspace to streamline every file and comment. So if you need six departments to finally agree on a proposal, you do that with Acrobat need to turn a mountain of feedback into one plan of action. Do that with Acrobat. Want to stop searching for files and finally get everyone on the same page. Do that, do that, do that with Acrobat. Learn more@adobe.com do that with Acrobat.
[19:40]
Joe Weisenthal
Let's say we stipulate that this is all true and everyone wants the fastest and everyone's like, you know what? This is the solution that the Cerberus technology, one big chip. This is really where it's at. How much of like your market share for the inference market? When you look out next year, the year after, et cetera, how much is your market share going to be dictated by your ability to get capacity at TSMC fabs? How much is that a gating mechanism for growth?
[20:12]
Andrew Feldman
You know TSMC is a huge part of the supply chain. Yeah, but we have some real advantages. There are three areas right now that are limiting vendors and building AI compute number one is HBM memory. It's this memory we described earlier that can store a lot, but it's really slow. That's made by three companies approximately, Samsung, Hynix and Micron. And it's under unbelievable supply pressure. It's extremely difficult to get. There are very long lead times. It's unbelievably expensive right now. We don't use it. The second part that's limiting is a process inside of TSMC called COAS. And this is the process that Nvidia and other GPUs use. We don't use it. The third thing is that at TSMC the factory that is under most pressure is their 3 nanometer factory. We don't use it. We use 5 nanometer. So we have managed to avoid some of the most binding supply constraints. Now TSMC still has to give us a meaningful allocation. And they've been an extraordinary partner from the get go. And they are the greatest manufacturing company on earth by far. A FAB is sort of a modern pyramid. It's an unbelievable thing. And I highly recommend you or any of your listeners if you get a chance to go to Taipei, go and see them. They are just extraordinary.
[21:41]
Tracy Alloway
Can you do FAB tours?
[21:42]
Andrew Feldman
You can actually, yeah, you can do fab tours. You can go and they have a museum of innovation and it is an extraordinary thing. They are the sort of the national champion of Taiwan. But I think today TSMC has given us as many wafers as we've needed. Business today is constrained by data centers. And that's the grand irony. You invent technology that has been unbuildable never been invented for 75 years in the history of compute, you write software that is extraordinary. You built a product that is vastly faster than the incumbent. And what are we all constrained by? Buildings. Data centers right now are everybody's constraint in the entire industry. Powered buildings. So real estate, it is an amazing thing right now. And that is true sort of across the board. And that that will not change for the next 15 or 18 months for sure.
[22:38]
Tracy Alloway
I mean, since we're talking physical constraints, I guess I should ask you. We did an episode about helium recently, a helium shortage. Given the situation in the Strait of Hormuz. And one of the things that helium is used for is lithography on semiconductor chips. Has that affected you at all or is that something that you're monitoring?
[22:58]
Andrew Feldman
We monitor, but there's not a lot we can do and there's plenty of stuff to worry about that we can't affect. We obviously are in communication every day with tsmc. We're in communication with our entire supply chain every single day. And we stay abreast of the various issues. But it has had no impact on us. And we put that in the bucket of things that our manufacturing partners worry about also and that we can't help.
[23:28]
Joe Weisenthal
You know, so in addition to manufacturing these chips, you actually, I didn't realize this, you have your own cloud.
[23:36]
Andrew Feldman
We do.
[23:36]
Joe Weisenthal
And. Or you have your own cloud services. We do. Which I have a bunch of questions about that. But you have your own cloud services through which a user can actually get access to various open source models and so forth. It looks a little bit sort of visually it looks a lot like the open router interface. Roughly the same environment, except it's all like, like the open source. What I'm something I'm curious about and maybe you could speak to this, you know, in traditional software. Open source. One nice thing about open source is you don't have to pay for it. So it's free. It's a little bit different when we're talking about there's no really such thing as like free AI software. Because even if it's like free, you still have to like pay for the depreciation of the chips and you have to pay for the electricity to run them. So there's no real such thing as like free open source AI software. But what I am curious about in your experience as a cloud vendor, are the open source models cheaper on a per unit of intelligence basis? If we had some way of saying levelized cost of intelligence, which I don't know if the industry has yet. Are open source models cheaper per IQ point? Whatever. We want. However we want to measure intelligence.
[24:51]
Andrew Feldman
Yes, by a lot.
[24:52]
Joe Weisenthal
Really?
[24:53]
Andrew Feldman
Yeah. I think in the closed source world you're paying a lot for that extra little bit of intelligence. Right. The open source models, there are no open source models that are as good as the closed source models. Okay. Think of it as 3, 4%, 5% different.
[25:08]
Joe Weisenthal
Okay.
[25:09]
Andrew Feldman
Something in that range, it could be a little more, it could be a little less. But the cost to you using them, you can jump up right now and run Kimike 2. It's a 1 trillion parameter model. It's an open source model on cerebras where 10 or 15 times faster than others. And what you're paying for is the cost of our power and some cost of the compute that took to calculate it. What you're not paying for was the cost to train it.
[25:37]
Joe Weisenthal
Right.
[25:38]
Andrew Feldman
And that's a battle that is underway in the market. You have OpenAI with their coding software, you have Anthropic with their coding software. And you've got companies like Cursor and Cognition that are using open source. We power OpenAI and we power Cognition. You have a battle underway between closed source and open source. And I think that the winners of that battle is yet to be determined. What is clear is that the closed source is strictly better by a little bit, by how much varies and it's more expensive.
[26:16]
Tracy Alloway
Yeah, I think we've talked about this before, but like I've heard of a lot of big companies in the US who have been like very quietly shifting from some of the closed source models to the open source models like the Chinese ones, like Kimi. Is that what it's called? Kimi and Quen. I'm sorry to press you on this point, but if you had to make a bet like in 20 years, is the dominant AI model going to be a cheap open source thing or a more expensive, incrementally better closed source model?
[26:43]
Andrew Feldman
I don't think there's going to be one. Right. There's not one SaaS software, there's some big dogs, there's Salesforce, there's some other sort of giant players and there are lots of other specialists. I can't think of many markets where we've settled on to one player. If you look at the semiconductor market, you've got x86 where you've got two major players in AMD and Intel and then you've got a whole adjacent market owned by ARM and the companies that build ARM parts. And then you've got Custom silicon around that. I think that's the way you're going to have this. We're going to have, you know, OpenAI is going to continue to do extraordinary things. There will be competitors to them and they'll be open source. I don't think any of those go away.
[27:28]
Tracy Alloway
Since we're on the topic of software, one of the things you often hear when talking about, you know, new chip entrance going up against Nvidia is this idea that, well, you know, like Nvidia chips, they're great and all, but the real moat of Nvidia's business is Cuda, right?
[27:46]
Andrew Feldman
Yeah.
[27:46]
Tracy Alloway
Software stack that goes with it. What's your take on that? Like, is that a realistic concern for someone who's trying to go up against a company as big and I guess as embedded in the software system as Nvidia currently is?
[28:01]
Andrew Feldman
Nvidia is probably the greatest company in the first part of this century. Right. Jensen's one of the great CEOs of our era, along with Hawk 10 at Broadcom and maybe Lisa at AMD. Just extraordinary. And CUDA was really important in the creating of the AI landscape, but it's not important now and it has no role whatsoever in inference. If you want to move from running a model on GPUs today to running it on us, we can move it in 10 keystrokes. Just move point to our API. So that's the first part. The second part is that a year ago every major Frontier Lab model had been built on a CUDA foundation and today two of three haven't. So they lost 70% market share. There are three leading frontier models, Gemini, Claude and GPT. Gemini, built by Google on TPUs, trained on TPUs, served on TPUs, no CUDA anthropics models trained on Trainium, no CUDA, served on TPUs, on Trainium and, and on GPUs and OpenAI's GPT trained on GPUs in the CUDA environment. So two of the three leading models today use no CUDA. That's a hemorrhaging of share. And so I think what was true three or five years ago, in which CUDA had a dominant position with Central, has shrunk significantly and not important at all at inference and shrinking in its role in training.
[29:48]
Joe Weisenthal
You know, since we're talking about the economic, since we're talking about, you know, the economics of inference and all this stuff, I've actually, I would love to get your take one of the things that Pete that like literally in the last couple of weeks there's been this flurry of announcements of these attempts to financialize the market for compute. And so it's like, oh, you're going to like buy some capacity, the H100 benchmark, et cetera. And people on maybe theoretically hedging it. I'm not entirely convinced. It still seems to me like I. It's not like maybe, but on the other hand, like an inference provider can lock in a very long term relationship bilaterally with a data center and so forth and no need for like these spot hedging markets. Do you think the market is going to evolve in such a way that there will be significant demand for. For financial instruments that allow inference providers to hedge their price exposure?
[30:47]
Andrew Feldman
I don't know, I'm not a financial engineer is the first thing, but we can look a little bit at history. The guys at coreweave were enormously innovative in how to fund some of their massive deployments. They were some of the first to use a debt instrument that had a backstop with the GPU and this enabled them to really leap out and have first mover advantage in the neo cloud space. That was an innovation in financial engineering and extremely creative. Others followed. And now there's a big and active debt market in funding the building and the fit out of data centers. When you have a market that is that big and that active, you have people who want to make bets on either side. And I think over time those bets normalize and regularize and you can wrap them up and you can make it easy to make the bet. When sort of CO2 was one of the first to loan money against GPUs. For Core Weave, this was really innovative. And not only does Core Weave get credit for the creating of the instrument, but so does the other side of the deal for doing it and making a successful, innovative bet. And as sort of more and more people jumped in and these could be regularized, they could be more easily priced. And then once it's regularized and you have a market, then derivatives of that market are easy to make historically. And that's sort of the way I see this unfolding, that as this market for data centers and compute matures, there'll be people making bets on either side and financial instruments will be created to do it. Whether it's a good idea or not, I have no opinion at this time.
[32:38]
Tracy Alloway
Since we brought up finance, I was looking through the IPO filing and looking at some of the actual numbers in there. And I know you have the OpenAI deal now, but a huge chunk of your revenue Comes from This company called G42 in Abu Dhabi. And I think they're both like your biggest customer and also a major Investor. What does G42 actually do with all these chips?
[33:04]
Andrew Feldman
Sure. Last year they were a really important chunk of our business, a lot of it. They're a minority investor. They are the national champion, the national AI champion of the uae. And they build a cloud that is used across the UAE's ecosystem. So it's used by leading universities there, it's used by leading companies there. Companies like Adnoc, they're leading oil company. It's used by G42S9 operating companies. The deployments to date have been in the US we have data centers that massive data centers that run equipment for G42 here in Santa Clara, but also in Minneapolis, in Dallas, Texas, soon in Toronto. And so they're doing training and they're doing inference. The training they're doing. They have pioneered some of the leading English Arabic models. They've done genomic work, they are doing serving of models and they're operating as a cloud, particularly for the UAE ecosystem, but also for global companies.
[34:36]
Advertisement/Commercial Voice
You need to make a huge presentation in an hour. Adobe Acrobat uses AI to take all your documents and generate a presentation with a single click. Build slides quickly and streamline the process. Need a last minute pitch deck? Do that with Acrobat. Need to level up your presentation design. Do that with acrobat. You have 30 plus documents that need to be simplified into a proposal. Do that, do that, do that with Acrobat. Learn more@adobe.com do that with Acrobat.
[35:08]
Andrew Feldman
Support for the show comes from public. Lately it feels like there are two types of investing platforms. Some are traditional brokerages that haven't changed much in decades and and others feel less like investing and more like a game. Public is positioned differently. It's an investing platform for people who are serious about building their wealth on public. You can build a portfolio of stocks, options, bonds, crypto without all the bugs or the confetti. Retirement accounts. Yep. High yield cash. Yes again. They even have direct indexing. Public has modern design, powerful tools and customer support that actually helps go to public.com market and earn an uncapped 1% bonus when you transfer your portfolio. That's public.com market ad paid for by Public Holdings Brokerage services by public investing member FINRA SIPC advisory services by public advisors SEC registered advisor crypto services by ZeroHash. All investing involves risk of loss. See complete disclosures@public.com disclosures being a small
[36:08]
Advertisement/Commercial Voice
business owner isn't just a career, it's a calling. Chase for Business knows how much heart and effort go into building something of your own. That's why they make business growth their priority. The Chase team takes the time to understand your mission, where you are now, and where you want to go. Their broad range of solutions is designed with you in mind so you can bring your ideas to life. From banking to payment acceptance to credit cards, you can conveniently manage all your business finances all in one place with their digital tools looking for tips and advice, their online resources are always available to give you the solutions you need to help your business thrive. See how your business can get stronger and go farther with Chase for Business. Learn more@chase.com business chase for business make more of what's yours the Chase Mobile app is available for select mobile devices. Message and data rates may apply JPMorgan Chase Bank NA Member FDIC Copyright 2026 JPMorgan Chase Co. Do you think that
[37:08]
Joe Weisenthal
over time corporate users, and perhaps individual users, but corporate users will want inference served from a company that's separate from the model maker, such that they can be certain that they are not revealing and thus training the company that might replace them? I mean, look, anthropic every couple days announces some new thing. Oh, we have a new markdown file that could do this for taxes or that could do this for whatever. And then a bunch of companies fall like are companies that use AI increasingly going to want to want to use data centers and inference providers that aren't the model themselves?
[37:50]
Andrew Feldman
Well, first, I think there is a type of professional, a type of job that is most directly under threat from AI.
[38:01]
Joe Weisenthal
Okay?
[38:02]
Andrew Feldman
And they're almost always white collar. And they required you to have expertise over a body of knowledge. That's what an accountant is. You have expertise over a body of knowledge of rulings of previous examples of tax case law, et cetera. That's exactly what AI is good at right now. Exactly. So lawyers, accountants, this sort of these professionals who have stood between sort of the ordinary person who doesn't know anything about IRS tax rules and the tax rules that is under threat. And that is something that it will be very easy for companies like OpenAI and Anthropic to chew through. There are other areas like say drug design, genetics, genomics, where companies like GlaxoSmithKline have remarkable and unique data sets. This is true for one of our large customers, Mayo Clinic. It's true for GlaxoSmithKline and other of our pharma customers. They have unique data and they will be able to find insight in that data and they will be able to get value from that data. And they will certainly not want to share that data with the foundation model makers unless they are guaranteed that it will not sort of make the general model smarter. And these are companies that have spent 20 or 30 years spending tens of billions of dollars a year gathering data. Right. Patient care records or, or test results for drug design. They're going to mine the insight in this work and they're going to provide find extraordinary things. And those are much more protected because the insights in the data and they have the data.
[40:04]
Tracy Alloway
You know, you were talking about fabs in Taiwan earlier and I'm now regretting not going on a fab tour when I was in Taipei. Just didn't cross my mind at that time. Next time. Yeah, hopefully. And there have been various efforts under the CHIPS act and some other industrial policies to try to build more chip making capacity in the U.S. in your view, what's the big, I guess, impediment to actually doing it? Yeah. A, is it happening? And then B, why does it seem so difficult to actually make happen?
[40:38]
Andrew Feldman
Right. The first thing is difficult because it's a difficult problem. They're hard. They cost 30 or 40 billion dollars and take five or six years to build. So that amount of money in that amount of time cuts across administrations. And that's a problem with the politics in the US is it's hard to make policy that's durable across administrations and across time. The first thing, the second thing is these are remarkably complicated buildings and we have a sort of a hodgepodge, a sort of strange latticework of local regional building codes that a fab maker has to negotiate. Third is we're trying. TSMC has dedicated tens of billions of dollars to their fabs in Arizona and have committed hundreds of billions more. Samsung has dedicated tens of billions of dollars and committed hundreds of billions more to their fabs in Texas. But they take a long time. And we have to remain committed to building not just the fab, but the surrounding ecosystem, not just for three or five years, but for 20 years or 25 years. Because you want not just one fab, but you want a whole trajectory of F. You want them working at today's cutting edge, but tomorrow's and next year's and in 10 years cutting edge as well. And those are things that have proven really challenging in the US And I think we need it. They're strategic assets and I think we need to find ways to collaborate with those that have the expertise and to find ways to build policy that is durable over a length of time, that can build a vibrant ecosystem in the FAB and the associated elements.
[42:36]
Tracy Alloway
So the other big political economy theme, I guess, when it comes to semiconductors is this idea that they are, in fact, a strategically important technology. And so the US should place some limitations on their use abroad. And so we've seen things like export controls, export restrictions. You're an actual chip company. And so I'm very curious at an operating level what your experience of these kind of export controls has actually been. How much time does that take up for you? And then also, given that one of your biggest customers is an international firm in Abu Dhabi, how important is the trajectory of those export controls to your future business?
[43:20]
Andrew Feldman
I think three or four years ago, I would have said not important at all. I think today they're really important. In the last administration, I got to know the leadership in the Department of Commerce and in the BIS Division of Commerce, which oversees the licensing. I think this is an extraordinarily difficult job. And we saw really hardworking, smart people doing a job that is very, very difficult. I got to know the people in this administration and I found the same. Every single one of them is earning a tiny fraction of what they could earn in the private sector and is doing this because they believe that this is an important mission. The problem is that there are differing views about the right way to do this, and there are differing views on the right way to achieve the goal, which is to not give your most precious technology to your industrial enemy. I think we can agree that today, in today's environment, China is an industrial enemy. Good. Well, meaning people can disagree on whether the right strategy is to limit them from gaining access. Others argue, as those at Nvidia have argued, is that the right strategy is to give them access and to keep them working on our product, on us made, on us, sort of designed product. I come down on the other side of that argument. I understand there are good arguments in both directions. I think limiting the distribution, the diffusion of our most precious technologies makes sense. And I think we have to do it thoughtfully. And we have to recognize that means some markets will be foreclosed to us. And I'm okay with that.
[45:09]
Joe Weisenthal
Just quickly, on the sort of, like, current business stuff you mentioned the deal with aws, how does that work? Could customers right now, like, can customers of AWS pay them to have inference served specifically on one of your chips?
[45:25]
Andrew Feldman
Not yet, but soon.
[45:26]
Joe Weisenthal
Okay.
[45:27]
Andrew Feldman
It will be served in bedrock, which is their AI as a service offering and they will yes, be able to go down to the click down menu and get super fast inference which will be delivered via a combination of what's called a disaggregated solution which is using some trainium for some of the inference work and using the Cerebra technology in our Systems called the CS3 for other parts of the work.
[45:55]
Joe Weisenthal
And presumably someone who scrolls down and selects that they would pay some premium for that ultra fast inference.
[46:01]
Andrew Feldman
I think they will pay a premium. We will see. This is entirely as Amazon wishes to price it. This is their product.
[46:08]
Joe Weisenthal
So you IPO'd this week. It's May 2026. This is not the first time that you've tried to or look towards going to the IPO market. There were headlines going back to 2020 about wanting to try for the IPO market. And then there were headlines last year, especially because of the relationship with G42 about CFIUS and some of the national security concerns. And maybe that was an issue with the ipo. But also last September you got one of your looks like G Round. G Round. One of the participants in the G round investor was 1789 Capital, which is of course the firm that's associated with Donald Trump Jr. Which is a lot of things. And then the IPO happens. I'm a cynic, so I wonder if the participation of Donald Trump Jr. S investment in your company made it easier to get the green light from these national security concerns to do an ipo.
[47:10]
Andrew Feldman
I wish it were that easy. No, it had no, no role at all. We resolved all CFIUS issues in March of 2025. I believe that was before we took money from 1789.
[47:23]
Joe Weisenthal
Okay.
[47:24]
Andrew Feldman
Moreover, I wouldn't ask. That's not who I am and that's not the way we roll. So we took money because they are a thoughtful venture firm and we don't believe that there's only one point of political view. There are lots of political views. They all have some merit, they all have some weaknesses. And so we have right leaning political some investors, we have left leaning. The fact that this firm had some right leaning in sort of investors. We were looking only at their ability to help us build an extraordinary company. And we have asked and we will not, we have never asked, nor will we ever ask for or political access or anything of the kind.
[48:09]
Tracy Alloway
What's it like to become a billionaire in a single day? This is something I assume will never happen to me, so I might as well ask you.
[48:16]
Andrew Feldman
I think the honest truth is it was a big Nothing for me. I had some wealth before and had some wealth after. I think this is a very difficult way to make money being a tech CEO. I think what you have to do is you have to love the work, you have to love the people and you have to think every day about how to make your team rich. And far more important than sort of some change in my wealth was we made more than 800 millionaires. And that's something I'm proud of every minute of every day. And at my last company we made 100 millionaires. And at this company through our IPO we made more than 800. And that's something that you wake up feeling good about yourself every single day.
[49:06]
Tracy Alloway
That was going to be my last question, but actually you just reminded me in that answer the idea that getting here, I said you became a billionaire in a day. But obviously this was the outcome of years and years and years of work. And if we think about technological hardware, one of the things most people associate it with is really long lead times and really big research and development budgets. Now that you're a public company, how do you sort of balance that quarter to quarter financial performance pressure with the idea that you still need to be investing in capex in new ways of designing chips, new improvements to the existing ones?
[49:50]
Andrew Feldman
Well, first we think the opportunity for innovation based on our way for scale engine, the, the best work is still ahead of us. Number one, we see an opportunity for extraordinary innovation in the years ahead to make leaps every bit as big and often bigger than what we made by building the largest chip on earth. When you love building hardware, the fact that it takes time is part of the deal, right. That what we do can't be done in a week or a month or a year. And that's what you sign up for. And that's true in every profession. You sign up for the good and the challenging. And you have to sort of make peace with that. If you're a person that wants to dive in and sort of begin iterating right away and fail quickly and code up something and look at it and throw it out in the market and see if it wins, Godspeed, that's great. And that's not for me. You know, in our business we, we measure twice before we cut once. And you have to put that in your soul and you have to like it. You have to like that, that mistakes in our business are really expensive. And you have to like the fact that you breathe life into a chunk of silicon.
[51:11]
Joe Weisenthal
Yeah.
[51:11]
Andrew Feldman
And you get it to do things that nobody else has ever been able to make a chunk of silicon do. And if that's for you, then this process, that takes time and money, you love that too. And so I think I would love it less if you could do it in a week. And I think the people that I love to work with, they feel the same way. And they like being engineers. Not because it's a path to money. They like being engineers because they like building things and they like building hard things. Things. And I like working with them for exactly that reason.
[51:45]
Joe Weisenthal
Yeah. You mentioned breathing life into a chunk of silicon. My dad, who's a physicist, always likes to point out how carbon and silicon are right next to each other on the periodic table.
[51:55]
Andrew Feldman
They are.
[51:56]
Joe Weisenthal
And this sort of like here are the two things that we have closest to life and they're literally touching each other. Maybe there's something deep in that.
[52:04]
Andrew Feldman
I think that's a really thoughtful thing your father said.
[52:06]
Joe Weisenthal
Thank you.
[52:07]
Andrew Feldman
And I think that's really cool. And nobody pointed that out to me though. We've stared at periodic tables for a long time, but I think to the extent we can make artificial life, we need silicon.
[52:19]
Joe Weisenthal
Yeah. And they're right next to each other.
[52:20]
Andrew Feldman
Right. Carbon's the heart of all other life and artificial life will be founded, at least the intelligent part will be founded on silicon.
[52:29]
Joe Weisenthal
Right below silicon is germanium. Maybe the next. I don't know.
[52:33]
Tracy Alloway
What does that mean? Ask your dad.
[52:35]
Joe Weisenthal
Yeah, let's keep an eye on germanium next. Andrew, thank you so much for coming on odd lots. Fascinating conversation right in the sweet spot of what we're interested. Really appreciate you taking your time.
[52:44]
Andrew Feldman
Hey, thank you guys for having me and I really appreciate it. Look forward to seeing you again soon.
[53:01]
Joe Weisenthal
That was really fun. I'm super interested in this topic and it does feel to me like the economics of inference in particular and the market for inference, inference capacity, speed, like it's still day one, you know what I'm saying?
[53:16]
Advertisement/Commercial Voice
Yeah.
[53:17]
Tracy Alloway
I just like looking at the giant wings.
[53:18]
Joe Weisenthal
It's so cool. It really does seem like an onion thing, doesn't it? It's like company solved inference with a
[53:25]
Tracy Alloway
giant by building the biggest chip in the world.
[53:27]
Joe Weisenthal
But it is interesting. We did that episode, of course, with Ray Wang from Semianalysis and talking about the role like memory as being this really important part of this sort of cutting edge chipsets. And it's interesting to think it's like, okay, well here is a bottleneck that doesn't run into that they don't have. And the idea that at least as he described it. They're not fighting to get the smallest nanometer chips. And so maybe that gives them a little bit of breathing room on capacity there too.
[53:56]
Tracy Alloway
Yeah, I mean, I do imagine there are some downsides to having giant chips, just as there are upsides that Andrew laid out. The other thing I was wondering, I know he made the case for the reason speed is very important, but I can also imagine a world where maybe it's not that important. I think at some point the incremental speed, speed factor just starts to become less important when weighed against like the incremental cost of generating that extra speed.
[54:30]
Joe Weisenthal
I think it really, this is like one of those things where it probably really depends what you're using it for. Right. So it's like if you're like, you know what, I'm really curious why pterodactyls aren't actually dinosaurs. Can you explain it to me? Then it's like I don't care about that like that fraction of a second.
[54:47]
Tracy Alloway
I would wait five minutes for the chat bot to tell you you're wrong, Joe.
[54:51]
Joe Weisenthal
You just, buddy, you just don't really care that much. But if you're doing some sort of like agent decoding thing or whatever, etcetera, Then like, yeah, that definitely adds up. And I will say, like, as you use it more like it's just like everything else, the hit the treadmill of expectations. Here's some tasks that you can do in 30 seconds which maybe several years ago would have taken you 30 minutes and you get impatient in that 30 seconds and you want it in 10 seconds. And that's just like that competition to shave down seconds. I think it's always going to be there. So no one ever gets satisfied with this is my point. It always eventually becomes like it feels like waiting.
[55:31]
Tracy Alloway
But to me this feels like this is the crux of the AI valuation argument, which is like how much of a premium are we going to place on a model that may be a closed source model that is maybe slightly better than an open source model? How much premium are we going to place on. On compute that is slightly faster than this other type of compute or like other use of compute like that? To me it's an unanswered question and Andrew was pretty upfront about closed versus open source. But I think on the speed question too, like, we're going to find out.
[56:04]
Joe Weisenthal
We're going to find out. And you know, I think one of the things that is going to happen and there have been all these stories about sort of like token shock like how much companies are spending on tokens. My guess is one of the things that will happen some point is there's going to be a lot more discussion about why are we using this ultra premium model when we could have done this. Like there is a lot of just like throw it at the AI, rack up those bills, et cetera. And at some point there's going to be this like, okay, what really needs to be served fast, what really needs to be served on the most premium closed source models. And companies are probably going to get a lot more skilled at allocating from, you know, different forms of inference depending on the need.
[56:50]
Tracy Alloway
Yeah, I think that's exactly it. And at that point, like we could well see some of the dynamics in the market start to change in terms of valuation. Shall we leave it there?
[56:59]
Joe Weisenthal
Let's leave it there.
[57:00]
Tracy Alloway
This has been another episode of the All Thoughts podcast. I'm Tracy Alloway. You can follow me at Tracy Alloway.
[57:05]
Joe Weisenthal
And I'm Joe Weisenthal. You can follow me at the Stalwart. Follow our producers Carmen Rodriguez at carmenarmon, Dashiell Bennett at dashbot kalebrooksailbrooks and Kevin Lozano at Kevin Lloyd Lozano. And for more Odd Lots content go to bloomberg.com oddlots we have a daily newsletter and all of our episodes and you can chat about all these topics 24. 7 in our Discord, Discord, GG Oddlots
[57:30]
Tracy Alloway
and if you enjoy Oddlots, if you like it when we talk about giant wafers, then please leave us a positive review on your favorite podcast platform. And remember, if you're a Bloomberg subscriber, you can listen to all of our episodes absolutely ad free. All you need to do is find the Bloomberg Channel on Apple Podcasts and follow the instructions there. Thanks for listening.
[58:20]
Advertisement/Commercial Voice
If you follow markets, you know the value of long term thinking. You plan, you diversify, you prepare for volatility. But in life, even the best strategies can't prevent every bad day a fire, a loss, a disruption that demands immediate attention. When that happens, what matters isn't just what you planned. It's who shows up. That's where Cincinnati Insurance comes in. For more than 75 years, they've helped individuals and businesses navigate life's toughest moments with care, expertise and personal attention. Together with independent agents, Cincinnati Insurance focuses on relationships, not transactions. Their approach is grounded in experience, follow through, and trust built over time. Bad days happen, and when they do, you deserve an insurance partner who understands risk, respects what you've built, and is ready to help you move forward. The Cincinnati insurance companies Let them make your bad day better. Find an independent agent@cin fin.com find home
[59:21]
Joe Weisenthal
wherever you roam at Sinesta Es and Simply Suites, where longer stays feel comfortable, flexible and easy. Stretch out and enjoy spacious accommodations and home like amenities designed to help you settle in and stay productive or relaxed for however long you need. And when you're a Sonesta TravelPass member, staying at Sonesta Es and Simply Suites means earning points toward free nights, upgrades and more with every eligible stay.
[59:46]
Andrew Feldman
Go to Sonesta.com to book your stay and unlock the best rates with Sinesta
[59:50]
Joe Weisenthal
Travel Pass here today, roam tomorrow. Join now@sonesta.com Terms and conditions apply.
[59:57]
Tracy Alloway
This Dog Salon Operational Excellence thanks to Genius from Global Payments Scheduling Personalized checkouts Instant Absolutely Genius Big league reliability for any business. That's genius.