
Loading summary
A
Hello, I'm Andrew Main and welcome to the OpenAI podcast. Today we're excited to be breaking some news involving Broadcom and OpenAI. Joining me from OpenAI is Sam Altman and Greg Brockman, and from Broadcom, Hauk Tan and Charlie Kawas.
B
A lot of ways that you would look at the AI infrastructure build out right now, you would say it's the biggest joint industrial project in human history.
C
We're defining civilizations as next generation operating system.
D
That is a drop in the bucket compared to where we need to go.
B
That's a big drop.
A
But so what are we talking about today? What brought you all together?
B
So today we're announcing a partnership between Broadcom and OpenAI. We've been working together for about the last 18 months designing a new custom chip. More recently, we've also started working on a whole custom system. These things have gotten so complex, you need the whole thing. And we will be starting in late next year deploying 10 gigawatts of these racks of these systems and our chip, which is a gigantic amount of computing infrastructure to serve the needs of the world, to use advanced intelligence.
A
So this is going to entail both compute and chip design and scaling out.
B
This is a full system. So we worked, we closely collaborated for a while on designing a chip that is specific for our workloads. When it became clear to us just how much capacity it is, inference capacity the world was going to need, we began to think about could we do a chip that was meant just for that kind of a very specific workload. Broadcom is the best partner in the world for that, obviously. And then to our great surprise, this was not the way we started. But as we realized that we were going to really need the whole system together to support this, as this got more and more complex, it turns out Broadcom is also incredible at helping design systems. So we are working together on that entire package. And this will be, this will help us even further increase the amount of capacity we can offer for our services.
A
So Hawk, how did this come about? When did this start? When did you guys first talk about working together on this?
E
Well, other than the fact that Sam and Greg are great people to work with, it's a natural fit because OpenAI has been doing, continues to do the most advanced models, frontier models in generative AI out there. But as part of it, you continue to need compute capacity, the best, latest compute capacity as you progress in a roadmap towards a better and better frontier model and towards superintelligence. And compute is key part. And that comes with on semiconductors and as Sam indicated, modern semiconductors. And we are, even though I say it myself, probably the best semiconductor company out there. And more than that is AI is a very, very exciting opportunity for us in terms of we are, my engineers are pushing the innovation envelope and newer and newer generations of semiconductor technology. So for us, collaborating with the best generative AI company out there is a natural fit.
A
And this isn't just chips. It's going out to scale like 10 gigawatts. And I can't have trouble kind of even understanding that. What does that even mean when you're talking about 10 gigawatts?
B
First of all, you said it's not just chips. Hawk touched on this too. But the vertical integration point is really important. We are able to think from etching the transistors all the way up to the token that comes out when you ask ChatGPT a question and design the whole system. All of the stuff about the chip, the way we design these racks, the networking between them, how the algorithms that we're using fit, the inference chip itself, a lot of other stuff, all the way to the end product. And one of the many reasons I'm so excited about it is by being able to optimize across that entire stack, we can get huge efficiency gains and that will lead to much better performance, faster models, cheaper models, all of that, as you get that better performance and cheaper and smarter models. One thing that we have consistently seen is people just want to use way more. So we used to think like, oh, we'll optimize things by 10x and we'll solve all of our problems. But optimize by 10x and there's 20x more demand. So 10 gigawatts, 10 incremental gigawatts. This is all on top of what we're already doing with other partners and all the other data centers and silicon partnerships we've done. 10 gigawatts is a gigantic amount of capacity. And yet if we do as good of a job as we hope, even though it's vastly more than the world has today, we expect that very high quality intelligence delivered very fast on a very low price. The world will absorb it super fast and just find incredible new things to use it for. So what is the hope with this? The hope is that the kinds of things people are doing now with this compute, writing code, automating more and more of enterprises, generating videos and soar, whatever it is, they will be able to do that much more of it. And with much smarter models, that's amazing.
A
So, Greg and Charlie, when you think about historically, when people have tried to develop chips or hardware to suit whatever was the current modem for using computing at that point, what examples have you looked upon historically to figure out how to plan forward? What's been inspiring you when you think about this?
D
Well, I'd say the number one thing, honestly, is working with good partners. I think it's very clear that we as a company are not able to do everything ourselves. And getting into actually building our own chips for our own specific workloads was not something we could do from a total standstill without working with Hawk and Charlie and Broadcom. So it's just been really incredible to lean on their expertise together with our understanding of the workload. And it's been actually very interesting to see the places where OpenAI is able to do things very differently from the rest of the industry or the way that things would historically be done. For example, we've been able to apply our own models to designing this chip, which has been really cool. We've been able to pull in the schedule. We've been able to get massive area reductions. You take components that humans had already optimized and just pour compute into it, and the model comes up with its own optimizations. And it's very interesting. We're at the point now where. Where I don't think any of the optimizations we have are ones that human designers couldn't have come up with. Usually our experts take a look at it later and say, yeah, this was on my list, but it was like 20 things that would have taken them another month to get to. And that's actually really, really interesting that we were coming up on a deadline, working with Charlie's team, and we were running optimizations. We had a choice of, do we actually take a look at what those optimizations were, or do we just keep going until the deadline and then take a look after? And we decided, of course, you got to just keep going. And so we've really been pretty building up this expertise in house to understand this domain, and that's something we actually think can help lift up the whole industry. But I think that we are heading to a world where AI intelligence is able to help humanity make new breakthroughs that just would not be possible otherwise. And we're going to need just as much compute as possible to power that. Like, one example of something very concrete is that we are in a world now where ChatGPT is changing from something that you talk to interactively to something that can go do work for you. Behind the scenes, if you've used features like Pulse, you wake up every morning, it has some really interesting things that are related to what you're interested in. It's very personalized. And our intent is to turn ChatGPT into something that helps you achieve your goals. The thing is, we can only release this to the pro tier because that's the amount of compute that we have available. And ideally, everyone would have an agent that's running for them 24, seven behind the scenes, helping them achieve their goals. And so ideally, everyone has their own accelerator, has their own compute power that's just running constantly. And that means there's 10 billion humans. We are nowhere near being able to build 10 billion chips. And so there's a long way to go before we are able to saturate not just the demand, but what humanity really deserves.
A
So, Charlie, being very deeply technical and being with a company that's been at a number of forefronts of some of these revolutions, what's it been like working with a company like OpenAI and working with Greg on this?
C
So for us, it's been absolutely exciting and refreshing because the beauty of the work we do together is, is focus on a certain workload. We started actually first looking at the IP and AI accelerator, which is what we call the xpu. And then we realized very quickly that we now can actually go to the workload, all the way down to the transistor. And as Greg was just explaining how we can both work together to go customize that platform for your workload, resulting in the best platform in the world, then we realized, as Sam was saying earlier on, it's not just that XPU or accelerator, actually, it's the networking that needs to go to scale it up, scale it out, and scale it across. And so suddenly we started seeing that we actually can drive next level of standardization and openness that not just only benefits us, I think it actually will benefit the entire ecosystem and it gets Genai to an AGI much faster. So very excited about the technical capabilities of the teams we have, but also the vision and, and I think the speed at which we've been moving.
A
I'm still kind of wrapping my head around the scale of it because it's just from both trying to design something like a chip and to help figure out how you're going to get the maximum efficiency on this to just the size of it, the infrastructure, what's involved in this, this is a global effort, and what comparisons you've been able to draw for this to other examples in history.
B
I always think the historical analogies are tough, but this is as far as I know. I don't know what fraction building the Great Wall was of global GDP at the time, but a lot of ways that you would look at the AI infrastructure build out right now, you would say it's the biggest joint industrial project in human history. And this is like, this requires a lot of companies, a lot of countries, a lot of industries to come together. And a lot of stuff has to happen at the same time. And we've all got to kind of like invest together. But at this point, given everything we see coming on the research front, given all of the value we see being created on the business front, I think the whole industry has decided this is like a very good bet to take. But it is huge. You go to one of these, even 1 GW data centers and you look at the scale of what's happening there. It's like a tiny city. Like this is. It's a big complex thing. So it is just like incredible scale
D
to the point of this being a massive collaborative project. I feel like whenever I call Charlie, he's in a different part of the world trying to secure capacity, trying to find a way to help us build what we're trying to do together.
C
Exactly. Actually one of the coolest thing actually I was thinking about is what we're doing together in this wonderful partnership. We're defining civilizations, next generation operating system. And we're doing it, as you're saying, at the transistor level, building new fabs, building new manufacturing sites, all the way to building these racks and ultimately the data centers. You're talking about 10 gigawatts of data centers.
A
Yeah, I think that's an important thing to keep track of is often people get fixated just on the chips themselves. And it's kind of like thinking the national highway project was about selling asphalt or railroads are about steel. In reality, it's the things become possible on top of that. And you've probably thought a lot about that, like what happens?
E
Well, I think this is like railroad Internet. That's one. I think this is becoming over time critical infrastructure or critical utility. And more than just critical utility for say 10,000 enterprises. This is critical utility over time for 8 billion people globally. That's, I think it's like the industrial revolution of a different sort coming forth. But it cannot be done with just one party or we like to think and done with two. But more than it needs a lot of partnerships, it needs collaboration across an ecosystem. And also because of that it's important to create much, as we say, about developing chips for specific workloads, applications and LLM. It also requires somewhat standards that are open, more transparent for all to use, because you need to build up a whole infrastructure at the end of the day to become a critical utility for 6 billion people in the world. And we're very excited, frankly, which is why we think we make great partners, because I think we share the same conviction. And more than that, it is about scaling computing to create breakthroughs in super intelligence and models is building the foundation of that.
A
You guys have a lot on your plate. Why design chips now?
D
Well, this project, we've probably been working on it for 18 months now, and it's moved incredibly quickly. We've hired some really amazing people and I think what we found is that we have a deep understanding of the workload and we work with a number of parties across the ecosystem. And there's a number of chips out there that I think are really incredible and there's a niche for each one. And so we've really been looking for specific workloads that we feel are underserved. How can we build something that will be able to accelerate what's possible? And so I think that that ability to say that we are able to do the full vertical integration for something we see coming, but it's hard for us to work through other partners. That's a very clear use case for this kind of project.
E
Yeah, actually more than that. And Greg, you put it very well. Really why you want to do your chip is computing is a big part of what's gating this journey towards superintelligence toward creating better and better frontier model. It really a lot of it down to computing and not just any computing. Computing that is effective, high performance and efficient, given especially on power. And what Greg is saying is exactly what we learned and saw here. For instance, if you want to train, you design chips that are much stronger in computing capacity, measured T flops as well as network, because it's not just one chip that makes it happen, it's a cluster, as Charlie put it. But if you want to do inference, you put in more memory and memory access relative to compute. So you are actually over time creating chips optimized for particular workloads, applications as we go along. And that at the end of day, is what will create the most effective models, is a platform that you want to create end to end.
D
Also, one piece of historical context is that when we started OpenAI, we didn't really have that much of a focus on compute, we felt that the path to AGI is really about ideas. It's really about try out some stuff. Eventually we'll put the right conceptual pieces in place. And then AGI, about two years in, in 2017, the thing that we found was that we were getting the best results out of scale. It wasn't something we set out to prove. It was something we really discovered empirically. Because of everything else, that didn't work nearly as well. And the first results were scaling up our reinforcement learning in the context of the video game Dota 2. Did you guys pay attention to the Dota 2 project back in the day? It was a super cool project, and we really saw you scale up by 2x and suddenly your agent is 2x better. It's like, okay, we have to push this to the limit. And at that point, we started paying attention to the whole ecosystem. There were all sorts of chip startups with novel approaches that were very different from GPUs. And we started giving them a ton of feedback saying, here's where we think things are going. It needs to be models of this shape. And honestly, a lot of them just didn't listen to us, right? And so it's, like, very frustrating to be in this position where you say we see the direction the future should be going, but we have no ability to really influence it besides sort of just like sort of trying to influence other people's roadmaps. And so by being able to take some of this in house, we feel like we are able to actually realize that vision. And again, in a way that, like, we hope that we can show a direction and other people will fill in. Because the amount of compute required to bring our vision of AGI to the world, 10 gigawatts is not enough. That is a drop in the bucket compared to where we need to go.
B
That's a big drop, but the bucket's really big.
A
What becomes possible with this? When you're building your own chips for inference and for training, where can you
B
take this to zoom out a little bit? If you simplify what we do in this whole process to, you know, melt sand, run energy through it, and get intelligence out the other end, you're not literally melting sand. Like, it's a nice visual.
D
That's all we have to do.
A
I like that.
B
What we want is the most intelligence we can get out of each unit of energy, because that will become the gate at some point.
D
And
B
I hope what this whole process will show us, which is, you know, from the model we design to the chip to the rack, we will be able to wring out so much more intelligence per watt and then everybody that's using these models in all of these incredible ways will do so much with it. That's what I hope for.
E
And you control your own destiny. If you do your own chips, you control your destiny.
A
Yeah. It's interesting to think about how the things that we're doing today are pretty amazing, remarkable. But we're using stuff that wasn't necessarily designed specifically for the way we're doing it.
E
Oh.
B
I mean, the GPUs of today are incredible, incredible things. I'm very grateful. And we will continue to really need a lot of those. The flexibility and the ability to let us do fast research is amazing. But you are right that as we get more and more confident in what the shape of the future is going to look like, a very optimized system to the workload will let us ring more out per watt. That's great.
C
And it's a long journey that takes decades. So if you go back to Hawk's example, take railroads, it took about a century to roll it out as a critical infrastructure. If you take the Internet, it took about 30 years. This is not going to take five years, it's going to take a long time. So I think as we collectively, especially with this partnership, continue to figure out ways to wring out more tokens out of it, we'll discover that, oh, for this training or research, maybe a GPU is great, or maybe, you know what, we can take whatever we're doing with Greg, it's actually a platform that allows you like a Lego block to take in things and out. And now suddenly we can get another XPU or an accelerator for next gen that's targeted at a training or an inference or a research.
D
Yeah. And to the point that Sam said of GPUs have really come in an incredible way. In 2017, when we started looking at all these other accelerators, it was actually very non obvious about what the landscape would look like in five, 10 years. I think it's really a testament to companies like Nvidia AMD for how much the GPU has just moved forward and continued to be the dominant accelerator. But at the same time, there's a massive design space out there. Right. And I think that what we see is workloads that are not served through existing platforms. And that's where that full vertical integration is, is something unique.
A
It's interesting too, because the idea that you'd want to put inferences close to the user is something Kind of relatively new. We've understood training. But then you think about the number of people every day using these products and how much they need COMPUTE to do fun things or serious things. When you start thinking about the scale of it, like we talked before, I keep coming back to it's a very big thing. Where does it keep going? Is it just a thing that we're going to continuously find new Things to use?
B
Compute for the first cluster OpenAI had, the first one that I can remember the energy size for was 2 megawatts.
A
Adorable.
D
We got things done with those two.
B
I don't remember when we got to 20, I remember when we got to 200, you know, and we will finish this year a little bit over 2 gigawatts. And these recent partnerships will take us close to 30. And the world has done far more than I thought they were going to do. Turns out you can serve 10% of the world's population with ChatGPT and do the research and do SORA and do our API and a few other things on 2 gigawatts. But think about how much more the world would like to do than they get to do right now. If we had 30 gigawatts today, with today's quality of models, I think you would still saturate that relatively quickly in terms of what people would do, especially with the lower cost we'll be able to do with this. But the thing we have learned again and again is let's say we can push GPT6 to feel like 30 IQ points past GPT5, something big. And that it can work on problems not for a few hours, but for a few days, weeks, months, whatever the amount. And while we do that, we bring the cost per token down. The amount of economic value and sort of surplus demand that happens each time we've been able to do that goes up a crazy amount. So you can see, to pick a, I think well known example at this point when ChatGPT could write a little bit of code, people actually used it for that. They would very painfully paste in their code and wait and they would say do this for me and paste it back in and whatever. And models couldn't do much, but they could do a few things. The models got better, the UX got better, now we have codecs. Codex is growing unbelievably fast and can now do like a few hours of work at a higher level of kind of capability. And when that's possible, the demand increase is crazy. Maybe the next version of codecs can do like a few Days of work at kind of one of the best engineer, you know, level, or maybe that takes a few more versions. Whatever it'll get there, think how much demand there will just be for that and then do it for everybody, Every knowledge, work, industry.
D
And one way I like to think of it is that intelligence is the fundamental driver of economic growth, of increasing the standard of living for everyone. And what we're doing with AI is actually bringing more intelligence and amplifying the intelligence of everyone. And so as these models get better, I think everyone's going to become more productive. The output of what is possible is going to just be totally different from what exists today.
A
It's interesting too that going from a point when with GPD3, which was pretty cost expensive comparatively, to where you're at a level of a GPT5, and the fact that you can provide that freely to people and is that a motivating factor for you? The fact that every time you create these new efficiencies that it just benefits so many more people?
B
Yes, absolutely.
E
And from our side on hardware compute capacity, to some extent, rubber hits the road on this. It's really incumbent on us to keep optimizing, pushing the envelope on leading edge technology. And there's still room to go and there's room to grow even on where we are as we go from 2nm going forward, less smaller than 2nm as we start doing all kinds of different technology. It is really great, exciting times, especially for the hardware and the semiconductor industry.
B
What Broadcom has done here is really like quite incredible. It used to be extremely difficult for a company like ours to think about making a competitive chip, in fact so hard we just wouldn't have done it and I think a lot of other companies wouldn't have done it as well. And all of these sort of this customized chip and system to a workload just wouldn't be a thing in the world. But the fact that they have pushed so hard and so well on making it so that a company can partner with them and they can do a miracle of technology chip quickly and at scale, unfortunately they do it for all of our competitors too, but hopefully our chip will be the best. Yes, of course, it's really quite incredible.
D
And I think also not just what they can do for us today, but looking at the upcoming roadmap, it's just so exciting the kinds of technologies that they're going to be able to bring to bear for us to be able to utilize.
E
Well, it's just the excitement of, I mean, enabling Joint collaboratively models ChatGPT 5, 6, 7, on and on. And each of them will require a different chip, a better chip, a more developed chip, advanced chip that we haven't even began to figure out how to get there, but we will.
D
And actually the GPTs are definitely going to be an increasing part of that.
C
We're actually looking forward to that because my software engineers now already use that from a software point of view and it's delivering efficiencies of dozens of engineers.
B
Really.
C
Yes.
B
Great.
C
On the hardware side, we're not there yet, but you know the good news,
E
we'll get there on very.
D
We should talk about them.
C
We should absolutely leverage this. But I was going to say with respect to compute, so when we started building these XPUs you can maximum build a certain number of compute in 800 square millimeter, that's it. Now today we're actually working together to ship multiple of these in a two dimensional space. The next thing we're talking about is tacking these into the same chip. So now we're actually going in the, you know, Y dimension or Z dimension if you want to think three dimensional. And then the last step we're actually also talking about is now we're going to bring optics into this, which is actually what we just announced, which is 100 terabytes of switching with optics integrated all into the same chip. So these are sort of the technologies that will take compute the size of the cluster, the total performance and wattage of the cluster to a whole new level that I think it will keep doubling at least every six to 12 months.
A
What kind of time frame are we talking about? When are we going to first start to see what's coming out of this relationship?
B
End of next year and then we'll deploy very rapidly over the next three years.
E
Absolutely, yeah.
C
Greg and I are talking about this at least once a week. We just had a chat earlier today on this.
D
Yes, good progress today.
C
Yes, exactly.
D
But yeah, we're really excited to get silicon back, starting soon actually.
C
Yes, very soon.
D
Yeah. I think that my view of this whole project is it's not easy, right? It's easy to just say oh yeah, 10 gigawatts. But like when you look at what is required to actually design a whole new chip and to actually deliver this at scale, get the whole thing working end to end. It's just an astronomical amount of work. And I would say that we're very serious. You know, our mission is to ensure that AGI benefits all of humanity. We're very serious about benefits everyone. Like, we really want this to be a technology that is accessible to the whole world, that lifts up everyone. And you can really see that in trying to make the world be one of compute abundance, because I think by default we're heading towards one that is like, quite compute scarce.
A
You ask my wife when she's trying to get more Sora credits. It feels very scarce.
D
No, no. We feel it so concretely, like teams within OpenAI, that their output is just like a direct function of how much compute they get. And so that the amount of sort of the amount of intensity on the like, who gets the compute allocation is so extreme. And so I think that what we really want is to be a world where just if you have an idea you want to create, you want to go build something that you have the compute power behind you to make it happen.
A
Gentlemen, thank you very much for sharing this with us. This is going to be very exciting to see where this goes, and I hope we can keep talking about this as it continues to develop.
B
Thank you. Thank you, guys, for the partnership.
A
Thank you.
E
Thank you for the partnership. We really enjoying it.
D
We are too.
C
Yep.
E
Thank you.
Date: October 13, 2025
Host: Andrew Mayne
Guests:
This episode unveils a major partnership between OpenAI and Broadcom, centered on designing and deploying custom AI chips and integrated systems on a historic scale. The discussion dives into why this collaboration matters for the development of advanced AI—in terms of both technical breakthroughs and global impact—as well as how the partnership seeks to build the foundation for future AI models, including steps toward AGI (Artificial General Intelligence). The conversation is forward-looking, practical, and packed with insights on the scale, challenges, and philosophical implications of building out world-class AI infrastructure.
"You would say it's the biggest joint industrial project in human history."
—Sam Altman [00:15]
"We are able to think from etching the transistors all the way up to the token that comes out when you ask ChatGPT a question and design the whole system."
—Sam Altman [03:28]
"You optimize by 10x and there's 20x more demand."
—Sam Altman [03:28]
"We've been able to apply our own models to designing this chip, which has been really cool... the model comes up with its own optimizations."
—Greg Brockman [05:34]
"The path to AGI is really about ideas... but the thing that we found was that we were getting the best results out of scale."
—Greg Brockman [15:23]
"We're defining civilization's next generation operating system."
—Charlie Kawas [11:02]"This is like railroad Internet... critical utility over time for 8 billion people globally."
—Hauk Tan [11:41]
"We're actually working together to ship multiple of these in a two-dimensional space ... The last step we're actually also talking about is now we're going to bring optics into this...100 terabytes of switching with optics integrated all into the same chip."
—Charlie Kawas [26:09]
"What we want is the most intelligence we can get out of each unit of energy, because that will become the gate at some point."
—Sam Altman [17:26]
"What we really want is to be a world where just if you have an idea you want to create, you want to go build something that you have the compute power behind you to make it happen."
—Greg Brockman [28:06]
"We're defining civilization's next generation operating system."
—Charlie Kawas [11:02]
"Compute is the gating factor on this journey towards superintelligence."
—Hauk Tan [14:04]
"If you simplify what we do to... melt sand, run energy through it, and get intelligence out the other end..."
—Sam Altman [17:08]
“Intelligence is the fundamental driver of economic growth, of increasing the standard of living for everyone.”
—Greg Brockman [23:03]
"If you do your own chips, you control your destiny."
—Hauk Tan [18:02]
OpenAI and Broadcom’s partnership marks a new era in AI infrastructure, culminating in custom, vertically integrated compute systems—pushing the limits of both scale and efficiency. With ambitions to serve billions and underpin the next generation of AI models, both parties see this endeavor as essential, not just for their organizations, but as foundational for society’s future productivity and opportunity. The mood is both urgent and optimistic; advancing compute is seen as key to making AI’s benefits accessible to all.
For anyone interested in the intersection of advanced AI, hardware innovation, and the massive societal shifts being set in motion, this is a milestone episode, rich in technical depth and vision for the future.