
Loading summary
A
Can you tell us about the dot com boom?
B
We did much more technically interesting work in the bust than we did in the boom. There's a degree to which innovation requires some level of desperation, that good economic times are kind of hard to summon that desperation.
A
How have AI tools changed how you're working at Oxite?
B
Certainly we're using cloud code a bunch and people are doing that, but for a lot of the work that we're doing, it is helpful as maybe a polishing tool, but less as at the epicenter of its creation.
A
Can you tell me what it actually means to design or build a computer?
B
Oh, it's very involved. Yeah, it's very involved. So first of all, how have servers.
C
And cloud infrastructure evolved since the late 1990s and what is next? Brian Cantrell was a distinguished engineer at Sun Microsystems during the dot com boom and dot com bust, built a small competitor to AWS called Joyent, and is now the co founder at Oxide. Today we go into the history of servers in the cloud from the late 1990s to today, the challenges of building hardware like the Oxide computer from scratch, how the Oxide team uses AI and why they find it practically useless for hardware engineering challenges, why Oxide builds everything as open sour, and how they manage to work remotely as a hardware startup, and many more. If you'd like to understand more about how the cloud works and learn how nimble hardware plus software startup operates, this episode is for you. This podcast episode is presented by statsig, the unified platform for flags, analytics, experiments and more. Check out the show notes below to learn more about them and our other seasoned sponsor.
A
So, Brian, welcome to the podcast.
B
Oh, it's great to be with you. Thanks for having me.
A
I'd love to jump back in time a lot back in the 1990s because you're someone who's been around the block and back then you worked at some interesting companies, including, including at Sun. And if you could give us listeners and viewers a sense of what was it like in the 90s in terms of software servers? What was the vibe like?
B
Yeah, it was an interesting inflection point because I was interviewing in 1995, I started in 1996. So I would say that the Internet, I mean, HTTP had been developed in like 9394. We had kind of the first web browsers, but it was still very, very, very new. And the Internet was just kind of primed for takeoff. Java had been. Java had come out in maybe in 1995. Java had kind of taken off immediately. So there was a Lot of really exciting energy. But it was nowhere near what would become a couple years, even a couple years later. It became very frothy, of course, and it was exciting. It was very clear to me. I went to school actually in the east coast. But just coming out here to Silicon Valley, the energy was extraordinary. And really knew that I wanted to come out here for my career. So at sun those next couple of years, I mean, I got very lucky really because sun was in the right place at the right time with the right technology, which sometimes you only appreciate in hindsight because it was so explosive. And if you wanted to build a website as part of that.com boom, you, you were buying sun servers, you were buying Cisco Switches.
A
Now why was this the case? Because again, just taking myself back, just being a bit naive, I would assume that let's say I'm in the 1995, I want to build a website. Could have I not just used a PC and spun up a server? Did it not work like that?
B
Or how did it work? I mean, a PC, like maybe. But you didn't really have an operating system, right? Because you. Linux is. Linux is very, very new. Linux is not.
A
I'll back down.
B
Oh yeah, definitely. Linux is, you know, be like Haiku today, which is an operating system you haven't heard of for a reason. It's kind of like a hobbyist operating system. You know what I mean? You'd be like, what? No, you wouldn't run. And then you kind of had the BSDS where the FreeBSD was certainly out there also. Still very much under the shadow though of this lawsuit from att. So the unices are kind of. There's not really open source operating system options. There was the. Actually this was kind of funny because. So where was the GNU option? It was gonna be the Herd operating system. So Herd was kind of like the Duke Nukem forever of its time. It was the operating system that was constantly coming kind of next year and next year and next year. And it was gonna be Microkernel based. And so you know that it's kind of amazing, but you really couldn't do it on PCs because of the lack of system software. And actually part of my attraction to sun was I had used Solaris on Spark, but I never. I knew Solaris existed on x86, but I never used it. So I was excited to use Solarisonic City says.
A
And so what did sun build? You mentioned Solaris. That was the operating system.
B
Solaris is the operating system. We built servers, so we built Spark based Servers. We built desktop machines. So we, sun was a computer company, it was a systems company. So we built desktop machines, built some ill advised laptops. So basically desktop machines, workstations. But then at that time in the 90s, what was really exploding were everything from those kind of work groups, tile servers up to really getting bigger and bigger servers up to the very large machines. Machines are as physically the same size as what oxide makes today. And I remember vividly in what would have been like 97, 98 maybe Greg Papadopoulos then the CTO of Sun giving to the entire company saying, here are the top three applications for Sun Microsystems. Databases, databases and databases. So that gives you an idea of kind of how it was being used. And this is again as that kind of in that, that, that knee up of that dot com build out where if you, again, if you wanted to really build a web presence, you were going to use, you were going to use Java, you were going to use, you're gonna do it on Solaris, you're gonna do it on sun servers. And you were gonna. And it was kind of, it was a wild time for sure.
A
And can you tell us about the.com boom? Because right now I know AI is pretty exciting and it feels like we're in a special time, but what was it like especially working on Sunk? It sounds like it was at the.
B
It was the epicenter of it. And you know what was funny is I did, it was frenetic in a way that was not always positive. So one of the things that is, that is just a point of fact and one can take from what one will. I did, we did much more technically interesting work in the bust than we did in the boom. Because I think that when you're in boom times, you know, everyone kind of like secretly believes that this is because of me. Like that it is because of the thing that I am working on. If I, you know, I once had, you know, one of the, one of the early technologists behind Java once told me with a straight face, every server that sun sells, they sell because of Java. And I'm like, you know what, you know what's most amazing? You believe that is actually the more interesting fact that I mean it is like obviously false, especially with you know, databases, databases, databases being the top three applications. But that kind of reflects the zeitgeist of the time that everyone believes that this is, you know, if I work on the microprocessor, it's because of the microprocessor is perfect. If I work on the operating system it's because, oh, it was the operating system that people are buying the machine for. And that doesn't really lend itself to, really to real innovation. I think, I think there's a degree to which like innovation requires some level of desperation that good economic times are. It's kind of hard to summon that desperation sometimes. So I think that during the boom it was, and it was just, it was frothy and it felt like there was a period of time where I'm like, this obviously can't go on forever. And you know, the Economist is having these very like gloomy covers about how this is all going to end and it's going to be an apocalypse, which I believed. And then I just stopped believing it. I'm like, well, maybe the Economist is right and just went on longer. And you know, one of my early life lessons from the boom and bust is these things go on longer than you think possible. But when they, the growth in terms of the boom, when you're in frothy times, that boom will go on longer than you think possible. And when it switches, it will collapse faster than you can fathom in the boom.
A
Do I understand correctly that customers were just like wanting to buy your servers? They were flying off the shelves. All these companies were everything. And on a day to day work, what did it mean for you?
B
So I'll tell you like in day to day it meant, first of all, it meant that traffic was terrible, that the, you know, there is, you couldn't get housing, you couldn't get, you know, everything was in short supply. You couldn't. Customers are, you know, they are buying. We had a customer that, you know, was, but was going to buy 19,000 servers, which is obviously a big, very big number.
A
And these were these massive, big servers, right?
B
Yeah. Well, in that case those were actually one use servers to build out a broadband initiative that actually was a company called Enron. You know, I remember vividly we were at a, a dinner here in the city at a restaurant called Aqua, which is a very kind of fancy restaurant, long since out of business. And I don't think Aqua survived the bust. And we were at Aqua with a bank who was a customer of Sons. And they were spending a galactic amount of money every year with Sun. And we were at a dinner and I just remember, I mean it was the kind of like 19th century Gilded Age kind of dinner. People are ordering, you know, nine courses. What I remember is at the end of that having Chateau d', Aquimes, which is a Sauterne. So I don't know very much. I don't know very little about wine. I know nothing about Sauternes. What I did know is there was someone who knew wine and is like, we are going to all drink the 1952 Chateau d' Aquims Sauternes. Which is. And I remember being like. I'm like, I'm not much of a drinker, but I was like too drunk at that point to really appreciate it. So I have had this Sauterne that you know that oenophiles kind of live their life to drink. And I'm sad to inform you that there's one less bottle of this precious vintage because it was poured down the gullet of a 20something.comer who really had. And I just remember being back in my apartment, being literally drunk on Chateau d', Aquims, thinking about in Potrero Hill and remember thinking to myself, this can't last. This is not sustainable. And I swear the dot com boom turned to a bust. Like that night that is September of 2000. So the pets.com had kind of busted out and there were a bunch of. NASDAQ had busted out early in 2000. The traffic got lighter early in 2000. Anyone who was here would be like that. The absolute spookiest thing is it went from like gridlock to like Covid, like traffic in the span of like a.
A
Month without Covid happening.
B
Without Covid happening, with only the NASDAQ collapsing. And you're like, okay, that's very odd. And then 2000 kind of muddled along and then the. That dinner was in September of 2000 and the. What really stopped was the telco buildout. So there was a lot of telco buildup because people are like, the Internet.
A
Is the future and telco build up, meaning the towers, the servers, the servers, the infrastructure for.
B
And then all of the concomitant, the fiber. Like JDS Uniface was a huge company. You had these companies that were, you know, Global Crossing and MCI, WorldCom and all these companies were explosive. And everyone believed that the Internet is the future and this is like an important thing.
A
And they were right.
C
Brian just said how an important lesson with the dot com boom was that people who believe the entry will be the future future, they were right. Today we're in a similar stage with AI. It's pretty likely that AI will be part of the software stack in the future, even if timing is harder to predict. The latest shift is how AI agents are becoming a lot more commonly used for development. And this is a great time to Talk about our season sponsor, Linear, and how they think about collaborating with agents. Linear has taken an interesting approach here. Instead of building one proprietary AI assistant and locking you into it, they built an open API and SDK that lets any agent plug into your issue tracker. That means you don't need to wait for Linear to build the features that you need. You can connect the best coding agents on the market, like Cursor, GitHub, Copilot, OpenAI codecs and Devin, or you can build your own agent for your team's specific workflow. It's a fundamentally different approach from most issue tracking and project management tools on the market. You get optionality and the experience is surprisingly natural. You assign an issue to an agent the same way you assign it to a teammate. Or you can simply mention the agent in an issued thread.
A
Cursor then can pick up a bug.
C
Understand the context from the issue, open a pr CODIS can explore a fix while you're focused on something else centric and root cause analysis. When something breaks, it's pretty powerful what you can get these agents to do. And here's what I like you. The human stays the accountable owner. The agent works for you, not instead of you.
A
You review the work.
C
You decide when it's good and when it ships. If agents are going to be a part of the toolset of building software, and it feels to me they increasingly are, you'll want a system that's actually designed for them. Linear is a system like this. To learn more, head to Linear app Agents. And with this, let's get back to the point where Brian was saying how.
A
Those believing the Internet will be the.
C
Future back in 2001 were right.
B
This is the other thing. It's like they're right. And so like a very famous impact crater from the.com boom is Webvan, right? Webvan was delivering groceries, which many people today are going to get their groceries delivered right instacart. It's like they weren't wrong, but their timing was off and they lost track of the underlying economics completely. And so when it busted out, so in the fall of 2000, in November of 2000 in particular, there were, there were zero orders from telecoms at Sun. Like it went to zero.
A
Wow.
B
And every. And you know, you're kind of used to kind of ups and downs, but that's like just like off a cliff. And from that point we, you know, going thousand and then, and then 2001. And it was then very, very grim. I would say that the thing that, that happened through the bust and layoff after layoff after layoff. And because companies had kind of built themselves and geared themselves around these fat times lasting forever, and now they were gone. And expectations, as frothy as expectations were during the boom, they were that much negative in the bust. People are like, everything is, it's the end of days.
A
And were you a software engineer back?
B
Yeah, software, Yep.
A
And then so as a software engineer, like both you and also thinking about your colleagues back at the time or friends, how did it impact you? Were you kind of just chugging along or.
B
I would say that, like lots of people left and you had like the statistic of, you know, the u hauls were 10 to 1 out of the Bay Area. So you just moved away. They moved away. And the thing that I noticed is that the people that had moved out to Silicon Valley because they were, they really had an interest in the technology, all were there, all stayed. And we're not adversely affected, honestly. I mean, I. The. Yes, we, every one of us. If you had equity in your company, which of course you all did, like, you try not to overthink it, right? You just try to like, you try to remind yourself, like, I never had it to begin with. So like, it's hard to, you know, but it's definitely gone. Sun lost 98% of its value, so it's like definitely gone. And you know, there was some thinking and I think it also, like a boom can get you to care about things that you actually don't care about. And a boom can get you to. Because in a boom, everyone is so financially driven that it's hard not to become financially driven. But it's like, that's actually not why I got into this. And so during the bust, I'm, you know, definitely able to put, you know, put a meal on my table and a roof over my head. But the, it was really a reminder about, like, what's important. And again, because we did, we did do better technical work in the bust than we did in the boom. And I think it's because in the bust it's like, okay, now we have to focus. We have fewer resources. The fewer resources actually force more creativity. So all of the things that we did, certainly speaking at sun, in System software, so ZFS and DTrace and the service manager ability, all these things that were really revolutionary for the operating system all happened in the same kind of post bust period of time. So all of those things happened from 2001 to say, 2005.
A
And so what were these specific innovations?
B
So I'd Gone to work at sun to work with Jeff Bonwick. And as long as I'd known Jeff, from the mid-90s, Jeff had wanted to rethink file systems. And now finally, in the early 2000s, he and Matt Aarons were able to really go take a clean sheet of paper from the file system, and that's zfs. I had a chip on my shoulder about the way we understand debug systems, by the way we observe systems. So I, along with two other colleagues, did DTrace, which allowed us to dynamically instrument systems. And you can kind of go down the line. And there were a bunch of things like this where we. And I don't know that all of this is related to the bust. It's just that the timing lined up such that it was all happening during the bust. And what we ended up with was a whole bunch of interesting technology coming together, actually in a single version of the operating system. And then very, I mean, fortunate for us, and I do think this is a bit of a consequence of the bust because sun was definitely open to new approaches. We open sourced all the operating system. So that happened in 2005. And that was very important to give these kind of technologies eternal life.
A
But I think we can never predict the future. But to me, it is pretty positive in the sense that even in the bus, hearing the stories that innovation did not stop, sure, you know, sounds like it was probably harder to get jobs and they might have been fewer of them, but, you know, industry kept innovating. And what you said, I didn't expect to hear that it was a bit easier to innovate.
B
It's just less manic. We were able to focus more. And so not that now, I mean, not that one should necessarily pine for a bust because busts are brutal, but there is a clarity that you get too. So, I mean, ideally you would like to have just like, can we just be like, normal economically? But like, nope, apparently in high tech we've got to be like, on or off.
A
So, bust aside, in the early 2000s leading up to this Internet boom, the way to, you know, most companies went about buying sun servers with Solaris installed, and everything was hardware and software came together. It was beautiful. It worked well. Again, I heard from folks who did it what happened then, because when I got into sec in 2000, I did not hear about Solaris, and that was not how it worked.
B
No. Right. That's right.
A
What was the shift?
B
So the shift was first of all open source. Right. So then, so, you know, we said in the mid-90s Linux was kind of still very much the hobby project. Not so by the 2000s.
A
Right.
B
So grew up. It grew up, absolutely. And it grew up because you had a bunch of companies that really backed up the truck. And you know, the things that at first, IBM and sgi, Data General, some other companies, those companies were very important because they decided to contribute their technologies like XFS. Right. XFS many people still use today on Linux. That's from SGI. XFS was SGI on Irex. That was happening in kind of the late 90s and then in the 2000s. I mean, Google was always built on Linux. Right. And so you had kind of the companies that became that next boom were all built on open source and indeed needed to be built on open source. So they economically relied on open source to be able to build what they build. So then it became much more practical to certainly run Linux. And I think the other bsds or we open sourced Solera, so there were a lot of options that were now available. So that shifted. I think the other thing that that shifted is that, I mean, Spark bluntly lost to X86 and Sun.
A
And Spark is a Harvard architect.
B
Spark is a microprocessor. Yeah. And, and there was, because there was a time in the 90s when if you wanted the fastest microprocessor, it was a RISC microprocessor. It was, it was from. It was a Spark microprocessor or it was MIPS or it was alpha. And x86 was, was a commodity but, and obviously available with a personal computer, but was not faster than those, those RISC microprocessors that shifted, that shifted in the late 90s. And we, you know, because we ran the operating system on that, what was in Solaris, on Both Spark and x86, we could see how fast these x86 machines were and could see frankly how like, you know, you talk to the microelectronics folks and they really did not. They kind of dismissed x86 and dismissed intel and you shouldn't do that. And in particular intel was, was very focused and architected the way around what was called the memory wall. And they were able in part because they used speculative execution, they were able to actually make these microprocessors that became much faster than the RISC microprocessors. So by the time, say you are in 2004, 2005, if you want a leading edge microprocessor, it's x86. So that was a big and important shift. So by the time you're coming up it's like, okay, yeah, if I want this, if I'll just like, I don't know, get like a Dell box or Super Micro box and then I'll put Linux on it or maybe FreeBSD and away I go. Then the next kind of big and important shift that happened started in 2006. You could argue with S3, but then especially in those next kind of AWS 7, 8, 9 with the introduction of EC2. And now you have like the cloud that starts to come into play and now like people are like, why would I even like screw around on the server at all? I mean it was so great to be able to just spin up infrastructure.
A
Yeah. I remember one of my early companies, mid 2000s, we had a server room. We had server administrators. The server room was always hot. And this was a small company, mind you, this was not a big one.
B
Every company needed to do that. It's kind of amazing to think it's like that. Every single company, no matter if you were a website, you had your own.
A
Server room and if you were a dev, you wanted to be friends with the server admin because when you wanted to deploy your stuff, know they could do stuff for you, they could do.
B
Stuff for you or that's it. Totally. And so I think that cloud computing was really important. This is not a deep thought that elastic infrastructure was really important. The ability to have API driven infrastructure and that. So for me personally, so I was, I was at sun and then from. In 2006 I started a storage group inside of sun which was great, really successful group, but so successful that it actually attracted Oracle as a customer for the first time in a long time. I kind of, this is like a little bit of like residual, like shame that I have that like did I attract the marine apex predator that ate the company?
A
Because Oracle, Sun. Right.
B
And then they kind of bought son in and that closed in early 2010. I left shortly thereafter because I could see what Oracle was.
A
Well, I never heard a story of your potential role here.
B
Right. So I, Yeah, and Oracle and I gave some. Maybe a year later I gave a talk in 2011 with some rather unvarnished opinions about Oracle and Larry Ellison in particular. I caution people about anthropomorphizing Larry Ellison. You have to treat Larry Ellison as a machine, like a lawnmower. You stick your hand in the lawnmower, it'll chop it off. Well, this isn't so. All right, so I'm giving this talk in 2011 again. This is after I've left what Was then Oracle. And I was just saying things that I felt were obvious, but the audience is kind of gasping and it's like. And people are coming up to me after the talk, like, do you think there's gonna be, like, it's gonna be retribution from Oracle? I mean. No, no, you're misunderstanding. Like, there's no, the lawnmower's not angry at you. It's a, it's a machine. It doesn't, it doesn't have, it doesn't have the mirror neurons to be. I would almost, I, it would almost show me that I'm wrong for Oracle to resent what I'm saying about them. Oracle. Anyway, so. But all the videos for that conference go up and my video does not go up. Oh, right, okay. And so my colleagues were like, this is an Oracle conspiracy. I'm like, this is not an Oracle conspiracy. Wasn't. It wasn't orchestrated by Oracle. But what I did, I, what I underestimated was the fear of the conference organizers. So they themselves were terrified of offending Oracle.
A
Yes. Even though it probably would have been fine.
B
No. So the talk did finally go up. Before the talk starts, there is a disclaimer. The views in this talk do not represent the views of the Usenix Association. And you're like, all right, I get it. Like, I've never seen this disclaimer before. But fine. Then during the talk, you know, the format of the talk is you got a slide and then you've got like a little blank strip and then you've got this talking head in the lower right corner. There's like kind of this dead space above the speaker. They took this disclaimer and they re justified it and they put it above my head the entire time I'm speaking. So if you. And in, I mean, and maybe in this regard they were prescient because to this day, if Ellison is mentioned on Hacker News or Oracle is mentioned on Hacker News, someone will immediately cite minute 33 of this talk, which is when I go on this kind of Oracle again. I don't view it as a rant. I view it as just like me describing what is obviously true, that we all know. But anyway, so I had left, I left Oracle after they bought Sun.
A
So we're now around like 2000. So cloud has taken off. X86 architecture is everywhere. Linux is now winning both for small time servers, but also on the cloud. And then what happens? This was an interesting time when Google started to figure out that, hey, they could do something interesting on their cloud, right?
B
Yeah. That's right. So this is still a little bit before that. So this is in kind from, I would say from 2010 to about 2014 is when is a period of relentless execution from AWS. AWS is executing so extremely well. There are not really other public cloud options. There's like kind of Azure's kind of drifting out there.
A
People forget that that, you know, like GCP on paper has been around from 2009, but up to like 2014, it was like, it was almost like a joke.
B
It was a joke, I would say before it was. It like it existed, but it was a joke. And, and the, and in particular at every single reinvent, Amazon would announce a new price cut. And if you were a competitor to aws, you are like dreading reinvent because here comes another price cut. If you are a partner of aws, you're dreading reinvent because here comes the announcement of a new service that competes with what you're making.
A
I think people who've not been around have forgotten, but it really has happened and because it's not been the norm the last like, let's say five, ten years or so.
B
Well, and in particular they did a couple things are just like, man, you got to, to tip your hat to just. I mean Jeff Bezos is the apex predator of capitalism. Like Larry Ellison may be the lawnmower, but Bezos is ultimately the apex predator because the thing that was so impressive is they were able to give people the idea that this was a terrible business. So in particular they did not break out their financials. So everyone's like, oh my God, what an awful business. Like they're cutting the price every year. Like you do not want to, like, this is a classic red ocean, it's bloody. You don't want to compete. And so we were at Joyent, we were actually competing head to head with aws.
A
So you were offering a public cloud.
B
So we have a public cloud. And then unlike aws, taking the software that we had used to run the public cloud and making it available for people that wanted to run a cloud on prem on their own hardware. So people that would buy Dell or HPE or Supermicro, they would buy our software and they would run it on there and get a cloud. So we, we ran a public cloud and we knew what the economics of a public cloud were, namely pretty good, margins were good. And so what we knew that Amazon, that Amazon wasn't volunteering, but what we knew is that AWS S3 was underwriting a war on big box retail S3 was paying for your prime shipping. It was a genius move.
A
And so also some, some insider information that you had because you did your own thing.
B
Well, we don't know that the margins are very good. And then of course, I mean we did. I. You will be unsurprised to learn that several of Joyent's most prominent customers were retailers. Retailers. This was not lost. Retailers were like, gee, I wonder what's happening. Retailers are like, if you think I'm gonna take my dollars and spend them on aws, so AWS can I. So Amazon can go to war with me? Like, no thank you. There was a period of time when it felt like in order to be in the cloud you have to implement every AWS API. So there's this idea that you had to be API compatible with EC2. There's a company called Eucalyptus that tried to do this. It was just a disaster. And part of the reason it was thought that GCP and Azure could never compete with AWS because they could never be API compatible. And so I am convinced that the. Because what changes? What changes is like 2015. What starts in 2015? Kubernetes. And I think that part of that initial attraction to Kubernetes is that people wanted to get some optionality around their cloud and they felt locked into aws. They're like, I'm not using all this stuff. I'm not using elastic beanstalk, I'm not using Greengrass, I'm not using kind of these more, I'm not using redshift. What I actually want is this kind of basic infrastructure and Kubernetes now gives me this layer upon which I can deploy and get some sort of true cloud neutrality. So multi cloud didn't really exist, I would say, before Kubernetes. And I think a lot of that, especially early momentum behind Kubernetes is around this idea of like I need to get some optionality in here. I want to actually be able to go to gcp. So I think, you know and I don't. I think it's giving Google slightly too much credit, but only slightly too much credit to say that it is masterstroke.
A
On the podcast I had Kat Cosgrove who's released a project manager on Kubernetes and you know, she's been in the project for a long time and I asked her, she's not, she was never a Google employee, but I asked her, why do you think Google open source Kubernetes, which, you know, they have Borg, which is amazing and they kind of built Honestly a better version for external. And they just released it just like that. They put a lot of work in it and to me it didn't really compute. Like why would Google like what is the business reason? And she told me that she thought again, speculation from the outside that she thought that they probably thought that it would help Google cloud.
B
That's right.
A
To have a container which is now portable. And now you can give the promise that if you run this on Azure, especially aws, you could come over. So it kind of makes sense. Is this your thinking?
B
Yeah, absolutely. But I think that is definitely, definitely the argument that Kubernetes proponents would make inside of Google in terms of like why they did it. Nobody prevented it, you know what I mean? It's like they kind of open sourced it.
A
Google was a pretty cool place in the sense that it was very bottoms up as I understand back then still.
B
And then I think part of their, you know, it was Craig McCaughey who really pushed for the CNCF, the formation of the CNCF around Kubernetes to give it kind of a foundation home. I did, I do remember one conversation with Craig and I were talking early as he's contemplating the CNCF and he's like, well, I think this is going to allow Kubernetes to get the marketing dollars that it needs. I'm like, don't you work for the most profitable company on earth? Like do you really? Isn't it just like gushing cash over there and you can't get like, you know, a couple million bucks for marketing for this thing. But no, apparently you can't. So, but, so I think that the argument that people were making internally was about we should be encouraging cloud neutrality because we are the ones that have something to win. And they're right and they did. And GCP is now not an afterthought. GCP is very important. It's a very big business. And I think that they've got is Kubernetes to thank solely no, but I think it's played an important role for sure.
A
And where are we today in terms of the hardware and the software stack running? Specifically thinking of these big clouds, what's happening inside the likes of Meta? These giants, as I understand, you know, they're no longer just like, you know, ordering servers from Dell or wherever.
B
Never work, were, never, never were, never were.
C
What did they do?
B
They. So it's kind of funny because for all of these folks they took a somewhat similar path. They never were. Because in Google's earliest Days they were assembling machines from Fries, you know, RIP Fries, Fries being a local electronic shop that has long since disappeared. But they were kind of famously velcroing machines together and finding.
A
So they bought like the processor, the. That's right, the different car networking switch, whatever.
B
And they had this idea that like, like it doesn't matter what junk we run on because you know, our, our, our software is gonna run as a distributed system. It actually doesn't matter. We don't need ECC protected memory because it doesn't matter if your dibs fail. And so I think they learned. Well, it does matter a little bit if your dimms have rampant data corruption like dims failing. That's actually not a problem. Dimms your memory returning the wrong thing like that is a problem. You can actually like, you turn that like next thing you know, like your software inserts that into a row into a database and like. Yeah, now you got it.
A
Yeah, that is. Correctness is a problem.
B
Yeah, yeah, correctness is a problem. It's like, okay, overshot the mark. So by the time they're like, okay, we're not gonna have Velcro machines together, we're not gonna. But by that point in time, you know, the business was established enough that they actually did, they built the machines that were fit for scale. So they have a great book that was written in the kind of the mid-2000s, the warehouse size computer where they talk about all the things they did. DC bus bar, really thinking about power across the entire dc. So they kind of, they went from being kind of too cheap for kind of Dell or even Soup Micro to then being much better engineered than those systems ever were. So they were never really meaningful customers. And ditto for Facebook Meta. They were never really meaningful. I mean they kicked them out very early and did their own stuff.
C
Brian just talked about how Facebook built their own servers because off the shelf solutions didn't work at their scale. And what's interesting is that companies like Meta and Google didn't just build better hardware. They also built incredible internal tools. Tools for safe deployments, feature flagging, experimentation, debugging, analytics, the whole stack. That lets teams shift fast and with confidence. Most companies never get access to this level of infrastructure. You either build it yourself, which takes years and large engineering teams, or you make with scatter tools that don't talk to each other. That's exactly where statsec comes in in Static is our presenting partner for the season. And they give every engineering team access to the kind of tooling that only the biggest Tech companies used to have internally. At its core, static is a toolkit for safer deployments and experimentation. You ship a new feature to 10% of users behind a feature gate, you validate that it behaves correctly. Watch the metrics and expand to remaining 90% only when you're confident. And if something goes wrong, you can turn it off instantly long before it affects everyone. And safe deployments require visibility. Statsic includes analytics, both product analytics and infrastructure analytics, so you can actually see what your code is doing in production. Errors, performance changes, funnels, user behavior. Because you cannot ship safely if you can't see what's happening. Companies like Microsoft and Notion run hundreds of experiments per quarter or statsig Velocity that used to require entire platform teams to build and maintain. This used to be infrastructure available to maybe 10 or 15 tech giants. Now startups and midsize teams use static to ship quickly without breaking things. If you want to give your engineering team world class tooling from day one, go to statstick.compragmatic. there's a generous free tier a $50,000 starter program and affordable enterprise plans. And now let's get back to the conversation about the history of computing and what might be coming next.
A
And this was independent. So like both Google and Meta both came to the conclusion of like, we should just build our own stuff.
B
And Microsoft and Amazon all came to the independent conclusion because the scale at which they needed to run was not at all the scale at which Supermicro and Dell one HP were geared. What they were geared to do was to run the servers in your server room where you needed to know the devs, right? Where it's like, I'm going to have a little rack, it's going to have six servers, then maybe it's got 12 servers. Okay, maybe we grow to 24 servers. That's what they were designed to do. If you're like, no, I want to buy servers by the thousands because I've got a public cloud business. Like, if you want to buy servers by the thousands, there is no product from those companies for you. And in very, very basic ways. Well, like the D.C. bus bar at every juncture, they've been designed to be a personal computer that you happen to be slapping many personal computers together. But they're not designed to actually run infrastructure at scale. So and that was happening inside of effectively all the hyperscalers. And Joyent meanwhile was bought by Samsung in 2016. Joyent was bought by Samsung because their cloud bill was off the charts parts.
A
And they bought, they bought you to bring it on bring it in a house.
B
Yeah. And, and there was not a product they could go buy. So they went to go buy a company. So you're like, wow. And then it's like, wow, that's a big AWS bill. It's like yes, very big AWS bill. But then that was not a product that our company that was available for, for you know, the next. What does the next Samsung do? It's like, well that's one less company available to buy. So when we were contemplating the next thing in 2019, one of the things that we had seen is that and we felt, we earnestly believe that one, cloud computing is the future of all computing. Not a deep thought. That elastic infrastructure, API driven infrastructure. That is modernity. One, two, you shouldn't be able to only rent that. You should be able to buy that, own it, run it in your own data center. Why would you want to do that? Well, you might want to do that for risk management, for security or for economics. Because it's it, you know, if you're at a certain scale, you'd rather own it than rent it.
A
And I think, you know, before oxide or like in 2019 or even in like you know, 2020, 2021, if you were like a mid sized company, you know, like not big enough to build out your own custom cloud and build everything that the hyperscalers did, you could like buy some off the shelf like HP or Dell, like a bunch of them. I think that's what Basecamp did. I think they posted that they bought a bunch of these things. They rented a space in one of these share or I think two different locations. They put in their boxes with all the memory and then you know they kind of set it up and put it together. So I guess those were the two options, right?
B
Yeah, those are the two options. And I think that, you know, Basecamp ended up being a real poster child for the economic advantage because I mean DHH obviously outspoken and the economic advantage was really, really, really clear. They're also at a scale which is like not the scale that we're targeting. Right. That the scale we're we're looking at is a much larger scale. And so the economic argument is actually even more compelling when you're at that larger scale. I love it when you're the VCs that passed on us because they felt there was no market then would send me the DHH blog post. It's like, why are you sending this to me? I should be sending this to you. I know this. We just knew the Economics of it and we knew, couldn't predict exactly what the trends would look like, but believed that there would be folks that were born on the public cloud that would outgrow the economics of the public cloud and want to go on prem.
A
Economics aside, what does it take to build one of these things? And I saw one of these things, we'll put in a picture of it. It's like a proper like you know, like nine feet tall rack. It's, it's big. It feels like you're putting like, I don't know, like 16 or 32 of those, of those like, you know, Dell things in terms of size just to get sensible.
B
Yeah. We would three, two compute slots in there.
A
That's right. And what does it take? What did it take to actually build? What did you need to design in terms of, of hardware and then software?
B
Yeah. So. Well, and we knew this too, that going into the company, we knew we were taking a clean sheet of paper. Right. And so we were deliberately like, no, we're going to start with a problem, we're not going to build it out of Dell HV Micro. We are going to start with a problem and how do you best solve the problem? And as it turns out like there were a whole bunch, there's a lot of technical debt that had been accrued by this kind of PC ecosystem. So I mean, you know, God, where do you start? Just on the environmentals, like on power. Right. The fact that you've got AC power in each of these Dell, HP Super Micro.
A
Yeah. So if you like put 16, you have like 16 seen separate AC times two times two.
B
Because you have two power supplies per one U2U chassis, two power supplies. By the way, there are two fans sitting on those power supplies and those fans are actually what wear out if you go to the like in terms of like the worrying fans, it's not just coming from the computer, it's coming from the power supplies. Because those power supplies are dense, they're packed with stuff. So they've got to overcome a huge amount of static pressure. So like that's not the way anyone does it at scale. The way people do it at scale is you got to got DC bus bar, you've got a power shelf that is much more efficient and that rectifies from AC to DC and then you run DC up and down and then you blind made into that. So we knew we were going to do that.
A
So that's a law of electronics engineering right there.
B
Yeah, yeah, yeah. Power engineering for sure. And we knew we were going to do that. We also knew that by taking a clean sheet of paper that we would have opportunity made available to us that we weren't necessarily thinking of. And that manifested pretty early. So we blindmate into power, which is to say that when you feed a sled in that power connector, you don't see it.
C
It's at the back.
B
You lock the sled. Blind mates into power. And we had assumed that we were going to do what Facebook and Google and others had done, Amazon done and had networking out the front in the cold aisle. But as we were taking clean sheet paper, talking to some connectivity vendors, they asked us like, why are you, wait a minute, you guys are taking a clean sheet of paper? Why are you putting cabling in the front? Like, why wouldn't you also blindmate in the network and the networking connection? And we were like, can you do that? They're like, oh, you can definitely do that. It's like, well, why don't the hyperscalers do that? It's like, oh, they would all tell you that if they could start over today, they would blind me, the networking. And they're just too afraid to do it at this point. Which is like, I mean that was like catnip for us, you know, like they're too afraid to do it. Like, okay, we gotta. And one of the very early. Holy God. We're gonna bet the company decisions was blind nading networking. Because if blind mating networking doesn't work, you've got nothing.
A
You don't have a problem. And so what is the difference in blind meeting networking?
B
It means there is no cabling in the system at all. So when you've got a sled, you are blind mating into a cabled backplane so that the, it's cabled in the factory. So the, the operator.
A
So when the box comes in, that's why I didn't see any cables. It's, it's inside it, it runs inside.
B
It runs down the back. And so versus when I look at.
A
The pictures of a data center of let's say Google, you see, they're very neatly organized. It's like, I love organ. So it's like beautiful. But it's cables everywhere and you can see. So you don't have that.
B
We don't have that. And in particular, so because there's no cabling, there's also no miscabling. Right? So every computer is not actually on just one network. It actually needs to be on three. It's on a power detect, a presence detect network. It is on a Service manager, a service processor network. And then it's on that high speed network that you really care about. Like the actual network in any facility. You need another network for power, environmentals and so on. On. It's very easy to have miscabling there. That's got to go to a different router. It's like you. There's a bunch of just complexity that we eliminate because we do. And then part of that decision came out of an arguably earlier bet, the company decision, which was we did our own switch. So we also did, in addition to doing our own compute sled, we did our own switch.
A
And last time you told me about this, in our deep dive, we did a little bit that like at first you said we did our own switch. And I was like, yeah, okay, cool. You did your own switch. And then you told me that actually like that is a second computer to build. Can you tell me why it's funny?
B
Because when he went through Sandhill initially raising money for the company, nobody asked us.
A
Sandhill World Roll the Sandhill Rodeo.
B
Exactly. And we were definitely. So people would be like, I've got a technical question for you. And you're like, oh God, here it comes. It's the switch question. But then we know some other random ass questions like, all right, that's not a very good question. But nobody was asking us about the switch. And we were concerned about the switch because we'd already come to the conclusion in order to make this thing really work, we had to do our own switch. And the reason you have to do our own switch. If we didn't do our own switch, it would be a third party integration nightmare and we wouldn't be able to actually solve the problem that we're trying to solve, which is when this thing shows up in your data center, we want this thing to come out of the crate, we want you to wheel it up, we want you to put in power and networking and go. We do not want you to have to cable anything. It should be. The level of operator involvement should be really minimal. So we'd already come to the conclusion that in order to make this thing operable and manageable, we need to do our own switch.
A
So you're saying that like buying. Because a switch to me sounds like a somewhat simple component and you're going to tell me why it's not?
B
Oh yeah, it's definitely not. No, but. But that attitude is very important. If you want to go build your own switch. I encourage you to have that attitude as long as you possibly can. Because otherwise you won't go do it.
A
So, so what is your switch? What is a switch being obviously the networking switch. What does your networking switch do or that made it so important for you to build it as opposed to like going to one of suppliers and saying let's get your.
B
Not many suppliers. Oh, if you actually go to the actual switching silicon is coming from like it was like one and a half providers. Oh, it's all Broadcom. And so what you're actually talking about is Broadcom Silicon. What we discovered is this actually interesting piece of actually intel silicon from a company they had bought called Barefoot. And we found Intel Tofino which allowed us to have true programmable networking. So we use intel to Tofino. Intel later killed Tofino. So complicated relationship with intel over this. We fortunately have procured enough Tofino to be able to. We bought ourselves the time we need to kind of design our next gen switch. But that programmability was very, very important for us in that we were not going to get from Broadcom is a very proprietary company. We were not going to get a bunch of the things that we needed in building that switch. We were not going to get out of Broadcom. So it ended up being very important. We were concerned. I mean again another one of these kind of bet the company decisions very, very about, about having our own switch. Integrating our own switch. And what we found is that was a, that was a win in so many dimensions, so many dimensions that we did not anticipate and is now you can't imagine the company without having done.
A
Sometimes do stuff and you might get some wins.
B
Absolutely. Well, I think also like whenever you're deliberating something big like that, the fact that it is big kind of forces you to really deliberate and then once you commit to it, to taking that big risk, you often see unexpected dividends because well, as long as we're going to do this, as long as we are taking a clean sheet of paper, as long as we're doing our own switch, we can blindmate the networking. If we were not doing our own switch, we really couldn't blindmate the networking. We really needed to be able to own both sides of that in order to be able to do our own switch.
A
Or a lot of us listeners, viewers are software engineers. So we don't know as much about hardware. Obviously we know how the things work. But can you tell me a bit on what, what it actually means to design or build a computer? Because you know, I'll give you the novice approach which is obviously going to be wrong. But the novice approach is like, oh, here's a processor, here's a few chips, here's a main board. I'll just put it on there and I'm done. But when I was in your lab in oxide, you told me that one of the first engineers turned out to be a radio frequency engineer. You told me how this is great because of all the FDA approvals and all these things. And I was like, okay, this is way more involved than I, I, yeah, it's very involved. How do you build a new computer?
B
It's all, I mean, it would be a lot easier if it were all slower, right? The problem is it's very fast, it's high speed. So the connection to memory, be it now DDR5 double data memory 5 is ridiculously high. Throughput is very, from a signal integrity perspective, really complicated. These boards, by the way, ultimately this is all analyzed. We think of it as digital and it is digital. But digital is like a lie that double E's allow us to tell ourselves. It is actually like you are talking about signals that are racing through a substrate and whether PCIe or DDR5. So those signals are very complicated to lay out. That's complicated. The actual, like, how does the computer start? Like, this computer is like a triple system, right? Or, you know, I.747 used to be my favorite jet to kind of pick on. But now the 747 is retired, so I got to pick something else. And I'm not going to pick another Boeing aircraft, I don't think. I don't know. An A380, I guess. Right? I should pick an Airbus. But you think about like the, okay, an Airbus doesn't just like come by itself. Like it needs an airport. It needs like a Runway. It needs, it needs all the infrastructure to feed it well, so too for a microprocessor, it. It doesn't like just the power sequencing for those things is very complicated. It needs another surround that manages the power distribution network, that actually manages its power on sequencing, that manages all of its environmentals, that manage its connection to memory to IO. So it's just fractally complicated to the point that people often just take reference designs and iterate on them. They don't actually really innovate on this stuff because it takes so long.
A
And you told me this was really interesting last time that, that as I understand reference design means, correct me if I'm wrong, that you're an electronics engineer or hardware engineer and you want to build a new hardware and you take an existing reference that has been tested, measure it out, it doesn't create accidental, all sorts of radio frequency things and then you implement that. But you told me that this is not what you did. You also told me that it's pretty hard to find electronics engineers who are used to not doing reference design but who are brave enough to like.
B
Who are brave. Yes. I would say that in, in, in computer design in particular, the high speed designs are so hard. People got very accustomed to taking the reference designs and it was harder to find folks that were willing to take a clean sheet of paper. And we, we ultimately found them. I mean, and we've got a, A, EE team that is extraordinary.
A
And Double E is electronics engineer. Right.
B
And yeah, and absolutely fearless. And in part because like they're actually. But they didn spend their careers at Dell and hpe. Like they're coming. No, they're like coming from like GE Medical where they worked on CT systems.
A
Wow. How did that happen?
B
How did they come to oxide?
A
It's, it's not. But it feels like such a different field. I would have assumed naively that you know, if you're building a computer you'll, you'll try to get electronics engineers who have built computers.
B
You would think. And then, and that was probably our thought as well and then we discovered that we were, were not getting along with those engineers very well. We didn't hire them because we were, but we were just like finding like there's a lot of friction because there wasn't a real first principles approach from those folks. And this is where you get, especially you get to talk to folks that like been at Dell for a generation and like for any design they're used to calling what's called the fae, which is the field applications engineer for you know, for the voltage regulator. It's like well, well the FAE gives me the design. It's like all right, well how do you know that it's the right design? Well, no he there. So it's like all right, so like let's go hire that person then let's forget you. And we were really just, we were struggling, I was struggling to get outside of my own personal network to find the right engineers. And we were kind of brainstorming like how can we get people to see the company who wouldn't otherwise see it.
A
And specifically for hardware engineers, like we're talking about just.
B
Yeah. And just in general, but especially for doubles.
A
Especially. Yeah, yeah.
B
For doublies we're, it was feeling especially acute. One of the things you were Kind of brainstorming as a team. And one of our engineers said the values are very important to us at Oxide, which they are. And I relay Oxide's values and our principles to people outside of Oxide, and they're like, that's just bullshit. And I explained that. Normally I would agree with you, but it's when I get to the compensation people, their heads turn because our compensation is transparent and uniform. And people are like, wait, what? And, oh, my God, I could write a blog entry on it. Yeah, that'd be great. I'm like, okay. And so up to that point, we had not talked about it at all. We had not talked about it publicly at all. I just came up with the idea that compensation is just private, just not something you talk about with people.
A
You go to levels, FYI or some of the forums you're anonymously asking, people are anonymously sharing that that's how you. Good information.
B
That's a good information. And so I kind of had this idea that it was. That it just is not something that you. And so we wrote this blog entry in March of 2021, and it sent our hiring nonlinear. It wasn't that people were like, oh, my God, I want to work for a company where everyone's paid the same. Like, that is like, that's like, yeah.
A
Because your composition was both the same. And you also put the number specifically. I think it was something like $200,000 back then.
B
Yeah, there was a little bit less back then, but more than that. Yeah, the now we just got another raise. So now I've lost Track. It was 207. But now it's more than that. I actually don't know. Because the one thing is when compensation is uniform, like, you don't keep total track of, like, oh, like, literally, people were like, wait a minute, like, I got. There's an error in my paycheck. I just got paid more. And people like, no, no, we got a raise. Like, when was that? Like, no, it was at the last all hands. Like, oh, you know, I did have to go to the bathroom, like, at the end of the last all all hands. I didn't listen to the recording. I guess I missed my raise. Like, yeah, yeah, you gotta pay attention around here. But it was more that. What. What drew attention was that people, engineers in particular were. But just in general, people drawn to a company that would be so nuts as to do that. And it ultimately, like, that engineer that made the suggestion was absolutely right. It was the compensation that convinced people that we take our values really seriously. That we're a really principled company, which.
A
Is you're paying everyone the same base salary. It's exactly the same. Yeah, they're making the same as you, the electronics engineer, software engineer, the whatever other role you might have.
B
That's right. And I don't know if you should just go ahead and say it if you want to, but many people are like, would you pay support engineers the same amount? It's like, why do people always pick on support?
A
They would ask.
B
Exactly. Answer to that is yes. And the answer to that is if you do that, you find supportive support engineers. And so we have got. I think we've got the best support engineers in the business. I think we've got. Got really, really phenomenal folks in support.
A
I heard a small company called Gumroad do this where they paid their support staff really high. Again about same as software engineers. And then they got support staff who were software engineers and they could fix the code or like write tools for themselves.
B
And you get people for whom. Because you know, there's a certain thrill in support because. Because you got someone with a problem, it's technical, you get to come up, you get to be technical, you get to solve a hard problem and then immediately you get such gratitude. And that's a rush. And if there are people that are really drawn to that. I love helping other people. I love that feeling that I get when I resolve a problem for someone. That immediacy. So one of the things that we've heard repeatedly from several of our support engineers is my heart was always in support but. But my career path was forcing me into a different career path. And I love the fact that I can get back to where my heart is.
A
Yeah, that's nice. Because now like it. Yeah. You're not going to make more by doing something that you're not as into. I love that. So going back to where we were, which is like you build the hardware, you build this like really complicated piece and you went through electronics engineering, putting it together. Together. Let's talk about a software because that's super exciting. What does it take to build software for this? Did you start from. Let's talk from the low level. Did you start from scratch from operating system? Did you have to or could you use.
B
Yeah, and there's kind of different answers at different levels of the stack. So on our service processor we did start from scratch. We did our own de novo operating system in Rust, appropriately called Hubris because we had the hubris to do it. The debugger, by the way, for Hubris is called humility. Feels like appropriate for a debugger. So that was de novo.
A
And this is open, open source, right?
B
Open source, yeah, the entire stack is open source. Everything we've done is open source.
A
We can go on GitHub and check out.
B
Go on GitHub and check it out. And yeah, I mean, we've got God's own revenue model. Because like, you're like, well, what if somebody like can download it, run it on a different computer? It's like, knock yourself out. Because, you know, we, we think the best way to run this is on, on the machines that we make. And those are not free. An oxide machine is not, you know, that's not freely downloadable, but all open source. So that was for the Surface processor, for the host cpu. We really had it kind of at a quantity like, what are we going to do in the host? And with that to say like, on the actual, like, what was then AMD Milan, now AMD Turin Silicon. We knew that we wanted to do in the product. We would do our own hypervisor and our own control plane. It was very. So this is not something that you run.
A
The control plane is that controlling multiple, like the whole. Like you have a bunch of processors and memory and all that, and control plane controls all up.
B
You plug this thing on, you power it on, you put a networking. What you get is a console that looks a lot like a. Well, would like look like AWS if AWS looked better? I mean, it's a console. I mean, not to just. I mean, look, not just spare aws, but like, we know that like design is not really the strong suit.
A
We agree with that.
B
Yeah, exactly. So it looks gorgeous, of course, but it's. And it's also got. You've got your API, you've got your CLI and your provisioning instances. Where are those provision instances provisioned? It's the control plane that makes those decisions. You are attaching virtual storage. Those. Where does that storage live? It's the control plane that makes that decision. So just like with aws, you don't need to know that stuff. That's just happening. You're using Terraform to spin up your cluster. You're running kubernetes on it, you're knocking yourself out. So we are delivering all of the software from that lowest layer, that service processor, the operating system that's running on the host cpu and then that distributed system, very importantly, that distributed system which we called Omicron before the Omicron variant of COVID which was feeling very like Ill timed. For a very brief period of time it was feeling ill timed and now I feel like the Omicron rate of COVID is just like, is just forgotten. And now it's a good name again. So it's like, you know, it was.
A
A really short lived.
B
It was a short lived, yeah. So, you know, we lived longer than the Omicron variant of COVID and that is our control plane and that is a very sophisticated body of software in addition to. Because it's not enough just like provision an instance, right. And you need to do that robustly. You need to do that via API, CLI and so on. But then you have all the software that does that and keeps track of your instances and so on. It's very important that you can actually update that software. That whole distributed system, you need to be able to update to a new version of the software. And this gets really thorny, right, because in a pool, in a public cloud, you do that with a runbook, right? I mean even the, you know, we don't feature it prominently, but even in GCP and aws, yes, there's a lot of automation, but there's also humans involved and there are humans that are taking the responsibility for actually updating. Updating software for sure.
A
Really?
B
Yeah.
A
I mean, again, for the most part, yeah.
B
I mean there's a lot of automation involved, but in particular, if something goes wrong in an up, you know, you've got DevOps that can hop in and figure out what's going on and get it rectified. We are shipping a distributed system across an air gap in an oxide rack that's potentially running in a secure facility. We cannot be there if it goes wrong. So we need when, when especially because.
A
A lot of your customers are buying it because they want to do it themselves.
B
They want to do it themselves. So in many ways the thorniest software problem for us, we had actually several thorny problems. Problems take between them because they're all thorny for different reasons. One of the very, very thorny problems was how do we ship a distributed system that we can then update? And one of the things we did that was important was like, okay, because it's very easy to paint a roadmap that is very complicated for update. You'll never ship anything. So what we needed to ship in that first product that we shipped when you were in Emeryville two years ago, we needed the minimum viable update. We needed an update where the software could be updated even if it was painful. So what we did is we had this thing called mUpdate, which is the minimum update. And mUpdate in particular required the control plane to be parked. So we're going to take this rack that's running instances, take it offline, we're going to update it and then bring it back online. And that was robust.
C
It was great.
B
And we got that working. That's great that it's great in that you can update it, but that's actually not what you want in a cloud, right? You're like, I, sorry, I'm like using this thing 24 7. Like, I actually these instances need to remain up while I update it. But that gave us the platform to go build that update functionality into the software. Extraordinarily sophisticated and really an extraordinary body of work. And actually just recently we had at our internal meetup, the engineer who led the charge on that, Dave Pacheco, gave a presentation on looking back of two years of update. And I, I gotta tell you, I think this is one of the best single talks on software you'll ever see.
A
And we will link this. But can you give me just a short overview of like why this update is so difficult? Because some listeners will be used to just building applications, for example, on the iPhone and an update there, what it means, obviously, I know this is way more complicated, but an update is. There's a new binary version and it replaces the old binary version. Now of course you're saying this is an operating system update or with the car. And of course you might think like, well, you could just replace the old version with the new version and there's some downtime, but where is the complexity that actually puts all this thorn? Because I'm sensing this is like I am missing something, something very obvious.
B
So because it's a distributed system, when you've got an app on an iPhone, it's not a distributed system.
A
Oh, and distributed system meaning that you've got a bunch of different nodes, components.
B
That are going to speak to one another.
A
Those might need updating as well.
B
Oh, they definitely need updating.
A
Oh, they all need updating.
B
Yeah, the whole thing needs to be updated. You gotta be able to update all of the software in the rack. Oh, this is not just updating the operating system. This is updating absolutely everything.
A
So you might need to update some parts or all parts.
B
You update the service processor, the root of trust, the drive firmware, the host operating system, and then all of the components that speak to one another.
A
Okay.
B
And then it's like, okay, So I mean, this is challenge is fractally complicated. I mean, one of the very basic ways it's complicated is like, so when we're updating, we are moving the system from one version to another version in between, it's going to kind of be in both versions. Like, what does that mean to have the system that's operable while you've got some new components and some old components? What if you change your database schema from one version to the next version, which we definitely have. Like, you have to have a method of doing that. And for every one of these components, how is it updatable? How we gotta reason about the system when it's in this hybrid state? And then it needs to be done in a way that's very, very robust. So first and foremost we had to develop the foundation that allowed us to do this absolutely robustly. And so the way Dave and team did this is, you know, with that foundation and then very slowly lighting up different aspects of the system and making it more and more automatic over time. And, you know, first started running that on what we call our dog food ratio rack and did our first automatic update on the dog food rack. It was a really great feeling for that team because this has been a very long software road and it has been one that has been very deliberate and ultimately like. And, you know, full credit to Dave and team took us about the amount of time that we thought it would, which is kind of very rare for software because I think software's so fractally complicated. But that's only because they've been very carefully managing scope versus schedule making any. Because quality's got to be the constraint. And Dave's talk goes into that in detail in a way that I think is just extraordinary.
A
So I'd like to talk about the topic that is on a lot of people's mind these days, AI specifically and AI tools. How have AI tools changed how you're working at oxide specifically? Think about software engineering, maybe even hardware. Are you using these tools? Are you experimenting with them?
B
For sure, we've been early on in terms of using them. Yeah. I mean, you use them for different. For a different. And people are using them in different ways. I mean, no part of the oxide stack is vibe coded. I think that that is. That. That is safe to say. But we are using it and we're using it to. And again, different people using different ways. We are, you know, using it to do things that are tedious. We're using it to do generate test cases, you know, generate the. I use it for. Because I think the thing that is just like unmatched at is just document comprehension. We've got a very writing intensive culture. We've got a lot of documents. It is great.
A
You always had that.
B
Yeah, always had that. And if you've got a writing intensive culture, like you're LLM ready not to generate those documents but to consume them and to, you know, one of the things that I've always wanted to do and it's still like now is possible. I haven't quite found the time to do it. Early on I wanted to make an RFD glossary. So RFDR requests for discussion. We've got a lot of technical terms. I want to make a glossary. I tried to do that for like three hours. This is like in 2020, I'm like, this would this spreads to the horizon. This is so just making a glossary is so complicated. A glossary is something that an LLM can just churn out. And so there are lots of things that we're doing to use LLMs in particular. It's clearly a very real, very, very big shift in lots of different aspects of software engineering. Engineering. I think that. But of course there are people that are being kind of reductive about it. I am definitely not a doomer. There are a lot of doomers that are out there. And I tried to give this talk about building oxide itself, the oxide rack, and in particular the problems that we had along the way that an LLM was never going to be of any assistance on. The title of the talk was Intelligence is Not Enough. And one of the prominent doomers actually did a reaction video to my talk. It's like the only time I've ever had someone. And my daughter who was then like 11 was just like, thought it was hilarious that someone had held their own time in such low regard that they would spend it recording a reaction video to my talk. And so she was like, I want to watch this. I'm like, oh God, I do not want to sit there and watch this again. Ultimately, the thing that was really frustrating is this person obviously disagrees with what I was saying. But then when I was giving these very concrete examples of. Here are the specific technical problems that required more than intelligence to resolve that an LLM was not going to be able to resolve. He literally fast forwarded through those parts. He's like, we just don't need this. This is like, da, da, da. This is just. You're like, bro, this is the talk. Like, you can't do this. Like you're fast forwarding over to the actual, like me to the Top.
A
Can you give an example of like a problem which you felt was this like even you know, if we fast forward to like the arbitrary future.
B
Yeah, yeah, yeah. So yeah, super simple. I mean like the, I mean we've had many, many scary problems, but we had a, the CPU when we did our first bring up of our first machine.
A
And what does a bring up mean?
B
A bring up means taking a board and powering it up and trying to get it to work for the first time.
A
I think you mentioned that the term smoke test comes from electronics engineers.
B
Oh, I, they, they, I mean a smoke test. I think of a smoke test more from, from aerodynamics. But yes, I mean aeronautical engineers. But yes, I mean you're definitely like smoke is definitely a possibility as a very bad. You do not want smoke. That is bad. But no smoke please in bring up.
A
So the bring up.
B
But we are doing bring up and we are unable to get the CPU out of reset. And after 1.25 seconds the CPU resets itself. What's going on? Is the power network bad? We're doing all. And like when you have something like that happen, it's like, well, what's happening happening? It's like, I mean it's just not working. I mean like what do you tell your LLM to be like it's not working. I mean and it can maybe give you some suggestions, but in this case it wouldn't. So we are going deep into this understanding. Like maybe the power network is like marginal. No, no, no, we resolved that. No, no, we're, we've got a man actually we're working with AMD at the time and AMD's no, these power numbers are amazing. Like your margin is very good.
A
You're measuring it out. You're like eliminating that one, eliminating that one.
B
You're going through eliminating, eliminating, eliminating and couldn't get. We, and this was weeks. And you're like, we are, we don't have a company. Like we're wow, we are absolutely dead. And I feel like this is the kind of thing that desperate, you know, you get desperate, you know, like we're going to try kind of anything. And what we, the engineer was working on this actually looked at the protocol between the CPU and the voltage regulators. There's a protocol that it goes back and forth says hey, hey, I need this voltage. And you know, this is voltage. And one of the things he notices is that there is no acknowledgement packet from the regulator. So the CPU asks for a voltage to be set to a certain level and he's noticing that there's no acknowledgement packet back from the regulator.
A
Which should come.
B
Which should come. And the test, they've got something called sdle, which is this great test goober that you take the CPU off, you put on the sdle and it will measure the power for you. Well, the SDLE didn't care whether it got an acknowledgement packet or not. The CPU definitely did. And the cpu, so the CPU says, I want you to go to 0.9 volts. It never gets an acknowledgement back. And meanwhile sitting at 0.9 volts and it's just like, well, I never got an acknowledgement. So we're going to reset and I'll do it again. And that was due to a firmware bug on the Renesas controller. And so they. We got a firmware update from Renesas and done. And I mean, to be fair, the Renaissance FAU is great. Was like, well, you guys really should reach out a lot sooner. Like, yeah, I know. We really wanted to make sure that we got like everything. And that's the kind of problem. And there were many, many problems like this where it's not merely intelligence. It's not building a board is not an IQ test. It's more. I mean, you need to be intelligent to do it. But intelligence is not enough. You need these other kind of characteristics.
A
Then I feel you also need a team in this case, right?
B
You absolutely need a team, 100%.
A
You need to see like you're gonna solve these problems with. You know, you had that engineer who just like thought of measuring this out, right?
B
Well, engineer who was desperate, you know, because we were all getting desperate and you know, we. And again, we've had many of these over the history of the company. And you're right, you absolutely need a team. You need, you need a team. And you see also the value when you have a team. People have different ways of approaching a problem. That diversity is really important because you need. And actually sometimes this has happened more than once to the company where somebody kind of like is just kind of like walking through the problem and like someone's like, hey, I'm just joining. You know, nice about a remote company. Anyone joins them, you know, they join in the Google Meet. Yeah, I'm just joining because, you know, I think that I'm following along. And you get someone who'll be like, just make an like, hey, I got like a dumb question. Are those virtual addresses? Like, those look like similar virtual addresses. You get something where someone's making and you need someone to kind of like, like come and make that observation that is maybe less grounded in it. And people are like, oh, wait a minute, no, that's actually like, that's something to go check. And so you need that different kind of approach that is really a team kind of uniquely summons.
A
And you know, I think you might have alluded to it, but on the previous podcast, Armin Ronicher mentioned to me, he's the creator of Flask. He's been around the block for quite a while, while. And he's now doing a startup. And he said that right now it's just him and his co founder and he's got an army of AI interns right now that he's prototyping. But he told me, I'd like to start to hire people soon because people bring energy and you need energy for a company to live and thrive. And I'm kind of sensing the same thing that.
B
For sure. No, for sure. And I just listened to this great piece with Richard Sutton, who was the inventor of reinforcement learning. And I think rightfully, and I agree with them, it's like you guys are conflating an LLM with artificial intelligence. It doesn't have goals. This is really important. So like a prompt is not a goal and guessing the next word is not a goal. And. But like us together as a startup and like wanting to make it together, not wanting to die here together, that's a goal and that. So we can use that creativity. Maybe we, you know, use an LLM certainly as a tool to help us achieve our goal. But I do think that that's a very important distinction.
A
And can you tell me like, what kind of tools you use and what are the areas that you find it helpful? I understand you're experimenting with stuff and you know, this is all work in progress, but where areas that. And you mentioned, like, the summarizing was one example of gloss stories.
B
Yeah. Oh yeah. I mean, and I mean, I use LLMs as an editor all the time. I find it to be a really. I mean, actually it was funny. I had a blog entry that went on Hacker News and someone's like, oh, this is LLM. I'm like, actually it is LLM edited. But the only thing that I did based on the LLM is I deleted an entire paragraph. So there was a paragraph that like, wasn't working. And the LLM was like, this paragraph's not working. And I'm like, you know what, I'm just gonna delete the paragraph. So it's like, I gotta know. You wanna say that's LLM edited because like every word there is written by me. But there were some words that there was written by me that an LLM I deleted that which I deleted. So I mean, I use it for. In writing for sure. I mean I also like to use it. I mean, this is like a stupid reason, stupid thing. But when you're writing rust, and we write a lot of rust, especially when you're new to rust, you wonder like the way I just phrased this, is this like idiomatic? Is there a better way to do this? That's a great little problem for like I've got this small little snippet of code. Is this an idiomatic way of doing this? Is there a better way of doing this? And that's a great thing for an LLM to be. To make a suggestion or not or tell you like, nope, that's an idiomatic way of doing. Maybe I would make this small adjustment. So I find it really Val. I find LLMs to be more valuable in the small than in the large. So like again, this kind of I might, you know, hats off to people who want to spend their lives acting as a middle management for robots. But like, that's not necessarily for me, certainly at Oxide, I mean our belief is that people take responsibility for their own work. So if you want to have an LLM help you out on that, that's fine. But ultimately, like if there's a bug in this, like, you can't blame the LLM. The LLM broke. My code is like not, not interesting. That's LLMs don't have accountability.
A
And so one thing that is starting to spread across I think a lot of engineering is engineers using LLMs either inside your IDE with autocomplete and also kicking off now agents, now there's more advanced ones with like cloud code and codecs where it can actually run command prompts and run your tests. Are you seeing engineers use some of these tools? And there's a little bit of back and forth as well, you know, like it's very clear that when you're doing kind of more boilerplate things that are so called on distribution, which is they've learned like reactor types, script, it can spit out a bunch of stuff. But you strike me as someone who's doing a lot more nuanced things.
B
Yeah, I mean you're writing a bunch of, you're writing a bunch of C code in the operating system kernel. It is less valuable.
A
Yeah, but so what are you seeing across the team in terms of, I.
B
Think you know, across the team, I encourage people to experiment and I would say we're seeing a wide variety of experimentation. Certainly we've got, we're using cloud code a bunch and people are doing that. And, but I would say, you know, know, broadly speaking, for a lot of the work that we're doing, it is helpful as like maybe a polishing tool, but less as a kind of the, at the epicenter of its creation. It's not true of everything there's some software for.
A
No, but, but that, that's also nice to hear because I'm, I'm kind of asking you more to putting on your CTO hat, who's also very like, you know, you're very hands on and you know what's going on with the industry because a lot of non hands on executives are kind of looking their finger and thinking, oh, we must be 10 or 20 or 30% more productive. But what I'm hearing is like, things are kind of same as before, right?
B
Yeah, I mean, I mean, my big belief is it's a tool, it's a powerful tool. I mean, I will say that the thing I, you know, occasionally get, people are like, well, I don't want to use it at all. And I'm like, you should. So like, you should try. Right? Yeah. Like, let me get you off of that position and let me, you know, we had Simon Wilson on our podcast. Simon's delightful and you know, one of the lines that he has that I really love is people should run these LLMs on their own laptop where they run slowly and poorly so they can see the bad output that they generate, so they can understand what some of the limitations are. So I definitely, I love that. I do think that people should use them enough to know where they are valuable. It's a very important tool in the toolbox. You want to be aware of it. But it's definitely reductive to think it's the only tool in the toolbox because it isn't.
A
Now you're in such an interesting company because like, you know, you don't not just do software, but you do a lot of hardware.
B
Yeah.
A
Have you found any use?
B
No, no, no. Zero. I mean, okay, zero is a bit reductive. I have found it to be useful when, for example, you know, you've got a waveform of an i2C transaction. It actually, amazingly, you can send that to an LLM and have it like interpret this like, hey, what am I, am I seeing I2C kind of compliant behavior and it can help you out on that a little bit, but it's like absolutely at the edges.
A
Okay, so that's the 0.01.
B
Also like I think people don't realize like there are already tools for that. Like that's what EDA is you spend a lot of money on. Like we're not laying this stuff out like by hand with graph paper. Like this is like you've got, you know, when you do layout for a board there are a bunch of rules that are automatically checked for si. You know, we, we've got a, we do a bunch of simulation work. Like we're not doing that by hand, we're using software.
A
Yeah. And I saw a few of those machines in there. Like I, I saw that. I think it's a bit reassuring to hear because I think it's very clear. Like maybe we don't realize software engineers, but programming is such a great use case for LLMs. It's a simple grammar, you can validate it.
B
Yeah.
A
And I think it's sometimes nice to just, you know, touch sand of like an area that is very, very different.
B
Yes, but, but it's, it's cool that.
A
You'Re checking and you know, you're seeing if, if, if it changes over time. I guess you always keep checking.
B
Yeah, and I, I for sure. And I think that like I, I, it is frustrating to me because programming is such a good use case for certain kinds of programs. So as a result you end up with certain kinds of programmers who just in part because of their own self centric view of the universe, believe that, oh, this is just gonna replace every job. And it's like, no, not even close. Not even close. And you need to spend more time, you need to get outside a little bit more.
A
Yeah. So speaking of getting outside and you know, meeting different people, what I noticed when I went to Oxide is just like, it was great. We had double E's, as you say, software engineers. People used to work on Virtual Real, Alias at Oculus, all in the same room. Can you tell me about how big is the team? What's the composition?
B
Yeah, so we're on, you know, we've, I think, you know, we got some more offers going out tonight. So I think we've got on the order. We'll be at like 85. I should probably keep better, I should keep better mental track of it. But we've got like 85 plus minus and we, you know, we've been very blessed by, we've really put a beacon out there. We've got a lot of people rooting for the company. We've got a lot of people and as a result, we got a lot of people who want to work for the company. So, you know, we, as we talked about last time, we really put a lot on folks to describe, you know, the work they've done, what's important to them, why they want to work for Oxide. I mean, a lot of my LLM use is I will look at someone's materials. You can imagine we've started to see materials that are heavily LLM authored. Potential applicants to Oxide, please do not do this. We get people who, like, who, who human author their entire materials. And then they get to the last question. Why do you want to work for Oxide? Why do you want to work in this role? And they have an LLM spit that. And you're like, do you think you want to work here? Like, I mean, just like, let's leave aside whether this is like, you know, is this right or wrong or cheating or not. It's like, fine, I guess, but like, I don't think you want to work here. Like, you're not going to get a job here because I don't think you actually want to work here. Put it in your own words. But that process really has allowed us to attract people who themselves are attracted to the company and attracted to the culture, the problem, the team. And it's just extraordinary. I mean, I just feel so lucky to be with such an unbelievable group of people across more and more and more and more disciplines. I mean, the great thing about our approach is it brings people in who are, you know, God, it's like, I love this approach. For we talked about support engineering, people who are like, God, I love this approach. Like, finally, QA can stand on its own two feet. I feel that QA has been kind of subjugated by, by these other disciplines. Now. QA is kind of really thought to be as important as anything else in the company. And it is, because at some monetary perspective, it is as important as anything else.
A
Yeah, but I remember when I worked at Microsoft back like 15 years ago or so, the QAs were just on a lower pay grade. The senior QA was at the same as, I think Software Engineer 2 or something, which just kind of implies, yeah, you're less important, you're less important, you're just less important.
B
And so, like, if you tell the world that we think it's as important, you know who you get. You get people who are extraordinary at qa, you get the best of the best. And so that has been really exciting and now we've got people coming. I mean, I do love how many different companies, because my belief is that, like, every company has something to teach us that there is something positive you can take from every company. Now, there are some companies just like, ooh, you're really scraping the bottom of the barrel.
A
Maybe not Adam Ronaldo, they did buy some.
B
Yeah, yeah, that's right. That's like, there are, like, even Oracle, you can find. There are. That maybe a bit of a challenge. Let's not do that one. But you know what? The. And at the time, I thought this was a negative, but now I'm like, I see it. Larry Ellison makes every hiring decision in Oracle.
A
So what's positive about that?
B
Exactly? Which.
A
What's.
B
I really. The. I really think that the kind of the founder mode, the Paul Graham essay on founder mode is talking about founders that lost track of their own hiring. So I think, no, I don't like the way Ellison does it. I think that you want to have. You want to trust a team to make a decision. But ultimately I believe that the, that the CEO of a company bears responsibility on every single hire. And I think should be looking at every single hire coming into your company. And that is, to me, that is a very important change check on these kind of companies. That, that. So that is. There you go. Something that I've. Something that positive, I take it from Oracle, and it's telling that your immediate reaction is like, wait, what's positive about that?
A
Yeah, I'm not sure. Like, I'm not sure you undid that. That talk on Oracle.
B
Yeah, fair enough. Exactly. Yeah. And they're from some companies more than others, but I think that there are. And so I love having all of these different experiences present at Oxide, because I do think that there's so much, much to learn. And we're trying, you know, you want to take all the positive things. Because I also think that every company, including, you know, people I had. Actually, one of the questions I love that I got once is like, what do you not want to emulate from Sun? I'm like, oh, thank God. Because, like, think people think of oxide as kind of the second coming of Sun Microsystems. And like, I. There are lots of things I loved about Sun. There are lots of things I did not love about sun that I did not want to emulate. And so I think for any. Also any company, there are things we want to leave behind. And I think when you've got a big diverse team, you get to go do that.
A
And one thing that really surprised Me, last time I was at your office is it turns out that most people are not in the office and they work remotely. And I would understand for software, but how do you make that work for hardware development where physically you do need to be at the hardware sometimes I understand you measure stuff. I saw a lot of units. Sometimes you need to go to check on manufacturing. How does that part work?
B
Yeah, so I mean a lot in people's basements. So, you know, fortunately we're making, you know, this is the advantage of making a server and not making like, you know, a tractor or like, you know, we're not making like a, you know, I don't know, like a wind turbine or something. You know, this is something that people can actually model in their basements. So that helps. But then a lot of even hardware engineering is using these software tools, using EDA Tools, you're using SolidWorks, you're using Altium, you're kind of putting this thing together. When you're doing layout, for example, which is very important task when you're laying out a board, all of that is, that can be done anywhere. That's all just software.
A
Okay.
B
And so the, the there are things that are where that physicality is very important. And then when you're doing bring up, you actually need to be at your manufacturer when you do that. So like that is also not in an office.
A
You would need to travel anyway.
B
Yeah, you need to travel anyway. And anyone coming to the electronics industry is like, okay, I mentioned an oxide, but please tell me I never have to go spend any time in Taipei or Beijing. Because, because you go out there for Shenzhen or wherever and you're out there for two weeks in a windowless office trying to get this thing brought up. And all of our assembly is done here in the United States of Minnesota. So we are all, in fact, we've got a bunch of folks out there this week for Benchmark Electronics and Rochester.
A
This is wonderful. And one thing that you told me is one of the things that's on top of your mind right now, as oxide is growing, you still have this culture of the same compensation, full remote. So it's kind of been the same since the start. What will be the challenge in maintaining it? Because again, you worked at large companies. Do you see how it goes? It can get tricky. What are the things that you're seeing and what are the things that you're trying to do to keep this kind of startup vibe, even as you might be just bigger?
B
Yeah. So I think that the thing that is top of Mind right now for me is. And especially because we raised a big Series B, which is great. I think much more importantly, we're seeing a lot of customer traction, which is great.
A
So we've seen paying off.
B
Yeah, I know. It really is. It's really great. And we kind of knew that was gonna happen in the abstract, but it's fun to actually see it happen and fun to actually see the customers that are, you know, like, you know, I bought one rack and I mentioned, but now I wanna buy a lot more racks. I love what I'm seeing, and I want, you know, that's great. Very, very, very exciting stuff. That means we're growing the company a bunch. And one of the things that's very important to me because I've seen this happen so many times, is companies take their eye off the ball when it comes to hiring in particular. And it is very important to me that we continue to have absolute discipline in the way we hire. And we're doing that. And fortunately, the nice thing about our hiring process is every single Oxide employee has gone through it. So it's like I'm not having to persuade anyone about the importance of our process because everybody has gone through it. And the thing that we've got overwhelmingly in our favor. Favor is because we've used our values as a lens for that hiring. Oxide's culture is important to every single person at Oxide. That's what it takes to really preserve that. And it doesn't mean that it won't change at all, but the bones aren't changing. What will change is it will be bigger. And it will be, I think. And I love the fact that even at 85, we're already so big that Steve and I know everybody at the company, but very few other people know everybody at the company. Company. So when we get everyone together, it's like the best party you've ever been to. Because, you know, when I in college, I used to throw the best parties in college. And the reason I threw the best parties in college was not because of me. It was because of the roommates that I had. So I was a computer science student who played Ultimate. My roommate was an engineer who was on the water polo team. My other roommate was a. Was a history student who was in the chorus. That's six different demographics that don't normally overlap. And then, very importantly, we made sure that the women's swim team was always invited. The women's swim team. But they were like the foundation Waterfall player. Yeah, exactly. Waterfall. You always check their calendar to make sure, they can make whatever. And people loved the parties we had. Why? Because they would meet people that they'd never met before who were really interesting. And what I love about Oxide is we've got this. When we get the whole team together, people get all these delightful surprises. So people will take me aside and be like, God, you know, Rye is awesome. I'm like, yeah, no, I know, I know you know too now. That's great. But like, you know, you know, whomever it is, it's just, it's really exhilarating. And I think that also serves to reinforce how important what we've got is. As I tell the team, like we have lightning in a bottle and we cannot take it for granted. And that means that every single one of us, we need to rise to the moment. We need to do what our customers need us to do, but we need to do it in a way that protects and preserves what got us here.
A
So, thinking a little bit ahead, let's assume that, you know, these AI tools will just get better eventually they'll be able to help. Even on your kind of low level things. You've been in the industry for quite a while, you've seen a lot of shifts. What do you think are some of the things both in software engineering or in hardware engineering or just in general engineering that will probably not change even if we predict these things being more capable.
B
Yeah, I think that what we, I mean, I think that it's certainly a revolution. I think it's going to allow us all to do more. I do think that we are going to hit, hit a point where people understand that this is a tool where, because there's a little bit where we're still had this tension of like, oh, is this going to be AGI? Is this going to replace all jobs? And this is like nonsense as far as I'm concerned. And it's distracting kind of nonsense. And we actually need to get back to putting the tools in the toolbox of the human that's building it. Now. These tools have become, become much more powerful. And I think that that's going to be, I think it's extraordinary. I think it's important. I think that also we've got a lot of experiments right now. We, humanity that I'm not sure are going to make economic sense. So we'll be figuring that out as well. But I think that one of the things I am a little bit worried about is a little bit of despair from younger software engineers in particular who are like, what's the point? Like an AI can do all this well.
A
And there's also the news, even for more experienced software engineers in the mainstream media. There's this news that Company X is laying off healthier workforce because of AI. And by the way, when we look closer, it's not because of AI, but it is coming across and it does give not younger people a lot of anxiety. Tons even like mid level folks or even some more experience. It does give a sense of, I think it's the first time in computer history that most of us remember that there is this thing that could threaten my job. And I think we've just never had to deal with this. I think, you know, there are other industries that might have been a bit.
C
More used to it.
B
Yeah, I would say that we, I mean there have been busts before. The dot com bust was a bust. Like it, like a lot of jobs did disappear. Right. So I think that we, but the bust has really come in what feels to be a broader and more permanent way. I mean my view is like this is an opportunity for, I mean, I think one of the things we should be societally really encouraging is new company formation. Because now, I mean just like you're talking to Armin about how, you know, just a small group, you know, just Armin and his co founder were able to do so much together. Right. We should be really encouraging that. And what are some of the gaps that we can all go fill? Because ultimately like we all need to find a livelihood, we need to find meaning. And the way we do that as engineers is we build useful things. And so we can now build many more useful things. What would we go build? If you could build anything, what would you go build? And that's kind of the question that people need to ask themselves. It's scarier. It's scarier than like going to this school, get concentrated in this and then mama Google will hire you and take care of you and feed you breakfast. It's like, no, that's not going to, like that's not what's going to happen. And it feels a lot scarier because it feels like there's at some level like less security, less job security. But yeah, that's true and that's scarier. But there's also a lot more opportunity.
A
And for a college student or someone in school or with little experience who says like, look, my goal would be one day in like five years time to be as good that I could get a job at a place like oxide. It doesn't need to be Oxide, but again, a place that Has a high bar are they often hire experienced people. But I want to get there. And yeah, there's all this AI stuff as hell happening. What would you advise them in terms of what to focus on, what areas to study, what things to do or how to think about? They have the goal is there. What advice would you have them part with?
B
Yeah, so I think that you need to have a different mindset and that mindset needs to be not around how do I create as much as possible, but rather how do I get better? How am I getting better every day? And I think LLMs are a great tool to get better. How can I learn about something new, go deeper, go into something that I wouldn't go into before, get over that kind of that fear. And one needs to, especially if you're coming out, you're in school now, you want to work at a place like Oxide, it's like you kind of have to view it as like, all right, you want to play major league baseball. That's. That's great. You're a great high school player. You want to play major league baseball. It's really hard. Got to get better every single day. And you're going to need to be really focused on getting better and you need to be really realistic about what I need to go do to get better. And it's hard and it's chancy because you might not get there, but you could get there. And you're certainly not going to get there if you don't focus on that kind of self improvement. So I really think that there is a shift in mindset. Mindset that needs to happen or that one needs to have. I put that way, you really got to have a mindset towards getting better, understanding more. What do you not understand? There is lots that you don't understand. I mean, I think one of the challenges of modernity is that we delude ourselves into thinking that we understand it all. You don't. I don't like one of the things that I've learned. I've joked at Oxide that I keep waiting for the day that I know how computers work. And it's like, it wasn't today, definitely wasn't yesterday. It's gonna be.
A
You understand how.
B
But I mean that earnestly in that the amount of complexity that I definitely, I mean I knew, but also didn't know. It's like every day I feel I'm still learning new facets and not just like a computer, but actually delivering a computer to people. There's so much to learn out there. So Many. And now, now with the way you've got to view LLMs is not like this thing is coming from my job. You got to view it as like, no, I've got now this private coach tutor, what have you that I can ask any question to. It's not going to necessarily, I got to fact check its answers for sure. But now you've got the opportunity and it is easier to get into this domain than it ever has been. And that's great and it's powerful, but it can also be scary.
A
And as closing, what's a book or two books that you would recommend to folks and why oh so many good books?
B
You know, I've got a 21 year old, an 18 year old and a 13 year old. And when the 18 year old was in his, he's now a freshman in college, he was a high school senior. He got this assignment, great assignment from his English teacher, namely to go to someone that you know and ask them for three books that they would recommend that you read. And I'm gonna assign you one of those three books to read and you're gonna read it and then you're going to talk with them about that book. I'm like, oh, I love this assignment. So he's like, dad, I'm coming to you. And I'm like, oh, you have. Thank you so much. And of course my wife was like, why didn't you come to me? Like, hey look, I'm, you know, sorry, you know, look, it was great. So yeah, I'll give you those three books that I gave to him. And I think that each of these is really terrific. First is Soul of a New Machine by Tracy Kidder. So this one won the Pulitzer Prize in 1980 or 1981, but about the building of a new computer at Data General. And it's extraordinarily well written. And even if folks think, well, I'm not like what do I have to do with a computer company in the late 70s and early 80s? Any engineer will see something of themselves in that book. It is just masterfully told. Tom west, who is kind of a complicated figure, but that is soul is still, still. I mean it's literature for us. So I would absolutely solve a new machine. Every engineer should read Solvenew Machine by Tracy Kidder. For me personally very influential was Skunk Works by Ben Rich. So about the history of skunk works, Clarence Kelly Thompson was kind of the originator of skunk works at Lockheed Martin. Extraordinary story about what engineers can do when they kind of task themselves on the impossible.
A
It's such a good book.
B
It's such a good book. Amazing book. And then the, the other one is Steve Jobs and the Next Big Thing by Randall Stross. So Steve Jobs is kind of like lionized by the industry, but people forget about a very important chapter of his life, namely next. And I believe it was just an anniversary. Maybe it was the 30th anniversary. It must have been. Or maybe the 40th anniversary. Jesus. Of the announcement of the NEXT machine. So Steve Jobs left Apple, was fired from. From Apple, started a computer company called next. Really interesting company, a lot of ways was at Next for a very long time. It was a 13 year journey before Next was bought by Apple. NEXT is bought by Apple. Steve Jobs returns to Apple when they buy next. This book, Steve Jobs and the Next Big Thing is written before Apple buys next. And it is at Steve Jobs lowest moment. It is not here to praise him, it is here to bury him. And it is, is very interesting about all the missteps at next. And the thing that we cannot know because Jobs obviously died. But I believe having read the book, which gets basically Next gets essentially no treatment in the Isaacson biography. Next is like six pages of glory. It's like that's not what it was. But Randall Strauss's book is masterful. And in particular I believe that Jobs failures at Next were essential for the resurrection of Apple. And there, because you look at the way he handled himself coming back to Apple was very different from the Jobs that got fired from Apple. And I think that when people look at Jobs, they don't really take him apart. And I think you should because I think he's a really interesting guy. He's enigmatic, he's someone. He did things that I think are really fascinating and also things that I really strongly disagree with. So just to be clear on that, but I think that he's indisputably an important figure and that book is by far the best book. So Steve Jobs in the Next book.
A
No, I'm adding that I actually want.
C
To read that now.
B
Oh, it's extraordinary. It's very good.
A
Well, Brian, this was such a fun discussion.
B
Oh, my pleasure. I mean we knew this was going to be long and wide ranging, so hopefully it delivered. But I really appreciate the.
A
We went from the 90s all the way to the future.
B
There you go.
A
Love that.
B
Awesome. Well, thank you so much for having me, Gergey.
A
It was terrific.
C
I've got to say, Oxide is one of my favorite companies. And I say this as someone who.
A
Has zero affiliation with them.
C
It's just so rare to find a startup that built both hardware and software and are world class in doing both of these and are so open about talking exactly how they do it all. Honestly, the only downside I can think about Oxide is how their server racks are built for pretty large companies and are definitely out of reach for Hobby's devs. In this episode I really appreciated how much of a straight shooter Bryan was, especially about the impact of AI tools. Yes, AI, everyone at Oxide uses them and they do find use cases for coding and working with documents, but it's eye opening how it gives them basically zero help with hardware engineering.
A
This is a good reminder that LLMs.
C
Might be the single best fit for coding related tasks, and as devs we should know that these tools might be more specialized than many people think. I hope you enjoyed the stories in this episode as much as I did. If you'd like to learn more about Oxide, I did a two part deep dive about the company and you can.
A
Read it linked in the show Notes.
C
If you enjoyed this podcast, please do subscribe on your favorite podcast platform and on YouTube. This helps the podcast a lot. A special thank you.
A
If you also leave a rating on the show.
C
Thanks and I'll see you in the next one. In the next year.
Episode: The history of servers, the cloud, and what’s next – with Oxide
Host: Gergely Orosz
Guest: Bryan Cantrill (Co-founder, Oxide; former Sun Microsystems, Joyent)
Date: December 17, 2025
In this episode, Gergely Orosz is joined by Bryan Cantrill, for an in-depth conversation about the history and evolution of server infrastructure—from the dot-com boom in the 1990s, through the rise of cloud computing, to Oxide’s mission to rethink hardware and software for modern data centers. They discuss lessons learned from industry cycles, innovation in hard times, cloud economics, hardware startup challenges, the open-source stack at Oxide, nuanced uses for AI tools, and advice for engineers in a rapidly changing field.
Audience: Especially valuable for software engineers, hardware engineers, and engineering leaders interested in infrastructure, cloud, and startup culture.
Bryan Cantrill is candid, technically deep, and animated—offering both practical details and industry meta-lessons. The conversation moves fluidly from war stories of the '90s, to the nuts-and-bolts (and philosophy) of modern hardware, to the realities of AI engineering and what’s left to human creativity. Oxide’s radical openness, hardware-first principles, and unconventional approach to company culture make it a beacon for ambitious engineers.
Summary Verdict:
For engineers hungry to understand both history and future of infrastructure (and the human factors that shape both), this rich episode provides not just a window into technical evolution, but a strong dose of inspiration for building and learning in a changing era.
For further resources & Oxide deep-dive:
Check the show notes and pragmaticengineer.com.