wavePod

Get Wave AI

Bonus: The DeepSeek Reckoning in Silicon Valley - Big Technology Podcast | Wave AI Podcast Notes

Back to Big Technology Podcast

Bonus: The DeepSeek Reckoning in Silicon Valley

Big Technology Podcast

Mon Jan 27 2025

Summary

Big Technology Podcast - Bonus Episode: The DeepSeek Reckoning in Silicon Valley
Host: Alex Kantrowitz
Guest: MG Siegler, Writer and Investor at Spyglass
Release Date: January 27, 2025

Introduction

In this exclusive bonus edition of the Big Technology Podcast, host Alex Kantrowitz delves into the seismic impact of DeepSeek R1, a Chinese open-source AI model that is shaking up the generative AI industry and affecting global markets. Joining Alex is MG Siegler, a renowned writer and investor from Spyglass, who provides in-depth analysis and insights on the ramifications of DeepSeek’s advancements.

DeepSeek R1: An Overview

Alex Kantrowitz opens the discussion by highlighting the significance of DeepSeek R1 in the AI landscape. He underscores the model’s impressive performance metrics and its cost-effectiveness compared to industry giants like OpenAI.

Performance Benchmarks:
- AIME Mathematics Test: DeepSeek R1 scored 79.8%, surpassing OpenAI’s best model at 79.2%.
- Math 500 Test: Achieved 97.3%, compared to OpenAI’s 96.4%.
Cost Efficiency:
- Input Tokens: $0.55 per million (DeepSeek) vs. $15 (OpenAI)
- Output Tokens: $2.19 per million (DeepSeek) vs. $60 (OpenAI)

Alex emphasizes, “DeepSeek R1 has created models that are as performant as the state of the art... at just 3.5% of the cost of running OpenAI's models.”

Market Reactions and Implications

MG Siegler assesses the immediate market impact, likening the release of DeepSeek R1 to an earthquake in the AI sector.

Market Impact Analysis (00:03:49):
- Nvidia: Stock down ~11%
- Microsoft: Down ~4%
- Google: Down ~3%
- Meta: Down ~2.6%
- S&P 500: Down ~2%

MG remarks, “From a pure market perspective, it seems like it's an eight. It's not going to totally destroy the stock market, but it’s going to be rough today.”

Key Points:

Nvidia: As the primary supplier of GPUs for AI models, the significant drop reflects concerns over reduced demand.
Microsoft & Google: Facing challenges in monetizing AI advancements amidst cost-effective alternatives.
Meta: Slightly better positioned due to its open-source philosophy, aligning closely with DeepSeek’s approach.

Adoption Among Startups

The conversation shifts to how startups are responding to DeepSeek R1’s release.

Alex questions the extent of adoption, asking whether startups are outright replacing OpenAI or Meta’s models with DeepSeek.

MG responds, “I think this is just beginning. People will experiment with it to see how much they can benefit from the cost differentiation. However, there are concerns regarding censorship and the transparency of DeepSeek’s training data, which makes adoption cautious.”

Notable Quote:
MG Siegler [07:01]: “If DeepSeek can release another iteration and maintain its performance, startups may take it more seriously. For now, it's a wait-and-see approach.”

Technical Innovations Behind DeepSeek R1

The duo explores the technological breakthroughs that enable DeepSeek R1’s efficiency and performance.

MG explains the technical underpinnings, noting that DeepSeek R1 was developed by a hedge fund in China with access to substantial Nvidia hardware before export restrictions took effect. The key innovation lies in the use of model distillation and reinforcement learning.

Alex highlights, “They’ve moved from self-supervised learning to pure reinforcement learning, allowing models to determine the right answers autonomously.”

Notable Quote:
MG Siegler [11:41]: “They used distillation to bring larger models down to smaller, more efficient versions, enabling them to run on a variety of hardware with significantly reduced costs.”

Challenging the Scaling Hypothesis

A pivotal segment discusses whether DeepSeek R1 invalidates the prevailing scaling hypothesis in AI development.

Alex posits, “DeepSeek has shown that you can achieve high performance without exponential increases in compute and data. Does this challenge the scaling hypothesis that has driven massive investments?”

MG concurs, suggesting that the scaling hypothesis might be reaching its limits. He notes that while scaling has been the cornerstone of AI growth, models like DeepSeek R1 demonstrate alternative paths to achieving high performance efficiently.

Notable Quote:
MG Siegler [16:39]: “DeepSeek R1 calls into question the necessity of massive scaling, presenting a fundamentally different economic model for AI development.”

Impact on Big Tech and Investment Strategies

The discussion delves into how major tech companies and investors are recalibrating their strategies in response to DeepSeek R1.

MG observes that companies like Microsoft and Google are now grappling with the changing economics of AI. With DeepSeek offering a cost-effective alternative, these giants must revisit their investment spreadsheets and AI deployment strategies.

Nvidia: Faces short-term revenue drops but might benefit long-term as AI adoption grows continuously.
Microsoft & Google: Need to innovate beyond current models to maintain their market positions.
Meta: Potentially more resilient due to its open-source approach, which aligns with community-driven advancements.

Notable Quote:
MG Siegler [19:21]: “If DeepSeek just pointed to the nail already hammered, we're moving into the next phase of the AI revolution.”

Economic Implications and Future Outlook

Alex raises concerns about the broader economic implications if AI intelligence costs plummet, making advanced AI accessible and potentially disrupting existing business models.

MG agrees, suggesting that DeepSeek R1 could catalyze a fundamental rethinking of AI deployment, moving towards more practical and economically viable applications rather than sheer scaling.

Notable Quote:
MG Siegler [34:18]: “This moment with DeepSeek is forcing a fundamental rethinking of how much money to spend and what to focus on.”

Investor Perspectives and Startup Ecosystem

In a segment focused on investment, Alex queries whether reduced AI costs might enable the emergence of new startups that were previously uneconomical.

MG responds cautiously, noting that while lower costs could theoretically foster new ventures, the current ecosystem may not yet see a significant surge in AI startups due to existing barriers and a lack of immediate profitable applications.

Notable Quote:
MG Siegler [39:19]: “If DeepSeek is truly transformational, it could lead to new companies emerging, but it's not apparent yet.”

Conclusion and Looking Forward

As the episode wraps up, both Alex and MG reflect on the potential long-term impacts of DeepSeek R1. They acknowledge the uncertainty surrounding whether DeepSeek is a temporary blip or a harbinger of lasting change in the AI industry.

MG emphasizes the importance of monitoring market reactions and corporate strategies in the coming months to fully understand DeepSeek’s implications.

Notable Quote:
MG Siegler [42:43]: “If this is just a step on the road and not a fundamental change, companies might still keep their foot on the gas.”

Alex concludes by reaffirming the podcast’s commitment to providing in-depth analysis on such pivotal moments in technology, hinting at future discussions and interviews, including a forthcoming episode with Reid Hoffman.

Key Takeaways

DeepSeek R1 presents a cost-effective and high-performing alternative to existing AI models, challenging established norms in the industry.
Market Reactions indicate significant short-term impacts, especially on companies like Nvidia, Microsoft, and Google.
Technological Innovations such as model distillation and reinforcement learning underpin DeepSeek R1’s success.
Scaling Hypothesis in AI is being questioned, potentially shifting investment and development strategies.
Future Outlook remains uncertain, with potential for both market stabilization and further disruptions.

Notable Quotes with Timestamps

MG Siegler [03:49]: “From a pure market perspective, it seems like it's an eight. It's not going to totally destroy the stock market, but it’s going to be rough today.”
MG Siegler [07:01]: “If DeepSeek can release another iteration and maintain its performance, startups may take it more seriously. For now, it's a wait-and-see approach.”
MG Siegler [11:41]: “They used distillation to bring larger models down to smaller, more efficient versions, enabling them to run on a variety of hardware with significantly reduced costs.”
MG Siegler [16:39]: “DeepSeek R1 calls into question the necessity of massive scaling, presenting a fundamentally different economic model for AI development.”
MG Siegler [19:21]: “If DeepSeek just pointed to the nail already hammered, we're moving into the next phase of the AI revolution.”
MG Siegler [34:18]: “This moment with DeepSeek is forcing a fundamental rethinking of how much money to spend and what to focus on.”
MG Siegler [39:19]: “If DeepSeek is truly transformational, it could lead to new companies emerging, but it's not apparent yet.”
MG Siegler [42:43]: “If this is just a step on the road and not a fundamental change, companies might still keep their foot on the gas.”

This episode of the Big Technology Podcast offers a comprehensive analysis of DeepSeek R1’s disruptive entrance into the AI market, exploring its technological advancements, economic implications, and the resulting shifts in market dynamics. For those keen on understanding the evolving AI landscape and its broader economic consequences, this discussion provides valuable insights and forward-looking perspectives.

Loading summary...

Transcript

A (0:00)

It's time for a bonus episode exclusively about DeepSeek R1 as the Chinese open source AI model roils markets and threatens to upend the generative AI industry. That's coming up right after this from LinkedIn News.

B (0:15)

I'm Jessi Hempel, host of the hello Monday Podcast. Start your week with the hello Monday podcast. We'll navigate career pivots. We'll learn where happiness fits in. Listen to hello Monday with me, Jesse Hempel on the LinkedIn podcast network or wherever you get your podcasts.

A (0:34)

Welcome to Big Technology Podcast. We're doing a bonus edition today exclusively on Deep Seek. What it means for the AI industry, what it means for markets. We're going to touch on technology, we're going to touch on business. And so thrilled that you're here for a bonus episode with us. We're joined today by MG Siegler. He's a writer and investor. He writes Spyglass. You can find it@spyglass.org It's a great newsletter. It's a must read for me. And he has a great piece out called Finds a Way As Deep Seek Changed the AI Game or just some Equations. MG Great to see you. Welcome to the show.

B (1:10)

Great to see you, Alex. Thanks for having me back and sorry for my crazy winter beard. It's. It is very cold and rainy right now in London, so I'm not. Not ready for spring yet.

A (1:18)

If he. It fits the season. I was just out in London to interview Demis from DeepMind.

B (1:23)

That's right. I listened to that. That was very good. Yeah. And very timely.

A (1:26)

Now, yes, I can confirm the sun does not shine in that city this time of year. So first of all, I want to talk a lot about, I mean, only about Deep Seek and deep seq R1 and what it means for the AI industry. Right now. It's. We are just about. We're gonna. The markets will open on this show, so I'll have a sense as to what it's gonna do today. But it's looking pretty bad, especially for Nvidia and some others as we get going. I just wanna thank all the podcast listeners who pointed me to DeepSeek because we had some comments that came in over the past few weeks. I was able to ask Demis about it. I was able to get it in as the lead story on Friday's show. So thank you. I appreciate all of you for pointing me towards deepseek. So let me just talk a little bit because we didn't touch on this Friday and we're going to definitely Fill some holes that were left. On the Friday show, we talked a little bit about how much it costs to train this model, but not necessarily about the benchmarks it hit and about the cost it costs to use this thing. So first of all, it's an open source model. It's much smaller than any of OpenAI's model. Yet on the AIME mathematics test, it scored 79.8% compared to OpenAI's 01, scoring 79.2%. So it bests OpenAI's best model on that. It scored 97.3% on the math 500 and it beat OpenAI, which scored 96.4%. Look, these are lots of different benchmark tests, but you can tell that just by these numbers it holds its own. And now the most remarkable part about this, it costs $0.55 per million token inputs and $2.19 per million token outputs. Just to give you a sense, OpenAI costs $15 per million input tokens and $60 per million output tokens. That's 3.5% of the cost that it costs to run OpenAI's 01 models. And you can do it again. It's open source. You could download it onto your computer and run it. So basically what, what Deepseek R1 has done in a nutshell, and then we'll turn it over to MG is it has created models that are as performant as the state of the art. Right. It's ranked number three in the chatbot arena at 3.53 to 5% of the cost. And that has huge implications for the technology, for the business. And we're going to get into those. So mg, first question for you. If there was an AI Richter scale, right, assessing how big of an earthquake this is, what would you give this development?

B (3:49)

So, I mean, it depends on what, I guess level. You're sort of measuring the magnitudes, right? Because as you noted, the markets will open. And that's going to be right now. Last I looked in pre market trading, Nvidia was down, I think 1110 to 11%. And that's the biggest hit right now. Microsoft, a bunch of others are like in the 3% range. So, you know, from a pure market perspective, it seems like it's, let's, let's call it an eight. You know, it's not, it's not going to totally destroy the stock market right now, but it's going to be rough, it seems like today from a bunch of other perspectives, I think, you know, it's, it's probably a little bit Less of a, of a shake in these earlier days. And I think that's because everyone's still even now sussing out what exactly this means for all different sorts of things. You know, you noted how much cheaper it is to run than say, OpenAI's models. And you know, over the weekend, just reading all of these sort of reports about the model and how many individual startups are even just changing, swapping out, right already because it's so much cheaper to do what they're doing right now by swapping in deep seats models. And so what does that do immediately? Like, you know, do we have to have price cuts immediately? And, and you know, I think you could sort of see OpenAI doing some stuff. I think Sam Altman tweeted, you know, maybe on Friday, like, about how they were like bundling, rejiggering some of the bundles, right, that they have, like what's in the free offering and stuff. And it sort of feels like we're going to see more of that, you know, as a response obviously to some of this. But then, you know, there was a, there was a big report, I think, in the information about Meta's response to this in particular, which seemed pretty interesting in that, like, you know, it's all hands on deck certainly. And there's like all these different teams. You and I remember that from the old school days of Facebook. And so, yeah, it's just like all of these companies are now scrambling. You have Satya Nadella tweeting out things, you know, which seemed directly aimed at the, at the market to try to, you know, ease that, that pain a bit. But anyway, going back to the original question on the, on the Richter scale, you know, overall, I think a lot of people are still figuring this out, but right now the market thing is going to be the most acute one because that's obviously going. And I think it's going to be pretty hard for, you know, this day at least. And then I think I, I read some of the early analyst reports on this and, and they're all over the place, right? Like they're, there's some folks who are saying like, oh, this is, this is awful for Nvidia. Some folks are saying that, you know, this is not a big deal, this actually could be good in the longer run for Nvidia in, in ways. And, you know, and then from big tech on down, what the ramifications are there.

B (7:01)

Yeah, I think this is just beginning. I think, you know, people will experiment with it, right, Just to see like, how much you could, you know, get by swapping them out, given the price differentiation you were talking about. But also there's downsides, of course, like people have noted sort of the, you know, the censorship within China and of certain terms. And so, you know, I don't think everyone is quite certain what's in there. You know, it's an open source in that it's open weight, but it's, you know, it's not clear exactly everything that's going on in there right now. And so I do think that if this proves out, say if, if deepsea can release another iteration of the model and it still is on the same sort of, you know, footing, I think that then you'll start to see more startups potentially taking it really seriously. I think now it's just a wait and see approach for sure and just people trying out to see if it is, in fact as good as they say. Because I think, you know, part of this, like my initial gut reaction, you know, Deep Seek, obviously, as you noted, had been around for, you know, basically since December and didn't really get all of the massive pylon until sort of Friday, right, when R1 came out. And in part it's like, you know, I've just, I don't know why my mind was drawn to this, but it's sort of like when they were talking about the, the room temperature conductor, right? Like, and everyone was talking about, oh my God, like there's this, there's this huge breakthrough that's happened and this is going to revolutionize everything. And then it turns out, oh, you know, maybe there was some, some funny business in that claim and, and maybe it wasn't, you know, all was cracked up to be. And of course that turned out to be the case. And so I'm not saying obviously that's not the case with Deep Seek. It seems like now this R1 release has legitimized it. And as you note on leaderboards and whatnot, people have been testing this and again, the startups are part of that, that pressure test, right?

B (23:36)

So I think it's different for each company. Probably Microsoft and Google are closest, you know, aligned in terms of where they Net out. And it's sort of interesting, you know, the numbers you just rattled off with where the stocks are at, that feels, you know, just like a very clear picture from Wall street what they think now. Right? Like they think Nvidia is going to get hit fast because in this, in this doomsday scenario because obviously they're the beneficiary from everyone, from all those companies, all those other companies that you mentioned, Big Tech is, is pouring as much money as possible as they can. They can't get it, get enough chips fast enough into Nvidia and if they pause that, that obviously is bad news for Nvidia in the short term. Again, I think there's longer term stuff that, that's different for Nvidia, which we can talk about. But to just hit on the rest of this question right now, I think that Microsoft and Google, which are, as we just mentioned, you know, are trying to sort of figure out the right models for how to charge for AI. I think that this puts them in a really tricky situation if the underlying economics just totally changed overnight of what AIs yeah. Underlying economic model should be. And so they were, you know, moving around different pieces trying to get to the right, the right end state so that, yeah, they could ultimately prove to Wall street like look, we're adding, you know, X amount on top of what we were already doing revenue wise thanks to AI. And a little bit, there's a little bit of weird obfuscation stuff going on there, right? It's like, well, it's bundled in now to 365. And so, you know, we don't necessarily need to tell you exactly what the uplift is but, but you can just, you know, assume that it's, that it's a part of this because it's all baked in and AI is like, you know, the new Internet and blah, blah, blah. And so, you know, there's ways that they can, they can finesse the messaging around that. But you know, to your exact question, I do think that there's, there's varying degrees of being worried certainly within Google and Microsoft Meta is more interesting because their open source philosophy, open weight philosophy and model is so similar to what Deep Seek has done. Right. And so the problem there in my mind at least is again they're spending whatever Zuckerberg just threw out 65 million or whatnot. He said, you know, at the end of last week that they're going to spend on, on Capex. And so why are they spending that amount now if, if you know, Deepseek can do it for, you know, pennies on the dollar, if not even less than that. And so what does that, that mean for their world? So in my view, high level. I think that Meta's probably in a bit better position than the other ones just because they, at the end of the day they do want like, you know, their whole philosophy is to open sources not for necessarily altruistic reasons, but because they know that it's historically helped them help their business. You know, to open source these things. The question of if it's not them open sourcing, it becomes pretty complicated. If someone else's, you know, you have to use someone else's models but they can pull back spend, it feels like a little bit easier than the other folks can. On the other end of the spectrum, OpenAI like they're, you know, the entire business is, is sort of built around being at the frontier and they've done a great job with that. They're a little bit different than, than Google and Microsoft in my mind just because they've done a good job getting mind share both in terms of brand and product. Right? Like Jet TBT is number two in the app Store right now behind Deep Seek, you know, for a reason. People are interested. It's a brand and they know it. And so what does it look like though if they're not the ones sort of powering the models? I don't think that they would give up and you know, go with Deep Seeks model necessarily. But what does it mean if, if they're not sort of the only one or the main frontier, you know, model maker providing that like. So there's all sorts of interesting offshoots and ramifications of that.

B (28:29)

That's a nice thing to say and like a nice high level mantra. And many of, you know, many of the leaders of these companies will be saying that today to sort of try to calm Wall Street. But at the end of the day, you know, aside from sort of OpenAI, which obviously is again tied with Microsoft and now Oracle, but besides them, the rest of these are public companies and Wall street, you know, like it or not, they have a say sort of over what they're going to do, like if they're gonna get hammered. And this is something I've sort of been harping on for a while, not because I think that they were doing the wrong thing necessarily with the spend, but it's just obvious that like it always comes back around, right where it's like I equated it, you know, last year to when all the movie studios during COVID and TV studios were just bulking up on streaming, right, and just spending as much money as possible as they could in order to build up their streaming services. And Wall street loved it at that time because, you know, Disney and everyone else was just gaining millions and millions of subscribers and it seems like they had a path to take on Netflix. And you know, this was the future of the industry. It's still, by the way, the future of the industry. But Wall street then all of a sudden turned on all that spend and decided like, you need to cut like spend X amount. You need to, you know, unfortunately cut the employee base and, and basically just become way more efficient while doing the same high level thing. And it was, you know, always obvious that at some point they were going to do that to the tech companies as well with regard to AI spend. And so again, they can all have the right mentality about like this is the future and say the right things, that this is the future and this spend is important. And I don't disagree with any of that. But still, they have to answer to Wall street, you know, to some degree, maybe Zuckerberg less so because he, you know, controls the, controls the company so strongly. But like, certainly Microsoft and Google to a lesser extent are going to have to answer for a lot of that spend. And this is the first real, real test. Meta had some of it, right? Like there was some backlash last year around their spend and certainly back dating back to the, the, you know, VR and AR and XR spend. And so they had to answer for Some of that. And Zuckerberg did, right, and he got rewarded for it after the fact. And that's like the game they're playing here. They know that if they cut spend because Wall street doesn't like to see all the AI spend, they'll get rewarded in the form of the stock going up and then all the ramifications from that. And so it's natural that that is going to play out that way. And so I think the narrative then shifts to other levels of not necessarily obfuscation, but other ways of framing it. It's like, okay, we agree that we shouldn't spend tens of billions of dollars on Nvidia server farms, but we need to build out our in person AI Robotics arms, right, in order to keep these models and keep sort of the next phase going as we march towards AGI and yada yada.