Your Guide to the DeepSeek Freakout: an Emergency Pod - Hard Fork

Summary

Podcast Summary: Hard Fork – "Your Guide to the DeepSeek Freakout: an Emergency Pod"

Episode Information:

Title: Your Guide to the DeepSeek Freakout: an Emergency Pod
Host/Author: The New York Times – Kevin Roose and Casey Newton
Release Date: January 27, 2025

1. Introduction

In this emergency episode of Hard Fork, hosts Kevin Roose and Casey Newton delve into the sudden upheaval caused by DeepSeek, a relatively unknown Chinese AI company, and its significant impact on the U.S. stock market and the broader tech industry.

2. Understanding DeepSeek

Background of DeepSeek: Kevin Roose introduces DeepSeek as a burgeoning Chinese AI startup that has recently made waves globally. Spun out from a hedge fund named High Flyer, DeepSeek has rapidly gained attention with its AI models, V3 and R1.

AI Models V3 and R1:

V3: Released around Christmas, V3 is an AI chatbot comparable to American counterparts like ChatGPT. Notably, it showcases its internal thought process while generating responses, a feature not present in OpenAI's models.
R1: Released a week prior to the episode, R1 is DeepSeek's attempt at a reasoning model, akin to OpenAI's more advanced iterations like GPT-4.

Casey Newton states, “V3 is just kind of like a chatbot like a lot of us have already used,” highlighting its functional similarity to existing models while emphasizing its unique features.

3. Technical Achievements

Cost-Effective Training: DeepSeek has astonished the AI community by developing high-performing models at a fraction of the usual cost. Roose mentions, “DeepSeek claimed that they had spent just five and a half million dollars training V3. ... something on the order of a hundred times cheaper than what something like an OpenAI model of equivalent performance would cost to train” ([04:56]).

Innovative Hardware Utilization: Due to U.S. export controls limiting access to top-tier AI chips, DeepSeek employed less advanced, restricted AI chips to train their models. This approach not only circumvented regulatory barriers but also demonstrated that high-quality AI models could be developed without the latest hardware.

4. Market Impact

Stock Market Reaction: The release of DeepSeek's models has led to significant volatility in the U.S. stock market. Notably, NVIDIA’s stock plummeted by approximately 18%, erasing hundreds of billions in market capitalization ([03:19]).

Implications for American AI Companies: The affordability and competitiveness of DeepSeek's models pose a threat to American AI giants. Casey Newton observes, “This really stood in stark contrast to [American companies']... [spending] tens of billions of dollars this year to increase our capacity and data centers” ([06:08]).

Kevin Roose emphasizes the broader implications: “The barrier to entry in this market is just not nearly as high as we thought it was” ([10:53]).

5. Geopolitical Concerns

Chinese Tech Dominance: The success of a Chinese AI company like DeepSeek raises alarms about potential shifts in global tech leadership. Roose notes, “people who worry about China in general are worried about this DeepSeek announcement because DeepSeek is obviously a Chinese company” ([11:21]).

Cultural and Censorship Implications: DeepSeek’s models reflect Chinese values and censorship norms, restricting responses to sensitive topics such as the Tiananmen Square incident. Roose explains, “... the model just refuses to answer them” ([12:18]).

6. AI Safety Issues

Open-Source Concerns: DeepSeek released R1 as an open-weights model, allowing public access and modification. This openness exacerbates fears among AI safety experts about uncontrolled dissemination and potential misuse of advanced AI technologies ([13:53]).

Casey Newton criticizes the lack of a safety-centric approach: “... DeepSeek’s approach is, yeah, let's just like build AGI, give it to as many people as possible, maybe for free, and see what happens” ([14:22]).

7. Industry Reactions

Meta’s Response: Reports indicate that Meta is in a state of urgency, with multiple war rooms activated to address the DeepSeek challenge. Newton remarks, “Meta is coming back and they say, oh, you think you're good at ripping people off. Just wait until we have plumbed the guts of V3 and R1” ([20:36]).

Microsoft’s Satya Nadella on Jevons Paradox: Satya Nadella tweeted about Jevons Paradox, suggesting that as AI becomes more efficient and accessible, its demand will surge, potentially neutralizing cost-saving measures. Newton connects this to broader industry skepticism: “... what Satya is trying to say here is that DeepSeek is not actually a threat to companies like Microsoft...” ([21:46]).

8. Hosts' Perspectives

Kevin Roose: Roose underscores the significance of DeepSeek’s impact, viewing it as a pivotal moment in AI history that challenges existing market dynamics and safety paradigms.

Casey Newton: While acknowledging the gravity of DeepSeek’s advancements, Newton urges caution against overreaction. She points out that the trend of decreasing AI costs was anticipated and that foundational American innovations remain integral. Newton states, “...things have already been moving in this direction. And I think most people who work in AI expected that it would continue to go there” ([15:01]).

9. Conclusion

Kevin and Casey agree that while DeepSeek’s achievements are noteworthy and disruptive, stakeholders should approach the situation with a balanced perspective. They emphasize the need for ongoing discussion and analysis, promising further exploration in future episodes.

Kevin Roose aptly captures the duality of the situation: “I predict that this is going to be something I'm going to hear about at every single party I go to for the next six months” ([22:14]).

The episode concludes with a light-hearted exchange, highlighting the hosts' commitment to addressing critical tech developments as they unfold.

Notable Quotes:

Kevin Roose ([03:19]): “...Nvidia stock... is down about 18% today. That represents hundreds of billions of dollars wiped off the market cap of just one company by this announcement from DeepSeek.”
Casey Newton ([14:36]): “I think as I'm doing my reading and having conversations with folks this morning, my sense is I am freaking out a bit less than some other folks that I'm talking to.”
Kevin Roose ([22:58]): “DeepSeek is not actually a threat to companies like Microsoft, because as the cost of building and using AI models comes way down, people are just going to want to use them more and more.”

This detailed exploration of DeepSeek’s emergence underscores the disruptive potential of innovative AI developments and the multifaceted responses they provoke across markets, geopolitics, and industry safety standards.

Transcript

Oracle Ad (0:00)

This podcast is supported by Oracle.

Kevin Roose (0:02)

AI requires a lot of compute power and the cost for your AI workloads can spiral. That is, unless you're running on OCI Oracle Cloud Infrastructure. This was the cloud built for AI, a blazing, fast, enterprise grade platform for your infrastructure, database, apps and all of your AI workloads.

Casey Newton (0:18)

Right now, Oracle can cut your current cloud bill in half if you move to OCI. Minimum financial commitment and other terms apply. Offer ends March 31. See if you qualify@oracle.com hardfork oracle.com hardcore.

Kevin Roose (0:31)

Hard fork well, Casey, the last time we recorded an emergency podcast, you were at gate E8 of the San Francisco airport and we were talking about OpenAI and how Sam Altman had just been fired. Are you at the airport today? And if not, would you like me to mail you an Auntie Ann's pretzel so you feel more comfortable?

Casey Newton (0:52)

Yeah, that'd be no. All things being equal, Kevin, it's actually much more comfortable to record here in my home studio and not have to compete with the PA system announcing flights to Houston.

Kevin Roose (1:24)

Casey we are here today to talk about a little company called Deepseek, which probably most people had not heard of, but that is causing a major series of events in the US Stock market and around the US Tech industry this week.

Casey Newton (1:39)

That's right. By now our listeners have probably seen that the stock market dipped on Monday and that some companies whose fortunes are closely tied to AI dipped quite dramatically. But they also might have just noticed it in the App Store, where Deepseek has hit number one this week, which is a rarity for a Chinese consumer app to do in the United States. So yes, suddenly everywhere you look, there are sign of this Deep Seek affecting the world.

Kevin Roose (2:08)

And you know, we should say to maybe talk directly to the things some listeners may be thinking about, like why we are interrupting our normal production schedule to do a special emergency episode about Deep Seek. Like, it is not unusual for people in the AI world to start freaking out about some new development or breakthrough or some new model that was released. But I believe that this is the real deal. I think this is a big moment in the history of AI development and it is really taking a toll on stock markets in ways that I think are really interesting. I mean, you said dip, but Nvidia stock, one of the highest performing stocks on the market over the past few years, and certainly the one that is most closely correlated with people's feelings about AI, is down about 18% today. That represents hundreds of billions of dollars wiped off the market cap of just One company by this announcement from Deep Seek. So I think this is a broader story, just the stock market. I think it has tons of implications for other companies developing an AI and also for concerns that a lot of people working on AI safety have about how this technology could get out of hand.