Loading summary
Jessica Mendoza
So what was one of your first clues? That Monday was going to be an interesting day?
Gunjan Banerjee
Yeah, it's funny. I was actually commuting in from Long island on Monday morning. I was on the Long Island Railroad and I looked at my phone, I was checking my emails, obviously checking the markets.
Jessica Mendoza
That's my colleague Gunjan banerjee. She hosts WSJ's take on the Week podcast and has covered markets and investing for the Journal for years.
Gunjan Banerjee
And I see that futures contracts tied to the NASDAQ composite were down more than 4%, which was really an eye popping move. We had not seen a move of that magnitude in quite some time.
Jessica Mendoza
Something was going down in the stock market and by the time Gunjan made it to the office, things had gotten even wilder.
Gunjan Banerjee
I get to my desk and, you know, around the time the market opens at 9:30, I think everyone is kind of glued to their screens at that point and they see that this really ugly day for the stock market is beginning. All three major indexes are down a bunch. Nvidia, of course, is down double digits.
Jessica Mendoza
Nvidia, the AI chip maker, its stock was tanking. We could be looking at the biggest drop in market cap on record here for Nvidia, when you take a look.
Gunjan Banerjee
At the red across the screen, specifically.
Jessica Mendoza
The NASDAQ 100 futures and the intense.
Gunjan Banerjee
So Nvidia was down more than 10% shortly after the opening bell. It ended the day down 17%. And just to put that into context, that is a market value loss of almost $600 billion.
Jessica Mendoza
So much money, right?
Gunjan Banerjee
That is just an insane amount of wealth and amount of value that evaporated within hours. In fact, it is the biggest one day market value drop on record.
Jessica Mendoza
By the end of Monday, about a trillion dollars of value had been wiped from the stock market. But while the drop was historic, Genjun also had a pretty good idea of why it was happening.
Gunjan Banerjee
So what traders, people on Wall Street, Silicon Valley was pointing to was this upstart artificial intelligence company, Deepseek.
Jessica Mendoza
Deepseek, it's an AI company out of China. And over the last few days, its chatbot has been blowing people away. Experts say Deepseek's AI is just as capable, maybe even more capable than leading AI chatbots like ChatGPT. But its creators claim it was made for much less money. And that set off a major shakeup in Silicon Valley and on Wall Street.
Gunjan Banerjee
Deepseek, this new artificial intelligence competitor, forced everyone to take a look at their portfolios, take a look at their AI products and really rethink who the winners and losers of this artificial intelligence trade we're going to be. How? All of a sudden, investors were going, hey, are the stocks that we own is Nvidia? Is it worth what we think it's worth?
Jessica Mendoza
Welcome to the Journal, our show about money, business, and power. I'm Jessica Mendoza. It's Wednesday, January 29th. Coming up on the show, how Deep Seek. See, thank the stock market. So what is Deep Seek?
Stu Wu
Deep seek is an AI chatbot. If you've tried ChatGPT, it's just like that. You go to the website, you log in, and. And you ask it a question, and it'll give you an answer as if a pretty smart human were answering it.
Jessica Mendoza
That's my colleague, Stu Wu. He covers tech in Asia. And how did you first hear about Deep Seek?
Stu Wu
I was doing this video interview with somebody in San Francisco who was the founder of an AI company, and we were talking about something else, and he didn't know something. So he shared his screen with me and said, let me look it up. And what I thought was weird was that he didn't go to Google or ChatGPT. He. He went to something I'd never heard of, Deepseek. And what he said was that he'd been playing with it for the past couple days, and he and his coworkers were just talking about how it was amazing and probably just as good as all the American competitors that he's been looking at.
Jessica Mendoza
So tell us a little bit about the company itself and the AI model who made it.
Stu Wu
So it's the brainchild of a Chinese guy named Liang Wenfeng. He co founded this hedge fund in China. It's based in Hangzhou, which is also the same tech hub where the Chinese company Alibaba is based. So deepseek grew out of that. So Liang's a pretty smart guy. He studied AI at one of China's top engineering programs. And what I thought was really interesting about the company was that it had this really unusual hiring practice. Liang wants creative people, but he doesn't really care that much about experience. And he says his hiring principle is hire people with the least amount of experience. Because his idea is that if you ask someone with work experience to solve a problem, they're going to say, well, we should solve it like this, because this is how I've done it in the past. But if you ask people without experience to solve that same problem, they'll have to sit down, think about the problem, and then they'll figure out the best and freshest and most efficient way to do it. So that's why a lot of people who work at Deepseek are either fresh graduates or people with just a year or two of work experience.
Jessica Mendoza
And so this sort of takes us to the AI model that that approach kind of created. So before Deep Seek, what was sort of the going assumption about how you make a cutting edge AI model?
Stu Wu
Yeah, so the conventional thinking was that if you wanted to make a world class AI chatbot or AI system, you needed a lot of the world's best, best AI chips that are super expensive as well.
Jessica Mendoza
In the us, AI development has been dominated by a handful of big tech companies who've trained their AI models using tons of top line AI chips. Those chips are largely made by, you guessed it, Nvidia. The assumption was if you didn't have enough of the right kind of chips, you couldn't build a world class AI model.
Stu Wu
And the other assumption was that a Chinese company could never do that because the US government had set these restrictions on what kind of chips US companies could sell to China. The thinking was that China would never catch up.
Jessica Mendoza
So let's take a look at those assumptions. The first of those assumptions is that like you said, you need a lot of chips to create these high powered AI models. How did deepsea sort of undermine that assumption?
Stu Wu
Yeah, so Deep Seq released this research paper that explained how it did what it did, and it said that it spent a fraction of the money developing its advanced chatbot, and it did so using less advanced chips. So how can we understand that? So I think a good analogy is that let's look at the first chatgpt that many of us have used and let's try to understand how that was trained. So imagine that ChatGPT is like a librarian that's read all the books in the library, and when you ask it a question, it'll give you an answer because it's read that book. But the problem is that to read all those books, that requires a lot of time and a lot of electricity for those computer chips to read those books. So Deep Seek didn't have those resources, so it tried a new approach. So imagine you're still in a library and Deep Seq is a librarian, but it hasn't read all those books. What it does instead is that it's focused on being really good at figuring out what book has the answer after you ask it the question. And it turns out that's just as effective as what ChatGPT originally did. It was just as good, but it used a fraction of the resources it.
Jessica Mendoza
Makes me think a little bit about kind of expert versus journalist. In some ways, it's like what we do is we know who to ask and what questions to ask. Instead of actually getting the PhD, we go to the experts ourselves versus the expert who has to learn everything about that subject.
Stu Wu
Yeah, that's a good example. There's very few of us who can just read all those books and just maintain all that information in their head. And then when we have to figure out it, we just kind of like stress out and call everybody we know and try to answer that question within an hour.
Jessica Mendoza
Exactly.
Stu Wu
But Deepseak does that in just a few seconds.
Jessica Mendoza
And then the second assumption here is that a Chinese company couldn't do this because they wouldn't have access to the best chips to Nvidia's chips. First of all, when and why did the US Start restricting the export of AI chips to China?
Stu Wu
So the thinking during the Biden administration was that AI is going to be really important for military purposes. So just imagine you can use it for developing a nuclear weapon or a biological weapon or helping a general make a decision on the battlefield. It could give one side an absolute advantage. So that's why they decided, we got to stay a couple years ahead of China on AI. We can't lose an edge with AI on the battlefield.
Jessica Mendoza
In 2022, the Biden administration put restrictions on the kinds of chips US Companies could sell to China.
Stu Wu
So what they said was that if you're a US Company that wants to sell these chips to China, you have to restrict this parameter called interconnect bandwidth. And the analogy that I would use is that if you were designing a race car, this restriction would constrict how much gasoline ran through the fuel line.
Jessica Mendoza
Nvidia followed that rule, but it also figured out a workaround for its Chinese chips.
Stu Wu
It complied with that fuel line. The fuel line was constricted, but it increased performance in other parts of the car engine to compensate for that, to make the most out of the fuel it did have.
Jessica Mendoza
The result was that the chips Nvidia was selling in China were more powerful than the US Government would have preferred. The Biden administration eventually cut off that workaround, but it took about a year.
Stu Wu
So they gave Deepseek and other companies a year to buy these pretty powerful chips. And if you look at one of Deepseek's research papers, it said it used about 2,000 of these powerful China only chips from Nvidia to train one of its advanced AI models.
Jessica Mendoza
Last week, Deepseek released its most advanced AI model yet called R1. And what has the reaction been?
Stu Wu
Well, I can't remember anything quite like this. I mean, I think the closest thing is when ChatGPT came out three years ago and that like, kind of like changed the world. Everybody's trying to write poems on it, you know, immediately and. But this had some serious financial consequences, right?
Jessica Mendoza
That financial fallout is after the break. For the past few years before Deepseek crashed onto the scene, investors had been piling into AI stocks betting on big returns. They called it the AI trade. Here's Gunjan Banerjee again.
Gunjan Banerjee
Basically, investors had latched onto this idea that artificial intelligence was going to unleash this wave of productivity in the economy among US workers and lead to gobs and gobs of profits for a handful of big technology companies, including Nvidia.
Jessica Mendoza
At its most recent peak, the company was worth more than $3 trillion. And Nvidia wasn't the only company people were betting on. Who else did people think were the winners of AI really?
Gunjan Banerjee
The big technology stocks even think of like Meta, Microsoft, which, which also has a competitor to ChatGPT. People were thinking of some of these huge technology companies in the US as the key winners from the AI boom.
Jessica Mendoza
And when we're talking about sort of people piling on, how big did this get?
Gunjan Banerjee
The AI trade completely ate the stock market. It just took over almost every corner of financial markets that you can imagine, like energy stocks.
Jessica Mendoza
People bought up shares of energy and utility companies because training AI models uses a lot of power. Did it feel like a bubble?
Gunjan Banerjee
It's interesting. There has been no shortage of investors the past few years saying we think this artificial intelligence trade is a bubble. And one of the reasons for that is just the amount of exuberance we've seen surrounding this trade and, and the levels of speculation. There was a lot of yoloing out there. You know, you only live once, right? Let's go for it. This was another flavor of like let's get really, really rich from trading these AI stocks. Let's pile into their options, which are super risky and can provide kind of these boomer bust returns. Let's pile into really risky exchange traded products. So there was just this mountain of speculation building and building and building while the AI craze continued.
Jessica Mendoza
But it kept growing.
Gunjan Banerjee
It kept growing and growing and growing. It kept snowballing.
Jessica Mendoza
And then came Deepseek, a cutting edge AI product that wasn't built by a US tech giant and seemingly didn't require a ton of chips. Investors who thought they knew who the winners of AI were suddenly weren't so sure.
Gunjan Banerjee
I think the Deep Seq news really spooked a lot of people about the valuations that they were assigning to some of these technology giants. It was a moment that made people question where they had been putting their money the past few years.
Jessica Mendoza
And why were investors backing away from Nvidia specifically?
Gunjan Banerjee
The one thing that a lot of investors were fixated on is that it seemed like Deepseek needed a lot less computing power. So that would mean that the AI models of the future might not require as many high end Nvidia chips as investors have been counting on. I mean, the way one investor put it to me was, we've been banking on Nvidia being the disruptor. Are they being disrupted now?
Jessica Mendoza
In a statement Monday, Nvidia praised DeepSeek's advancements. It added that serving up these kinds of AI models to users requires large numbers of its chips. Since Monday's chaos, the market seems to have stabilized. Tech stocks rebounded on Tuesday, with Nvidia up 9%. But my colleague Stu Wu says that the AI industry is just beginning to wrestle with Deepseek's model and its implications. For example, there's still a lot of questions about how Deepseek pulled this off.
Stu Wu
So Deep Seq published some research papers that explained how it accomplished what it accomplished, but it hasn't revealed all of its secrets. So we don't know exactly what the training data it used. We don't know what that looks like. And there's a lot of people in Silicon Valley who are wondering aloud, without evidence, I might add. But this is informed speculation that maybe Deepseek actually had even more powerful Nvidia chips than it's letting on. So there's still a lot to figure out. DeepSeek disclosed some of its secrets, but not all of them.
Jessica Mendoza
OpenAI, the maker of ChatGPT, has said it's looking into whether Deepseek used large volumes of OpenAI data to help develop its model. Deepseek didn't immediately respond to requests for comment. I'm curious. I mean, were all of those people also as surprised as the rest of us about this? I'm just trying to figure out, like, how did everyone, you know, Silicon Valley, those people in Wall street, how did everyone miss this?
Stu Wu
Yeah, so how did everybody miss this? Okay, that's a good question. So after Deep Sea came out last week, a lot of prominent people in Silicon Valley, whether they're AI researchers or venture capitalists, went on X or some other platform and said this is really innovative, right? Like, they just found a new way of doing this. And one of the guesses was that resource constraints breeds creativity. Right. If you think about the book or the movie Moneyball, how did the Oakland A's 20 years ago compete with the richest baseball teams despite having a fraction of the budgets? Well, they looked at undervalued strategies in baseball and they figured out how to win despite this handicap. So that's one theory, that resource constraints breeds creativity.
Jessica Mendoza
Do you think part of the problem here was that people underestimated China?
Stu Wu
I've been thinking a lot about this question, so I think a lot of people are in general surprised at how far China has come in technology. But in America, you don't actually get to see a lot of this because of effective bans on Chinese technology. In America. A lot of Americans have never touched a Chinese cell phone made by Huawei, or electric car made by byd, which is one of the world's biggest car companies. Right. These things basically don't exist in America. So I think what happened was that when Deepsea came out, anybody could download it, ask it a question in English and see the answer in English, and they're like, wow, this wasn't supposed to happen. How did this happen?
Jessica Mendoza
That's all for today. Wednesday, January 29th. The Journal is a co production of Spotify and the Wall Street Journal. Additional reporting in this episode by Asa Fitch, Rafael Huang, Karen Langley and Sam Schechner. Thanks for listening. See you tomorrow.
The Journal: How DeepSeek Sunk The Stock Market
Released on January 29, 2025, "The Journal" podcast—co-produced by Spotify and The Wall Street Journal—dives deep into the unprecedented stock market crash triggered by the emergence of DeepSeek, a Chinese artificial intelligence (AI) company. Hosted by Jessica Mendoza, along with contributions from Gunjan Banerjee and Stu Wu, this episode unpacks the intricate dynamics between AI advancements and financial markets.
The episode opens with Gunjan Banerjee recounting the initial signs of the impending market turmoil.
As the market opened, the decline intensified, marking the start of one of the most significant drops in stock market history.
Nvidia, a leading AI chip manufacturer, experienced an unprecedented decline.
By the end of the day, roughly $1 trillion had been wiped from the stock market, marking the biggest one-day market value drop on record.
The catalyst behind the crash was identified as DeepSeek, a Chinese AI firm whose innovative chatbot rivaled Western counterparts.
Stu Wu explains how DeepSeek's advancements were catching the attention of Silicon Valley and Wall Street alike.
DeepSeek distinguished itself through unconventional methodologies, challenging established AI development norms.
This strategy fostered a workforce of fresh graduates and individuals with minimal work experience, promoting novel problem-solving techniques.
DeepSeek disrupted key beliefs about AI development, particularly regarding hardware reliance and geopolitical limitations.
Contrary to the prevailing notion that cutting-edge AI requires the most advanced (and expensive) chips, DeepSeek achieved comparable performance with fewer resources.
Despite US government restrictions aimed at limiting China's access to advanced AI chips, DeepSeek navigated these barriers effectively.
Nvidia initially complied with these restrictions but later found workarounds, temporarily giving DeepSeek access to powerful chips.
Prior to DeepSeek's emergence, AI stocks were soaring, driven by optimism about AI's transformative potential.
Major tech companies like Meta and Microsoft were also seen as primary beneficiaries of the AI boom, with Nvidia's valuation peaking at over $3 trillion.
The introduction of DeepSeek forced investors to reassess their AI-centric portfolios.
Questions arose about the sustainability of previous AI investments, with concerns that companies like Nvidia might be facing disruption from new entrants like DeepSeek.
In response to the turmoil, Nvidia acknowledged DeepSeek's advancements but maintained its critical role in AI infrastructure.
Subsequently, the market began to stabilize, with tech stocks rebounding by Tuesday, exemplified by Nvidia's 9% stock increase.
Despite DeepSeek's public research, many details about its AI model remain undisclosed, leaving industry experts curious and cautious.
Speculations include whether DeepSeek might have utilized even more powerful Nvidia chips than disclosed or accessed proprietary data from competitors like OpenAI.
The episode concludes by reflecting on how DeepSeek's rise highlighted a broader underestimation of China's advancements in technology.
This revelation underscores the dynamic and competitive nature of global AI development, challenging Western dominance narratives.
Conclusion
The episode "How DeepSeek Sunk The Stock Market" serves as a comprehensive exploration of how a breakthrough from a relatively unheralded Chinese AI firm can ripple through global financial markets. By challenging established technological and economic assumptions, DeepSeek not only precipitated the largest single-day market value loss but also sparked a critical reevaluation of AI investment strategies worldwide. As the AI landscape continues to evolve, the episode underscores the importance of adaptability and vigilance in the face of rapid technological advancements.
Notable Quotes:
Additional reporting in this episode was provided by Asa Fitch, Rafael Huang, Karen Langley, and Sam Schechner.