Deep Questions with Cal Newport
Episode: AI Reality Check: Did the LLM Job Apocalypse Begin Last Week?
Date: March 5, 2026
Host: Cal Newport
Episode Overview
In this special “AI Reality Check” episode, Cal Newport scrutinizes recent headline-grabbing stories about AI causing massive workforce reductions, AI capabilities as measured through academic benchmarks, and the practical ways professional programmers are adapting to new agentic coding tools. Newport pushes back on hype-laden media narratives, offers grounded industry analysis, and shares insights from hundreds of real-world programmers to reveal the true state of AI’s impact on jobs and work.
Key Discussion Points & Insights
1. Block's AI-Fueled Layoff Claims: Separating Fact from Hype
Segment starts: [03:00]
- Headline: Block (led by Jack Dorsey) laid off 40% of its workforce (~4,000 employees), citing "AI tools enable new ways of working" as rationale.
- Cal Newport’s Analysis:
- Describes an industry trend where pandemic hiring led to overstaffing, now correcting across tech (Amazon, Microsoft, etc.).
- Skepticism About AI as a Cause: Dorsey’s statement lacks specificity: “No specific reference of this particular tool has taken on this role. So we fired, we shut down this division because we don’t need employees there... very vague.” [10:15]
- Media “Vibe Reporting”: Newport notes that “journalistic tricks” are used, attributing bold claims to executives to avoid fact checking:
"So you just make the claim, then you put a comma and attribute it to someone else." [06:20]
- Alternative Explanations:
- Block’s headcount ballooned from 4,000 to 10,000 from 2019–2025, mainly through risky acquisitions, now unwinding.
- Financial analysts unimpressed with Block's recent results despite media claims of “strong financials.”
- Quotes from Industry Voices:
- Ethan Mollick (AI commentator):
“This isn’t about AI, but that is a smart way to sell it. If you want to see your stock jump 20%...” [13:00] “Given that effective AI tools are very new and we have little sense of how to organize work around them, it is hard to imagine a firm wide sudden 50% efficiency gain... CEOs with Vision... should use AI for expansion and augmentation, not decimation.” [14:20]
- Ron Shevlin (Fintech Analyst):
“Block lays off 40% of staff and blames it on AI. Don’t buy the excuse.” [15:40]
- Ethan Mollick (AI commentator):
- Key Takeaway:
- AI will impact jobs, but the “AI job apocalypse” is not here yet. Most observed layoffs reflect economic and industry corrections, with “AI-washing” used as PR spin:
“We have to stop the vibe reporting on the AI job apocalypse. It’s not here yet and we don’t know if it’s going to come at all. But the best we can do is try to be accurate about what we’re saying.” [17:30]
- AI will impact jobs, but the “AI job apocalypse” is not here yet. Most observed layoffs reflect economic and industry corrections, with “AI-washing” used as PR spin:
2. Can AI Pass a Freshman CS Class? A Reality Check on LLMs' Capabilities
Segment starts: [18:30]
- Setting the Stage:
- Anthropic’s CEO, Dario Amodei, repeatedly compares LLMs' intelligence to human levels: “having an army of PhDs in your data center” or “a country of geniuses.”
- Cornell Experiment:
- A TA for the advanced freshman CS course (CS 2112) gives ChatGPT, Claude, and Gemini every graded assignment, quiz, and exam from the course, grading them exactly as human students.
- Memorable Quote from the Video:
"Can AI pass a first semester freshman CS class? To answer this question, I ran every single assignment, every exam, every quiz... through the three best models... so that I could give each AI the most accurate possible grade in the class." [20:30]
- Performance Results:
- Early/simpler assignments: AIs score very high (ChatGPT: 102/104, Claude 99/104, Gemini 101/104).
- Some projects and more complex assignments: Scores drop drastically (e.g. Assignment 6: ChatGPT 32/100, Claude 20/100, Gemini 13/100).
- Humorous Hallucinations:
- On a string concatenation assignment, Claude outputs “hello world world” instead of “hello”, reflecting overfitting to AI grading datasets [22:40].
- Final Grades:
- ChatGPT: B+ (below class median)
- Claude & Gemini: C+ (below threshold needed to declare the CS major at Cornell)
- Cal Newport’s Interpretation:
- On Amodei’s Claims:
"It was stupid all along... to try to use human education levels as a way to describe a large language model. This is just different.” [27:30]
- LLMs solve very specific problems but aren’t general-purpose intelligence equivalent to people.
“Hopefully we can stop using terms like having a data center full of PhDs.” [29:00]
- On AI/Human Hybrid Tool Use:
- Real value comes from interactive, guided workflows—not one-off, autonomous brilliance.
- On Amodei’s Claims:
3. Agentic AI and Coding: What Programmers are Really Doing
Segment starts: [30:30]
- Cal’s Ongoing Research:
- Put out call for reports from professional developers. Has reviewed 100+ of 350+ responses detailing first-hand experiences with AI tools.
- Two Real Programmer Perspectives:
- The Enthusiastic All-in User
- No longer writes any code manually; instead, tasks and code managed through AI agentic tools (Claude, Codex, etc.)
- Workflow: Plan feature/bugfix with AI → iterate with chatbot → AI outputs plan/code → human verifies and uses Git for control
- Quote:
“I have developed things in the past week that would have taken me months before.” [34:00]
- Key Insights:
- Heavy chatbot use in planning, not just code generation.
- Multi-agent approaches (many agents working in parallel) are “exhausting”; rarely used outside of hobbyist projects.
- The Reticent, Yet Appreciative, Professional
- Uses LLMs for “scaffolding, boilerplate, repetitive tasks” and to check documentation.
- Quote:
“When actively using these coding agents, I found that it generally slows me down. Using them introduced tasks I didn’t have before: composing a prompt, checking the output, re-prompt, manually refactor...” [37:45]
- Reviewing AI-generated code is more laborious and risk-prone.
- The Enthusiastic All-in User
- Cal’s Synthesis:
- Adoption: About 45% of programmable respondents now produce most of their code via agentic tools.
- Best Practices Unclear: Wide spectrum of adoption; no consensus on efficient workflows.
- Professional Workflow Diverges from Online Hype:
“I do not think based on these interviews that the hyper multi agent approach that we see most talked on the Internet is going to become some sort of standard for serious programmers...” [42:30]
- Actual Coding Transformation is Nuanced:
“Something is happening. It’s more complicated than other people make it seem... what’s working, what’s not, what’s hype, what’s not—let’s try to figure out what’s actually happening.” [44:10]
Notable Quotes by Timestamp
- On Block’s AI Layoff Narrative:
- “If you are announcing the layoff of 40% of your staff, can you use capital letters at the beginning of your sentences?...it feels a little disrespectful.”—Cal Newport [04:10]
- “We have to stop the vibe reporting on the AI job apocalypse. And it’s not here yet and we don’t know if it’s going to come at all. But the best we can do is try to be accurate about what we’re saying.” —Cal Newport [17:30]
- On LLMs and "PhD-level" Intelligence:
- “It was stupid all along...to try to use human education levels as a way to describe a large language model.” —Cal Newport [27:30]
- “You could get these chatbots the right answers if you’re willing to be sufficiently interactive and hold their hands... but that’s not really the right takeaway here.” —Cal Newport [26:50]
- On Real Programming with AI:
- “There’s a lot of just chatbot discussion happening in these workflows... they’ve entered a more interactive way; they want to talk back and forth...” —Cal Newport [36:15]
- “Are we sure that this [agentic approach] is actually producing the best code?” —Cal Newport [39:00]
- “None of the serious programmers I heard of so far are doing anything like that [hyper multi-agent workflows] for the most part.” —Cal Newport [42:40]
Summary Takeaways
- AI is not (yet) the direct cause of mass layoffs in tech despite media narratives—most recent cuts reflect economic corrections and over-hiring during the pandemic.
- Heavily anthropomorphizing LLM capabilities (“PhDs in data centers”) is misleading; actual competence is inconsistent and highly dependent on guidance, context, and human oversight.
- Professional programmers are integrating AI tools, but real-world practices are nuanced and hybrid; solo, multi-agent workflows touted online remain rare.
- The technology’s impact is substantial but overhyped in the press—critical thinking and firsthand analysis are needed to understand what’s truly changing.
Further Listening
- Follow Cal’s AI Reality Check series for ongoing, grounded updates
- Monday’s main episode will resume broader “Deep Questions” topics
