Summary6 min read

Deep Questions with Cal Newport

Episode: AI Reality Check: Did the LLM Job Apocalypse Begin Last Week?
Date: March 5, 2026
Host: Cal Newport

Episode Overview

In this special “AI Reality Check” episode, Cal Newport scrutinizes recent headline-grabbing stories about AI causing massive workforce reductions, AI capabilities as measured through academic benchmarks, and the practical ways professional programmers are adapting to new agentic coding tools. Newport pushes back on hype-laden media narratives, offers grounded industry analysis, and shares insights from hundreds of real-world programmers to reveal the true state of AI’s impact on jobs and work.

Key Discussion Points & Insights

1. Block's AI-Fueled Layoff Claims: Separating Fact from Hype

Segment starts: [03:00]

Headline: Block (led by Jack Dorsey) laid off 40% of its workforce (~4,000 employees), citing "AI tools enable new ways of working" as rationale.
Cal Newport’s Analysis:
- Describes an industry trend where pandemic hiring led to overstaffing, now correcting across tech (Amazon, Microsoft, etc.).
- Skepticism About AI as a Cause: Dorsey’s statement lacks specificity: “No specific reference of this particular tool has taken on this role. So we fired, we shut down this division because we don’t need employees there... very vague.” [10:15]
- Media “Vibe Reporting”: Newport notes that “journalistic tricks” are used, attributing bold claims to executives to avoid fact checking:
  
  "So you just make the claim, then you put a comma and attribute it to someone else." [06:20]
- Alternative Explanations:
  - Block’s headcount ballooned from 4,000 to 10,000 from 2019–2025, mainly through risky acquisitions, now unwinding.
  - Financial analysts unimpressed with Block's recent results despite media claims of “strong financials.”
Quotes from Industry Voices:
- Ethan Mollick (AI commentator):
  
  “This isn’t about AI, but that is a smart way to sell it. If you want to see your stock jump 20%...” [13:00] “Given that effective AI tools are very new and we have little sense of how to organize work around them, it is hard to imagine a firm wide sudden 50% efficiency gain... CEOs with Vision... should use AI for expansion and augmentation, not decimation.” [14:20]
- Ron Shevlin (Fintech Analyst):
  
  “Block lays off 40% of staff and blames it on AI. Don’t buy the excuse.” [15:40]
Key Takeaway:
- AI will impact jobs, but the “AI job apocalypse” is not here yet. Most observed layoffs reflect economic and industry corrections, with “AI-washing” used as PR spin:
  
  “We have to stop the vibe reporting on the AI job apocalypse. It’s not here yet and we don’t know if it’s going to come at all. But the best we can do is try to be accurate about what we’re saying.” [17:30]

2. Can AI Pass a Freshman CS Class? A Reality Check on LLMs' Capabilities

Segment starts: [18:30]

Setting the Stage:
- Anthropic’s CEO, Dario Amodei, repeatedly compares LLMs' intelligence to human levels: “having an army of PhDs in your data center” or “a country of geniuses.”
Cornell Experiment:
- A TA for the advanced freshman CS course (CS 2112) gives ChatGPT, Claude, and Gemini every graded assignment, quiz, and exam from the course, grading them exactly as human students.
- Memorable Quote from the Video:
  
  "Can AI pass a first semester freshman CS class? To answer this question, I ran every single assignment, every exam, every quiz... through the three best models... so that I could give each AI the most accurate possible grade in the class." [20:30]
Performance Results:
- Early/simpler assignments: AIs score very high (ChatGPT: 102/104, Claude 99/104, Gemini 101/104).
- Some projects and more complex assignments: Scores drop drastically (e.g. Assignment 6: ChatGPT 32/100, Claude 20/100, Gemini 13/100).
- Humorous Hallucinations:
  - On a string concatenation assignment, Claude outputs “hello world world” instead of “hello”, reflecting overfitting to AI grading datasets [22:40].
- Final Grades:
  - ChatGPT: B+ (below class median)
  - Claude & Gemini: C+ (below threshold needed to declare the CS major at Cornell)
Cal Newport’s Interpretation:
- On Amodei’s Claims:
  
  "It was stupid all along... to try to use human education levels as a way to describe a large language model. This is just different.” [27:30]
  - LLMs solve very specific problems but aren’t general-purpose intelligence equivalent to people.
  “Hopefully we can stop using terms like having a data center full of PhDs.” [29:00]
- On AI/Human Hybrid Tool Use:
  - Real value comes from interactive, guided workflows—not one-off, autonomous brilliance.

3. Agentic AI and Coding: What Programmers are Really Doing

Segment starts: [30:30]

Cal’s Ongoing Research:
- Put out call for reports from professional developers. Has reviewed 100+ of 350+ responses detailing first-hand experiences with AI tools.
Two Real Programmer Perspectives:
1. The Enthusiastic All-in User
  - No longer writes any code manually; instead, tasks and code managed through AI agentic tools (Claude, Codex, etc.)
  - Workflow: Plan feature/bugfix with AI → iterate with chatbot → AI outputs plan/code → human verifies and uses Git for control
  - Quote:
    
    “I have developed things in the past week that would have taken me months before.” [34:00]
  - Key Insights:
    - Heavy chatbot use in planning, not just code generation.
    - Multi-agent approaches (many agents working in parallel) are “exhausting”; rarely used outside of hobbyist projects.
2. The Reticent, Yet Appreciative, Professional
  - Uses LLMs for “scaffolding, boilerplate, repetitive tasks” and to check documentation.
  - Quote:
    
    “When actively using these coding agents, I found that it generally slows me down. Using them introduced tasks I didn’t have before: composing a prompt, checking the output, re-prompt, manually refactor...” [37:45]
  - Reviewing AI-generated code is more laborious and risk-prone.
Cal’s Synthesis:
- Adoption: About 45% of programmable respondents now produce most of their code via agentic tools.
- Best Practices Unclear: Wide spectrum of adoption; no consensus on efficient workflows.
- Professional Workflow Diverges from Online Hype:
  
  “I do not think based on these interviews that the hyper multi agent approach that we see most talked on the Internet is going to become some sort of standard for serious programmers...” [42:30]
- Actual Coding Transformation is Nuanced:
  “Something is happening. It’s more complicated than other people make it seem... what’s working, what’s not, what’s hype, what’s not—let’s try to figure out what’s actually happening.” [44:10]

Notable Quotes by Timestamp

On Block’s AI Layoff Narrative:
- “If you are announcing the layoff of 40% of your staff, can you use capital letters at the beginning of your sentences?...it feels a little disrespectful.”—Cal Newport [04:10]
- “We have to stop the vibe reporting on the AI job apocalypse. And it’s not here yet and we don’t know if it’s going to come at all. But the best we can do is try to be accurate about what we’re saying.” —Cal Newport [17:30]
On LLMs and "PhD-level" Intelligence:
- “It was stupid all along...to try to use human education levels as a way to describe a large language model.” —Cal Newport [27:30]
- “You could get these chatbots the right answers if you’re willing to be sufficiently interactive and hold their hands... but that’s not really the right takeaway here.” —Cal Newport [26:50]
On Real Programming with AI:
- “There’s a lot of just chatbot discussion happening in these workflows... they’ve entered a more interactive way; they want to talk back and forth...” —Cal Newport [36:15]
- “Are we sure that this [agentic approach] is actually producing the best code?” —Cal Newport [39:00]
- “None of the serious programmers I heard of so far are doing anything like that [hyper multi-agent workflows] for the most part.” —Cal Newport [42:40]

Summary Takeaways

AI is not (yet) the direct cause of mass layoffs in tech despite media narratives—most recent cuts reflect economic corrections and over-hiring during the pandemic.
Heavily anthropomorphizing LLM capabilities (“PhDs in data centers”) is misleading; actual competence is inconsistent and highly dependent on guidance, context, and human oversight.
Professional programmers are integrating AI tools, but real-world practices are nuanced and hybrid; solo, multi-agent workflows touted online remain rare.
The technology’s impact is substantial but overhyped in the press—critical thinking and firsthand analysis are needed to understand what’s truly changing.

Further Listening

Follow Cal’s AI Reality Check series for ongoing, grounded updates
Monday’s main episode will resume broader “Deep Questions” topics

Loading summary

Transcript1 lines

[00:01]
A
Did the fintech company Block just lay off 40% of its workforce due to AI automation? Can the best AI models pass a freshman computer science class? Programmers love agentic AI, but how exactly are they using these tools? For those of you who followed the tech news this past week, these are all pressing questions and we're going to try to find some answers. I'm Cal Newport, and this is the AI Reality Check. Now I want to do a quick aside before we get into this week's stories. Because this is a new format for my podcast feed, I want to give you a quick explanation more and more on the main Monday episode of this show, I've been reacting to the latest AI news where I put on my computer science hat and I try to push back on hype and vibe reporting and surface the deeper trends in these topics that I think really matter. But not everyone who listens to that Monday episode wants to hear about this. So I decided I would move the AI discussion to its own mini episodes on Thursdays. This is an experiment. Maybe I'll move it back. Maybe I'll move it to its own feed. Maybe I won't do it every week. So just bear with me. But keep in mind, if you want to share any of these episodes, we're also putting up on YouTube so you can send the video link to someone who might need to hear some of this reality checking. All right, that's enough logistics, so let's get into our first story of the week. All right, late last week, Jack Dorsey, the CEO of the FinTech company Block. You know they're responsible for Stripe and Cash app, among some other products, posted a note on X announcing massive layoffs at his company. Let me read you from this note, Dorsey said. Today we're making one of the hardest decisions in the history of our company. We're reducing our organization by nearly half, from over 10,000 people to just under 6,000. That means over 4,000 of you are being asked to leave later on. He says the following we're not making this decision because we're in trouble. Our business is strong, but something has changed. We're already seeing that the intelligence tools we're creating and using paired with smaller and flatter teams or enabling a new way of working which fundamentally changes what it means to build and run a company. And it's accelerating rapidly. Can I make a quick aside? This is like a hint to CEOs. If you are announcing the layoff of 40% of your staff, can you use capital letters at the beginning of your sentences? It really caught my attention in this tweet that he doesn't capitalize any of his words. I don't know, it feels a little disrespectful. But let's get back to the actual story here. The traditional media was quick to embrace and amplify Dorsey's claim that these layoffs were because AI made these positions redundant or unnecessary. Here is the headline, for example, from a New York Times article about the layoffs. The headline read, Block cuts 40% of its workforce because of its embrace of AI. Here's the subhead from that article. About 4,000 workers will lose their jobs as the payment company does more work with new artificial intelligence tools, its top executive said. Another quick aside, because this is a journalistic thing, I began to notice more and more, I think, really starting around the COVID coverage era, where you have a claim that feels right that you want to put in your subhead because there's a point you're trying to make, but either it's hard to fact check or you don't want to fact check it because you're not quite sure what you're going to find. It'll be complicated. So you just make the claim, then you put a comma and attribute it to someone else. We didn't used to see attributed claims in sub headlines or headlines, but we began to see it more. It's a good way of. I'm trying to make a point here, and I don't actually want to go and directly verify. Did they lay off all these people? Because AI tools. I'll just say they lay off people's AI tools, said someone. So you add as a comma. So just keep in mind that sort of reporting trick. If we read the article itself, the framing makes it super clear what they're implying here. Here's from the article. The cuts made as Block reported strong financial results for its most recent quarter are perhaps the most striking example so far of a technology company's making plans to eliminate employees because of AI. I don't mean to pick on the Times. A lot of publications had similar coverage and the stock price went up 20%. For block, this is an important article to look at, in part because I got sent it a lot of times. When I get sent an article a lot of times, that means it is catching people's attention and is either exciting or upsetting them. So it's worth some closer scrutiny. I think there's a general vibe that this article is trying to verify or validate, which is the vibe of something big is happening. Yeah, We've been talking about AI could get rid of jobs or whatever, but now it's happening. See look, this is the first shoe to drop of a major crisis like the first company that laid off almost half of its workforce. This is the thing we've been warning you about. Major economic disruption. It has begun. That is a story that is very sticky and very attention catching. But is it true? Well, if you dig a little deeper, there's a lot of commentators online who know this industry sector a little bit better who are not at all convinced. Let me give you a few bits of contextual information about Block and its layoffs. Between 2019 and 2025, Block's employee count grew from around 4,000 employees to over 10,000. So they had massive growth during the pandemic. A lot of this growth actually came from acquisitions in the crypto and blockchain space earlier in the pandemic when those things were still hot. Those acquisitions are now of course floundering as those technologies, especially the blockchain based software technologies, are having a hard time. A lot of their startups are really struggling despite the fact that the Times said that they had quote, strong financial results, end quote. If you actually read the industry analysts who study the quarterly reports from Block, they're not impressed because the last two quarters they actually fell short of their earnings target. So here's an alternative explanation for what might be going on here. Like just about every major tech company in America block over hire during the pandemic when that industry was booming. Also like just about every major tech company right now in the last two years, they're shedding jobs to try to right size back because they had overhire during the pandemic. We've talked about on this show before, Amazon doing this, Microsoft is doing this. This is a common trend in recent years. But how do we know it really wasn't AI? AI is the reason why they laid off these 4,000 people. Well, there's a couple things going on. One, a lack of specificity in Dorsey's statement. He just says like well we have these intelligence tools and then he talks about non AI things again. We have like different types of teams and we just, we don't need as many people anymore. No specific reference of this particular tool has taken on this role. So we fired, we shut down this division because we don't need employees there or in this division. What we did is we laid off the entire entry level class because the managers can now get by with less. It's very vague what he said. Two, as we'll hear later in today's episode though there is major changes happening in computer programming because of new agentic AI tools. Basically every serious commentator who is studying this industry says yeah, we're not yet. We haven't figured out. The companies haven't figured out exactly what this means. We're certainly not laying off, ready to lay off half of our workforce yet. These tools are very new, the versions that people are getting excited about. But maybe the most telling reason why we know this is not AI is that Ethan Malik didn't buy this claim. Ethan Mollick from PIN is a respected AI commentator who is very much on the booster at side. He's very AI is going to change everything. And even he didn't buy this idea that AI was responsible for the layoffs at block. On a LinkedIn post Ethan Molik said the following referring to the layoffs this isn't about AI, but that is a smart way to sell it. If you want to see your stock jump 20% then on X Ethan Mollick said the following in response to Dorsey's tweet two things. One given that effective AI tools are very new and we have little sense of how to organize work around them, it is hard to imagine a firm wide sudden 50% efficiency gain. Two CEOs with Vision who hired well should also use AI for expansion and augmentation, not decimation. I'll just say as an aside, I've been hearing this from the managers and programmers I've been talking to in the last couple weeks about how they're using agentic programming. I am much more likely to see the effect to be. I mean I haven't had any of them say we're laying people off. But I have heard a lot of people say, like Malik implies here, the reaction to these tools at a lot of these startups has been do more work. Great, now we can do more work with the same people. Let's make more money out of the same people. Not let's lay people off. All right, we have another voice of skepticism here. This one comes from Ron Shevlin, sorry, who is an industry analyst who specializes in the fintech sector. So he specializes in the sector where Block is and he writes and covers Block professionally as a financial journalist. He wrote a column right after this that was titled the following block lays off 40% of staff and blames it on AI. Don't buy the excuse. And he goes on to say yeah, they over acquired, they made some bad acquisitions, they need the right size and they're Blaming AI because it sounds better than saying, yeah, we made some bad calls during the pandemic and now we have to adjust to it. So what's the bottom line here in terms of reality checking this story? AI will have an impact on jobs. I'm not one of these skeptics that says this is a fad that's going to go away, that this is going to be like blockchain based software that really just failed to catch on. But we're not really there yet. Outside of some narrow instances, the tools have not matured to the phase where we really understand what's going on, where we're really seeing major changes to the way companies are structuring themselves. Most of the commentators I can find who follow this closely say, yeah, sure, there is going to be things happen with jobs. We don't know if it's going to lead to expansions or contractions or what sectors get hit more than yet, but we're not there yet. There is a tendency, I think among coverage right now to lean into the debt vibe that AI is going to affect jobs and try to keep making the claim as happening right now. And what's happening is the CEOs of these companies, especially tech companies. So CEOs like Jack Dorsey are seeing the tendency towards that vibe reporting. This is very tempting for journalists. And so they're trying to. There's a term Annie Lowrey introduced. I think it was something like AI washing. They're trying to justify layoffs that are due to things like pandemic over hiring by saying, well, AI we're being smart so they look better, like better decision makers and they're more forward thinking. It's important that we cover AI's impacts on jobs accurately so that when real impacts come, we can see them with clear eyes and react to them honestly and hold to account the actual why are you firing these people? What's happening here? What leaders doing this? We really do need to cover that accurately. So we have to stop the vibe reporting on the AI job apocalypse. And it's not here yet and we don't know if it's going to come at all. But the best we can do is try to be accurate about what we're saying. All right, second story. This one's kind of a fun one. All right, so Anthropic CEO Dario Amadei famously said in recent, I guess this is all this last year famously said that their LLM products have the intelligence of someone with a doctorate. That before, like, well, it was as smart as a high School student, then as smart as a college student. Now it's as smart as someone with a doctorate. He described this product, deploying his product like having an army of PhDs in your data center. Last month he used a related terminology. He said, we can offer you a country of geniuses in a data center. Well, I was thinking about this approach of sort of describing AI with human education levels when I came across an interesting video that was posted in January which did a really cool experiment, a TA for Cornell University's freshman computer science course, CS 2112. They probably call it 2112. This is their sort of advanced freshman fall CS course. So if you come into the CS program there as a pretty advanced student, this would be the course you would take. But it's for freshmen in their first semester. He was taing it. So he said, here's what I'm going to do. I'm going to take the three leading AI models and I'm going to give them every graded thing we do in this class I will give to the models and then I will grade their results. At the same time, I'm grading the real students in the class using the exact same rubrics. And then at the end I will, you know, weight the grades. Just treat them like a student in my, in this class and see how they do. Let me play a quick clip here. This is the intro. The intro to that video. Can AI pass a first semester freshman CS class? To answer this question, I ran every single assignment, every exam, every quiz, every graded interaction the students got this semester through the three best models I could get my hands on from ChatGPT, Claude and Gemini. Then I graded each result with the exact same rubric we use on students so that I could give each AI the most accurate possible grade in the class. All right, so this was a very entertaining video if you watch the whole thing because he goes through specific assignments, he's like, whoa, look, this is really cool. Oh my God, look at this crazy thing it did. It's well edited. I thought it was really cool. In the end, they have a competition in the class where you create these like critters that evolve. And they had the AI models, critters compete with the critters from the class. A couple things I noticed from the videos, sometimes these models did very well on assignments. Sometimes they really struggled, Sometimes they made very revealing, baffling mistakes. Like in an early assignment where they were doing some simple string concatenation and the assignment had you write a program that was going to output the word you're going to create a string concatenation, but basically you're going to output the word hello is what it asked you to do on the screen. And Claude's submission outputted hello world world. Because what's going on here is there's a lot of AI assignments out there. I mean CS assignments out there that famously say, hey, write hello world as the first thing you do when you're using a new programming environment. And clearly it was just trying to statistically grow out its answer. It's like, well, if I'm printing hello in an assignment, I got to print hello World and then add another world just to be safe. But how did they end up grade wise? Okay, so I have the grades in front of me here. They used the latest greatest models from ChatGPT, Claude and Gemini. They actually upgraded during the fall. They did this last fall. They were using the very most expensive version of the Cloud LLM available. I forgot which one. And then when a new one came out, they upgraded to that new one. On some assignments these things did pretty well. Especially the early assignments we got like on the first assignment, ChatGPT got a 102 out of 104. Claude got a 99 out of 104. Jim and I got a 101 out of 104. They also did well on the final exam because this was an in class final exam where you're just writing answers, right? So like you, you just have to use the knowledge in your head. That's a good setup again for LLMs. And so like ChatGPT got a 93 out of 100. Gemini got an 84. There's other assignments where they, they really struggled. Assignment six, ChatGPT got 32 out of 100. Claude got 20 out of 100. Jim and I got 13 out of a hundred. On assignment five, ChatGPT got 60 out of 100. Claude got six out of a hundred. Gemini got 67 out of 100. There's a lot of issues it had with hallucinating. It had a hard time. If you watch this video where you would, the assignment would give you multiple, you know, some rules for what to do in the assignment and it would just sort of skip some of the rules Sometimes I think in the example where Claude got 6 out of 100, it just kind of made up its own assignment and solved that one instead. So it's sort of a mixed bag in terms of its final grades. Two of the models, Claude and Gemini, ended up getting a C plus in the class. This is a freshman computer science, you need a 2, 5 to declare in your. In the initial classes. You need a 2. 5 GPA at Cornell to declare yourself as a computer science major. A C plus is like a 2, 3 something. So they weren't doing well enough to actually even major in computer science. ChatGPT did better with the B plus. It was below the median for the class, but it did somewhat better. Anyways, here's what's interesting about this. I mean there's the kind of. The catchy thing is like this is an army of geniuses. This is a PhD level, whatever this is. They're struggling with the first class you take as a freshman in computer science, which is the topic that these models are best suited for. So there's that sort of like gotcha moment. But that's not really what this is about. Right. Because I'm sure you could get these chatbots to get you the right answers to these assignments if you're willing to be sufficiently interactive and hold their hands and get the prompts in just the right way and correct them. That's not really the right way. The right takeaway here. I think the right takeaway here was that it was stupid all along. For Dario Amadei that try to use human education levels as a way to describe a large language model. This is just different. The human brain, we have a general purpose integrated brain that does lots of things. The whole person is educated. It makes sense to talk about the education level of a person, but not really a language model. It turns out a lot of these claims like Guendario Amade, I went back and checked this out. Excuse me, why did he originally say that their language models were now PhD level? It's because they had the original time. He started saying that is that they had given it math problems like a problem set and it was doing well on the math problems from this problem set. And one of the professors who worked on creating those problem sets said, those are hard problems. Those are the type of problems I would assign to my graduate students. That's where they originally got the claim that this is a PhD level. Right. So this idea of just generally talking about the intelligence level of language models, I think is anthropomorphizing and is not useful. The reality is these are very specialized tools. They tend to get tuned for specialized purposes and to get their real value. It's a combination of the tool and learning as the human how best to use and deploy the tool and check its work and redeploy it towards that particular goal. That is a very different tool use scenario. It's a tool use scenario. It's very different than imagining just an anthropomorphic that has a general education level. So hopefully we can stop using terms like having a data center full of PhDs. Also, that was a clever video. So you know, kudos to that TA for putting that together. It's a hard it was a hard CS class. It was definitely harder than the intro CS classes I took at Dartmouth, but it reminds me of the type of classes we had at mit. So, you know, it was a hard class. All right, one final story here. The story actually comes from me. Obviously there's a lot going on in the last four or five months with new agentic coding tools being enthusiastically embraced by computer programmers. A lot of these viral essays are going around that just keep and articles that are influenced by those essays and podcasts where people are talking about oh my God, huge changes are happening in the world of computer programming and this is really going to be. This is like ground zero for the Long Promise. We're about three years in now. The Long Promise claim that the language model based tools are going to have massive disruptions. But what actually is going on? I've been trying to find out. As people who subscribe to my newsletter@calnewport.com know, a week or two ago I put out a call for professional computer programmers to send me detailed reports about exactly how they and them teams use language model based AI tools and how this has changed in the recent past. I have over 350 such reports in so far. I've carefully made my way through 100. I'm reading really trying to get my brain around what's really happening with professional programmers and these tools. I thought it would be useful today to read you excerpts from two responses that I think are very typical of the type of responses I'm reading that try to give you a better picture of what exactly does it mean for these programmers to be using these new tools. I cut out details in these and have some elision to get rid of identifying details. All right, so here's my first excerpt. I'm a software developer working at a tech startup. Our use of AI varies by person at the company, but my use has skyrocketed starting in the fall of 2025. So much so that I don't write any code anymore, but I'm still heavily involved in oversight and architecture. I used cursor quite a bit last year, but have moved on to working directly in the terminal with codecs at work. The workflow goes Something like this plan a feature or start a discussion about a bug fix with AI, discuss until I'm satisfied, have it output a plan, iterate on the plan, then execute the plan. After execution, I verify the outcome. I use Git extensively throughout this process. Git is a repository software for managing code that multiple people are working on. I've tried the multi agent approach where multiple agents are working on different git work trees at the same time. I can't do it. It's too much context switching and I end up just accepting things I wouldn't normally accept because it's an exhausting process. The quality dips dramatically. I love my current workflow. I've developed things in the past week that would have taken me months before. All right, let's pause there before I do the second excerpt. This I would say is very typical of what I would call the enthusiastic all in user from among the subset of professional pro programmers. Most of the code they're producing is now actually being generated by an AI agentic tool. Typically it is Claude code where they switch the model behind it. I don't know if it was Opus to Sonnet or Sonnet to Opus in the fall and that really seemed to be make it good enough now that a lot of people wanted to use it though I would say I also see ChatGPT Codex is also commonly used. But an interesting thing about this or I want to point out two things. One, there's a lot of just chatbot discussion happening in these workflows. Remember you talked about making a plan, iterating on the plan. That's all actually like chatbot interaction. So sort of or related to using these tools to produce more code. These programmers have entered a more interactive way. They want to talk back and forth. It reminds me a lot of the research I did for the New Yorker about how students are using chatbots to write paper. They find talking back and forth with the chatbot as they write is less straining. So that's picking up here. But also notice this programmer is not really big on the multi agentic approach. Which is what you see most often told in the sort of breathless online articles and YouTube videos is this idea of I have 20 agents working at the same time and this agent checks this agent and there's a supervising agent that looks at those agents and then it reports over here to the hierarchy agent and then that agent is on open openclaw so that it can it can send recommendations to my YouTube channel and then make sure that it pays that you know these super complicated Trees of different agents supervising other agents. You really aren't seeing that, at least in my study here. It's. You're not seeing a ton of that in professional programmers. You tend to see it more in people who are like working on their own personal bespoke projects and find it really fun. But I don't see as much. And that's what we saw reflected here. All right, let me read you one other typical excerpt here from a real professional programmer. I think this captures well another very common type of response which is a little bit more reticent but still appreciating the power of these new tools. Let me read this. I'm a software developer working at a tech startup. Our use of AI varies by person at the company, but my use has skyrocketed starting in the fall of 2025. Oh, wait, that was the last one. I'm sorry, this is the new one. I don't want to just reread the last one. All right. I'm like a language model here, just sort of randomly hallucinating the same answer twice. No, no. Here's the real second excerpt. I'm a staff software engineer at a tech startup. The AI models have made the easiest tasks even easier. Scaffolding a solution, boilerplate code, replacing variables, or moving an import. Repetitive tasks are good candidates. LLMs are also useful as a way to quickly investigate the documentation of a tool and or get a reminder on syntax for something I'm trying to do. But the easy stuff, the task that AI can do well was never the hardest nor most time consuming part of my job. When actively using these coding agents, I found that it generally slows me down using them introduced tasks I didn't have before. Composing a prompt, checking the output, reprompt manually, refactor when it isn't quite right. It also slows down the code review process. I'm much more detailed in my reviews when I know a coworker used an LLM to generate some or all of the code. That's also a very common response as well. That's pointing out this idea, which I think is a fair criticism that the people like our first excerpt, which is doing most of their code generation with agentic AI like this is saving so much time. They're noting the more reticent users are noticing you are downplaying the huge amount of time that now surrounds. Yeah, you don't write the code yourself. That's faster. But now you have to do so much other work. All of this iteration with the model and the prompts and try the prompt again and work on your agent markdown file and your skills harness and then all of the review on the other side. And if it was produced with AI, you really have to review it. And he's like, there's all of this other work that's surrounding this workflow, which is. None of it's very fun. I mean, this is taking a lot of time. Are we sure, are we sure that this is actually producing the best code? So there's sort of this tension going on in the computer programming world. Here's the takeaway from this one. Agentic coding tools passed a threshold of usefulness with the Claude codecs update in the fall. That has made them much more heavily used in my survey. Something like 45% of the people I talk to are now producing the majority of their code with an agentic tool such as CLAUDE code. All right, two, it's really unclear exactly what the best practices are for this. Are there seems to be a spectrum of enthusiasm of the users of it in the space for sure. On one end there's way too much AI interaction going on. This can't be the most efficient way to do it. On the other end, there's a lot of reticence. The reality is going to fall somewhere in the middle. We don't yet know what the future computer programming looks like. I think by the summer there's going to be some best practices. They'll have some clever acronyms to go with them. There'll be some best practices about how best to use these. There will be automatic code production. I think we're going to pull back a little bit on how much AI Chatbot should be involved in review as well as planning. I think that's a little bit of just enthusiasm there. I do think a lot of code will still be generated, but we'll be better at where we deploy the code. I think there'll be more standardization about planning and architecture documents, et cetera, which will have a high overhead at first, but it'll allow us to deploy these tools better. I do not think based on these interviews that the hyper multi agent approach that we see most talked on the Internet is going to become some sort of standard for serious programmers in most places. And the vibe coding like you see, you know, talked about a lot, give me this app and I come back a week later and it's done. That really is in the realm of like hobbyist and apps for personal apps for yourself or people who are doing experiments. None of the serious programmers I heard of so far are doing anything like that for the most part. All right, so there's a lot to be done here. But what I'm trying to do, that's why it's reality check. I am not interested in breathless accounts of what's happening online because that's engagement hunting. I'm not interested in hearing sort of like non technical reporters who have just heard a lot of those accounts and then are like, look, I don't know the details, but I think we can all agree that like there's not going to be programmers in the future. I think we got to talk to real programmers. What is really going on? Something is happening. It's more complicated than other people make it seem. Let's keep listing. I'll read you some more of these reports in weeks ahead. Let's figure out the old fashioned way. Turn every page, learn what's going on, what's working, what's not, what's hype, what's not, and let's try to figure out what's actually happening. I think we will and we'll get on it, especially if you follow me here. All right, that's all the time I have for today. Remember, take AI seriously, but not necessarily everything you hear about it. I'll be back on Monday with the main episode. I hope they'll do another one of these next Thursday. See you then.