
Companies are using A.I. as a reason for layoffs, but the truth may be more complex.
Loading summary
Casey Noon
So there's a lot of noise about AI, but time's too tight for more promises. So let's talk about results. At IBM, we work with our employees to integrate technology right into the systems they need. Now a Global workforce of 300,000 can use AI to fill their HR questions, resolving 94% of common questions. Not noise. Proof of how we can help companies
Jasmine Sun
get smarter by putting AI where it
Casey Noon
actually pays off, deep in the work that moves the business.
Jasmine Sun
Let's create smarter business.
Casey Noon
IBM. I just read the most heartwarming news this morning that I wanted to share with you, Kevin.
Kevin Roose
What's that?
Casey Noon
The UK government has withdrawn a proposal to let AI companies train on copyrighted works after a backlash from artists like Dua Lipa. Did you see this?
Kevin Roose
No.
Casey Noon
Dua Lipa said, don't start now with this AI. My sugar boo, she litigating, Kevin. She's making some new rules and she's saying we're not going to train on my copyrighted works. Wow. And that's why she is a queen. And so, Dua Lipa, if you're listening, we salute you.
Kevin Roose
Yeah. Dua Lipa. You're a Dua Keepa, period.
Casey Noon
Dua Lipa said artists rights.
Kevin Roose
Wow. I'm Kevin Roos, a tech columnist at the New York Times. I'm Casey Noon from Platformer, and this is Hard Fork.
Casey Noon
This week, a big wave of tech layoffs is raising the question, has AI job loss truly begun? Then writer Jasmine sun is here to help us answer the question, why are chatbots bad at writing? And finally, it's token maxing time. Why tech companies are building leaderboards to measure who is spending the most on AI.
Kevin Roose
Well, Casey, for years now, we've been monitoring for signs of an AI job apocalypse.
Casey Noon
Yeah, we've been monitoring the situation, it's true.
Kevin Roose
And. And over the past few weeks, I think we've gotten some early indications that something is happening in the labor market, especially for tech workers.
Casey Noon
Yeah, we have certainly heard CEOs of companies announcing layoffs, invoking AI as a reason that it is happening. And so that has gotten our attention.
Kevin Roose
Yeah. So just a couple examples from the last few weeks. Last week, atlassian announced a 10% reduction in its staff, about 1600 jobs that they said were going to help them fund further investment in AI and enterprise sales. That came on the heels of a big round of layoffs at Block, the financial tech company formerly known as Square, which said that it was cutting its staff by about 40% or about 4,000 jobs, saying that they were shifting the way that they were working to use smaller and flatter teams. And then the big one that folks are expecting maybe as soon as this week is that Metta is reportedly poised to lay off 20% or more of the entire company. This was reported by Reuters last Friday, who said that their sources had told them that Meta was preparing to cut as many as 16,000 jobs, the largest layoffs at that company since late 22 or early 2023 when they laid off 20,000 people. So as of this recording, that hasn't happened yet that we know of. But I know that people at Meta are very on edge and are awaiting the further news about their jobs.
Casey Noon
Meta, after this story came out, told Reuters that it was, quote, speculative reporting,
Kevin Roose
which, if you're not familiar with the language deployed by Meta Communic staffers, means this is happening, but we don't want to tell you it's happening yet.
Casey Noon
Correct.
Kevin Roose
So, Casey, I want to hear what you make of these layoffs, but first we should do our disclosures. I work for the New York Times, which is suing OpenAI, Microsoft, and Perplexity,
Casey Noon
and my fiance works at Anthropic, so.
Kevin Roose
Okay, Kasey, what do you make of the fact that all these companies are referencing AI in some way as a reason for their layoffs?
Casey Noon
Well, I think it's a little different at each company, Kevin, and I think we can make a decent case for and against the idea that AI is really driving the show at each of them. So maybe we should get into that. But at the highest level, I would say companies do continue to tell us now that AI is a significant factor in the reduction of these workforces. And sooner or later, I do think we're going to have to believe them.
Kevin Roose
Yeah, I think this is the early warning sign for a lot of people, especially in the tech industry, who are, I think it's fair to say, going to be some of the first people to see their jobs change or disappear because of these new AI tools. But let's get into some of the specifics here. So, Kasey, let's start with Atlassian, the first company I mentioned. Their CEO, Mike Cannonbrook, said in a company blog post that the bar for what great looks like for software companies on growth, on profitability, on speed, on value creation has gone up. He said we are choosing to adapt thoughtfully, decisively and quickly to drive durable, profitable growth. He claimed that AI was not replacing people, but he said it would be disingenuous to pretend that AI doesn't change the mix of skills we need or the number of Roles required in certain areas.
Casey Noon
Yeah. So I take him at his word. It seems like he himself is trying to walk a middle path there. Right. And sort of not denying that AI is a factor here, but also not saying, like, this is the only reason this is happening. I think some other context that is worth having is, is that Atlassian is one of the companies that could be part of what we've been calling the SaaS apocalypse around here. Right. This is a company that makes tools for businesses. A lot of its products are essentially structured workflows. And there are those who believe that sooner or later you're just going to be able to code your own pretty cheaply. Now, maybe you will still choose to buy a product from a company like Atlassian, but maybe you're not going to be willing to pay nearly as much as you would before. And so the company's stock price has just been battered over the past year. And I think that has left them, one, hurt for cash a little bit, but two, and probably more importantly, looking for a different story that they can tell the stock market about what they're doing. And so today that story is we're going to get rid of some of these workers and we're going to figure out how to make our remaining workers more productive.
Kevin Roose
So there's this term that's been floating around called AI washing, which is basically when a company wants to lay a bunch of people off or maybe they don't feel like they need as many people.
Casey Noon
And I thought it was when a software engineer finally took a shower.
Kevin Roose
And basically the thesis is like, these aren't really layoffs about AI. This is just sort of a convenient excuse that these companies are using. Do you think Atlassian qualifies as AI washing?
Casey Noon
I would like to get a little bit more detail on exactly who they are laying off here, which is a detail that we do have about some of these other companies that helps us answer that question. So I don't know exactly how it is happening inside of Atlassian, but I think that their CEO was relatively straightforward, as these things go, in saying, like, it's a little bit about AI, it's entirely about AI, but like, yes, keep your eye on AI. So to me that just reads as honest. And so I'm going to give them a pass.
Kevin Roose
Okay, let's talk about Block. Jack Dorsey, the CEO of Block, gave an explanation about their layoffs. He said, quote, we're not making this decision because we're in trouble. Our business is strong, but something has changed. I had two options cut gradually over months or years as this shift plays out, or be honest about where we are and act on it now. I chose the latter. Casey, your take.
Casey Noon
So something to know about me and Jack Dorsey is I have a bit of a bias against him. As a former Twitter user who misses that website dearly at this point in 2026, I would not hire Jack Dorsey to run a lemonade stand. Okay. But if you want to talk about Block specifically, this is a company that tripled its headcount from about 3,800 people in 2019 in what seems like just kind of classic like inattention to what was happening in the business during pandemic era boom times. And I wonder if you saw this detail because it truly took me out. Kevin. Five months before they announced the layoffs, Block spent $68 million to fly 8,000 people to an in person event with Jay Z.
Kevin Roose
Come on.
Casey Noon
Yeah, so that's the kind of famous attention to detail that has turned Jack Murphy into one of the greatest visionaries in tech. So look, is this about AI again? You know, what does Block really do? They have those little iPads at the coffee shop and then they have Cash app. Okay. How many people do you really need to run those products? Probably fewer than 10,000. Is that about AI? I don't know. Maybe if you squint. But again, this is a company whose stock price was cratering. They needed a different story to tell the market. And I do think you can make a case that AI will make the remaining workers more productive. So again, this is another one where it's like you could use AI to justify what's happening, but you also could just say this company has been mismanaged for a while now.
Kevin Roose
Yeah, you could use AI washing or Jay Z washing, which just seems to be what they. What they are doing here.
Casey Noon
Yes.
Kevin Roose
So this did seem to have an effect on their stock price. In fact, the day after Jack Dorsey announced the layoffs, Block stock shot up 17%. It's gone down a little bit since then, but they're still up from where they were before these layoffs. And I think we should just say, like, this is also a part of the equation here. Right. These are companies, largely public ones, that have investors attention. And right now there's sort of this narrative power around AI where if you seem like a company that is investing heavily in the AI tools and the AI way of working, your investors say, oh, that company is really forward looking. They must have a plan for how to navigate this transition. And so I think there's sort of they're seeing the power in telling the story that all this is related to AI.
Casey Noon
Yeah. Which by the way reminds me of like the peak of cryptomania when like some public traded companies would just add like a crypto term to their name and their stock price would shoot up by like 40,000%.
Jasmine Sun
Yes.
Casey Noon
It turns out that the public markets actually can just be tricked that easily.
Kevin Roose
Yes.
Casey Noon
That would give me some relief if I was a CEO, just knowing that I could fool people like that. But anyways, so let's talk about the
Kevin Roose
third large tech company that is reportedly conducting layoffs. Metta, we don't know exactly who or what teams are being affected by these layoffs, but this is a significant part of their workforce and they seem to be saying in their communications with the public what all of these other companies are saying, which is we are going all in on the new way of working and we are going to have to make some cuts to. To make that work.
Casey Noon
Yeah. On a recent earnings call, Mark Zuckerberg said that, quote, projects that used to require big teams now can be accomplished by a single very talented person. And, and we should also say that this cut is coming alongside this massive AI infrastructure investment. Right. They're going to spend $135 billion on capital expenditures this year. And even for a company of meta size like that is real money. Right. So I know that they're trying to be careful again, trying to not spook the stock markets too much. This is obviously the biggest bet in the company's history. And I think that making some substantial cuts are going to signal to the market like, hey, don't worry, we're not like completely losing our minds here, like we're going to keep some of these expenses under control.
Kevin Roose
Yeah, I think that's a really important point because what we're seeing here at some of these companies is that they are not actually sort of cutting costs in the aggregate by using these tools. They are just shifting the cost from human labor to AI.
Casey Noon
Right.
Kevin Roose
They are plowing this money that they are going to save by laying off these thousands of people into the building of data centers and other AI infrastructure. And basically the bet they're making is these new AI workers are going to be faster, more efficient, maybe cheaper in the long run, maybe not, but they are going to be able to do the work that used to require many thousands of people. And that is a profound shift in the way that companies are talking about their workers. I recently talked to a venture capitalist who said that a lot of the AI startups that he sees the most AI native companies are spending more on AI tools than they are on payroll. And that may be an outlier, but I think that is sort of where these companies believe that we are headed. Where the majority of your expenses will not go to paying the salaries of human workers. It will go toward buying the AI tools and the tokens that your company runs on.
Casey Noon
Yes, I think that's absolutely the bet that they're making. I also just think it is worth noting that this is still purely, mostly speculative. Right. Like in the case of Meta specifically. This is a company that has arguably been struggling when it comes to AI. They had to abandon their last model, Behemoth, because it wasn't very good. The Times reported last week that it's delaying the release of its latest model, Avocado, because it hasn't been hitting its performance targets. It's apparently barely outperformed Gemini 2.5. What is this, Last March?
Kevin Roose
Yeah, that model is really the pits. That's an Avocado joke.
Casey Noon
That's very good. Thank you. So, again, this is not as simple as saying they're able to cut 20% of their workforce because they've just made these massive gains. I'm sure there are individuals there who have made massive gains, but as a company, it still seems like it is somewhat mired in dysfunction. They just did yet another partial reorg of their AI teams, and that just always sort of makes me raise my eyebrows.
Kevin Roose
Yeah. I will say, like, one thing that's been surprising to me about this recent round of layoffs is that the companies that are making them are not. Not the ones on the frontier. Right. It is not the open AIs, the anthropics, the Googles, those companies are not laying off people en masse because of these AI tools which they are building and presumably have even better models than the ones they're releasing to the public. So you have to think that part of this is just companies that are sort of lagging behind their competition, saying, well, maybe if we just use a bunch of AI, it'll help us catch up.
Casey Noon
Yes. But also, like, OpenAI and Anthropic are much smaller companies than some of the ones that we've been talking about today. At least a number of workers. Right. Like, I think it. It is interesting to think that Atlassian is, like, bigger than OpenAI in terms of the number of people who work there. When you look at the, you know, relative, like, value of what. What they're generating.
Kevin Roose
DocuSign has 7,000 employees.
Casey Noon
There is no funnier sentence that is true in all of tech journalism. As somebody who. Who has a paid subscription for DocuSign that I. That I truly resent paying for. Get to work over there people or
Kevin Roose
get not to work.
Casey Noon
Get not to work. Here's another question that I would ask Kevin. Okay, so we're seeing a bunch of layoffs. Like, are these AI related or not? Does it actually matter if the effect on workers is the same? Right. Like, you know, if you're the worker, like, whether it's about AI or not, you're still out of a job. Yeah.
Kevin Roose
And it's not clear to me what workers can or should be doing to sort of protect themselves against these layoffs. One person I talked to said, you know, they're. They work at one of these big tech companies, and they're like, well, there's just a lot of jostling and fear and anxiety right now. People don't know if they should be, like, using the AI tools a ton because then it shows that they're, like, getting with the program or whether that just means that they're proving that their work can be automated. Like, I think there's a lot of fear and suspicion and mistrust inside these companies right now, and for good reason. Their executives are planning to lay them off.
Casey Noon
Yes. And by the way, I think at at least some of these companies, that is. Is maybe not an explicit reason for these layoffs, but some of the executives there would see that as a positive byproduct.
Kevin Roose
Right.
Casey Noon
Because, you know, if you're like Mark Zuckerberg, you lived through the 2020 era. You had these restive employees that, like, wanted a lot of things from you, and they wanted to have a lot of control over what the company could and could not do and how it did it. And, you know, I just know that executives over there really resented that sort of thing. And once Meta entered this new era of massive layoff, employees over there did get really scared for all of the reasons that you would assume they were like, oh, God, like, you know, maybe I actually am going to lose my job. And all of a sudden, they got a lot more quiet, and you started to see a lot fewer protests over there. So I'm not going to say that, like, these occasional mass layoffs are a way of, like, keeping the workforce in line, but I have noticed that it seems to be having that effect totally.
Kevin Roose
And it makes me wonder whether something that I predicted was going to happen, you know, a year or two ago that did not happen, which is the sort of sudden and Mass unionization of workers at these companies may actually start to happen in the next year or two. I think one major difference between what's happening now at these tech companies and what has been happening for decades at manufacturing companies, car companies, you know, factory workers, is that those workers were by and large unionized. And so when the employers said, hey, we're going to lay a bunch of you off, they were able to negotiate. They were able to say, hey, maybe instead of, of laying us all off, maybe you could find other jobs for us. If our jobs are being automated, maybe we should be allowed to sort of retrain to do something else. And that was largely successful. There were still layoffs, of course, but not the number that we're seeing today at these tech companies. So do you think there's any possibility of that, or is that just sort of a union fever dream?
Casey Noon
Here's what I will say. I cannot think of anything that would make Mark Zuckerberg more mad than a union of software engineers at Meta. And I think the software engineers at Meta should use that information how they will.
Kevin Roose
You think that would make him more mad than getting booed at a UFC fight?
Casey Noon
ABS Absolutely. I think that probably just made him really sad.
Kevin Roose
Well, there you have it. If you want to make Mark Zuckerberg mad that employees sign your union card
Casey Noon
when we cut back, why aren't chatbots as good at writing as I am?
Kevin Roose
We'll ask Jasmine Son.
Advertisement Voice
Most all in one HR systems are a patchwork of disconnected and manual tools. Rippling is totally automated. If you promote an employee, Rippling can automatically handle necessary updates from payroll taxes and provisioning new app permissions to assigning required manager training. That's why Rippling is the number one rated human capital management suite on G2, TrustRadius and Gartner. If you're ready to run the backbone of your business on one unified platform, head to rippling.com heartbreak and sign up today. That's RIP P L-I-N-.com hardfork to sign up. Framer is a website builder that turns.com's from a formality into a tool for growth. Whether you want to launch a new site, test a few landing pages, or migrate your full.com, framer has programs for startups, scale ups and large enterprises to make going from idea to live site as easy and fast as possible. Learn how you can get more out of your.com from a framer specialist or get started building for free today@framer.com hardfork for 30% off a framer pro Annual plan rules and restrictions may apply. The right technology can strengthen human judgment. That's why Deloitte brings together AI and data analytics with multidisciplinary teams who can help you connect the dots across your enterprise, from risk to operations to customer needs. So opportunities don't slip by and surprises don't spread. Because the smarter your systems, the sharper your instincts. That's how technology makes people better at what they do best. Deloitte Together makes progress. Learn more@deloitte.com Together makes progress.
Kevin Roose
Well, Kasey, over the last couple of years we've talked on this show about how AI models are getting better at so many things. They are getting better at coding, at competition math, at solving novel physics problems,
Casey Noon
mass domestic surveillance, autonomous weapons.
Kevin Roose
And I think the story of the last few years in AI has been one of sort of rapid, steady progress. But these systems are still sort of jagged and they have flaws and weaknesses. And one place where they arguably haven't improved that much is in writing.
Casey Noon
Now, that's our domain.
Kevin Roose
Yes. At least that is the argument that Jasmine sun made in the Atlantic this week. She is a freelance journalist. Her piece was called the Human Skill that Eludes AI and it's her attempt to understand why, despite so much progress in all these different areas, the models of today don't seem to be writing anything particularly good or compelling. Yeah.
Casey Noon
And while I think the question of our LLMs good at writing is highly subjective and dependent on the use case, I do think Jasmine makes a really interesting technical case for why these models write the way they do.
Kevin Roose
Do. Yes. And we should say before we bring her in, Jasmine is a friend of mine. She has also been my researcher on the upcoming book that I'm working on. And I just think she's like one of the best people writing about AI today. She writes on her substack, which is called Jasmine News. It's J A S M I News, and you can read much more of her writing there.
Casey Noon
All right, I'll allow it. But I do want to balance it out by next week, bringing on one of your enemies.
Kevin Roose
Okay, let's bring her in. Jasmine Sun. Welcome to Hard Fork.
Jasmine Sun
Thanks for having me. I'm excited.
Casey Noon
Hi, Jasmine.
Kevin Roose
So you wrote this great piece in the Atlantic this week about the human skill that eludes AI and I want to start by challenging the subtitle of your piece. Why Can't Language Models Write well? Can't Language Models Write well?
Jasmine Sun
So I do say in the piece that most writing period is very bad. And so I think that language models are definitely better at writing and language than most humans are. But the question that I was really curious about is why can't they write at a sort of literary, creative fiction level? Because the thing is, if you listen to these AI leaders talk about their aspirations, they say, we're going to cure cancer, we're going to solve physics, we're going to build a superhuman coder. They are not shy about, oh, our AI models are going to be better than 75% of human coders. They're saying, no, we will literally build a self replicating factory tomorrow. And then Tyler Cowen asked Sam Altman in an interview from last October, when do you think GPT will be able to write a Neruda poem? And Sam Altman says, maybe in the future ChatGPT will be able to write, quote, a real poet's okay poem. So that was the thing that fascinated me is even these guys who are more bullish than anybody else about the capabilities of their technology, they are very reserved about how much literary writing their models can do. And so that was the gap that I was really interested in.
Kevin Roose
And you start your piece with this interesting, interesting provocation, which is that in some ways, GPT2 was the peak of AI when it comes to creative writing. So explain that.
Jasmine Sun
Part of what got me interested in this piece was I was actually doing research for your book and I was going through all of these previous generations of models and reading the outputs. And the thing that really shocked me is that like, in a way, the writing style of GPT2 and GPT3 I found so much more compelling than ChatGPT today. It doesn't have any of the annoying ticks, it doesn't have the M dashes, the tripartite list. That it's not this, but that the tone was much more variable. Like, it would actually surprise you. It would be funny, it would be poetic. And that shocked me to sort of like go back a few generations and realize that maybe, you know, they were also lying all the time and all sorts of other things. But from a writing style perspective, I kind of preferred it. And I wanted to investigate.
Kevin Roose
They were weird.
Casey Noon
That shocks me. To me, talking to GPT2 was like talking to somebody who had just fallen down the stairs. You know what I mean? Where it was like, I think, do we need to get you to the hospital?
Kevin Roose
Do you smell toast?
Casey Noon
Yeah.
Kevin Roose
There are these amazing prompts for this, like, early opening AI prompt library where they would say, like, you know, I just won $175,000 in Las Vegas, what do I need to know about taxes? And GPT2 would say like start just writing some short story about like an orphanage.
Jasmine Sun
And like it was like surprising. Yeah, like nutty. They were, they were, they're weird. They would absolutely be a terrible corporate assistant. Horrible, like coding intern. It can't do any of the things that modern alums can do that I'm very grateful for. But like from pure writing style perspective, they're very good. So GB3 in particular, like they. There's. I found this like set of samples that some guy did where it's like, oh, write in the style of Paul Graham, write in the style of Richard Dawkins, whatever. And it could style match much better than modern LLMs can. And particularly because so much of sort of literary writing comes from voice and style. That was one of the things I was really interested in is like what did we lose that the LLMs can no longer emulate Paul Graham's style or whoever's style? Because I would put in the same exact prompt prompt that this guy gave GPT3, put it into ChatGPT5.4 thinking or whatever and it would be God awful. And I was like, that's really weird.
Casey Noon
So tell us about what you learned about what happened after the GPT 2 and 3 era that changed the way that these models respond to us.
Jasmine Sun
Yeah, I mean, I think the answer is post training basically. So they started adding a post training layer which is basically saying we have these like crazy, unpredictable, like nut job concussed models and they need to learn how to behave because a model that can't behave is a very bad corporate assistant. And so the AI researchers give them example dialogues and scripts to learn from. They give them words that they can and can't say. They do RLHF, which is a process by which human graders will rate which response is the most helpful sounding or something like this. And so now these post trained models have been trapped in a way or trained or guided towards a very particular character or Persona that is a very helpful assistant, but might be very, very bad at writing in creative and surprising ways.
Kevin Roose
I mean, the way that you described it was that there is a phase within the post training phase where these AI models are evaluated by humans. And that's part of what they call RLHF or reinforcement learning from human feedback. And what struck me in your reporting is that you actually talked to some people who have done this kind of feedback to the models who say that they're just being asked to grade things in ways that don't make Makes sense. Yeah, right. Tell us about that.
Jasmine Sun
Yeah, I mean, this is super interesting because, like, these jobless things you'll see on like, places like Mercur or Xai Elon's company will list them directly. I'll be like, Creative writing expert, $45 an hour. Must be a New York Times bestseller. And have like a star Kirkus review or something like this.
Casey Noon
Have you ever gotten a star Kirkus review?
Kevin Roose
I think so. Okay, good. Sure.
Casey Noon
All right.
Jasmine Sun
You might qualify to help. To help Annie from Grok, Right? A little bit better.
Casey Noon
Yeah, we're going to get in that job listing. But okay, you were saying?
Jasmine Sun
Yeah. So anyway, so these companies, because they realize that these AI research, they're really good at knowing what good coding is, but they don't actually know what good writing is. So they're like, why don't we hire some humans to find out? And so they'll commission like, MFAs and published authors and sometimes just like random guys with a blog or whatever. And one of the people I talked to who was a contractor for scale AI as a writing evaluator, and he was doing this for one of the bigger labs, he said that the rubrics just didn't make any sense. He would be told things like, you have to grade them based on the number of exclamation marks that there are. And so if something has three exclamation marks, that's too many, and so you have to ding that one.
Casey Noon
Yeah. And I have to say, generally not bad writing advice. I mean, I guess it depends on the length of the text, but three feels like a lot for many scenarios.
Jasmine Sun
This is what they tell women in business communications. It's like, take all those exclamation marks, replace them with periods. Like, we are going to remove all
Casey Noon
of the ideas we teach women to shrink themselves.
Jasmine Sun
Exactly. But yeah, so he was sort of like being asked to grade these things or another one was he got a bunch of fan fictions and he was supposed to grade them on their factuality. Since that was one of the criteria. I do imagine that one could, you know, devise better rubrics than this particular evaluator was given. But I think it does show at least that some of these, like very big companies that are very well resourced, simply do not know how to think about what good writing is.
Casey Noon
Briefly, like, like, I want to underline that because to me that seems like the whole story. We are taking the entire Internet and we are grading it on factuality. And like, so the. The LLM that you're going to get out of that is just probably not going to be all that creative.
Kevin Roose
Well, and I wonder how much of it is related to this sort of verifiable RE system that a lot of these companies are using where you, you have a system generate a bunch of code and then you have another evaluator model check the code to see whether it's good or not. And that works in domains like programming, where the code either runs or it doesn't. But creative writing doesn't work that way. You can't have an evaluator tell you, you know, with any sort of consistency whether something is good or not. And so it may just come down to preference. And so I guess I'm curious, like, do you see this as a technical problem that the labor are frustrated trying to solve or is this just demand related? Is this just what people want chat bots to sound like? And in every test where they pit different models against one another, the one that sounds like a bland corporate assistant wins. And so they go with that.
Jasmine Sun
I think both are true. It's like the majority of writing that we are asking the models to do is write this email for me, right? And like they excel at that. They are truly great corporate email writers. They are much better at the whole like passive aggressive thing than I am. At the same time, I do think, like you said, there is a technical challenge that has to do. Do largely with verifiability is like there are people who have spent decades of their lives attempting to articulate what makes Shakespeare Shakespeare or what makes a Neruda poem a Neruda poem. And they will still not know in any kind of certain way, they will still get into debates with their fellow academics and literary critics about which writer is better than the other. And because these things are subjective, because they are ineffable, because they are hard to put in a rubric. And that is the nature of art.
Casey Noon
And to that point, you know, you started this segment by talking about Sam Altman saying, like, hey, you know, we just basically can't write a great poem yet. Sam Altman a year ago said the company had trained a good creative writing model and posted a short story on X. Many people found it compelling. Is Sam Altman just not being consistently candid with us? Jasmine?
Jasmine Sun
Ooh, wouldn't be the first time. But that short story, if you remember, I'm sure you guys recall, had some great lines like talking about, about the seams of mirrors or Thursday the. What was it?
Kevin Roose
It was like the liminal almost Friday or something.
Jasmine Sun
Yeah, the liminal day that takes almost Friday.
Kevin Roose
Wait, I had to actually look this one up because it was so good.
Jasmine Sun
While you're looking it up, like, you know when the thing about AI writing is like, it comes up with all these fun metaphors and they are like kind of surprising sometimes with the metaphors, but also the language is not grounded in the life. And that was my other thing is, aside from the verifiability, fundamentally, when I think about the writers who I really love, when I think about whether it's journalists or poets or whatever, like they are writing from life. Right? Like a journalist goes out and talks to and they like see stuff and observe the color of the sky in a particular way. Or a poet is thinking about personal experiences that they've had. Their writing has stakes. It comes from an emotional place. And the fact that LLMs, while being very talented, grammatically pristine, whatever, they don't have lives. That means that all of the metaphors they choose, all of the words they choose, the examples they choose, they're just ungrounded. It's not coming from a point of view or a particular experience or a particular community that makes the writing believable. I think part of what voice and style is that it is very specific to the life that a person has had and LLMs cannot get there in the same way a human who hasn't really lived that life, like, cannot get there.
Casey Noon
I don't know. I feel like it's case dependent. You know, I'm a big music fan and over the past few months I have enjoyed putting questions about music and in particular the sounds of certain bands to an LLM, which sounds like a joke prompt because an LLM has never heard anything. Right, Right. And yet I find that in general, the models can have good conversations with me about the sound of music. Now, it may be that they are just pattern matching based on a bunch of public writing on the Internet by people who do have ears and have heard. Right. Like, I'm very open to that. But I, I, I again, I, I have just sort of been struck about the way that it is able to like, sort of write about sensory topics in an evocative way that at least to me like surpasses what I want would predict they would be able to do.
Kevin Roose
Yeah, I want to pose a couple objections that I think someone might make to your article. One of them is, this is Cope, this is Jasmine, a writer, a very talented writer, sort of finding the things that AI in her view is not good at yet and saying, this is categorical proof that they will. It will be very hard for AI to do these things. This is the same reaction that software engineers had when models started getting really good at code. They would say, oh, well, I can't do these other 10 things that I do. And that basically, basically just wait a few years and the models will be better than all of us at everything, including writing.
Jasmine Sun
I would love for it to be cope because I try to automate myself away all the time. I have no sort of deep attachment to having to. I like writing, but I have tried over and over and over for the past three years to automate my own job away and to get Claude to do my job for me. It cannot do it. This is very frustrating. It's not out of a lack of trying. And again, I'm going back to the, the CEOs themselves and the things that they themselves are saying, right? Like it's not just me a writer, it's Sam Altman saying, this thing will cure cancer and solve physics, but it will not write better than a real poet's okay poem. And so like, I think, like that suggests that there is something that is at least perceived as a little bit different. I think it's very possible the models will get much better at writing over the next few years. I don't think it's like a never thing. I do think that, you know, like, reporting is hard to replicate. I think that, like having life experiences that are real and verifiable is hard to replicate. I think the style stuff can be improved, especially if you fine tune the models. But I think what's also interesting to me about this piece is that it shows how the, the market incentives, the demand incentives of these companies do shape what we see their abilities are today.
Kevin Roose
The other objection I'm imagining people might have who are very AI pilled is, is that this is all in the eye of the beholder. Right? There have been several studies now that have shown that if you give people a blind taste test of AI writing versus human writing, they prefer the AI writing until you tell them that it's AI writing and then the value in their eyes plummets? I did one of these in a New York Times quiz just recently. So is it possible that the models have already become superhuman at writing, but that the minute we learn that they are AI models generating text and not humans writing words with their fingers, we lose all interest in it just because of the source, not because of the quality of the writing?
Jasmine Sun
I mean, I think it's definitely interesting and true that people don't want to like AI writing. And that is part of what bothers them when they see AI text. That is obviously AI. Even though, like you said in these quizzes and tests, AI can outperform human writers in those narrow scenarios. I mean, my quibble with a lot of these quizzes and tests is that as a writer, and you guys are writers too, how much of your job is actually text generation? I think AI is a superhuman text generator, right? My job, I am generating text probably 25% of the hours in my day. I spend a lot of time interviewing people. I spend a lot of time coming up with ideas. I spend a lot of time reading. And not just reading indiscriminately, but reading very particular sources that feel like the right ones. And so like, you know, usually at the point that you are doing one of these tests, you're saying like generate like one paragraph very specifically about like why Trump won the 2016 election, 500 words or less. And like you've already given the prompt, which I think is a critical part of writing is like, what are you going to write about? You've often like supplied some of the evidence and the guidance in the form of it saying like 500 words or less. And at that point I do think that AI is probably a better text generator than almost all humans are. But again, when I think about, about it, you know, AI is still very bad at coming up with ideas for articles. It is still very bad at reporting the non text generation parts of the role feel further away from automation. Again, like I'm not a. Never say. Like, like I'm, I'm sort of like never say never, like maybe I'll get there. I would be totally happy if, you know, Claude was able to give me good ideas for my next essays, but it's not there yet.
Casey Noon
Well, we're, we're already seeing the LLMs make huge progress in genre fiction, right? So like recently on the show, we talked to the author of a story in the Times about how authors of romance novels are now able to generate dozens of novels a year using LLMs. In fact, much of the discussion that we had was around how you just have to prompt them differently and sort of relentlessly in order to get what you want, you know, Your piece, Jasmine, made me wonder like how much of getting a model to just write weird can be achieved by repeatingly telling it in different ways. Hey, be a little weird, Weirder.
Jasmine Sun
Some of it, but not all of it. I mean, so I talked to for example, James Yu, who is the Co founder of Pseudo Write, which is one of the earliest creative fiction AI writing assistants. I talked to some other folks who similarly were in the fiction writing LLM space and like you said, to an extent, a lot of writers are already using these. A lot are already leaning on LLMs to generate large amounts of text and it can be very successful and it can meet readers needs and whatever. But like, even these people who I was talking to, they were describing the to me how freaking hard it is to undo all of the post training that the labs have done. So they are applying immense amounts of engineering effort that clearly, in my conversations with them, clearly frustrates them that it is so hard to get these models to stop being so chirpy, so sycophantic, so PG13 and everything in order to get them to this sort of base model state where they're able to be weird again. So I think it's certainly possible, but I think the labs have made it quite challenging just because of the way that these models are trained. The other thing that I think is important is I tend to think that writing and a lot of creative work is actually like the perfect use case for these centaur models, right? Like the idea that the human plus AI collaboration is where you can get the furthest. And when I listened to the interviews that you guys did about the fiction authors, I was thinking, this is a centaur model, right? Because without the human prompting and bullying the AI into getting weird and getting sensual and whatever, it was not going to do that on its own. And I myself, I do use LLMs as a research assistant. I wrote about that Inside the Atlantic piece about the way that Claude has now sort of helped me edit my own work in a way that I found incredibly useful. But I do feel like the collaborative element is important for any domain where the personal perspective, lived, experience, whatever really matters.
Kevin Roose
Talk about that a little bit. You mentioned your editing process. How are you using AI to help you edit your work and are you finding it useful?
Jasmine Sun
Yeah, so I feel like I really cracked this over the last like couple months, which I'm very excited about, because again, I've tried to make these things like write and edit for me over and over and over, and they've never really been able to do it. So the thing that I realized was if I make Claude into an editor that is not just trying to grade and give feedback on my work against some genericized standard of what good writing is, but actually we. What we did against basically what my personal. Jasmine's personal aspirations for writing Are it can give feedback that I find much, much more helpful. So what I did was basically I fed Claude my entire substack archive of the writing that I've previously done, as well as some of my freelance work.
Casey Noon
And just to get real specific, is this inside like a cloud project or how have you set this up? Because I know, like, our listeners are going to want to try this.
Jasmine Sun
Yes, I did it in a project, but on Claude's advice, I was like, do I need to cloud code something? Claude was like, no, that's overkill. So you don't need to code or anything. So in a cloud project project, I gave it my whole archive of writing. I also personally write retro notes to myself after everything I publish. So I have a notes app that's just like me writing what was good and bad about everything I've ever written. Just a few bullet points.
Kevin Roose
This is why Jasmine's gonna be our boss.
Jasmine Sun
I mean, these are very low quality bullet points. But I also gave it that because I wanted to learn my taste, I wanted to learn what do I aspire to be and where do I see myself falling short and where. What am I proud of, right? And so from those two things, plus a little bit more information about, like, here's my audience, this is my beat, this is my goals, we were able to co develop a rubric of. Instead of like, how many exclamation marks does it have? It would say things like, does this take advantage of your quote unquote, like, insider anthropologist position in Silicon Valley? Because that's one of the things that Claude and I think distinguish my voice or it'll also notice like, oh, Jasmine, you tend to move between registers. You'll switch between, between startup jargon and Internet slang and whatever. And I think the fact that you can do the high, low or move from policy to personal scene, this is something that is characteristic of your writing. And so again, we're co developing these qualitative criteria and then I split it into phases of ideation, phase, rubric, structure, rubric prose, rubric, final fact checking. And so what I do now, I put this all in a cloud project. I said, your job is to evaluate my drafts based on this criteria, but not to do the writing for me and to make sure to prompt out of me what I can do better. Better. I dumped a draft into Claude. Claude will run like phase two structure on it. It'll say things like, your conclusion is just a summary and this is really boring. In fact, in your piece about this and that, you actually ended on a Scene. And I thought that was much more powerful. So why don't you try ending this one on a scene? And Claude will say, rather than inventing a scene, it will say, what were you thinking when the plane took off? What were you feeling inside? Can you think of a scenario where you had a conversation with, say, a kid's safety advocate about AI that really resonated with you? Because right now it sounds like Dry Policy explainer. And that feedback I actually found incredib useful. Like, I'm still applying my own judgment to say, do I take it or not? But I'm like, you know, this is about me becoming the best version of myself as a writer. It's about, like, me self improving and Claude pushing me to do that, which I found much, much more helpful.
Kevin Roose
I want to ask you both a question. As fellow writers. Do you feel the impulse to make your writing weirder because of AI to sort of stand out from the sea of slop? Because I find myself feeling this tug of like, oh, that's a little weird aside that probably I should cut, but I think I'm gonna leave it in. Cause, like, Claude would never do that, right? It's like. It's like a marker that I am typing these words, and I feel like that's sort of my imprimatur that I'm leaving.
Casey Noon
My answer to you is, yes, I absolutely feel that way. And I've, like, gone back and tried to edit sentences to like, make them feel a little bit more, like, weird or like, in particular, to make them sound colloquial in a way that I know, like, an LLM generally would not be. And like, yes, it is for that reason, I think that writing right now, like, we're all, not all, many of us are on such high alert for the prospect that we might be reading slop that I think if you are a writer who does not want to be producing slop, like, you should be asking yourself that question.
Jasmine Sun
I. I think it makes me a lot more comfortable writing the way I want to write in the first place. Like, I think, like, maybe unlike both of you, I didn't sort of come up through newsrooms where I was, like, learning a very specific house style and all of these norms. Like, I can do news writing now. It's something I've learned now. But, like, I'm actually much more, quote unquote, like, Internet blogging native, which is a form that is voicey and irreverent and not as pristine and will make inappropriate jokes. And it's just a looser form of writing. And so I think what it's actually done is made me more comfortable doing the bloggy thing instead of sort of always trying to write in a more professionalized journalistic tone.
Kevin Roose
So I think we should leave this with a question for you, Jasmine, which is your piece makes the case very convincingly that today's AIs are not very good at the kind of writing that I think we all value. Do you think they will get there? And what should the companies do to make their models better at writing?
Jasmine Sun
I think that if we separate out text generation from reporting, which I'm not that bullish on the models doing, and we are just talking about say literary fiction or here's a bunch of interview transcripts, write a magazine feature or something, I think that if they applied as many resources towards that task as they do towards coding agents and things that actually make them money, I think that they could get there. Will the companies ever find it financially advisable to spend all their resources on that instead of automating 23 year old software engineers? Probably not. I would be grateful for that world. I don't need them to take my job or these folks jobs. But I think it's possible.
Casey Noon
Look, they're going to get around to it eventually. Okay? You know, it's like, I mean, I
Kevin Roose
hear you see what writers make in this economy, Casey.
Casey Noon
Eventually. Like those aren't going to pay for
Kevin Roose
a lot of data centers.
Casey Noon
No, there is equity, economic value in writing and eventually the AI companies will want that all to themselves.
Kevin Roose
You know what would be a very funny outcome of this? Taking your point about the sort of guardrails of the models, maybe the next great American novel will be written by Grok.
Jasmine Sun
Oh God, that's.
Kevin Roose
And with that. Jasmine sun, thank you for joining us.
Jasmine Sun
Thank you very much, Kevin and Casey.
Casey Noon
When we come back, everyone's spending money on tokens.
Kevin Roose
Kevin, Great. Keep going.
Casey Noon
You've gotta be token. Token maxing, that is.
Kevin Roose
Keep going. Yes.
Casey Noon
And when we come back, what are you token about? That's the question being asked by a leaderboard that's sweeping Silicon Valley.
Kevin Roose
The sweeping. It's really sweeping there she.
Advertisement Voice
Most all in one hr systems are a patchwork of disconnected and manual tools. Rippling is totally automated. If you promote an employee, rippling can automatically handle necessary updates from payroll taxes and provisioning new app permissions to assigning required manager training. That's why Rippling is the number one rated human capital management suite on G2 TrustRadius and Gartner. If you're ready to Run the backbone of your business on one unified platform. Head to rippling.com hardfork and sign up to today. That's rippling.com hardfork to sign up over the last two decades, the world has witnessed incredible progress from dial up modems to 5G connectivity, from massive PC towers to AI enabled microchips. Innovators are rethinking possibilities every day. Through it all, Invesco QQQ ETF has provided investors access to the world of innovation with a single investment. Invesco QQQ let's rethink possibility. There are risks when investing in ETFs, including possible loss of money. ETFs risks are similar to those of stocks. Investments in the tech sector are subject to greater risk and more Vol the more diversified investments. Before investing, carefully reading Consider fund investment objectives, risks, charges, expenses and more in perspectives@invesco.com Invesco Distributors, Inc.
The Browser is your business's first line of defense against online threats. Keep your employees and data safe with Chrome Enterprise, the most trusted enterprise browser. Create controls that enforce company policies like rules that prevent employees from printing, pasting, or sharing company data access in depth reports that show the apps and extensions your employees are using and where data is going and company and prevent phishing and malware attacks from reaching your employees with automatic proactive protections. Visit Chrome Enterprise Google to learn more.
Casey Noon
Well Kevin, you've recently returned from Book Leaf and are once again writing in the New York Times. How does it feel to see your name in print again?
Kevin Roose
Feels great. Hasn't happened yet, but when it does, it'll be great.
Casey Noon
Well, I got to take an early read at a story that you are publishing about the fact that tech companies have now created leaderboards to show show which employees are using the most AI tokens in their work.
Kevin Roose
Yes, it's a token frenzy out there, and the employees of these companies are competing among their colleagues sort of informally and sort of for fun. But they're taking it very seriously. They want to be the people at their company who are using the most AI tokens.
Casey Noon
So let me just ask a basic question for listeners who may not be familiar. What is a token? And why is that something you might start keeping track of?
Kevin Roose
So a token is the basic atomic unit of AI labor. It's basically a fragment of a word, and it is how AI model providers measure their consumption. So if you type in a prompt, you know, help me write this essay. An old model might have given you a couple hundred tokens in response. That would be a couple Hundred words. And what has been happening over the past year or so as these agentic coding tools have started taking off, is that the models are just much more token hungry. You can use now hundreds of thousands or even millions of tokens in a single session. And so that is what is propelling these leaderboards is the idea that the more sort of coding you're doing, the more agentic tools you're using, the more simultaneous processes you're running, the higher your token count will be.
Casey Noon
Measurement I found useful was that apparently it takes about 10,000 tokens to generate 7,500 words, if that sort of, you know, helps to ground you at all. But as you just said, and I want to hear more about this, the more advanced systems are using way more tokens than that. So tell me about some of the numbers that some of the sort of token all stars are putting up on the, on the boards.
Kevin Roose
So I don't know all of the exact numbers, but I did learn that at OpenAI, where they do track this kind of leaderboard, the highest employee token count over a seven day period recently was a guy who used 210 billion tokens. And this is for rough scale, about 33 Wikipedia's worth of text. And now all of that is not sort of typing and receiving a response. Some of that is what they call cashed tokens. So it's not all sort of, you know, being extruded from the model for the first time. But these are the kinds of numbers that I think even a year ago would have sounded completely insane right now.
Casey Noon
Was this guy working on a new mass domestic surveillance program for the Department of Defense?
Kevin Roose
I don't know. And OpenAI did not make him available for interviews. But what I wanted to do in writing this column was to try to call up a bunch of people or talk to a bunch of people who are in this sort of billion token club, right? The sort of extreme power users and just act, ask them like, hey, how are you guys using all those tokens? And isn't that very expensive? And how are you paying for it all? And I learned a lot.
Casey Noon
Yeah, well, okay, well, so tell us first of all just how expensive it is.
Kevin Roose
Very expensive. In fact, I heard that the top user of CLAUDE code, the top individual user of CLAUDE code as measured by anthropic, spent more than $150,000 on tokens last month. So extrapolate that. That is like an employee making more than a million dollars a year and they are burning that in a month. And I heard similar figures from Some of these other extreme coders who are spending something on the order of thousands of dollars a day on tokens from these models. Now, we should also say the employees of these companies get their tokens for free. Right. So they're not shelling out. Their companies are not shelling out, selling out. But at other companies, this is starting to become an issue because they are sort of outstripping their. Their budgets for these things.
Casey Noon
So there are companies where there are engineers who legitimately are costing their employers maybe $150,000 a week because they're getting tokens from one of the big providers.
Kevin Roose
Yeah, I talked to a software engineer in Sweden who said that he probably spends more than his salary on Claude. So this is essentially becoming like, a very expensive job perk for some of these coders.
Casey Noon
So talk to me about why employers want to create leaderboards to promote this to employees. Because I could see other companies saying, if you spent $150,000 on tokens last month, you actually don't work at this company anymore because we're bankrupt.
Kevin Roose
Right. So this was a big question that I had is like, why is this going on? And it seems to be some combination of. Of sort of employee motivation and worker tracking. Right. There are executives at these companies who think that the more tokens you use, the more productive you probably are. And as we discussed in a previous segment on this show, these companies are very eager to have their workers start embracing the AI tools. And so at a number of these companies, I talked to people who said, yeah, this is just basically them trying to see who is really all in on the new way of programming. Programming.
Casey Noon
And you've talked to a number of people who are ranking high on these leaderboards. I realize you probably haven't dug deep into their code, but what is your sense of how productive they actually are? Like, what is the relationship between token usage and taking my company to the next level?
Kevin Roose
I mean, it's very unclear. Right. Some of these people may be just generating, like, worthless projects. I think the thing that worries a lot of the people I talk to about these leaderboards is that they just incentivize you to, like, run up your token count, right? Yes. Because then you look like the special, you know, 10x engineer or 100x engineer who's, like, outperforming all your colleagues. So I think there are a number of companies that. That see this leaderboard business as a little strange and maybe counterproductive. But I do think that there is a feeling among the most sort of heavy token users that they are being productive.
Casey Noon
Yeah. I have to say, when I read your column, I thought this just seems like it would create the worst incentives. Right. There's this idea of Goodhart's law, right. Like when a measure becomes a target, it seem pieces to become a good measure. I can't think of a better way to ensure that tokens usage becomes a bad measure than creating a leaderboard for it.
Kevin Roose
Totally.
Casey Noon
What are the people inside the company saying about that?
Kevin Roose
Well, some of them are opposed to this whole leaderboard thing. I also talked with some folks who defended the leaderboards. They said, look, it's never been all that easy to track the productivity of programmers. Some people have had their productivity measured by like how many lines of code they generate or how many poll requests they made. These are sort of these imperfect proxies for like, how hard are you working, how much are you doing? But the employees of these companies also see this, I think wisely, as a key to their own success. A number of these companies are now using AI token use and consumption as part of the performance review cycle. So you go in for your annual review, your boss says, hey, it looks like you only use, you know, 70 million tokens, like last month. What's going on? And so I think the engineers of these companies are getting wise to the fact that if they want to have a long, successful career, they better start using some tokens.
Casey Noon
Yeah, but I imagine that some of them are really nervous about that though, right? Because like, it seems clear to me that at least some of these companies want to incentivize token usage because the companies themselves suspect that the more we can get them using this stuff, the less long we will have to employ the humans maybe.
Kevin Roose
Although I think it's less about, about like the AI systems replacing the humans and more about like it is just a radically different way of working. Right. These are people who most of them have had long careers in software engineering. They grew up writing code by hand. They maybe grew up using some sort of like AI assistant, like GitHub, Copilot. And what people at these companies are saying is that these agentic engineering systems are just really different. You have to approach them in a different way. You have to spend a lot of time with them to understand what they're good and not good at. That and to them this is sort of a way of motivating their employees to say, hey, go out and try the new thing.
Casey Noon
Yeah, I, I don't know. I've been thinking a lot about this question of like, if I were an engineer at one of these companies, and I had this incentive to get on the leaderboard. Like, how would I approach it? And I do think that, like, the instinct to just, like, waste a bunch of tokens, to. To, like, rise higher on the leaderboard, like, ultimately, if you rise too high, people are going to ask you what you did with all the tokens. If you're number one at like, 10 billion tokens and you only manage to, you know, like, you know, vibe co calculator or something, people are probably going to get mad at you. Yeah.
Kevin Roose
And I actually did talk to one person who speculated that actually the people at the top of the leaderboards are all doing side projects. They're starting their.
Casey Noon
They're starting hustle. They started a new company with. With the boss's money. And if you're doing that, I just want to say, I salute you. Like, that is the right way to work. Yeah.
Kevin Roose
Maybe don't be the number one on the leaderboard. If you're doing that, maybe try to stick around 6 or 7.
Casey Noon
Middle of the pack is kind of where you want to aim yourself. I mean, let me ask, is there any kind of token tracking that you think offer a reasonable signal? Like, do you think that if you're like a tech company, you should create a leaderboard?
Kevin Roose
No, I think that's a bad idea for all the reasons that we just talked about, including Goodhart's law, which is, I think this is just going to lead to people just wasting tokens doing side projects. But if I'm the budget manager at a company and I'm seeing that people are spending multiples of their salary on AI tokens, I'm asking them some questions about what they're doing with that all. And if their answer is not, I, I built an amazing new product that's going to generate billions of dollars a year in revenue. I'm trying to say, hey, could you maybe use a little less next month?
Casey Noon
Yeah. I have to say, I have been struck at how this idea of the token leaderboard just represents a new incarnation of something that the software industry has been trying to figure out for a long time, which is how can I figure out if my software engineers are productive? You know, I was talking recently to this very handsome software engineer who I'm engaged to about your column, and he was telling me that you. He used to be evaluated on how many lines of code he contributed. And he told me about all the games that people used to play back in the day with. Oh, you know, I, like, wrote a quick algorithm to, like, you know, translate a bunch of stuff into some new languages and it's like completely worthless. But it makes me look like I had a very productive week. And so I went back and looked into this and they were doing this in the 60s and 70s and there's this saying from the early days of computer programming that eventually a arises that says, quote, measuring programming progress by lines of code is like measuring aircraft building progress by weight. And I have to say I think that the same thing kind of applies here, right? That like, yes, if you squint and at the right level of abstraction, it's probably true that some people who are using a lot of tokens are more productive than some people who aren't. It just doesn't quite seem like the right way to measure these things. And I just wonder how quickly the industry is to going going to figure that out.
Kevin Roose
Yeah, I think it's going to be pretty soon in part because the budgets are just getting very ridiculous and, and especially the AI model providers are now seeing individual users consuming amounts of their services that entire companies would have consumed just a few months ago.
Casey Noon
You know, maybe the kind of last question I have for you about this is just what implications do you think it has for the broader economy? Right, because we know that in so many different sectors of the economy managers are, are saying I want to incentivize my employees to use AI and I want to track how they're using AI. So do you think that as knowledge of these leaderboards spread, we're going to see people in non technical fields try to adopt their own version of them?
Kevin Roose
I hope not. I think it's really a bad move not just for tracking actual productivity and output, but just for morale. Right. Like I remember years ago when like Gawker would have like a traffic leaderboard at their office so you could see how many clicks your stories were getting relatively relative to other people. And I don't think anyone who like worked there at the time thought that was like incentivizing the right things or creating like high morale among employees. Basically everyone was just competing with each other all the time. And I think in this case it's even worse because it's not necessarily even correlated with like any success. It's just pure sort of like, you know, how many agents can you run in a parallel swarm to sort of work 247 doing tasks of uncertain value?
Casey Noon
Which is a great question to ask on a first date in San Francisco too, by the way. But anyways, I have to say I worry that this idea of like Token maxing is going to spread into the broader economy. I was talking with somebody who works in marketing this week, and she was telling me that, you know, her job used to be evaluated solely on creativity. And then recently the performance review got a new AI section and everyone is being evaluated on how much AI did use. And like, from her perspective, she was kind of like, this was working fine, you know, like, I didn't need to use like an AI tool to help me, but now, like, you know, my bonus might be based on how much of it I use. So I think this thing is sort of already seeped out of the labs and is like getting into the water elsewhere. And I just hope that managers are like, really thoughtful about what they are incentivizing and that maybe, like, AI use for the sake of AI use is not going to be the boon to your company that you're hoping it is.
Kevin Roose
Yeah, I think it's going to be very case by case. I think there will be people who are token maxing who are way more productive than their colleagues and doing way more projects way more quickly. I think there will be other people whose managers look at their, like, token budgets and see, say you spent this many tokens on what? And we'll have to have some hard conversations. But I think it's very hard to draw with a broad brush and say, like, all of this token maxing is pointless productivity theater. It sounds to me from my conversations like some of it really is working for people.
Casey Noon
Yeah. Well, I will say on the flip side, I've also heard of people in my social circle who have gotten in trouble for spending too much on cloth. Really?
Kevin Roose
Yeah.
Casey Noon
And when I heard that, I was like, oh, like, your company's not going to make a bro. Like, you got. You got to spend on this stuff.
Kevin Roose
Well, what's so interesting is now it's becoming part of job conversations for engineering jobs. People are going into new jobs and saying, well, what's my token budget? And for the employees of these big AI labs who have unlimited free access to the models, some of them are using so many tokens that they effectively can't afford to quit their jobs. Right. Because anywhere else they would work would have to pay for their tokens and it would be completely unaffordable to employ them.
Casey Noon
Yeah, I mean, those sounds like real incentives and better than the ones that matter. Do you remember when Meta was spinning up a super intelligence labs and they said, you can sit really close to Mark Zuckerberg? If I were them, I'd be like, I'll Take the tokens. Thanks. Well, just to wrap this up, exactly how many tokens should a person use?
Kevin Roose
I think. I think that's you have to look within yourself.
Casey Noon
Look within yourself.
Kevin Roose
Yeah. Okay.
Casey Noon
Yeah. That's between you and your God.
Kevin Roose
Yeah.
Casey Noon
Yeah.
Kevin Roose
Do what Marc Andreessen will not and Introspect.
Advertisement Voice
The browser is your business's first line of defense against online threats. Keep your employees and data safe with Chrome Enterprise, the most trusted enterprise browser. Create controls that enforce company policies, like rules that prevent employees from printing, pasting or sharing companies company data access in depth reports that show the apps and extensions your employees are using and where data is going and coming from and prevent phishing and malware attacks from reaching your employees with automatic proactive protections. Visit Chrome Enterprise, Google to learn more.
Over the last two decades the world has witnessed incredible progress. From dial up modems to 5G connectivity, from massive PC towers to AI enabled microchips. Innovators are rethinking possibilities. And every day through it all, Invesco QQQ ETF has provided investors access to the world of innovation with a single investment. Invesco QQQ let's rethink possibility. There are risks when investing in ETFs, including possible loss of money. ETFs risks are similar to those of stocks. Investments in the tech sector are subject to greater risk and more volatility than more diversified investments. Before investing, carefully reading consider fund investment objectives, risks, charges, expenses and more in perspectives@invesco.com Invesco Distributors Incorporated for Small Businesses
Every hire matters, but the time and resources required to hire right are Limited. Luckily, LinkedIn Hiring Pro is built for that reality. Posting a job isn't always the hard part. It's finding, connecting with and screening the right candidates. Hiring Pro streamlines the entire process. Nearly 60% of hires find a candidate to interview within a week. Hire right the first time, post your first job and get $100 off towards your job. Post@LinkedIn.com hardfork that's LinkedIn.com terms and conditions apply.
Kevin Roose
Hard Fork is produced by Rachel Cohn and Whitney Jones. We're edited by Viren Pavic. We're fact checked by Caitlin Love. Today's show was engineered by Katie McMurran. Our executive producer is Jen Poyant. Original music by Rowan Niemisto, Alyssa Moxley and Dan Powell. Video production by Sawyer Roque, Pat Guenther, Jake Nickel and Chris Schott. You can watch this full episode on YouTube@YouTube.com Special thanks to Paula Schumann, Pui Wing Tam and Dahlia Haddad. You can email us as always@hardforkytimes.com send us your token budgets.
Advertisement Voice
If you, your parent or spouse served in the military, you could join our family. Our members saved an average of $70 a month on auto insurance when they switched. Tap the banner or visit usaa.comjointoday to check your eligibility restrictions.
Kevin Roose
Apply.
Episode: ‘A.I.-Washing’ Layoffs? + Why L.L.M.s Can’t Write Well + Tokenmaxxing
Hosts: Kevin Roose (The New York Times), Casey Newton (Platformer)
Guest: Jasmine Sun (Freelance Journalist)
Date: March 20, 2026
This episode explores seismic shifts in the tech industry brought on by AI, focusing on three intertwined topics: whether recent tech layoffs are truly caused by AI (“A.I.-washing”), why large language models (LLMs) seem to hit a ceiling on good writing, and the rise of token “leaderboards” that track who is spending the most on AI tools within tech companies. The hosts, joined by Jasmine Sun, dig deep into the reality behind the headlines and examine the human and economic stakes of AI’s next wave.
[02:00–17:55]
Guest: Jasmine Sun, Author of “The Human Skill That Eludes AI”
[19:46–44:59]
[47:21–62:53]
On ‘A.I.-Washing’
On Creative AI
On Editing with AI
This episode is a deep dive into the murky realities behind AI’s market hype: major tech layoffs justified (maybe opportunistically) under the AI banner, the struggle of even “superhuman” LLMs to write with real voice and soul, and the wonky new culture of “tokenmaxxing” that’s reshaping incentives and office politics. The conversation with Jasmine Sun is particularly rich for anyone interested in how AI is (and isn’t) changing creative work.