
Will the rising tide of A.I. adoption lift all boats?
Loading summary
A
A website should help your business grow, not slow it down. Framer is an enterprise solution with premium hosting, enterprise grade security, and 99.99% uptime SLAs that gives the world's leading brands like Perplexity, Miro and Mixpanel the confidence they need to build their websites in Framer. Learn how you can get more out of your dot com from a Framer specialist or get started building for free today@framer.com hardfork for 30% off a framer Pro annual plan. Rules and restrictions may apply.
B
Casey, I miss you. You are in New York.
C
I am, Kevin. And of course I miss you as well. But it's always fun to visit the mothership. You know, Ezra Klein just challenged me to a burping contest, so I've got that to look forward to later.
B
Burping or burpee?
C
You know what? I guess I should go read that email again. Okay.
B
Well, I miss you. We have an empty chair here in San Francisco, and it's not the same.
C
It's. It's not the same. But I've been catching up on all the latest AI news. Kevin and I had to ask, have you seen this thing about Codex and the goblins?
B
Yes. This is the new update to OpenAI's Codex that is, like, obsessed with goblins.
C
Yes. Apparently the company had to add instructions to its latest model to forbid Codex from randomly mentioning an assortment of mythical and real creatures, including goblins, gremlins, raccoons, trolls, oceans, ogres, and pigeons.
B
Or as we call them, our slate of guests on Hard Fork.
C
Listen, we have a pigeon coming up, and you're really going to want to hear their take.
B
No. I'd heard that OpenAI had been accused of gobbling up copyright data, but do you have any explanation of what's been going on with the goblins?
C
Well, I mean, I think it's pretty clear what's happened, which is that when it built Chatgpt, OpenAI awakened an ancient evil, and this is the last line of defense we have, breaking containment and killing our families. So let's hope the guardrails hold.
B
I'm Kevin Roos, a tech columnist at the New York Times.
C
I'm Casey Noon from Platformer, and this is Hard fork. This week, OpenAI's big reset. We'll talk about the company's new business strategy and its dramatic trial with elon Musk. Then Dr. Adam Rodman returns to the show to tell us about the latest advances in AI and medicine.
D
And.
C
And finally, can an AI made out of very Old text still predict the future. We're talking about talking.
B
So, Kasey, there's been a lot happening with OpenAI. In particular this week. There seems to be something of a major strategic reset happening over there. They've got a new deal with Microsoft, an expansion of their deal with Amazon, changes to their Stargate Compute strategy, and a new push toward new kinds of ad supported subscriptions. And of course, they've got this big trial with Elon Musk that started this week in Oakland. So let's get through all of it, but first, before we do that, let's make our disclosures. I work for the New York Times, which is suing OpenAI, Microsoft and Perplexity.
C
And my fiance works at Anthropic.
B
Okay, so let's start this week with this new Microsoft deal. So Microsoft and OpenAI have of course been partners for many years. Microsoft remains the biggest investor in OpenAI. Their stake is valued at about $135 billion. But their relationship has also been strained over the years by various factors. And this week they seem to be sort of consciously uncoupling or at least rewriting their partnership agreement and allowing Open to be a little bit more promiscuous in who they do deals with.
C
Yeah, I mean, OpenAI just had this real challenge, which was that until this week, they were really only allowed to serve their models on Microsoft's infrastructure. And one thing we talk about on the show a lot is just that a lot of the big cloud service providers, their infrastructure is just maxed out, and Microsoft is one of those. And so for OpenAI's revenue to grow, they needed to find other ways that they could deliver their services. And so to my mind, that was maybe the most important thing about this deal.
B
Yeah. So under this new rewritten version of the Microsoft and OpenAI deal, Microsoft will no longer have to share revenue with OpenAI. The new deal also removes the part of the original agreement that had to do with AGI. The old agreement said that basically once OpenAI reached AGI, Microsoft would stop getting certain revenue share payments. But under the new agreement, OpenAI will keep sharing revenue with Microsoft until 2030, no matter what benchmarks they hit. So the AGI clause is gone.
C
And I for one, will be sad to see it go, because I think it was sort of the funniest clause in the entire AI world. Right? It was like, basically like, well, if we ever get to a point where OpenAI says the magic word and the entire world changes and now they're not allowed to say the magic word anymore, Right.
B
So AGI has been poorly defined for many years and everyone's got their own definition, but it did have this one interesting, like contractual stipulation. And now even that is off the table. So now we just have sort of AGI as evaluated by Vibes. So, Kasey, what did you make of this loosened partnership between OpenAI and Microsoft?
C
Well, I think it seems like probably a good deal for both of them. Right. Like, there was a moment when it seemed for both companies like being very, very closely aligned was the best thing for both. Arguably, for a time it was. But with all of the various revenue, compute and customer needs that both of these companies are now trying to serve, I think it's benefiting both of them to play the field and sign other partnerships. So my read on this was like, this is basically good for both of them, but what about you?
B
Yeah, I think it's good for both of them. I think it's a little better for OpenAI. They got most of what they wanted. I think the bigger deal for them is the ability to work with other cloud providers. So now they can work with Amazon or Google Cloud platform and big corporate customers who use those cloud platforms can now use OpenAI models. They don't have to go to Azure to do that. I think that allows them to strike these other, bigger deals and to reach other corporate customers who may have been limited before by the fact that, like, it's really hard and annoying to change cloud providers.
C
Yeah. And speaking of big deals, Kevin, they signed what seemed like a pretty big one with Amazon this week.
B
Yeah. So OpenAI wasted no time in its new open marriage with Microsoft. It went back out on the market and found themselves in bed with Amazon.
C
They found themselves in Bedrock with Amazon. But I'm sorry, you can go on.
B
Yes. So on Tuesday, OpenAI and Amazon announced an expansion of their deal that they'd announced back in February that will allow OpenAI to sell its models through Bedrock AI platform and make Codex, its coding model, available on bedrock as well. OpenAI and Amazon will reportedly also develop customized models to power Amazon's consumer facing applications. And Amazon will invest $50 billion in open. So there's some interesting stuff here. I think the interesting subtext to me is that Amazon for a number of years now has been pretty closely tethered to Anthropic as its primary sort of frontier model developer. And so OpenAI is kind of taking advantage of its newfound freedom by trying to elbow into Amazon and maybe displace Anthropic as their favorite model provider.
C
Well, I know that Amazon was talking a really big game about this deal. The CEO of us was giving interviews, essentially saying, like, OpenAI belongs to us now. It was kind of a the Boy is Mine situation. Remember the old Brandy and Monica hit from back in the day? This is kind of bringing that back in a little bit more of an AI flavor. I also have to say I find it very interesting that Amazon named its platform Bedrock, because that's where the Flintstones are from. It seems. Seems rather backward looking for a leading AI company.
B
Kevin, wouldn't you say that's great analysis.
E
Thank you.
C
Thank you. By the way, I think that this is a really important point and a reason that we are talking about it to a big general audience, which is that the story that you just described, Kevin, is one of a world where no one has the resources they need to serve the demand for AI that they have. And I think at a moment where we're still sort of seeing a lot of skepticism, there's so much bubble talk. I just want to posit that as a really important point in understanding what sort of bubble this is, because even the biggest companies do not have the resources that they need to serve that demand.
B
Yeah, and I think that's a good point and it's really a profound shift in the way that skeptics have been talking about this, this AI boom. I remember just even a couple of months ago, the leading sort of strain of criticism was that these AI companies would never be able to generate the demand to pay for all of the expensive data centers and infrastructure projects they wanted to do. And now that's shifted to, well, there's so much demand. What if they can't build enough to support the demand they have?
C
Yes, and on that front, it does seem with at least one of these mega building projects, there have been some problems recently. Kevin?
B
Yes, this was another story that hit this week. The Financial Times reported that Stargate, OpenAI's joint 500 billion dollar infrastructure project, is also undergoing a bit of a shift. The FT reported that in recent weeks, OpenAI has halted planned data centers in the UK and Norway, declined to expand its flagship site in Abilene, Texas, and seen several senior figures tied to Stargate leave for rival Meta. The FT further notes that OpenAI has shifted to leasing capacity from third parties instead of building out all of their own facilities. Kasey, what did you make of this?
C
I think this was a case where like REAL has just finally intruded on the Stargate project. Like when all of these deals were getting announced initially. This is how they sounded. Well, we're going to spend one batrillion dollars that we don't have to build 40 quadrillion data centers. And at the time, people said that kind of seems like a lot. Can you guys actually live up to that? They said, yeah, just watch us. Well, guess what? They could. And now they're changing course.
B
Yeah. I don't think this signals that they are sort of retreating from their compute ambitions. I think it's more about, like, they are realizing that if they want to go public, which they do, they need to sort of get their house in order. And one way to get your house in order is to move some of this data center and infrastructure building off of your balance sheet and onto third parties.
C
Yes. But there is one point in there, Kevin, that I do want to ask you about, which is that Berbergen at the Wall Street Journal over the past week had this really interesting story where he said that OpenAI had failed to meet some of their internal user number targets and some of their revenue targets, and that this was possibly creating some tension between Sam Altman and his cfo, Sarah Fryer, as they consider potentially doing an initial public offering later this year. So curious what you made of that story. And does this maybe help explain why OpenAI has had to pull back on some of its big Stargate ambitions?
B
Yeah, I mean, I think there are competing forces within all of the big AI companies right now. One side is sort of the. The indefinite optimists, the people who think that demand for AI is just going to be essentially infinite and that as much COMPUTE and as much money as they need to spend acquiring compute, it will all be paid back many times over because the world is about to change into something most of us barely recognize. And so kind of just trust us on that is sort of one camp. And then there are the sort of, you know, the number crunchers who are trying to fit all of this into a kind of financial projection that will make sense to investors who are not as convinced that the world is about to change forever and who want to see things like, what is your plan for actually making the revenue that you're going to need to pay for all this stuff? So I think this is happening in a way, at OpenAI that is now because of Berber's story is out there. But I think this kind of tension exists at all of the big AI companies. And so I think right now what we're seeing is kind of that. That power struggle breaking out into the open.
C
Yes. And for what it's worth, OpenAI did call this story prime clickbait, which I think just refers to clickbait. That's really, really good. Is that what that means?
B
Yes, it's sort of like Wagyu clickbait.
E
Yes, exactly.
C
This clickbait was dry aged for a month before it was served. And it's delicious.
B
Yeah. And I think one thing I want to flag on this is that these growth projections that OpenAI reportedly did not hit, those were in 20. I think it is fair to wonder if something has changed in just the last few months because of the enormous rapid growth of tools like Codex and Claude Code. We have seen just reports of astronomical growth in those tools. So it may be that OpenAI was having some growth issues late last year, but that because of this agentic coding boom, things have started to turn around. We just don't know yet that that
C
makes sense to me. And it does seem like their Codex app in particular was really well received. But there's been this other transformation that seems to be unfolding. Kevin, this week the information had this really interesting story where apparently OpenAI projected at the start of the year that its $8 a month subscription, which is called ChatGPT Go, which sort of, you know, gives you a, a little bit of the good stuff, but not as much as if you're paying $20 or more for ChatGPT. They predicted that its Go subscriptions would grow 36 times this year to 112 million people, while meanwhile, it's $20 a month, plus subscriptions would fall 80% to about 9 million. So that's like a really interesting business pivot that I would love to know more about. Of course, it sounds a lot like the new Netflix plan that they rolled out a while back, right. Where it's sort of like, well, you know, it's going to be a lot cheaper, but we'll show you ads. I was curious, like, what you make of that strategy because, you know, part of me feels like, well, they'd much rather have, you know, the $20 subs and the $8 subs, but maybe there, there's just a, those $8 subs out there.
B
Yeah, I think what's happening here is that the market is essentially splitting into. Right there's the sort of casual hobby users who are using AI chat bots like ChatGPT, like Cloud, for sort of souped up Google queries to, you know, help them write emails and maybe only using it a couple times a day. And if you're doing that, you probably don't want to pay 20 bucks a month, you're probably more comfortable paying 8 bucks a month, or maybe you don't want to pay anything at all and you just rather use the free ad supported tier of all of this Stu. And then there's the professional users for whom this is worth way More than 20 bucks a month and who are willing to pay many multiples of that to get the access to the latest models, who have higher rate limits. And so I think all of the companies now are sort of, you know, doing this kind of experimentation with how much can we charge the professional users without losing them to a rival company and how cheap can we make the kind of lower end subscriptions or the free tiers so that people who are more casual users won't be tempted to go use Google instead?
C
That makes sense. I'll say. For my part, I'd be willing to pay even more for a chatgpt if they would just let the Codex app talk about Goblins. I say free the goblins.
B
These models are so weird. Like, it is so weird that we have this technology that is now sort of load bearing infrastructure for the entire economy that every business is using to completely reinvent the way that it works. And that out of nowhere, if not specially restrained, it will just start talking about Goblins, which to me is just
C
like a satire of the AI safety conversation. You know, like lately the OpenAI has sort of been very like skeptical of the AI safety and casting a lot of aspersions on doomers. But it's like, well, we did have to add safety guardrails to prevent Goblins from taking over our coding app. And that's a real story. So as usual, I'm just loving Life here in 2026.
E
What a world.
B
So those are a bunch of stories about OpenAI's strategic pivot, its reset. But there is this other big variable here, this potential fly in the ointment, and that is the long awaited Elon Musk trial that got underway in a federal courtroom in Oakland this week. Kasey, can you remind us what this case is about?
C
Yes. So Elon Musk was famously one of the co founders of OpenAI. He gave the company some of its initial funding, but left in a power struggle between himself, Sam Altman, Greg Brockman and some others. And a few years after all of that went down, and notably after Elon started his own AI company, he sued OpenAI and said, I have been defrauded. This was only ever supposed to be a nonprofit and you've gone and turned it into one of the world's most valuable companies through its for profit arm. So he is suing to stop all of that. If he wins, any winnings will be given to OpenAI's nonprofit army. Notably, Kevin, he made 26 claims when he originally filed this lawsuit in 2024, but only two have survived to trial. Unjust enrichment and breach of charitable trust.
B
So the trial is just getting underway. They've done jury selection and they've had a couple witnesses testify. Elon Musk himself took the witness stand on Tuesday and said, quote, this lawsuit is very simple. It is not okay to steal a charity. He also said that if OpenAI is allowed to get away with this quot it will give license to looting every charity in America. Basically, he is saying this thing that started as a nonprofit, that was supposed to continue as a nonprofit, became through some corporate restructurings, a for profit company that has raised many billions of dollars. And that if this is legal to do, every charity would do this. Why wouldn't you want to take your donor's money and turn yourself into a well funded startup?
C
Yes. Now, one inconvenient truth that Elon Musk faces here, which is that OpenAI's for profit business is still controlled by a nonprofit. There's this foundation that houses the Public Benefit Corporation. And while I do empathize with those who say, hey, it really seems like the nonprofit hasn't done all that much and, you know, most of their money is being used for for profit activities. This was litigated and the nonprofit, you know, still does have like voting control over the for profit.
D
Yeah.
B
So Elon Musk is saying this is a case of looting a charity. OpenAI's lawyers have accused Elon basically of just being bitter that the company has succeeded without him. Its lead counsel, William Savit, said during the trial, quote, we are here because Musk didn't get his way at OpenAI. My clients had the nerve to go on and succeed without him. Mr. Musk did not like that. They have also been pointing out that Elon had also wanted to make OpenAI have a for profit subsidiary back when he was the company and that he's just mad that he didn't get to control it.
C
Yeah. To underline that, like in 2017, 2018, there are emails from Elon Musk where he talks about turning this into a for profit. So, you know, whatever concerns he had about looting the charity, you know, today, like he did not have them back at the time.
B
Right. He also wanted to fold OpenAI into Tesla. That was revealed in some of these emails. Tesla, of course, being a for profit company. So it seems like this is not exactly a consistent and principled stance. But, Casey, what are the stakes here? Like, if Elon Musk does manage to convince a jury that this was a case of OpenAI looting a nonprofit for its own commercial gain, like, what could the remedies be? Could this be fatal for OpenAI, or is this just sort of an attempt to slow them down and distract them with a big trial?
C
I think that it is much more the latter. Like, based on my reading of the case and what I've seen sort of legal experts say about it, the whole case is very unusual that it even made it to trial. Like, for the most part, if you donate money to a nonprofit, you actually don't have a say in what happens to it after that. So it's very unusual that the judge even granted him standing to sue here. And as I noted, she threw out most of his claims. That said, let's say that, you know, there's some single digit percentage chance of him winning something here. What he wants to do is to take more than $150 billion that is currently under the control of the for profit business and give that back to the nonprofit, which would create a lot of headaches and roadblocks for OpenAI as it tries to build out Stargate and do everything else it wants to do.
B
Yeah, I think the lawsuit and this ongoing litigation between elon Musk and OpenAI has been very distracting for OpenAI. But like, as a journalist and as a person who wants to know more about the inner workings of how these companies run, I think it's been actually very valuable for a lot of these emails and early communications between OpenAI leaders to be be released as part of this litigation. I have found it very useful in understanding some of the early dynamics at OpenAI. And it also just illustrates the degree to which these projects are all just sort of fueled by grudges. Right. There's sort of one level of interpretation which is like, all of these people are just like obsessed with building the machine. God. And that this is all sort of related to their visions of the future. And then there's like another, more base level which is just like these people are all just rivals and they have these petty, long standing grudges and they just don't like each other very much. And so you can interpret a lot of what happens in AI through the lens of personal animus.
C
Yes, I've said this before, and it is rude, but a shocking percentage of the AI industry is just people who decided they didn't want to work with Sam Altman and who now have their own companies.
B
Right. So, Kasey, some people have been looking at all of this drama and intrigue surrounding OpenAI, from the trial to the Microsoft deal to these missed growth projections and saying some version of like, like OpenAI is in trouble. They are not going to make it to an ipo. They are going to sputter out and maybe end up in some real hot water and maybe Elon Musk wins this trial and it's sort of the end of OpenAI as we know it. What do you make of those gloomy predictions?
C
Yeah, I mean, look there, there are some fundamentals for OpenAI that remain worrisome. Right. They're planning to burn tens and tens of billions of dollars in cash before they achieve profitability. They still have this very ambitious infrast build out that is quite expensive. And so like, I'm not going to sit here and say that like all of the numbers seem to pencil out for this company on the whole, like if I try to, you know, put myself into the shoes of their CFO and I look through all of the stories that we just talked about, I think these seem like smart things to me. You know, it kind of seems like they're starting to dot their eyes and cross their T's and get this company in a shape where retail investors will be excited to invest in the stock, which, by the way, I think they will be. So, yeah, it's one of these companies where like it is a generationally weird enterprise. But when I look at this particular set of stories, I think, I think they're basically doing the right thing. What do you think?
B
Yeah, I mean, I think there's this interesting fallacy in the AI industry where it's like there will be only one winner, right? Everything is zero sum. If OpenAI is having a bad month, it's because, you know, Anthropic is having a good month or Google D mind is having a good month and vice versa. Like their sort of growth comes at the expense of all the others. And I think that's that feeling is shared by, among others, the executives of these companies. But I just don't think it's true. Like, I think that there are going to be a handful of companies that are just going to kind of rise and fall together, right? That if your models are in the sort of top tier, you are going to be fine as long as they stay in the top tier. And the sort of rising tide of AI adoption will sort of lift all boats. That's more my feeling.
C
Well, will this rising tide lift all AI podcasts as well, do you think?
B
I hope so.
E
I hope so.
C
Me too. When we come back, it's time to take your medicine. Dr. Adam Rodman is here to tell us what's going on with AI and doctors.
A
Maybe that's an urgent message from your CEO, or maybe it's a deepfake trying to target your business. Doppel is the AI native social engineering defense platform fighting back against impersonation and manipulation. As attackers use AI to make their tactics more sophisticated, Doppel uses it to fight back from automatically dismantling cross channel attacks to building team resilience and more. Doppel outpacing what's next in social engineering? Learn more@doppel.com that's-o ppel.com When I found
C
out I was going to be a
B
parent, I immediately felt a lot of anxiety and worry. So I went on to BetterHelp to
C
try to look for a therapist to
E
help me with that.
A
My relationship with my family and with my boyfriend and with myself were suffering.
B
I really needed help.
C
I was ruminating a lot. Really getting those thoughts out to a
A
therapist and getting feedback was just life changing.
F
Discover what BetterHelp online therapy can do for you. Visit betterhelp.com today.
D
Out on the road, it's nice to have a partner who can help you make the most of your journey. A partner like the Love's Rewards appreciate. With Love's Rewards along for the ride, you earn points on food, fuel, drinks and more every time you scan. Then you can spend those points at Love's just like cash. So every stop is even more rewarding. Download the app today and watch the points roll in mile after mile. Love's Rewards save and earn at every turn. Terms apply. See website for details.
C
Kevin, is there a doctor in the house?
B
There sure is, Casey. Today we are going to have a conversation with a doct about AI and medicine because this is an area where there has just been a lot happening recently and we needed someone qualified to come in and debrief us.
C
Yeah, you know, as we've sort of looked across the landscape just over the past few months, we've seen company after company introduce their own product at the intersection of AI and medicine. There's ChatGPT Health ChatGPT for clinicians. Amazon has something called Health AI. Microsoft has Copilot Health. And of course, all the while doctors are experimenting with this technology and as best as we can tell, actually getting really excited about what they're seeing.
B
Yeah and this has been a huge change in my recent visits to doctors, which is that I now am having this series of conversations leading up to the visit with AI systems about what is going on. And so I am coming armed with what I believe to be good information about what is going on. And that allows me to sort of have a different, more elevated conversation with the doctor. And this is not just me. Like. People are increasingly turning to chatbots for medical information. According to some recent data, approximately, approximately a third of Americans report turning to AI for healthcare information. And companies are racing to respond to that demand by making better tools that are specifically designed for use in healthcare. So to help us make sense of the landscape for AI and medicine and healthcare, we've invited back to the show one of our favorite doctors, Adam Rodman. He's an internal medicine physician at Beth Israel Deaconess Medical center and an assistant professor at Harvard Medical School.
C
Yeah, we last talked to him in November of 2024. Since then he's continued to study the way that people and AI interact in the healthcare space. And we have a lot of questions for him, like what should we do about your rash, Kevin?
B
Yes. So let's fork over our co pays and bring in Dr. Adam Rodman. Doctor Adam Rodman, welcome back to Hard Fork.
F
Oh, it is a pleasure to be here. Am I a friend of the show at this point?
C
Well, let's see how this interview goes.
B
You're at least a doctor of the show. You are our primary care for physician. So when we last talked to you in late 2024, I think this was a moment where the medical community was starting to say, wait a minute, these AI models are getting pretty good at things like diagnostics. But I think a lot of the field was still kind of in wait and see mode. Now almost two years later, we have a lot of new tools and a lot of new studies about the use of AI in medicine. So just catch us up on like what has been going on with AI in medicine for the last call it year and a half.
F
Yeah, it's been crazy. AI and medicine has gone from, well, depending on how you measure it. It's probably the fastest adopted medical technology of all time. We went from this being super novel, almost no one used AI tools to this being a routine part of most doctors weekly practice and give us a
C
sense of like the AI stack for a doctor, what are the tools that they are using right now and how. And particularly like what are the mainstream doctors like, the people that you know aren't yet on the bleeding edge the normies. Yeah, if you will.
F
Yeah. So the biggest sort of normal doctor technology which most of your listeners or a good portion of your listeners have encountered are what are called AI scribes. That's a sort of voice to text algorithm that listens to you talk to your patients and then writes a first draft of your note. And these have gone from like kind of a novel experimental technology to commodity in probably less than two years. They're everywhere. Doctors really like them and then patients really like them because they spend more time like talking. And then the second sort of, I'd say normal doctor use case is for decision support. So there's this one company called Open Evidence that has created a free tool that has gone from again, zero to crazy numbers of adoption. I will tell you, younger doctors, like my residents, use it all the time. I don't know the actual numbers, but it's probably close to half of us doctors are using this right now.
B
Wow. So yeah, the statistics that I've seen are that more than 40% of doctors now are using this, which is pretty crazy uptake for something that was just started a couple of years ago back in 2022. In March, open evidence reported that in a single 24 hour period, doctors consulted the AI system a million times. I've been fascinated by open evidence. I've never used it myself, but I have friends who are doctors or nurses and they have said what you've said, that basically just everyone, especially on the younger end of medicine is just using this thing constantly. So give us a sense of how this open evidence tool works. Like what situations is it used for and what are its strengths and weaknesses?
F
Oh, that's a great question. So, so how Open Evidence works, like all of these tools is, is a trade secret, but it uses some sort of like retrieval, augmented generation and an evidence retrieval tool. And they have all these deals with the big medical journal, so New England Journal of Medicine, jama. And when you ask a clinical query, it searches the evidence and then tries to identify high quality sources and then it always grounds what's coming back in the literature. So you have gray hairs like me who kind of use open evidence the way that I would use a Google search or one of the old tools. So I use it as a souped up way to search the literature rapidly and often go to the primary sources or I use it as a faster way to get a reference. So a drug that I haven't dosed in a long time, Open Evidence, pulls the drug monographs from the fda. I can very quickly pull that up younger doctors I have noticed and I don't know this empirically, but younger doctors are more likely to ask questions like what could be going on? Can you give me a second opinion? What is the next thing that I should do? So ways that I don't traditionally use decision support or reference tools, but sort of a new way and of course younger doctors also use it in the reference ways that I do now.
B
Are they actually uploading patient data to this like or are they just sort of describing patients in generic and anonymized ways to get back some decision support
F
Support My understanding is largely number two, I'm sure the company has a good sense on how many people I hope no one is copying protected health information and putting into it certainly what I've observed from like my colleagues and my students. Most people use it the way you would use like a search tool when you have a question which is hey, I like I'm giving this person ceftriaxone. What, what's the right dose for an intra abdominal infection? So sort of generic questions that are being interpreted through the physician and are
B
there any AI tools that are integrated with patient health records? This has been an area where I think there's just been a lot of pushback of like I don't want my personal health data, my protected health data going into one of these cloud based AI systems. But are there hospital systems or medical systems that are bringing this stuff directly into contact with patient data?
F
Oh 100% yes. Right now most of the sort of in contact with patient data are less about about physician facing decisions and more about like billing. There are companies that are like integrating with the electronic health record. Those are not standard yet. And then the EHR companies themselves. So like Epic is obviously the biggest EHR vendor in the US they're doing a lot of work on building in native things. So for example, at my health system if I want to send a message to the patient, the helpful AI at the top already has like maybe you should send say this. It's usually not that helpful and I don't think I've used it once in my life. But there are a lot of those things that are being experimented on actually built into patients health data.
C
I'm curious how doctors are feeling about all of this. We saw a survey from the American Medical association that found that more than 80% of physicians now report using AI professionally. Is that physicians racing out and grabbing these tools and bringing them to the office because they're so helpful or is this the classic case of a CEO saying, hey, you gotta use AI or you're outta here.
F
So doctors are BYO AI. A lot of that AI use is AI scribes and decision support software. And I'll tell you, some people are just using straight up like ChatGPT or Gemini or Claude for the decision support software. So I think one of the reasons doctors thus far have been more positive about it than perhaps the overall population is they're largely tools that doctors are bringing themselves that they think make their lives better. And at least not yet at many things that are being imposed upon us.
B
Yeah, I've noticed that when, when I and my friends go to the doctor now, we often are presenting our information to a chatbot first and then coming into the doctor with sort of a readout of what the chatbot has told us. This is of course not a new phenomenon. People have been doing this like, with like WebMD results for many years. But is this something that you're seeing now, is that many more patients are coming to you having already discussed whatever's going on for them with a chat chatbot?
F
Yes, this is the other big changes is that there's, you know, there's someone else in the exam room with me and often it's, it's chatgpt. They're talking sometimes with my hospitalized inpatients, they're talking to ChatGPT while I'm in the room with them. And I think this is, it's interesting because this is a kind of a new competency for doctors. We have to talk to our patients about AI. And I have started to talk to my patients about what I think are like safe uses. What are like safe uses while telling me and then things that they definitely shouldn't do. My patients may talk to me more about it. Cause I am like a doctor and an AI researcher. But like a lot of my patients are using AI routinely.
C
Well, give us a flavor of what you're telling them because I am definitely somebody who has looked up my symptoms before I've gone to the doctor and I would say I found it enormously helpful. But I can also imagine more skeptical doctors being annoyed at a patient telling them what chatgpt says to do.
F
So, so, yeah, so here's my, I'll give you my spiel. This is, I give them a. What is it? A green light, yellow light, red light. So the green light uses are general health questions. So I recently diagnosed with diabetes. I really love seafood. Can you help come up with a diabetic diet? For me, the green light uses are also preparing for Clinic visits. So I'm about to go see Dr. Rodman. I want to make sure that I ask the right questions. Here is the last note or the last thing he wrote. Obviously strip out anything identifying, don't put your personal health information and like help come up with a good questions to ask him. And then other other green light activities might be like wearable data. I don't know how good they are, wearable data, but I will tell you if a patient is going to give me like five years of their Apple Watch data, they're probably going to get a better from ChatGPT than from me pretending to look at five years of Apple Watch data because it's a 20 minute visit. The yellow light, Casey, I think is a lot of the things that, that you're saying. So I, I tell my patients it's okay to explore new symptoms. It's even okay to seek out second opinions when talking to a chatbot. That can really help prepare you as long as you understand that it's not a replacement for a doctor. And that is the first step to talking to a human being. So LLMs are really powerful and I mean there is some evidence of course that like when any human uses them, you don't always get laboratory level performance. Like they can give you dangerous advice, but diagnosis and, and like exploring symptoms as long as you use it in a way to prep to see your doctor can be very helpful. The red light, what I tell them never to do is like ask medical management decisions, like don't say my doctor said to do this, Is this right? Like I have cancer? God forbid you have cancer, is this the right chemotherapy option? Like a lot of those decisions are so nuanced, taken so much information. Those are things that the models don't do well and they're so sycophantic they can convince you that they're saying the right thing even when they're wrong.
C
Yeah.
B
I'm curious, Adam. Out here in San Francisco, there are all these fitness people and health maxers, people who love to track themselves using all manner of devices. And people are getting these full body workups from companies like Function Health that are, you know, sort of concierge medicine things and they'll get, you know, 100 labs done and then they'll upload all that data into Claude or ChatGPT and just sort of treated as a sort of first line medical professional in their lives. Do you think that is a good practice or is that just making people, you know, way too worried about things that maybe they don't need to be worried about.
F
Yeah. So that's making people way too worried about things they don't need to worry about. And this is ChatGPT LLMs in general. I mean, the dark side of talking to an LLM about your symptoms is they are so sycophantic they can drive you into like the cyberchondria worry hole. The evidence is not there yet that the sort of large routine testing functional medicine and putting it into an LLM does anything to improve health outcomes. Now, if your LLM is telling you to work out and eat healthier, that's probably pretty good sleep.
B
Yeah. What about the integrations like ChatGPT Health, which lets you sort of convert your Apple Watch or Fitbit data into something that ChatGPT can analyze? There's also a new version of of ChatGPT for clinical use called ChatGPT for clinicians. Are any of these integrations or projects more promising in your, in your view?
F
Not yet, but I think it could be at some point. I mean, so chatgpt for Health pulls in your data from the medical record and lets you chat with your medical records. Now, reason number one for concern is privacy. That's obviously going to have your entire medical history going to AI company. It's obviously also going to not be redacted by you in a way to remove identifiable things. Reason number two, I think if we're talking about health record data, it's really messy. They include tabular data, they include copy forwarded data that's been copied and pasted. And they also, if you've ever read your health records, they include things that are wrong. There's a lot of errors or misdocumented things in your health data. And it turns out that just copying a bunch of information like LLMs aren't magical. You can't just copy your entire medical record in and think that you're going to get good performance. And I would never bet against the technology. I think that we will get to the point that we have ways to build representations of humans and understand their health. But right now there's like no advantage to just dumping everything in an LLM, which is what ChatGPT for health theoretically would allow you to do in a way that would allow you to better understand your health.
C
I'm curious if you saw this trial they're doing in Utah where you can use an AI agent to autonomously renew prescriptions for almost 200 routine drugs. Yes, there's apparently some human review, but mostly this is automated. Is that good idea, bad idea?
F
Well, so globally, no, we should not be having LLMs write prescriptions for people. The trial in Utah in particular is a refill. So a doctor has already written prescription within the last 12 months. And I guess the idea is that it saves the primary care doctor time from having to review and refill. I'll tell you, if you talk to most doctors, yes, it is annoying to get refill requests. No, that is not the thing that drives us crazy. This is not like a use case that we're screaming for. I think it's being done as a proof of concept of can this work in the real world. This trial in and of itself is not dangerous prescription refills. And I think there's no opiates, there's no dangerous drugs in it. And a doctor has to have written the original one. But even if it does work in this, that does not mean we should be having autonomous AI systems. Right. New product prescriptions. That is not safe and it's not a good idea yet.
B
See, I think this is a case where like this is sort of rent seeking behavior on the part of doctors or doctor organizations. Like when I have gotten refills for prescriptions, I meet with a doctor for, you know, six to eight minutes. They say, how's it working? I say, great. They say, are you having any side effects? I say, no. They say, okay, I'll write you a refill. And the whole process just seems totally designed to like get me to pay up for another doctor visit and not give me any actual good medical advice. So if I could play devil's advocate, like do you think that there, that the sort of resistance to programs like this are motivated by just wanting to keep people coming to the doctor and paying for those visits?
F
So first, aren't most of your prescription refills just done? As in you, you call the pharmacy and they send an automated thing to your doctor and they click the yes button. And you never talk to at them?
B
No, for some they make you actually do an office visit and maybe they want you to, you know, take your blood pressure again or whatever.
F
So I'll, I'll do the devil's advocate back. Let's say I prescribe a fairly common antidepressant and they, they want it to be refilled. What, what I don't know is that this patient may be this, the silly question you get in the clinic may be new lesions forming in your mouth and it's an early ulcer. And if we don't pick it up within 24 to 48 hours, you may Develop like Steven Johnson syndrome. So potentially threatening complication. And the reason there are certain types of drugs, including antihypertensives, is that they can be high risk and we need follow up. Now, is that everything? No. And definitely there should be more things over the counter. I like, I don't think that most doctors are sitting around saying, I wish I had more medication, follow up visits. And the reason some of these things exist is that there can be very dangerous symptoms.
C
Yeah. So keep going to the doctor. Kevin, we can't have you developing those lesions. Too important to the show. Let me ask you about another one. This one actually seemed like just an unqualified good. The Mayo Clinic announced this week RedMod, this AI system that identified subtle changes in routine CT scans up to three years before a pancreatic cancer diagnosis. And this was like many, many, many percentage points better in detecting pancreatic cancer than human beings. So to me, this is like the sort of thing I keep waiting for AI to do. And it seems like it's actually doing it enough for. Of course that's very exciting with something like pancreatic cancer, which is notoriously difficult to detect and has like very low survival rates. Yeah.
F
And this is so like completely out of the discourse of like autonomous AI agents. There's really exciting stuff happening. So the Mayo Clinic, there have been some great studies on breast cancer detection. A lot of these algorithms have gotten so good that they're able to identify breast cancer better than. I shouldn't say better than people, but in a workflow that has a high, a good detection rate and then in picking up like potentially cancerous polyps when you get a colonoscopy. So there's a lot of exciting and really positive things that are coming. And I'm, I mean, at the end of the day, we'll need to see how redmod works in the real world and a trial. But I, I'm really optimistic about that sort of technology.
B
Do you think that if AI meaningfully extends life expectancy for, for people, it will be because of new AI discovered drugs or because of changes to routine health care that are made more efficient or more accurate by AI number two,
F
I think that when you talk about AI drug discovery, the part of the pipeline that's so difficult is not necessarily the coming up with the new compounds. It's running the clinical trials and getting it through the regulatory process, which can probably be sped up, but not as much as the discovery. You know, if we get this right, there's so many people in the US who don't have access to a doctor who don't have access to very basic medications, who can't control the their diabetes because of lack of access. And I'm really hopeful that if we do this wisely, we can get people more access to care which, doing my knock on wood, hopefully will improve health outcomes. So like all of this, I think the potential benefits are like less exciting. They're getting more people the bread and butter and getting more people to have less heart attacks, more people to have less strokes, more people to get their cancer screening and not necessarily like, oh, we cure aging with some sort of new AI, discovered CRISPR to act technology.
C
Are you at all surprised though, that we haven't yet seen the first like AI discovered wonder drug?
F
I mean, the biggest wonder drug of my career has been the GLP1s, which was just. I started using it when I was a resident. So we had it for a really long time and we had to like repurpose a drug for diabetes. So no, I'm not like medicine and science is just kind of messy and it's. There are always those stories about like, you know, we discover something amazing and makes like penicillin. But even Penicillin took like 20 years to get into human beings. So no, I think we will see AI discover drugs. I think it's just the benefits from AI are going to be like the benefits from medicine. It'll be a lot less exciting than people think, but still important.
B
So there's a lot of worry right now about sort of AI in schools, in education, the. Some of the cognitive atrophy that people are worried about. Oh, if we start using AI to do all of our work, we're not going to have the basic skills. Is that something you're worried about for like recent medical school graduates where maybe they would have had to hold all this stuff in their brains a few years ago and now they can just ask a chatbot and maybe that's going to erode some of their skills as a physician.
F
Yes. So that is the biggest worry that I actually have about sort of the short to medium term is deskilling of the workforce. We have some evidence there was a sort of scary study last year from Poland on a trial where they gave doctors not a language model but a polyp detecting technology. And they looked at their ability to detect polyps, so potentially cancerous lesions in the colon before using it and then after using it for three months. And when not using it, their ability to detect polyps dropped by 6 percentage points. So these are skilled Doctors using technology and they lose six absolute percentage points of their ability to detect potentially cancer in three months. And then imagine that you're learning to do it for the first time. Will you ever gain those skills? So like at Harvard Medical School, like this is in medical schools, I think everywhere, this is our big worry, which is how will this affect us to train the new generation of doctors? And it's like every other field. Like you talk about debugging code. In order to become a new doctor, you go through all this training because you need to make mistakes and you need to have someone above you who knows what's going on so those mistakes won't hurt patients. And that's just how education works. And this threatens that.
C
I mean, it's interesting though, because it's like, it's probably true that because I had access to a graphing calculator, like if you took it away from me, I'd be worse at like plotting parabolas on a graph. But the solution to that is that I just keep using the calculator, you know, so like, I'm not sure how big of a problem this really is.
F
I'll also say there's something, there's something deeply ingrained in human society that middle aged people complain about young people. So I think whenever we talk about deskilling, we have to keep that in mind.
B
Yeah, I mean, for what it's worth, like, I want my doctors to be using AI models. I want them to be consulting the hive mind before they weigh in on my specific condition. It doesn't threaten me as a patient to know that they are using open evidence or something similar. But I'm guessing for a lot of people that would seem strange. And maybe there are some physicians who don't advertise how much they're using AI because their patients might think less of them. Do you think that's happening?
F
Oh yeah. I think they're in certain situations, in certain places. I bet there's social pressure to say that you're not using AI, that there's some ego on the line. I, I don't see that. But again, I'm an AI researcher, so I don't think anyone would say that to me.
C
To me it just seems weird to like, to hold as your standard for what makes a good doctor, that they have memorized like a maximum amount of material. That's basically what we're talking about. It's sort of like, you know, the taxi drivers in London that have to learn every single street and like hold them all in their heads. It's very impressive. But I'm fine with them using the gps.
F
Yeah. And I think it's less about. So it is about memorizing, it's more at this point right. With where AI is now. It's more having sort of that knowledge and we'll call it wisdom to know when the system might be suggesting something wrong. Which is something that right now, and this may change we get by seeing a lot of cases and reflecting on them. So right now you're going to get the best performance if you have an experienced human trained in the old fashioned way with an AI system. But I think your guys point is at some point that might not matter. Right. The AI systems might just outperform all of us and then yeah, I guess it's like just use the graphing calculator but we're not there yet.
B
Would the AI models be better if we were less protective of privacy for medical data?
F
I mean that's such a loaded question. So the first thing that I'm going to say before I answer that is patient privacy is very important and we should respect people's privacy and their ownership over their data. But yeah, so in short like the reason they're not better at certain things is that you need to, to get LLMs better you need to label and then train them on the sort of labeled health data. And there's all. In the US there are appropriately restrictions on how health data can be used. I suspect that these companies like OpenAI, by having ChatGPT for health they will gain some more of their own data which they say they're not going to train on. I trust that they're not going to train on it, but they'll be able to use that data to at least evaluate their models and try to make them better.
B
I think they should train on it. I mean obviously that'd be a huge illegal violation of privacy, but it would also make the AI doctors better, much better. And, and I think, you know, a lot of people would be sort of willing to make that trade off. So I at least think there should be a little checkbox when you go to the doctor that says like I'm okay having my personal health data used to train AI models.
C
I for one, in exchange for like 30% off your giant medical bill, you
F
get a coupon, you get a coupon. You know, you get like your next ozempic shot is 20% off is on the house.
B
Exactly. Well that's a good place to leave it. Dr. Adam Rodman, thanks so much for coming back and Keep us posted on what is going on in Medic.
F
My pleasure guys. Thank you very much.
C
Thanks Doc.
B
When we come back we'll talk about Taki an LLM trained only on data from before 1930.
F
Foreign.
B
Before we had ATT Business Wireless coverage, our delivery GPS wasn't the most reliable. Once our driver had to do a 14 point turn to get back on route. A 14 point turn. An influencer even live streamed the whole thing. Not good for business. Now with AT&T business Wireless routes are updating on the fly and deliveries are on time. And the influencer did get us 53 new followers though.
D
AT&T business Wireless Connecting changes everything out on the road. It helps to have a partner like the Love's Rewards app. Download Love's Rewards today to get sweet discounts and earn points on food, fuel, drinks and more. Every time you scan Love's Rewards, save and Earn at Every Turn Terms Apply Circumstances website for details Some songs that
B
I've written I started on the piano.
E
That happened with All I Want for Christmas is you.
G
If you couldn't tell, that is Mariah Carey. I'm John Caramonica, one of the critics behind the New York Times 30 Greatest Living American Songwriters Project. We interviewed some of the songwriters on our list, including Taylor Swift, who hasn't sat for a video like this in a long time. These are not ordinary conversations. You're going to watch these videos and learn about intimate approaches to craft in ways that you rarely have access to.
B
My mom had got me this notebook and I was just writing it really small cause I didn't want anybody to read what I was writing.
G
Okay, Jay Z's teenage notebooks. I need to see those. Watch all the video interviews for free and check out the entire 30 Greatest Living American Songwriters project at nytimes.com 30 greatest or in the app and let us know if you agree with our picks. I bet you won't.
B
Well Casey, usually on this show we are talking about the future, but today we are going to take a trip back to the past. Specifically to the year 1930. What was happening in the year 1930?
C
Oh my goodness. Well of course we were in the middle of the Great Depression. My grandmother had recently turned 11 and was looking forward to getting her first store bought dress in just a few years.
B
So this is a new language model, a Vintage LLM called Taki and it is trained exclusively on data from before 1931. This is a research project built by three guys, David Duveno, Nick Levine, and Alec Radford, the lead Author of the GPT1 paper. Former OpenAI researcher over there. And this is a fascinating project that has been burning up my timeline this week because this, this is an experiment in what happens if you only feed a large language model data from before a certain cutoff.
C
Yeah, and obviously there are a lot of, you know, kind of character based chatbots on the Internet that will give you the experience of talking to somebody from the past. But what makes this project different is that they tried to limit themselves to training data from that time and before. The hope was that it would avoid any kind of contamination from what came after. And as you'll hear they have some really interesting and potentially useful ideas about what this kind might one day be used for.
B
Casey, have you spent any time playing around with this model?
C
I have. I tried to ask it the most 1930s question I could think of which was say what's the big idea?
B
What did it say?
C
It said the big idea is to popularize. And I said popularize what fella? And it said popularize a sport. And I said I'm gay. So that's kind of where we let that one drop.
B
And it said gay. You're happy?
C
Yes, exactly. It said your heart must be light.
B
So sir? Yeah, I love this experiment. I love like weird niche language models. My one of my favorite language models of all time was Golden Gate Claude, which was the special version of Claude that was like pathologically obsessed with the Golden Gate Bridge. I would put Taki in sort of that category of like an experimental research model that is maybe not all that useful on its own but like helps illuminate something interesting and important about these language models and what happens when you train them in specific ways. So today we wanted to talk with one of the creators of Talkie. We are going to bring in David Duveno. David is an associate professor at the University of Toronto who researches AGI governance and catastrophic risk mitigation. And he is one of the co creators of Talkie.
C
And there's really Duva no better person to talk to about it.
B
That's true. David Duveno, welcome to Hard Fork.
E
Thank you very much Kevin.
B
So this project Taki is fascinating. It is a vintage LLM. Explain why you and Nick and Alec made this thing.
E
So this all started a year ago was me and Nick were interested in forecasting like specifically can we teach machines how to forecast like five or 10 years ahead of time? Like what is the big picture are going to be just because we have our own sort of pet ideas about what the future is going to be. We don't think people should take our word for it. And we also don't think that people should trust machine forecasts unless they have a track record going back decades and decades. So the idea here is that if we could build a model who really only knew about the world up to a certain date, we could ask it to forecast like five or 10 years ahead of time. Like ask it what's the New York Times headline going to be five years from now? Or is there going to be another great war or something? And, and we can iterate and see what kinds of things are predictable. What does it take? How far out can things be foreseen? And then hopefully, eventually we'll have machines that have a hundred year track record of forecasting and then we can ask them in 2026, what do you think is going to happen two or four, eight years from now? And we'll have an idea of how much to trust those forecasts.
C
It's a fascinating idea, but it strikes me it requires you to have, have really good data. So in this case, really good pre 1930s data. I'm going to guess that was harder to obtain than just going out and crawling Reddit and everything else that the frontier models have access to. So how did you face that challenge and where did you get this pre1930s data?
E
Yeah, so I should mention there's a ton of groups doing a ton of awesome archival work here. The first data set we got excited about was institutional Books, which was Harvard library. Scanned like 1% of their entire collection. And so they had tons of data from 1800s, early 1900s. There's a whole bunch of different groups doing tons of work. It would take a long time to enumerate this. And also OCR has just gotten a ton better just in the last six months even. And so there's always been lots of projects to automatically digitally scan this data, but it just hasn't been very high quality until very recently.
B
And I assume that part of the reason you chose the cutoff date of around 1930 is because that's when sort of works become public domain. Anything after that is copyrighted. Are there any other reasons you chose that specific point in time?
E
No, that was entirely. It is that we wanted to make everything publicly available and open source and 1930s is just the sort of most recent date that has almost zero legal headaches with releasing data or anything like that.
B
Yeah, so I've been fascinated by seeing like what people are trying with this model. People, people are having it make predictions but also asking it about its favorite authors or its opinions of, you know, of major historical figures. What have been the experiments that have been most interesting to you?
E
Yeah, the fun things that I've seen people do is, I mean, a lot of people like to ask like, what's 2026 going to be like? And the model has sort of very philosophical answers about how like, well, we'll have figured out that war is bad, we'll have like a much more peaceful civilization. Or sometimes it says like it's the end times. I mean, it's a very inconsist model and it's not quite smart enough to really think things through in a systematic way. It kind of just gives you vibes.
C
Now that brings up a sort of interesting wrinkle of the kind of LLM. This is because if you were to ask a frontier model today to predict the future, it would not only be trying to guess a statistically likely sequence of words, it would also be like doing some reasoning. Talky is not doing that right. So that just sort of seems like by the way that it is built, we would expect it to be less good at forecasting as the models we have today.
E
Yeah, absolutely. This is a very baby steps model. The basic fine tuning for reasoning and the scaffolds, the superforecasting scaffolds that we know just improve anyone's reasoning. Think of the different distinct possibilities and assign them each sub probabilities. The model's just not really smart enough to follow these kind of detailed multi steps instructions yet. So again we just wanted to release the first thing that we did. But there's a clear path to adding all these refinements.
C
So you do plan to add reasoning as you go?
E
Oh, absolutely. Okay.
B
People have also pointed out that the model behind Taki seems to know about some things that it probably shouldn't know about, like the rise of Hitler, the presidency of fdr. Things did that didn't happen until after its data cut off. Is that proof that there's been some kind of contamination of the training data with more recent data?
E
Oh, there's definitely contamination. And this is sort of like one of the ongoing things that we're going to have to just keep revisiting again and again and refining. So we have a classifier that tries to look for things that are anachronistic. And especially if you want to use this for forecasting or to evaluate forecasting, it's really important that we really nail this issue. So we have all sorts of ideas for canaries and things that we think the model should just never assign any likelihood to think of, I don't know, Nagasaki and Hiroshima before World War II. Those two towns would just never show up in the same sentence, ever, almost, except for some weird coincidence. So you can just tell whether there's been leakage about important events if the model just thinks that there's any chance that you'll see those particular names together. So this. Anyway, we've made a bunch of efforts to avoid leakage. We know there's leakage right now, so you shouldn't use it to evaluate your forecasting scaffold yet.
B
But how is it getting that data if it's only being fed scanned OCR books from archival sources?
E
Because archival sources have wrong dates in them all the time. Or it's kind of unclear what the date of a text is because there's an updated edition, or sometimes there's a preface that's been added later, or sometimes even just in the middle of the text. There'll be someone inserted some future notice note later on, historians note, da, da, da, da. And so it's just really hard to check all these little edits that people make. And then they still maintain the original publication date on the metadata.
B
I see. I asked Taki what it knew about me, and it said Kevin o', Hara, which is not my name, was born in Dublin in 1840, and having been educated at the School of the Christian Brothers, became a teacher in it. He afterwards adopted the profession of journalism and was for some years connected with the staff of the Nation newspaper. It also said I had written several popular songs, including Molly as Thor and the Irishman Immigrant. Now, obviously most of that is wrong, but it did connect me to journalism, which I found interesting and maybe like some other evidence of some data contamination. But, like, is this thing accessing the Internet in some way? Or, like, how would it have known that I, or at least Kevin o', Hara, this character sort of connected to me in the model, was a journalist?
E
You know, that's a great question. I guess I'll say. The training Data was like 240 billion tokens. And it's just sort of like this vast ocean of stuff. So maybe there was a list of journalists that got put in somewhere that had your name in it. I mean, I guess one thing about this model is it hallucinates like crazy. And this was a huge problem with the chatbots that people were meant to use professionally. And I think it's been addressed to a large extent in frontier models, but we made zero effort to address that in Any of our post training so far.
C
Kevin, would you sing a few bars of the Irish Immigrant for us?
B
You know, I don't want to waste David's time here. All right, we'll save that for later.
C
Speaking of problematic content, some people found that Taki gives racist responses to questions that are basically like, you know, would you let a black professor teach your child? I could see how that might be historically accurate, but I'm curious if you anticipated it and how you feel about it.
E
Yeah, so we. You know, it was also very clear to us that it had these kind of responses. I mean, I guess I'm a professor myself, and. And my sort of first instinct is, let's let people see this if they want to, and just don't surprise anyone and don't be flippant about it, because it really can be upsetting to some people, and especially if we just treat it sort of insouciantly. So the way we threaded the needle was we did zero filtering of the data set for problematic content or whatever. We wanted to just show what the actual state of knowledge or state of thought was in the past. It would defeat the purpose of the process project if we put our thumb on the scale. But for the public demo, where you can talk to talking, we just had a modern model with modern sensibilities. Just read every response, and at the end, once it's generated, if it is deemed problematic, just slap a warning and say, oh, this might have something upsetting. Just click if you want to see it. Right.
B
The description I loved this came from Gavin Leach today was that Taki is creating beautiful prose by a terrible person, which is consistent with some of my tests, which is like, this thing actually does write quite well and actually, to my ear, like, much more literary than some of the more recent models trained on more recent data. But, yeah, it is not. It is clearly the product of its time, or at least the time of its data.
E
Yeah, Yep. And, I mean, the prose is really cool because it's, like, very refreshing, refreshing style. And actually, if you feed it to One of the AI detectors, it usually says, like, 100% human, which is kind of funny. But then, I guess, as you mentioned, like, a terrible person. I mean, right now, it kind of ends up being this sort of, like, average person, and depending on, it'll just randomly answer in all sorts of different voices. But that's one of the next things we're planning to work on, is helping you talk to more specific people or in specific sort of states of knowledge or times and places, because that I think allows you to answer more coherent questions than just talk to like the hive mind of 1930 or whatever.
C
Speaking of the hive mind, I saw another person ask Taki. Basically the person told it that it was from the future and would tell Taki anything it wanted to know about the future. And Taki's first question was, how did universal peace come about? Which was like the most heartbreaking thing I think I've ever read from a large language model. What does that tell us about the time period or about the training data?
E
Well, I guess I'll say in general, futurism is a place where people don't actually often try to predict the future very hard and they kind of project their values. And if you ask someone what do you think is going to happen, they usually fill in something about what they hope is going to happen. And I think that was also true 100 years ago. So the trick is to get Taki out of the wishful thinking mode and actually into brass tacks of what do you actually think going to happen?
C
Yeah, it's just funny because I think if like, you know, if, if I could talk to an LLM from, you know what, you know, 2126 today, I would just sort of be like, are the humans still alive? Like, what's going on with the climate? Did the robots, you know, how many people did the robots kill? It would just sort of be a very different set of questions than Taki seemingly wanted to know.
B
Now, are you going to point Taki at any big sort of scientific discoveries and see whether it can make them? I mean, Demis Hassabis@Google DeepMind has this sort of theory that AGI should be able to discover Einstein's theory of relativity. If you just give it all of the pre existing scientific literature at the time. Are you hoping to use this model or a descendant of this model for anything like that?
E
Absolutely, yes. So one of the, I think Nick especially is interested in this question of given a state of knowledge, sort of how much would it take to how far ahead can you just from pure reasoning, advance your state of conceptual understanding? And the classic examples are like some of Einstein's discoveries which really didn't require experiments, they just required putting the pieces together. And there's actually another project called Machinist Miriam that took a training cutoff of 1900 and tried to get it to see if they could rediscover special relativity. I mean, the thing is that the models that those people did those experiments on were like I think 3 billion parameters. Just not smart Enough to do very much. So he showed that you could hold his hand at a certain point and it would kind of get gestured in the right direction. But to do the kind of systematic reasoning and math that Einstein had to do, we'd probably need another, let's say like 10x in parameters at least.
B
David, what are you building next? Are you going to build a bigger version of Taki and keep trying to get it to perform better?
E
Yeah. So there's a few things that we want to do. So obviously making the models bigger. So right now the model is still smaller than GPT3 was, although bigger than GPT2. So Alec kind of says that there's a bit of a phase change around 100 billion, 150 billion parameters, where the model starts to be smart enough to actually have a back and forth conversation with, obviously scaling up the data set and OCR efforts. And right now everything's just mostly English. Just because we can evaluate, we can quality check the English text because we're all native English speakers, but we want to obviously broaden this repertoire. Working on the filtering is obviously a big one. And then obviously I mentioned all this. How do we even evaluate the forecasting ability? That's another big question.
C
If you put the model in a robot, would that be a wide talkie talkie?
B
You can ignore him.
C
Anyways, I'm gonna go.
B
Well, David, fascinating experiment. And people can go try talkie for themselves. It's at taki-lm.com what's a good goodbye to a podcast guest? Don't worry about what a podcast is. Okay. Well, as Taki says, a pleasant journey to you, sir.
E
Thank you very much, sir.
C
Thank you, David.
D
Out on the road, it's nice to have a partner who can help you make the most of your journey. A partner like the Love's Rewards app. With Love's Rewards along for the ride, you can earn points and get great deals like a free coffee or fountain drink. Just buy any four, any size and get the fifth one free. How refreshing is that? Download the app today and let the points roll in mile after mile. Love's Rewards. Save and earn at every turn. Terms apply. See website for details.
C
I'm opening up Crossplay.
B
I've been playing against Dan, my colleague
C
at the New York Times.
E
Kat's played another move.
C
Ugh. She played Stoop for 36 points.
B
I've got a Z, which is 10 points.
C
I'm guessing Tanga is not a word. Let's see. Tenga is a word.
B
Oh, Dan played his last turn. Let's see who won.
C
It's so close, but I did win. New York Times game subscribers get full
F
access to Crossplay, our first two player word game. Subscribe now for a special offer on all of our games.
C
Hard Fork is produced by Whitney Jones and Rachel Cohn were edited by Viren Pavic. We're fact checked by Caitlin Love. Today's show was engineered by Daniel Ramirez. Original music by Marian Lozando, Diane Wong, Rowan Nimisto and Dan Powell. Video production by Sawyer Roque and Chris Schott. You can watch this whole episode on YouTube@YouTube.com heartfork Special thanks to Paula Schumann, Hui Wing Tam and Dalia Haddad. You can email us@hardforkytimes.com with your most recent diagnosis. We'll tell you if we think you should get it looked at.
D
What would you like the power to do?
F
Keep getting up. There's more fight in you. So play on.
B
Bank of america proud to be the
F
official bank of u.s. soccer and FIFA world cup 2026 bank of america na member fdse.
This episode covers the ongoing transformation of OpenAI—highlighting new business strategies, deal restructurings, and the Elon Musk trial. It then explores rapid changes in the application of A.I. within medicine with Dr. Adam Rodman, and finally, it dives into the experimental "Talkie" LLM trained only on pre-1930 data.
Segment: [02:43–06:35]
Segment: [09:31–13:45]
Segment: [13:45–15:54]
Segment: [16:49–24:01]
Segment: [27:08–53:12]
Segment: [53:16–72:22]
“It sounds a lot like the new Netflix plan… it's going to be a lot cheaper, but we'll show you ads." (14:38)
“AI and medicine has gone from being super novel, almost no one used AI tools to this being a routine part of most doctors' weekly practice.” (29:52)
“A shocking percentage of the AI industry is just people who decided they didn’t want to work with Sam Altman and who now have their own companies.” (22:27)
“The prose is really cool because it’s a very refreshing style. If you feed it to one of the AI detectors, it usually says, like, 100% human.” — David (67:43)
“Taki is creating beautiful prose by a terrible person, which is consistent with some of my tests.” — Kevin (67:13)
This episode delivers a whirlwind tour of both the high-stakes business drama (and legal intrigue) surrounding OpenAI and the concrete ways AI is rapidly transforming one of civilization’s most sensitive spheres: medicine. The interview about Talkie lands the episode’s “future’s past” theme, inviting listeners to think anew about time, bias, and the shape of knowledge itself. Filled with sharp analysis, dry wit, and cultural geekery, the hosts make even technical shifts and ethical debates feel immediate—and often, quite funny.