
Why does it matter that US government datasets are being deleted and discontinued?
Loading summary
Tim Harford
This BBC podcast is supported by ads outside the uk.
Advertisement Voice
You can make a difference in someone's life, including your own, with a job in home care. These jobs offer flexible schedules, health care, retirement options and free training. They also provide paid time off and opportunities for overtime. Visit oregonhomecarejobs.com to learn more and apply. That's oregonhomecarejobs.com.
Ryan Seacrest
Hey, it's Ryan Seacrest for Albertsons and Safeway Cough and cold season is coming, so make sure you're prepared and stock up on your family's favorite personal wellness products. Now through October 7th. Shop in store and online for savings on products like Mucinex Kickstart Combo, Zyrtec Allergy Relief Tablets or Liquid Gels Halls Cough Drops and Mucinex Fast day and night so you and your family are armed and ready for the season ahead. Offer ends October 7th. Restrictions apply. Offer offers may vary. Visit albertsons or safeway.com for more details.
Tim Harford
Hello and thanks for downloading the More or less Podcast with A program that delves into data, steeps itself in statistics and mulls over mathematical mysteries and I'm Tim Harford. In early February 2020, something strange started happening across US government websites.
Maggie Levenstein
Error.
Tim Harford
Decades of data began disappearing from webpages for agencies such as the Centers for Disease Control and Prevention, the National Institutes of Health, and the Census Bureau. In many cases, the entire website went dark. Within a few days. Some 8,000 government pages and 3,000 data sets had been taken down. Since then, many have been reinstated, but some have not. A curious case indeed. As data sleuths, we have questions. Why was this data taken away? And why does any of it matter? Let us delve into the case of the missing US Data.
Maggie Levenstein
Lots and lots of data sets disappeared from access because entire websites for agencies of the government came down because of executive orders from the president questioning what could be made available. I'm Maggie Levenstein. I'm a professor at the University of Michigan and director of icpsr, the data consortium that's based here.
Tim Harford
At the start of President Trump's second term, he signed a flurry of executive orders, for example, eradicating measures promoting diversity, equality and inclusion in federal government. The order targeted websites over language about minorities, indigenous peoples, race, gender, sexuality and disability. Climate change was another banned topic because.
Maggie Levenstein
Agencies weren't sure that their websites would be consistent with the policies. They just brought the entire websites down, and that meant that data that had been collected and was accessed through those websites became inaccessible.
Tim Harford
While this was happening, the Department of Government Efficiency was plowing on with their goal of cutting $2 trillion from government spending. They haven't got anywhere close to those savings.
Maggie Levenstein
By the way, lots and lots of people were fired. And those people are often people who produce data or people who provide access to data. And without those people in place, access was impossible. And there also were budget cuts that were affecting data access and data collection going forward.
Tim Harford
Now, most of the government websites did eventually reappear. Some data sets were fully restored, but some data sets were discontinued and some reappeared. Although researchers soon noticed they'd been changed.
Maggie Levenstein
There were actual decisions for particular data sets to alter them to obscure information about gender identity or about vaccines or about sexual behavior or about climate, that those were policy areas where there's a lot that the administration didn't want people to have access to that data about those questions. And so those data were removed or were altered.
Tim Harford
When data goes missing or is altered, it can have very real life consequences. For example, some might see the eradication of the non binary gender category as purely a symbolic move. But we know that teenagers who identify as non binary more likely to be more vulnerable to certain risks than their peers. That's something we couldn't have known without collecting the data in the past. And without the data now, are we.
Maggie Levenstein
Providing the appropriate social supports to them? It's very hard to know without the data that was being collected about the mental health of young people and their gender identity and their risk and their sexual behaviors. When we don't have that information, it's much harder to provide the resources that people need to live full and healthy lives.
Tim Harford
The data at risk spans all areas of life and society. Recently, the Bureau of Labor Statistics announced that when calculating cpi, the consumer prices index, they would stop collecting the data from three cities due to budgetary cuts.
Maggie Levenstein
Businesses rely on these and they rely on them to be high quality measures. When we reduce spending and the resources that the BLS has, that we get lower quality measures. And as a result, businesses are kind of much more likely to be flying blind.
Tim Harford
Many websites also now have government mandated disclaimers on their homepage throwing doubt on the data. For example, if you go on the cdc, the center for Disease Control and Prevention's homepage for national HIV surveillance systems, you're met with this.
Maggie Levenstein
Per a court order, HHS is required to restore this website as of 11:59pm Eastern Time, February 11, 2025. Any information on this page promoting gender ideology is extremely inaccurate and disconnected from the immutable biological reality that there are two sexes, male and female.
Tim Harford
The fate of other data Sets is uncertain. For example, a decades old dataset that monitors birth outcomes in the us. Prams.
John Cubali
Prams, or the Pregnancy Risk Assessment Monitoring System, is kind of the gold standard for assessing and monitoring maternal and child health in the us. That's an area where the US has consistently lagged behind other high income countries.
Tim Harford
That's John Cubali, a research assistant professor at the University of Michigan who specialises in epidemiology. The Department of Health and Human Services states that the purpose of PRAMS is to reduce infant morbidity and mortality by influencing programmes and policies aimed at reducing health problems among mothers and infants. As an infant is around three times more likely to die in the US than in Finland, Japan or Sweden, this seems like a pretty important data set.
John Cubali
It's something that is used both by policymakers and by researchers really to understand how are different aspects of maternal and child health changing over time and what populations are most impacted. That's really important information that you need to have if you're going to be considering how to design interventions to address some of those challenges.
Tim Harford
This data is collected by local states and governments and then sent to the CDC to be merged and for population weighting to be added. To access the data, researchers needed to apply for access and sign a declaration stating they wouldn't share the data.
John Cubali
I started getting contacted by researchers and essentially these requests were just going unanswered. I submitted multiple requests myself. I reached out to that office directly. You know, I never heard any response. And so essentially any of the data from the last 10 years, so 2016 or later, that's not publicly available. And so if you did not have access to that already, you're kind of out of luck at this point.
Tim Harford
Do we know whether they're still collecting the data?
John Cubali
Yeah, there is some uncertainty around that. There was an announcement, I believe in early to mid February saying, oh, we're going to discontinue this data collection. And there was a massive backlash and outcry and there was kind of a clarification, oh, no, we're not going to discontinue this, but we're going to kind of push it back until April 2025 and then we'll start it up again. But April 1, the entire prams office was essentially fired. And so now it's really uncertain what this means for this data collection moving forward.
Tim Harford
Since February, there's been a scramble to try and safeguard as much data as possible. Both Maggie and John are involved in this process, but snapshots of old web pages do not a full data set make.
Maggie Levenstein
So there's some data that you can't get to at all. There's some data that we don't know is gone yet. There's also data that's been preserved, but we don't know for sure that it hasn't been changed. And you also don't have the context, what we call metadata, the description of the data, all the things that allow you to make inference from the data.
Tim Harford
The case remains open. Thanks to Maggie Levenstein and John Cubali. That is all we have time for this week, but if you have any questions or comments, please email in to more or lessbc.com until next week. Goodbye.
Advertisement Voice
This is the story of the One As a maintenance supervisor at a manufacturing facility, he knows keeping the line up and running is a top priority. That's why he chooses Grainger, because when a drive belt gets damaged, Grainger makes it easy to find the exact specs for the replacement product he needs, and next day delivery helps ensure he'll have everything in place and running like clockwork. Call 1-800-GRAINGER click granger.com or just stop by grainger for the ones who get it done.
Maggie Levenstein
America is changing, and so is the world.
Tim Harford
But what's happening in America isn't just a cause of global upheaval. It's also a symptom of disruption that's happening everywhere.
Maggie Levenstein
I'm Asma Khalid in Washington, D.C. i'm.
Tim Harford
Tristan Redmond in London, and this is the Global Story.
Maggie Levenstein
Every weekday we'll bring you a story from this intersection where the world and America meet.
Tim Harford
Listen on BBC or wherever you get your podcasts.
Podcast: More or Less: Behind the Stats
Host: Tim Harford (BBC Radio 4)
Date: September 27, 2025
In this episode, Tim Harford and the More or Less team investigate the mysterious disappearance and alteration of thousands of US government data sets in early 2025. Joined by data experts Maggie Levenstein and John Cubali, they explore the execution, motivations, and real-world consequences of government actions that have rendered key public data sets unavailable or unreliable. The episode examines how political, budgetary, and ideological forces can jeopardize not only transparency but public wellbeing and research.
The episode closes on a sober note, highlighting how the case of the missing US data is far from resolved. Data loss and alteration undermine not just academic research and policy analysis, but the ability to provide targeted support to vulnerable populations. Both the cause and effect of political disruption, the erasure of data represents a fundamental challenge to evidence-based governance.
If you have questions or input, contact the More or Less team at moreorlessbc.com.