Summary7 min read

Data-Smart City Pod: "Federal Data, Local Impact: What Cities Need to Know"

Date: June 17, 2026
Host: Stephen Goldsmith, Bloomberg Center for Cities
Guest: Denise Ross, Director of Federal Data Policy at the American Federation of Scientists

Overview

This episode delves into the intersection of federal data and local government needs, exploring how cities can leverage federal data assets, the challenges (and opportunities) of maintaining high-quality data pipelines, and the essential role of public engagement in both the creation and application of government data. With her extensive experience at all levels of government, Denise Ross provides practical insights and real-world examples that highlight the critical value—and current vulnerability—of the federal data infrastructure for cities.

Key Discussion Points and Insights

Denise Ross’s Career Thread: Data, Advocacy, and Equity

[01:00-02:17]

Ross recaps her journey from analyzing and democratizing census data in New Orleans, to advocating for robust public data policies after Hurricane Katrina.
Emphasizes a lifelong focus on making public data accessible and ensuring it serves community needs:
- "What I’ve been doing since is really about public data policy and making sure those data supply chains are robust and that they're meeting the needs of the American people." — Denise Ross [01:11]

Relationship of Federal and Local Data

[02:17-03:45]

Federal data serves as a backbone for local data by providing standards, continuity, and comparability.
Standards (e.g., Uniform Crime Reporting) make it possible to compare data across cities and over time.
Federal data can lag behind in fast-moving local crises, creating tension between stability and responsiveness.
- "That's a huge gift. ... but then the federal data aren’t quite as adaptive when things change rapidly." — Denise Ross [02:53]

Data Quality and Feedback Loops

[04:01-07:13]

Uniform Crime Reporting as a case study of data quality:
- Local agencies sometimes maintain two sets of data—one for operational use, one for federal reporting.
- Local innovation (e.g., recording non-fatal shootings) can prompt overdue updates in national data standards—eventually benefiting the entire country.
Example: Addition of non-fatal shootings to federal crime statistics began locally and was later adopted nationally, now rolling out to 18,000 agencies.

Community-Driven Data Use Cases and Comparisons

[07:26-10:48]

The power of small-area and neighborhood-level data is often under-realized due to privacy and sampling limitations.
Civic-data intermediaries, like the National Neighborhood Indicators Partnership, fill gaps by reshaping public data for hyperlocal comparisons and benchmarking among cities.
Innovative use of commercial and postal data (e.g., junk mail deliveries, USPS "nostat" data) helped cities track housing recovery after disasters.
- "All of these different cities were using the junk mail data to track repopulation after the population declines." — Denise Ross [09:57]

Finding and Using Federal Data: Challenges and Strategies

[10:53-14:17]

No easy inventory exists for all federal data sets; city leaders largely rely on instinct, networking, and issue-driven “recipes.”
Under-used but highly valuable datasets (e.g., American Community Survey, postal HUD data) are at risk of disappearing before local governments even exploit their potential.
Networks like the Civic Analytics Network serve as knowledge hubs and recipe exchanges for city leaders.

Preparing for Census 2030

[14:29-15:37]

Now is the time for cities to prepare for the next decennial census, especially via the Local Update of Census Addresses (LUCA) program.
Census data is foundational for eligibility and allocation in nearly 350 federal programs, distributing trillions in funding.
- "The basis of the decennial is the addresses and knowing where people are. And that is a local government responsibility." — Denise Ross [15:06]

The Fragility of Federal Data: The "Weather Forecast" Analogy

[15:37-18:09]

The American Data Index tracks the health of the federal data ecosystem, likened to "a weather forecast for government data sets."
Recent years have seen degradation via loss of data elements, reduced capacity, staffing cuts, and increased bureaucracy.
Essential data sets are being terminated, delayed, or stripped of details (e.g., food security, gender identity data).
- "Because of all of that... we’re starting to see some datasets be terminated completely... Or [we see] reductions in access." — Denise Ross [17:17]

The Everyday Impact of Federal Data

[18:13-20:48]

Ross tells a story of receiving a recall warning for peanut butter—an everyday safety benefit enabled by open federal data and commercial application.
- "A few years ago I got an email from the online grocer... they told me that the peanut butter in my pantry might have salmonella... it was because of the Consumer Product Safety Commission's dataset on product recalls." — Denise Ross [19:03]
Urges local data leaders to speak up when data sets are cut, as public input is required by law and can have outsized effects.

Data Quality, AI, and Fitness-for-Purpose

[20:53-23:51]

Advocates robust public engagement in using city data for AI, being vigilant about matching data quality to the importance of its intended use.
Cautions against repurposing data for high-stakes AI decisions if the data was collected for less rigorous applications (e.g., SNAP social security numbers).
- "For every data set that we're thinking about putting into an AI model, we need to think about what is the use of that model and what are the stakes and is that data high enough quality for that purpose..." — Denise Ross [21:56]

Advice for City Data Leaders

[24:38-26:20]

Public engagement is crucial: models, however technically sound, can fail or create inequities without community buy-in.
Flint, Michigan water crisis as a cautionary tale: predictive models successfully located lead pipes but lost effectiveness (and fairness) when public engagement faltered and the remediation strategy changed.
- "If there had been a participatory process where the neighborhood residents were involved... it would have been politically untenable to change the method..." — Denise Ross [26:05]

Communities as Producers of Data (Community Science & 311)

[27:05-28:53]

Community-collected data can add legitimacy and fill gaps but can also create new disparities if less-resourced neighborhoods can’t participate.
Examples: After Katrina, resource-rich neighborhoods collected better data—compounding social inequities; similar effects can be seen in 311 reporting patterns.

Denise Ross’s Current Work and Call to Action

[29:00-30:06]

Ross is now focused on monitoring federal data health, identifying moments for public input, and telling the stories of how federal data impacts daily life.
Invites local government colleagues to share favorite data sets and subscribe to the Data Index US newsletter to receive updates and advocacy opportunities.
- "It is literally written into the law, into the Evidence Act, that the chief data officers in federal agencies need to engage with state and local data leaders..." — Denise Ross [29:30]

Memorable Quotes & Moments

"The innovation started at the local level, got adopted at the national level, and now propagates across the nation." — Denise Ross [06:58], on adding non-fatal shootings to national crime stats
"You would hope you could just take the data that's being produced by the federal government, assume it'll keep flowing... But right now, what we're seeing is an unprecedented degradation of federal data capacity." — Denise Ross [15:55]
"As the significance of the use escalates, the quality of the data needs to escalate accordingly." — Stephen Goldsmith [22:34]
"Community science is one of the best ways to establish legitimacy of data. It's tricky though..." — Denise Ross [27:24]

Important Segment Timestamps

01:00 — Thematic introduction: Ross’s career and the thread of public data service
02:17 — Complementarity of federal vs. local data
04:53 — Data quality and user-driven improvement in federal datasets
07:26 — Benchmarking at the neighborhood level and using unconventional datasets
10:53 — How cities can discover and leverage underutilized federal data
14:29 — Preparing for Census 2030 and LUCA participation
15:53 — The "weather forecast" for federal datasets: fragility and advocacy
19:03 — The peanut butter recall: everyday safety powered by federal data
22:34 — The link between purpose, stakes, and necessary data quality
24:38 — The Flint water crisis: public participation in crisis data use
27:05 — Community science: legitimacy and risk of new disparities
29:00 — Denise Ross’s current mission and advocacy call

Summary

For local officials and data leaders, this episode is a clear roadmap for:

Leveraging federal data to enhance local impact,
Understanding the urgent need to maintain, advocate for, and improve the national data infrastructure,
Navigating the practical, ethical, and equity challenges in data and AI,
And most importantly, ensuring that public engagement and transparency are at the heart of all data-driven decision making.

Denise Ross’s blend of real stories, actionable advice, and big-picture warnings offers both inspiration and practical steps for cities aiming to turn national data assets into local improvements.

Loading summary

Transcript53 lines

[00:03]
A
From DataSmart City Solutions the Bloomberg center for Cities, this is the DataSmart CityPod.
[00:12]
B
Thank you and welcome back. This is Stephen Goldsmith from the Bloomberg center for Cities with another podcast. We have an illustrious guest today, Denise Ross, who was all sorts of things. Director of Enterprise Information in New Orleans, Presidential Innovation Fellow Deputy US Chief Technology Officer, US Chief Data Scientist, now Director of Federal Data Policy at the American Federation of Scientists, just to name a few. Welcome, Denise.
[00:40]
A
Thank you, Stephen. It's so great to be here.
[00:43]
B
Did I leave out any one of your 25 previous illustrious titles?
[00:48]
A
Well, my favorite job was in City Hall.
[00:51]
B
Well, I'm glad you have your priorities correct. Yeah, I occasionally meet a governor who was a mayor, and I express my sorrow for their demotions.
[01:00]
A
Exactly.
[01:01]
B
You worked in a great city with a great mayor. Let's just talk for a second to get to know you. What's a common theme, a common thread through all those jobs you've had? And how do you describe your accumulating expertise in data and government?
[01:16]
A
I'm glad you asked that, because I've just recently started to make sense of my career path and realized that when I was in New Orleans, I was working for a nonprofit data intermediary. And I moved to New Orleans in 2000 to publish census data. It was the first time the Internet was a thing and the census data were being released. And it was this opportunity to democratize the data so it could be used by communities and nonprofits to sort of chart their path toward their own destiny. And we analyzed the data and made it really easy for neighborhood folks to use. And then when Katrina happened, all of those data became instantly historical. And I found myself shifting from becoming a data analyst and a data user to also becoming an advocate for for the data that we needed in order to create a more equitable recovery for the city of New Orleans. And what I've been doing since is really about public data policy and making sure that those data supply chains are robust and that they're meeting the needs of the American people.
[02:18]
B
That's ambitious. Let's deal with that for a second. So let's just start with a framework. Many of us who have spent all our time in local government recognize that most of the data we need is local, but not all of it. So how should a mayor or a chief data officer think about the importance of federal data? Both what is possible today and what's missing? How do we think about that hierarchy of data?
[02:46]
A
There's a very complementary relationship between the data work at the local level. And that at the federal level, one nice thing that federal data bring to the picture is standards. So a local government doesn't have to invent how it's collecting crime data, for example. And because of those standards, and because of the voluntary or mandatory reporting requirements for some federal statistics that local governments have, that enables local governments to compare across time because you have continuity of data collections and across space so you can benchmark how your city is doing compared to another city. And that's a huge gift. And in order for the federal government to be able to provide that sort of data stability, there's a tension, because then the federal data aren't quite as adaptive when things change rapidly, whether it's with a disaster or a disruption to our economy, like AI is causing, for better and for worse.
[03:46]
B
I have to tell you a story that's like 30 years old. You've just kind of provoked me. I'm so old that when I was a district attorney, I was on an FBI local committee to revise uniform crime Reporting. Right?
[04:01]
A
Yeah.
[04:01]
B
And I remember being in the room when the top FBI person assigned to us said, in response to a question about, I think guns, said we couldn't possibly let local governments and even private individuals use our data because there's too many errors in our data. And it felt to me like the reason there were errors in the data was because people weren't using it. And, you know, if you're FBI agent, a local police officer, you just assume that maybe it was right, maybe it was wrong, and you just went ahead and acted. So let's talk for a second now that you've kind of provoked my story about data quality and data usage. Like what federal data and even the local data, when do you use it, how do you think about AI's access to it, and when is quality enough?
[04:53]
A
Well, I'm so glad you brought up the Uniform Crime Reporting data, because that's a perfect data set to talk about these issues with. And it's the data set that got me involved in some of the federal statistics when I was a Presidential Innovation Fellow. There are lots of types of federal collections that come from local governments. That includes the crime reporting from law enforcement agencies, public health reporting on disease surveillance, wastewater tracking and whatnot. Also school performance data that comes up from school districts and states. And each of those represents an opportunity both for that consistency across time and space when the data go to the federal government, but also for the data to start to be used immediately by the local community in order to drive better outcomes. And I remember When I was working on the police Data Initiative, it was just a few years after, a couple of years after Ferguson and I was talking to a police chief and he was describing. He's like, well, we've got this data here that we actually use to understand what's happening in our communities. And this data over here is what we report to the federal government. And so that was like, that was just extra work for him. Right. And so you definitely do want a reality where the data that's being reported to the federal government is also the same type of data structures that you need to make those local decisions. And we have an example from recent history where local communities were innovating on that crime data now NIBRS data rather than ucr, and there was a gap. So it turns out that the standard did not require that police departments report non fatal shootings, which is sort of an oversight. If you're trying to understand gun violence, you don't want to just analyze the gun violence that resulted in deaths. You want to analyze all shootings. So some local jurisdictions started just adding a question in their form about non fatal shootings. And then the FBI, through its deliberative standard setting process just recently, like in the last six months, added the non fatal shootings category to the national standard. And now it's a challenge of propagating that new standard across the nation's 18,000 law enforcement agencies. So the innovation started at the local level, got adopted at the national level, and now propagates across the nation.
[07:14]
B
It's a good example. It's also an example of a practical change that makes the data more useful. The difference between two categories is how good the the shooter was.
[07:25]
A
Totally. And how close the hospital was.
[07:27]
B
Yeah, yeah, right. Let's stay with this for a second. On the comparison side, one way to think about comparisons is housing versus housing. Another is to think about it demographically. What federal data is available and how should we think about disaggregating the data so we can understand conditions relative to the opportunities that individuals have in those neighborhoods? If you take a neighborhood in Indianapolis, you might want to compare it to other neighborhoods in Indianapolis, but you also might want to compare it to similar neighborhoods in Cleveland. Right. Where the demographics are the same. So you've got both inside the city comparisons and comparisons from city to city.
[08:09]
A
That's where the magic happens, I think with small area data. The challenge, of course, with small area data is that because of privacy concerns and also just the nature of statistical sampling, it's relatively rare that we get neighborhood level Data at the city level, and census data is an exception to that. Obviously, the decennial is a full count. And the American Community Survey, you get that neighborhood level geography, but it's a rolling average over five years. So it doesn't give you that sort of year by year comparison as cleanly as one might hope. There is a civic infrastructure that some cities have that can be really helpful in making those comparisons. And that is the Urban Institute has something called the National Neighborhood Indicator Partnership. And when I was in New Orleans, we were one of those data intermediaries. So we took publicly available data, organized it into neighborhoods, and made it really useful and relevant for local decision makers. And what was really transformative is then once or twice a year we meet up in one of the cities. I think there's 30 or 40 cities now that have this civic capacity. And we would compare notes with each other and compare methodologies. And so this was really useful after, for example, the housing crisis of 2008. You know, Youngstown and Detroit were struggling with declining populations and lots of blighted properties. And New Orleans, of course, was just three years out from Katrina. And we similarly had a blight problem, but for different reasons. We all honed in on a specific type of data set that was really useful for that type of benchmarking. It was a little bit surprising. It was the company that sends you junk mail. It turns out that they had the best data on addresses actively receiving mail, because there's a business purpose for them to not want to deliver mail to an empty house. All of these different cities were using the junk mail data to track repopulation after the population declines. Then there was also a complimentary data set that an analyst in HUD had produced. He had gotten permission from the Postal Service to aggregate addresses with the status of nostat, which meant that the mail was not deliverable to that house. It's a good proxy for blight. And he was able to aggregate that, I think, into census block groups. Between those two data sets, we had a pretty good comparable data data set across time and across place to get a sense for how is the recovery from these housing crises going in these different places and what interventions might be making a difference.
[10:49]
B
These are great answers. Really. These are terrific answers. I love the examples.
[10:53]
A
Oh, thank you. I try to be specific.
[10:57]
B
Where would a local chief data officer or chief of staff or a person who's interested in data access, how would they find out what's available at the federal level that could assist them? That's still available at the federal level? Obviously we talked about crime, but you know, we have some projects involving environment. Some of the data is there, some of the data is not. So how do you figure that out? If you're working in a city,
[11:23]
A
when you're working in a city, you're heads down on what your community needs and what your, you know, what your mayor's office is looking to do. And, and I remember when I was in City Hall, I had very little articulation with the federal data apparat. It was just something I took for granted. Right? Like we've got the census data, we've got the American Community Survey data, we've got different transportation data sets and whatnot. And when I talk to people who use federal data, one common theme is that they can't live without it. For example, the American Community Survey is going to be your, your city's best way of understanding where are the neighborhoods where there's a high proportion of households without access to a vehicle. And then if you're making an evacuation plan, you would want to make sure that you had enough buses and you bring those buses to the right neighborhoods so that people can get on the buses and safely evacuate. And there's no replacement for the American Community Survey for that data set. And it wouldn't make sense for a local jurisdiction to try to collect that data on their own. And so that's a data set that's used broadly by emergency managers in local governments that needs to keep flowing. There are also lots of data sets that are underutilized that would be useful, like that obscure postal data set at HUD that we just happen to hear about. But most city GIS departments, for example, probably don't know about that data on the nostat addresses that are a proxy for blighted properties. And that's part of the challenge, is that we might lose some of these federal data due to diminishing capacity before we even started to use it in our city government operations. And there are some proactive things that you can do around making sure that your city is well represented as we develop our nation's denominator for Census 2030. And I can talk about that in a minute.
[13:21]
B
And there's no easy inventory of what's there that would be useful and what's not.
[13:27]
A
Right?
[13:27]
B
You just have to keep probing and looking.
[13:30]
A
There is no easy inventory. There are ways that city data leaders can find data that are useful. One is thinking about using data as sort of like having a recipe if you're trying to solve the problem of blight or emergency preparedness or workforce development. There are different federal data sets combined with local data sets that are going to be really powerful. And networks like the Civic Analytics Network that I know you were instrumental in setting up, those types of networks can be great for sharing those recipes so that you can meet those pressing use cases. And then there's some software packages that also sort of compile those templates and will do the work once to find the data that's relevant to that issue and then sort of bring it to your doorstep.
[14:18]
B
Thanks for mentioning the American Community Survey. Every city should use that on a whole host of kind of access areas. How should a city today be thinking about the next census?
[14:29]
A
It's 2026, so we've got four years until the next decennial census. But now is the time to start getting your data ducks in a row. And this is important because census data determine federal funding. It determines whether your jurisdiction is eligible for different types of funding if you're a metropolitan or a rural community. And the Project on Government Oversight recently identified almost 350 federal programs that rely on census data to direct more than $2.1 trillion in federal funds to state and local communities. That was in 2020. So the stakes are high, and the basis of the decennial is the addresses and knowing where people are. And that is a local government responsibility. There is a program called Local Update of Census Addresses, affectionately known as luca. And now's the time. Ask your GIS person to tap into the LUCA process at the Census Bureau and start making sure that all of your addresses in your city are fully accounted for, including your group housing.
[15:38]
B
Great advice. You've spoken a little bit on the American Data Index, and I saw you reference that as a weather forecast for government data sets. What does Weather Forecast for America data sets mean?
[15:53]
A
Well, you would hope that you could just take the data that's being produced by the federal government, assume it'll keep flowing, assume it will be high quality and fully supported, and go about doing your business of running your city. But right now, what we're seeing is an unprecedented degradation of federal data capacity. And it takes a few different forms. One is just data elements or entire data sets that are not aligned with the administration's priorities, like on DEI or gender or climate. And the other is really just a reduction in overall government capacity, with staffing losses, contracts being cut, the loss of scientific advisory committees that used to help keep federal data sets sort of up with the times, and then an increase in red tape where data collection processes that used to be routine now have to go through multiple levels of political review before the agency can move forward. So because of all of that, what we're starting to see, and I think we'll see more of in this year, is some data sets be terminated completely. For example, the food security supplement, which is the best way that we understand childhood hunger in America and food insecurity at the state level. There's also been data sets where gender identity, for example, has been taken out, especially data sets where it's particularly relevant, like the 988 crisis hotline or the National Crime Victimization Survey had some of those fields taken out as well. There have been some explicit losses. And also I think what we're going to see more of is data sets that used to be published in a certain month might end up being late because there just aren't the staff and the people in place to produce those in a timely way. We might see less detail of a thinner report coming out. We definitely are seeing reductions in access. So it's more work now to get access to the data because those, those front end tools that made it easier to access the data in many cases are not being supported anymore. So it requires more technical expertise to, to derive value from federal data than it used to. And that shifts, unfortunately, that shifts the burden to local governments.
[18:09]
B
So what is essential data that you work on?
[18:13]
A
One of my big regrets from my time in the White House most recently is that we didn't do a great job of telling the story about why federal data are so important for running a modern society and how federal data sets benefit everyday Americans. The example I gave you about everyone in a neighborhood has a seat on the evacuation bus because their city used American community survey data to plan ahead. That's an example of everyday American who's benefiting from federal data. So what we're doing is really telling the story about how federal data touches every corner of American lives and livelihoods. One example that I really, I like to tell is a few years ago I got an email from the online grocer that I get my groceries from? And they told me that the peanut butter in my pantry might have salmonella. And I was sort of creeped out by it. Like, how did they know that my peanut butter might have salmonella? And a few years later I realized that it was because of the Consumer Product Safety Commission's data set on product recalls. That's an open API. And then that online grocer, like any company, can just tap into it. They know I bought the peanut butter. And now the federal government is telling them that it might have salmonella and I know to throw the peanut butter away. We're telling the story about through all of these data sets why it matters. And there's a particularly compelling angle, I think, for local data leaders to play a role in that. And what we're doing at dataindex us is we're monitoring for opportunities for public input. If a local cdo, for example, speaks up about how they need the food Security supplement or the American Community Survey or some DOT data set, these policy windows mean that there will be an outsized impact by them giving their feedback at that time because the agency is open for public input and local data leaders have a real outsized role to play because they're closest to the citizenry. Right. And if they don't weigh in, then what happens is federal agencies are career colleagues on the inside. They assume that the changes they're making to a data set are fine or they might think that nobody cares about the data set.
[20:33]
B
I will remember this as the peanut butter story.
[20:37]
A
I have a P.S. to that. A colleague of mine bought his peanut butter in the store and so he didn't have that chain of custody digitally and he actually got the salmonella poisoning.
[20:48]
B
Certainly proves essentiality, I'll say that.
[20:52]
A
Yeah.
[20:53]
B
So how do you think about the quality of data that underlies the use of AI? It's almost back to my crime story right now. With AI, more people can use the data. So therefore that's good and that's bad. So if you were today chief Data officer in New Orleans or fill in the blank, how would you go about accessibility of the data so that it can be used for good purposes?
[21:20]
A
I would prioritize meaningful public participation in any AI tools that you're building on your data. And also be really thoughtful about fitness for purpose because often data might be collected for one purpose. For example, SNAP data states collect Social Security numbers just as like a rough sort of identification measure, but it's never rigorously used. But if those Social Security numbers then go up to the federal level and are used for fraud detection, the quality of those socials isn't sufficient for that high stakes use case at the federal level where people might get kicked off of benefits. For every data set that we're thinking about putting putting into an AI model, we need to think about what is the use of that model and what are the stakes and is that data high enough quality for that purpose and those stakes and that's something that public participation, like public input can also help really sharpen your Thinking about the match between data quality and data purpose and the AI applications.
[22:33]
B
Well, that was a great example. Let me repeat it back to you with my words, just to see if I have it right. So as the significance of the use escalates, the quality of the data needs to escalate accordingly. Is that a fair summary?
[22:51]
A
Yes. Often in city government we produce data that's good enough. An example is permitting data. I know after Katrina we took out a couple of permits and I must have had a few false starts because there must have been three or four records from my address, from me trying to get permits for one thing or another. And I could always find ultimately the right permit that I needed to continue acting on. But there was this debris that I had created in the process, and that was fine because it was good enough for me to find the actual permit. And it was good enough if I went into the Office of Safety and Permits for the city employee to find my address and the right permit. But if I'm an analyst and I'm trying to understand permit activity in a neighborhood and I'm going to be making resource allocation decisions based on that, if three of the four permits attached to my address were false starts, that data is not high enough quality to inform that analysis.
[23:52]
B
You know too much, really. This is a very difficult interview because you know the exceptions to every rule. This is very difficult to be simplistic with you. Looking forward, it feels to me like we're at a point in time where, thanks to data and generative AI, we could see cities operate at a much higher level of performance. Understanding causation, anticipating problems, identifying needs. If you were speaking to others that work, that work in the local area, maybe aspire to do the things you've done in life, what would be the one or two pieces of advice you would give them about how to make their cities better? Insofar as the use of data to improve the quality of life,
[24:38]
A
I would say engaging the public at every step of the way is going to be the most impactful thing you can do. And I'll give a cautionary tale of something that happened in Flint, Michigan after the water crisis there. A group of technologists from Georgia came in and they had a machine learning model that could predict where the lead pipes were most likely to be. You know, Flint's an old city. There wasn't really any reliable data on where the lead pipes were. And with this model, when the city was digging up pipes to see if they were lead, they had a 70% hit rate. They were more Efficient than they would have been if they were just randomly digging up pipes. But because these technologists had not engaged the public, the public wasn't really bought into how important this model was for prioritizing, getting clean water to the areas that needed it the most. You can imagine a city council member looking at the map of where the digs were happening and saying, why aren't you digging? In my neighborhood there was a contractor change and the decision was made to dig equally across neighborhoods rather than predictively. Using that AI model, the hit rate went down to 15%. So the wealthier, whiter neighborhoods were being reassured that their water was clean, while the lower income black neighborhoods were continuing to be poisoned by lead. And I think if there had been a participatory process where the neighborhood residents were involved in understanding the data collection, testing the water quality before and after, I think it would have been politically untenable to change the method of prioritizing the digit.
[26:20]
B
Yeah, it's a very troubling example for many reasons. We've been spending some time, we have a Robert Wood Johnson foundation grant to look at hyperlocal environmental data and its effect on public health in various areas. And as federal subsidies for lead pipe remediation have gone down, the intensity of local remediation has gone down. But if you looked at it the way you just said it, you would still be engaged in your most high risk neighborhoods, right?
[26:52]
A
Yeah.
[26:53]
B
The homes that are where kids are getting poisoned every day. Better data equals better resource allocation, equals higher quality of life for residents of your city.
[27:04]
A
I agree.
[27:06]
B
So whether it's lead paint or abandoned houses or other similar local issues, how do you think about the community? Not just as the user of the data, but as a producer of important data to augment what the federal, state and local governments have.
[27:25]
A
I think community science is one of the best ways to establish legitimacy of data. It's tricky though after Hurricane Katrina with the federal data being instantly historical because 80% of the city flooded neighborhoods collected data about housing condition on their own and that created its own data disparity on top of the pre existing demographic and economic disparities that the neighborhood suffered from. So for example, a neighborhood that was highly resourced could get a copy of Salesforce and walk the streets with notepad and enter it into the database and had very good visibility on the status of different properties and then could advocate with with city hall around remediation, whereas a lower resource neighborhood did not have the capacity to walk the streets and collect that or purchase the software tools for it. So what we saw was this data disparity that was on top of the preexisting disparities, and that's just something to be cautious of. Same thing happens with 311 data, right? We've heard stories like, for example, I just was talking to a Fairfax county representative, and he was saying that in the Hispanic neighborhoods in Fairfax that their calls have gone down. And that's not because they don't have problems. They just didn't feel comfortable calling the government because of the current climate.
[28:53]
B
Let's just close. I know you've got multiple things you're doing now, but just tell us a little bit about what you're doing and what you're trying to accomplish.
[29:00]
A
I am collaborating with a group of federal data watchers to monitor what's happening with the federal data ecosystem, identify opportunities for public input, and tell the story about why it matters. We would love to collaborate with any of our colleagues in local government who, if you've got a favorite data set, please let us know. Let us tell the story about it. We can monitor it. We can let you know. If you sign up for our newsletter at dataindex Us, we'll let you know when opportunities for public input do arise. Because it is literally written into the law, into the Evidence act, that the chief data officers in federal agencies need to engage with state and local data leaders around the value of federal data and how it can be improved. So they're required to get your input on how to do data better. And admittedly, those feedback loops are a little rusty right now, but we're trying to to make them more effective so that folks who need the data can advocate for its continued investment.
[30:06]
B
Well, thank you. This is Steve Goldsmith from the Bloomberg center for Cities. I'm with Denise Ross, who is one of the nation's preeminent data experts and apparently knows more obscure anecdotes than almost anybody I've met. So thank you for your contributions in multiple ways.
[30:22]
A
Thank you, Steven. If you like this podcast, please Visit us at datasmartcities.org and find us on itunes, Spotify, or wherever you get your podcasts. This podcast was hosted by Stephen Goldsmith and produced by me, Betsy Gardner. Thanks for listening.