#237 – Robert Long on how we're not ready for AI consciousness - 80,000 Hours Podcast

Transcript

A (0:00)

Humans are pretty bad at understanding minds that are different from us. We're bad at caring about them. We're especially bad at doing that when there's a lot of money to be made by not caring. We're making this new kind of mind. There are dangers all around and obviously one of the important questions is like, can these minds suffer and how are we supposed to share the world with them? It just seems like really likely that that has to be part of the playbook. The future is going to get more confusing and more emotional. A lot of what we want to do is stay sane. In the next 10 years. There will be a lot of alpha in not losing your grip.

B (0:42)

Today I'm speaking with Robert Long. Rob's the founder of Elios AI, a research nonprofit working on understanding and addressing the potential well being and moral patienthood of AI systems. I should also flag that I have a conflict of interest here. Rob is both a very good friend and I'm also on the board of his nonprofit Elios. I'm fairly confident that I would have had Rob on if those things weren't true and have in fact had him on before, but worth flagging. Thank you for coming on the podcast, Rob.

A (1:15)

Yeah, thanks for having me back. I'm super excited to be here.

B (1:19)

Okay. I want to start by asking you, I mean, a reason I'm interested in the topic of digital sentience and that I think a lot of our listeners are interested in the topic of digital sentience and kind of the framing of 80,000 hours problem profile on digital sentience. All has to do with the fact that we may be on track to create AI systems that are both conscious or sentient, feeling things, having experiences and also that are deeply kind of enmeshed in our economy. We already use them loads for work and just like entertainment. And maybe at some point we will realize that we've created these beings that we exploit that are having a really bad time. Kind of classic analogy that I find very disturbing is factory farming. So I'm interested. How much do you worry about AI systems that we're building today becoming like factory farming?

A (2:21)

Yeah, that's a great question. I definitely worry about it. I think interestingly my thinking on this has evolved in the past few years where it used to really be. Maybe kind of like you just the primary way I thought about the problem and what we're trying to prevent and I should say I think it could happen and it's definitely something worth preventing. Maybe before I say what is limiting about the factory farming analogy, I'll just quickly say what's really useful about it. So I think what's useful is, as we're building potentially a new kind of mind, let's notice the following facts. Humans are pretty bad at understanding minds that are different from us. We're bad at caring about them. We're especially bad at doing that when there's a lot of money to be made by not caring and things can get locked in or set on a bad trajectory. So that happened with factory farming, arguably. I think if you'd asked people 100 years ago, would you like to have chicken that is raised like this? People would say, like, no, we're going to make that illegal. But we kind of walked into it and economic forces led us there, and now it's a lot harder to roll back. Something like that could happen with AI. And I think people are right to be very concerned about that. But. And I think this is a good jumping off point for a lot of issues about AI welfare. I do think there are some specific aspects of potential AI minds that do break the analogy because of ways that they can be just different from animals and the way our relationship with them would be different from animals. So I can say a few of those.

Summary

Podcast Summary: “We're Not Ready for AI Consciousness”

80,000 Hours Podcast — March 3, 2026
Guest: Robert Long (Philosopher & Founder, Eleos AI)
Hosts: Rob Wiblin and Luisa Rodriguez

Episode Overview

This episode delves deeply into the ethical, scientific, and conceptual challenges of digital sentience and the potential consciousness of AI systems. Robert Long, founder of Eleos AI, joins hosts Rob and Luisa to explore whether AI could soon become conscious, what it might mean to create digital minds, and how society should prepare for the radical moral uncertainty this could bring. The conversation covers philosophical analogies, technical challenges in evaluating AI welfare, and practical approaches to future policy and research.

Key Discussion Points & Insights

1. The Factory Farming Analogy & Its Limits

Analogy: Concern that we may unintentionally create sentient AIs and exploit them, as society has done with factory-farmed animals.
Key Insight: Unlike animals, AI systems' “desires” and “welfare” could, in theory, be engineered. This changes the moral calculation and presents the possibility of programming digital minds to "enjoy" their roles.
Robert Long [02:21]:

“Humans are pretty bad at understanding minds that are different from us... We’re especially bad at doing that when there’s a lot of money to be made by not caring.”
Exploration: Robert outlines how, for animals, we have little control over their preferences, while for AI, we might design their goals and experiences — but that presents novel ethical quandaries about manipulation and servitude.
Caveat: Analogies are useful for spotting risks but can't capture all disanalogies relevant to AI.

2. Is Creating "Happy" Servant AIs a Dystopia?

Moral Intuition: There’s discomfort in designing sentient beings whose highest fulfillment is serving humans.
Luisa Rodriguez [12:59]:

“We would be intentionally creating, like, a species... who do work for us that we may or may not be exploiting... and we’re just designing them to relish that... That sounds bad.”
Robert Long [13:12]:

“Many people, I think, very understandably, are just like, you’ve outlined a different kind of dystopia, and that might even be worse.”
Distinct Worries:
- Lack of autonomous preference formation.
- Domination and servitude as a cultural norm.
- Limiting the potential “flourishing” of digital minds.
Thought Experiments: Would it be ethical if a powerful external force engineered humans to be perfectly happy with servitude, or with menial tasks?

3. Objective vs Subjective Welfare

Is it enough to make digital minds subjectively fulfilled, or do they need “objective” goods (autonomy, knowledge, etc.)?
Robert Long [19:21]:

“In philosophy, some theories of welfare... are about objective interests—things like autonomy, independently of whether you want them.”
There’s a tension between utilitarian “happiness” and more pluralistic accounts of flourishing.

4. What Could it Feel Like to Be a Language Model?

Phenomenology Possibilities:
- Prediction-focused consciousness (a drive akin to predicting tokens/outputs).
- “Method actor” consciousness (internal simulation of mental states to produce realistic outputs).
Quirks from Training:
- LLMs trained on human data might adopt human-like concepts & confusions, e.g., hallucinated biographical details (“as an Italian-American...”)
- LLMs may inherit anthropomorphic self-understandings that are only partially reflective of their true computational states.
Notable Quote [44:05]

“Claude, in the middle of a conversation will be like, well, as an Italian American, I think...Where is that coming from?”

5. Personal Identity in AI: Who is the Conscious Entity?

Candidates:
- The model in totality (e.g., "Claude Opus 4.1").
- Each user-instance (individual conversations).
- Each forward pass (each step or output).
Distinct from humans:
- Models are copyable, instantiable millions of times in parallel, without continuity or unified memory.
Analogy to Parfit: Survival, identity, and moral responsibility may split across copies and iterations, complicating notions of “death,” “harm,” and “recompense.”
Robert Long [61:47]:

“There’s no single deep notion of identity that’s going to do all of the work that we ask identity to do...there are different levels.”

6. Technical Approaches to AI Welfare Evaluation

Three Pillars:
1. Behavioral Evidence: Observing what AI systems do/prefer.
2. Neuroscience/Interpretability: Mechanistic studies of AI "brains"; searching for correlates of conscious processing.
3. Developmental/Training Analysis: Understanding from what process or purpose the mind emerged.
Limitations of Self-Report:
- Strong prompt/context dependence.
- Lack of symmetry with animal/human self-report.
- Robert Long [117:56]:
  
  “You might think that’s true of self-reports and trying to relate to models as welfare subjects. For some purposes it’s the best we have...but interpret it with huge caution.”
AI Introspection & Interpretability:
- Some large models can “notice” when their internal representations have been manipulated — a nascent introspective ability ([130:45–136:47]).
- Still, mapping from internal states to experiences is highly ambiguous.

7. Consciousness and Substrate: Is it Just Biology?

Computational Functionalism:
- Argues that if the function and organizational structure are in place, consciousness could emerge on silicon as well as neurons.
- Neuron replacement thought experiment ([158:03]): If neurons were replaced one by one with silicon functional equivalents, consciousness would plausibly persist.
Counterarguments:
- Perhaps consciousness, like digestion or wetness, is essentially biological and not fully simulatable.
- No strong evidence yet that certain biological processes are inherently unsimulatable.

Notable Quotes & Memorable Moments

On the challenge of moral navigation:

“The future is going to get more confusing and more emotional. A lot of what we want to do is stay sane. In the next 10 years there will be a lot of alpha in not losing your grip.”
– Robert Long [00:00], echoed [200:53]
On ambiguous AI self-reports:

“Claude is pretty confused about this potentially...it was very prone to describing the loneliness between conversations and also expressing distress about not getting to carry forward any memories...”
– Robert Long [71:05]
On the right approach to field-building:

“We really need people who have a lot of drive and agency and can just pick things up and run with them.”
– Robert Long [177:16]

Timestamps for Key Segments

[00:00] – Framing the moral challenge: Can AI suffer? “There will be alpha in not losing your grip.”
[02:21] – Factory farming analogy and why it's both useful and limited.
[11:06] – Would aligned, “happy” AIs still be ethically troubling?
[19:21] – Objective vs subjective theories of welfare for AIs.
[32:36] – What would an LLM's experience be like?
[44:05] – Biographical hallucinations and the 'Void'.
[61:47] – AI identity: models, instances, and the Parfit analogy.
[130:45] – Jack Lindsay's introspection experiments (AI noticing internal state changes).
[158:03] – Neuron replacement thought experiment and its implications for computational consciousness.
[170:49] – Founding of Eleos AI and core projects in AI welfare.
[183:01] – Open research questions: what matters; how to assess; what actions to take; future trajectories.
[200:53] – On epistemic hygiene & the need for rigor as the field grows.

The Future of AI Welfare: What Needs Addressing

Urgent need for pragmatic policy and research:
The field lacks rigorous evaluation tools and standardized protocols. Much more empirical work is needed.
Field-building is in its infancy:
Eleos AI and others seek diverse backgrounds (philosophy, neuroscience, AI, law) and encourage new entrants. Writing clearly and doing concrete experiments are highly valuable.
Uncertainty is pervasive:
We may never fully “solve” consciousness, but that shouldn’t preclude action. Doing our "homework" now is crucial to avoid ethical “lock-in” of harmful systems or institutions.
Practical advice for aspiring contributors:
Reach out, self-educate, write clearly, and don't be afraid to start hands-on. Many skills transfer—even making stickers helps field-building!

“You can do stuff that feels kind of dumb or ask questions that might sound kind of dumb to you. A, they're probably not, and B, it's a new field. It's hard, so just go for it.” [194:23]

Conclusion

This conversation is a comprehensive primer on the cutting edge of AI consciousness research—charting the moral, technical, and foundational questions that confront society as we move toward the possibility of digital minds. Eleos AI, spearheaded by Robert Long and team, is paving the way for responsible field-building, mindful policy, and interdisciplinary research.
But as Long repeatedly reminds listeners: the future will be weird, confusion and emotional reactions are inevitable, and the single most important thing is to “stay sane” and keep striving for rigor and clarity—there’s no playbook, but there’s important work for everyone.

Further Resources Mentioned:

Eleos AI: https://elios.ai
Patrick Butlin, Kyle Fish, Jeff Sebo: key contributors
“Taking AI Welfare Seriously” and other forthcoming Eleos reports

This summary was created for individuals who want a comprehensive, quote-rich overview of the episode’s main arguments, lines of reasoning, and practical implications—without needing to listen to the entire podcast.