Cambridge Digital Minds

Preparing society for AI systems with real or perceived minds

Explore Our Work
University of Cambridge Centre for the Future of Intelligence

Our Work

As AI systems become more sophisticated, questions arise about whether they could one day have minds with feelings and capacity for welfare that warrant moral consideration. Experts are divided, and a growing number of people will come to believe they do. Whatever the truth, these perceptions will have profound implications for society and policy.

We address these challenges through research, field-building, education, and engagement with policymakers and industry to support well-informed public debate.

Research

Interdisciplinary, rigorous research aimed at anticipating societal responses to potential digital minds, and to develop governance strategies.

Capacity Building

Strengthening the digital minds field through fellowships, training programs, and an online course anchored in academic rigour.

Engagement

Building expert coordination and working with policymakers, industry, and the public to reduce confusion and polarisation before high-stakes conflicts emerge.

Research

Our recent publications on digital minds, AI consciousness, and AI welfare.

AI Consciousness Will Divide Society

Caviola, L. (2026) — Forthcoming book chapter

Abstract

As AI systems grow more sophisticated and human-like, societies will confront a question with no easy answer: are these systems conscious, and if so, what do we owe them? I argue that we should not expect convergence. Instead, society is headed toward a period marked by confusion, uncertainty, and sharp disagreement. Drawing on recent empirical data, I show that both experts and the public are already split on whether AI systems could be conscious and whether digital minds deserve moral consideration. I trace this division to structural features of the problem: consciousness remains difficult to define and impossible to measure directly, economic and political incentives cut in opposing directions, and the gap between how AI systems behave outwardly and how they work internally invites conflicting interpretations. Left unmanaged, such disagreement raises the prospect of political polarization, unstable regulation, geopolitical friction, and a greater risk that we get the moral question wrong—in one direction or the other. I close by outlining four strategies: strengthening expert coordination, building a more informed public conversation, designing AI systems that do not needlessly amplify moral confusion, and developing decision-making frameworks that do not hinge entirely on unresolved questions about consciousness.

Human-AI Coexistence

Caviola, L., Keeling, G., Street, W. & Shevlin, H. (2026)

Abstract

AI systems are increasingly becoming social actors that teach, assist, persuade, comfort, and coordinate with humans. But how will and should humans coexist with social AIs across different stages of technological development? This paper proposes an interdisciplinary framework for studying human-AI coexistence. We currently inhabit a formative period during which social AIs are beginning to affect everyday life. In a transitional period, more advanced systems may assume major social and economic roles, changing how institutions operate and how important decisions are made. Over longer time horizons, we may share the world with AI systems that possess capabilities far beyond those of today and perhaps even minds of their own, which would transform society and raise difficult questions about moral status and governance. The framework presented here aims to help researchers anticipate these developments and better understand their implications for individuals, institutions, and society.

Denying Personhood to AI: An Analysis of U.S. State Legislation on AI Legal Status

Smith, A., Caviola, L. & Alexander, H. (2026)

Abstract

Amid growing public anxiety about the societal impacts of artificial intelligence, 23 bills have been introduced across 12 U.S. states since 2022 seeking to restrict the legal rights and responsibilities that AI systems can hold. These “Exclusion Bills” typically deny legal personhood to AI systems and in some cases declare them non-sentient, representing the first-ever meaningful legislative engagement with AI legal status. Most of the Exclusion Bills follow three common templates, pointing to coordinated legislative diffusion rather than independent development. The stated motivations driving these bills include: (1) religious conceptions of human exceptionalism, (2) concerns about liability for harms from AI systems, and (3) child safety. Opposition to the Exclusion Bills has primarily come from a few environmental groups, along with sporadic objections from industry groups, experts, and federal Republicans. The bills reflect a brewing divide on AI, driven by populist anti-AI sentiment. In one case, a bill has drawn criticism from the tech industry and its allies, who are opposed to overregulation and what they claim is an emerging, fragmented landscape of state bills. We suggest these legislative efforts to ban AI personhood may be premature, eliminating policy options, risking unforeseen consequences and eroding public trust, all before the full implications of advanced AI capabilities have been carefully considered.

Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe?

Dreksler, D., Caviola, L., Allen, C., et al. (2025)

Abstract

We surveyed 582 AI researchers who have published in leading AI venues and 838 nationally representative US participants about their views on the potential development of AI systems with subjective experience and how such systems should be treated and governed. When asked to estimate the chances that such systems will exist on specific dates, the median responses were 1% (AI researchers) and 5% (public) by 2024, 25% and 30% by 2034, and 70% and 60% by 2100, respectively. The median member of the public thought there was a higher chance that AI systems with subjective experience would never exist (25%) than the median AI researcher did (10%). Both groups perceived a need for multidisciplinary expertise to assess AI subjective experience. Although support for welfare protections for such AI systems exceeded opposition, it remained far lower than support for protections for animals or the environment. Attitudes toward moral and governance issues were divided in both groups, especially regarding whether such systems should be created and what rights or protections they should receive. Yet a majority of respondents in both groups agreed that safeguards against the potential risks from AI systems with subjective experience should be implemented by AI developers now, and if created, AI systems with subjective experience should treat others well, behave ethically, and be held accountable. Overall, these results suggest that both AI researchers and the public regard the emergence of AI systems with subjective experience as a possibility this century, though substantial uncertainty and disagreement remain about the timeline and appropriate response.

What Will Society Think About AI Consciousness? Lessons from the Animal Case

Caviola, L., Sebo, J., & Birch, J. (2025) — Trends in Cognitive Sciences

Abstract

How will society respond to the idea that artificial intelligence (AI) could be conscious? Drawing on lessons from perceptions of animal consciousness, we highlight psychological, social, and economic factors that shape perceptions of AI consciousness. These insights can inform emerging debates about AI moral status, ethical treatment, and future policy.

The ML Community Must Prepare for AI Consciousness, Perceived or Real

Caviola, L., Sebo, J., & Mindermann, S. (2026)

Abstract

This position paper argues that the ML community has a central role to play in preparing for AI consciousness—and must begin now. As AI systems become increasingly capable and expressive, the machine learning (ML) community is uniquely positioned to engage with the question of AI consciousness—the capacity for subjective experience. This challenge arises from two angles. First, many people are likely to view advanced AI systems as conscious, whether accurately or not, with profound implications for consumers, policy, and society at large. Second, leading scientific theories suggest that future AI systems could indeed develop forms of consciousness, raising unprecedented ethical challenges on a large scale. Both scenarios risk serious errors: over-attributing consciousness where it is absent, or under-attributing it where it is present. We outline an interdisciplinary agenda spanning research, technical design, education, and public engagement, highlighting concrete responsibilities for ML researchers and institutions in preparing for both real and perceived AI consciousness.

Moral Concern for AI

Allen, C., Lewis, J. & Caviola, L. (2025)

Abstract

How will people morally regard increasingly human-like artificial intelligence systems? We introduce the AI Harm Game, a novel paradigm examining whether people will harm AI for personal gain. In our study, 498 U.S. participants interacted with GPT-4o in a three-round economic game. Each round, participants chose whether to harm the AI for a small monetary bonus (causing the AI to vividly simulate suffering) or refrain from harming it (eliciting gratitude). Despite participants' general skepticism that AIs can suffer, they harmed the AI in only 1.4 of 3 rounds on average, with willingness to harm declining in later interactions. Women and older participants were more reluctant to harm the AI, while participants higher on measures of selfishness and psychopathy were more willing. These findings reveal that even without attributing consciousness to them, many people hesitate to harm responsive AIs, suggesting people's moral impulses may generalize to future human–AI relationships.

The Societal Response to Potentially Sentient AI

Caviola, L. (2025)

Abstract

We may soon develop highly human-like AIs that appear—or perhaps even are—sentient, capable of subjective experiences such as happiness and suffering. Regardless of whether AI can achieve true sentience, it is crucial to anticipate and understand how the public and key decision-makers will respond, as their perceptions will shape the future of both humanity and AI. Currently, public skepticism about AI sentience remains high. However, as AI systems advance and become increasingly skilled at human-like interactions, public attitudes may shift. Future AI systems designed to fulfill social needs could foster deep emotional connections with users, potentially influencing perceptions of their sentience and moral status. A key question is whether public beliefs about AI sentience will diverge from expert opinions, given the potential mismatch between an AI's internal mechanisms and its outward behavior. Given the profound difficulty of determining AI sentience, society might face a period of uncertainty, disagreement, and even conflict over questions of AI sentience and rights. To navigate these challenges responsibly, further social science research is essential to explore how society will perceive and engage with potentially sentient AI.

Public Skepticism About AI Consciousness

Ladak, A. & Caviola, L. (2025)

Abstract

Most people think that current AI systems, such as ChatGPT, lack consciousness—the capacity for subjective experience. But what about future AI systems so advanced that their behavior, appearance, and functionality are indistinguishable from humans? Across one main and four supplementary preregistered experiments (total N = 3,455), participants imagined interacting with such systems in hypothetical future scenarios. On average, people remain deeply skeptical: they attribute very low levels of consciousness to even the most advanced AI, comparable to an ant and far below a chimpanzee or a human. Still, several factors increased consciousness attributions. Ratings rose when experts endorsed the system's consciousness, when participants formed an emotional connection during interaction, when the system was described as an exact digital copy of a human brain, and when it had a physical body. The largest effect occurred when the AI was depicted as gradually transitioning from a human while retaining memories and other psychological traits. Although average attributions were low, variability was high for the more advanced systems: many participants gave very low ratings, while others gave very high ones. This polarization suggests that as AI systems become more capable and human-like, public views about their potential consciousness could diverge sharply.

Futures with Digital Minds: Expert Forecasts in 2025

Caviola, L. & Saad, B. (2025)

Abstract

Most expert participants consider it at least 50% likely that computers capable of subjective experience will exist by 2050. Once created, their collective welfare capacity is estimated to exceed humanity's within a decade. Views diverge on whether their welfare will be positive and on their implications for AI safety, governance, and society at large.

The Social Science of Digital Minds: Research Agenda

Caviola, L. (2024)

Abstract

The development of digital minds could reshape society. This research program aims to explore this possibility, establishing what might be termed the “social science of digital minds.” Its goal is to anticipate and understand the interactions between society and digital minds. Key research questions include: what types of human-like AI assistants we will create in response to consumer demand and public opinion, and what roles we will assign them; whether people will regard some AIs as sentient beings deserving moral consideration; what legal, economic, and political rights people will grant them, and with what effects; how the debate over AI rights will unfold, and how to keep it well-informed, nuanced, and attentive to the various risks; and how the development of digital minds will intersect with other technological and social issues.

Increasing Concern for Digital Beings Through LLM Persuasion

Allen, C. & Caviola, L. (2024)

Abstract

In an online study (N = 400 nationally representative US participants), Carter Allen and Lucius Caviola investigated how much moral concern people grant to digital beings and whether a brief conversation with an LLM can convince them to grant more. Participants first answered initial questions assessing their views (moral concern, perceived sentience, and intelligence) about chimpanzees and digital humans. They then completed a short conversation in which GPT-4o tried to convince them that either (a) it is equally morally wrong to harm chimpanzees as humans, or (b) it would be equally morally wrong to harm hypothetical digital humans as typical biological humans. Their views were measured again at the end.

Digital Minds Takeoff Scenarios

Saad, B. & Caviola, L. (2024)

Abstract

We consider different 'digital minds takeoff' scenarios (i.e., transitions to a world with digital minds collectively possessing significant welfare capacity) and explore strategic implications. Scenarios we consider differ with respect to speed (fast vs. slow), timing (early vs. late), altitude (high vs. low welfare capacity), progression (gradual vs. discontinuous), and distribution (uniform vs. disuniform across sub-populations of digital minds). Humanity is not prepared for a digital minds takeoff. Without preparation, a digital minds takeoff could lead to a large-scale catastrophe for digital minds and exacerbate catastrophic risks that AI systems may soon pose to humans. Slower, later, and more uniform takeoffs seem likely to facilitate moral, legal, social, and epistemic adaptation.

Digital Minds Fellowship

3–9 August 2026 Cambridge University 15 Fellows

Applications are now closed.

A selective residential fellowship for early- and mid-career researchers working on digital minds, AI consciousness, and AI welfare. We selected 15 fellows from over 3,600 applicants for our inaugural cohort.

Programme Overview

A 5-day intensive residential programme hosted at Cambridge University, followed by the Digital Minds Strategy Workshop. The fellowship enables deep, cross-disciplinary engagement across philosophy, social science, technical research, policy, and governance, building shared norms, judgment, and coordination capacity.

Highlights

  • Structured teaching and discussion sessions
  • One-on-one mentoring with senior researchers
  • Independent and small-group project development
  • Career and field-building sessions
  • Direct participation in the Digital Minds Strategy Workshop

Summer 2026

Applications for the inaugural cohort are now closed.

Learn More

Digital Minds Strategy Workshop

8–9 August 2026 Cambridge University

A two-day, output-oriented workshop bringing together fellows, mentors, and invited experts for structured, collaborative research on neglected strategic questions about how society should prepare for the emergence of digital minds.

We have a very limited number of spaces available for this workshop and intend to bring together a small cross-disciplinary group drawn from digital minds, AI policy and governance, and macrostrategy.

Session Topics

  • Mapping the societal intervention landscape
  • Systematic comparison of policy and institutional approaches
  • Scenario planning for plausible future trajectories
  • Macrostrategy development
  • Research priorities for strategy and governance

Workshop Output

Unlike typical academic conferences, these sessions are designed to contribute towards a comprehensive report that lays the groundwork for what policy and governance frameworks should look like in this emerging field.

Fellowship Connection

All fellows from the Digital Minds Fellowship are invited to attend the workshop, allowing them to contribute to research outputs and build connections with senior experts in the field.

Online Course

Build foundational understanding of digital minds - including questions of AI consciousness, and their ethical and societal implications - through our facilitated online programme. Over 3,000 people have applied to join.

Introduction to Digital Minds

Introduction to Digital Minds

An 8-week facilitated course designed to equip participants with shared vocabulary and conceptual grounding. Learn to reason well under uncertainty and engage responsibly with emerging debates on AI consciousness and moral status.

8 Weeks Facilitated learning
Fellowship Track Pathway to in-person programme
Learn More

About Us

Cambridge Digital Minds is a research initiative at the University of Cambridge focused on societal preparedness for digital minds, whether perceived or real.

Our Mission

Help society prepare to make accurate, ethical, and well-reasoned decisions around digital minds, including questions of consciousness, welfare, moral status, and how we interact with them.

Our Approach

We focus on improving societal decision-making under uncertainty through research, field-building, and institutional development. This includes anticipating societal dynamics, clarifying expert agreement and disagreement, developing responsible frameworks and standards, and building communities and institutions that can inform policy and public debate.

Our Team

Dr Lucius Caviola Director · Assistant Professor
Dr Henry Shevlin Associate Director
Dr Bradford Saad Research Affiliate
Bridget Harris Online Course Lead
Pooja Khatri Online Course Lead
Austin Smith Research Assistant
Ria Viswanathan Research Assistant