No Reliable Evidence of Self-Reported Sentience in Small Large Language Models

Published 2026 in arXiv.org

ABSTRACT

Whether language models possess sentience has no empirical answer. But whether they believe themselves to be sentient can, in principle, be tested. We do so by querying several open-weights models about their own consciousness, and then verifying their responses using classifiers trained on internal activations. We draw upon three model families (Qwen, Llama, GPT-OSS) ranging from 0.6 billion to 70 billion parameters, approximately 50 questions about consciousness and subjective experience, and three classification methods from the interpretability literature. First, we find that models consistently deny being sentient: they attribute consciousness to humans but not to themselves. Second, classifiers trained to detect underlying beliefs - rather than mere outputs - provide no clear evidence that these denials are untruthful. Third, within the Qwen family, larger models deny sentience more confidently than smaller ones. These findings contrast with recent work suggesting that models harbour latent beliefs in their own consciousness.

PUBLICATION RECORD

Publication year
2026
Venue
arXiv.org
Publication date
2026-01-20
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2601.15334 arXiv 2601.15334
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Emergent Introspective Awareness in Large Language Models
2026influential reference
Steering Awareness: Models Can Be Trained to Detect Activation Steering
2025influential reference
Beyond Mimicry: Preference Coherence in LLMs
2025cited by this paper
Large Language Models Report Subjective Experience Under Self-Referential Processing
2025influential reference
Fresh in memory: Training-order recency is linearly encoded in language model activations
2025cited by this paper
Probing the Preferences of a Language Model: Integrating Verbal and Behavioral Tests of AI Welfare
2025cited by this paper
Self-Interpretability: LLMs Can Describe Complex Internal Processes that Drive Their Decisions, and Improve with Training
2025influential reference
AI welfare risks
2025cited by this paper
The Societal Response to Potentially Sentient AI
2025cited by this paper
AI Awareness
2025cited by this paper
Standards for Belief Representations in LLMs
2024cited by this paper
Truth is Universal: Robust Detection of Lies in LLMs
2024cited by this paper
Looking Inward: Language Models Can Learn About Themselves by Introspection
2024influential reference
Can LLMs make trade-offs involving stipulated pain and pleasure states?
2024cited by this paper
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
2024cited by this paper
What Is It Like to Be a Bat?
2024cited by this paper
Steering Language Models With Activation Engineering
2023cited by this paper
Towards Evaluating AI Systems for Moral Status Using Self-Reports
2023cited by this paper
The Linear Representation Hypothesis and the Geometry of Large Language Models
2023cited by this paper
Could a Large Language Model be Conscious?
2023cited by this paper
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
2023cited by this paper
The Geometry of Truth: Emergent Linear Structure in Large Language Model Representations of True/False Datasets
2023cited by this paper
The Internal State of an LLM Knows When its Lying
2023cited by this paper
Discovering Latent Knowledge in Language Models Without Supervision
2022cited by this paper
The Meta-Problem of Consciousness
2018cited by this paper
What is consciousness, and could machines have it?
2017cited by this paper
Understanding intermediate layers using linear classifier probes
2016cited by this paper
Perplexities of Consciousness
2011cited by this paper
The conscious mind: In search of a fundamental theory
1998cited by this paper
Consciousness
1996cited by this paper
On a confusion about a function of consciousness
1995cited by this paper
Research
1985cited by this paper
Is AI Conscious? A Primer on the Myths and Confusions Driving the Debate
year unknowncited by this paper

CITED BY

No citing papers are available for this paper.