CellARC: Measuring Intelligence with Cellular Automata

Published 2025 in arXiv.org

ABSTRACT

We introduce CellARC, a synthetic benchmark for abstraction and reasoning built from multicolor 1D cellular automata (CA). Each episode has five support pairs and one query serialized in 256 tokens, enabling rapid iteration with small models while exposing a controllable task space with explicit knobs for alphabet size k, radius r, rule family, Langton's lambda, query coverage, and cell entropy. We release 95k training episodes plus two 1k test splits (interpolation/extrapolation) and evaluate symbolic, recurrent, convolutional, transformer, recursive, and LLM baselines. CellARC decouples generalization from anthropomorphic priors, supports unlimited difficulty-controlled sampling, and enables reproducible studies of how quickly models infer new rules under tight budgets. Our strongest small-model baseline (a 10M-parameter vanilla transformer) outperforms recent recursive models (TRM, HRM), reaching 58.0%/32.4% per-token accuracy on the interpolation/extrapolation splits, while a large closed model (GPT-5 High) attains 62.3%/48.1% on subsets of 100 test tasks. An ensemble that chooses per episode between the Transformer and the best symbolic baseline reaches 65.4%/35.5%, highlighting neuro-symbolic complementarity. Leaderboard: https://cellarc.mireklzicar.com

PUBLICATION RECORD

Publication year
2025
Venue
arXiv.org
Publication date
2025-11-11
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2511.07908 arXiv 2511.07908
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

ARC-AGI-2: A New Challenge for Frontier AI Reasoning Systems
2025cited by this paper
Addressing the Abstraction and Reasoning Corpus via Procedural Example Generation
2024cited by this paper
CAX: Cellular Automata Accelerated in JAX
2024cited by this paper
Chain of Thought Prompting Elicits Reasoning in Large Language Models
2022cited by this paper
RoFormer: Enhanced Transformer with Rotary Position Embedding
2021cited by this paper
Language Models are Few-Shot Learners
2020cited by this paper
GLU Variants Improve Transformer
2020cited by this paper
Root Mean Square Layer Normalization
2019cited by this paper
On First-Order Meta-Learning Algorithms
2018cited by this paper
Universal Transformers
2018cited by this paper
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
2017cited by this paper
Attention is All you Need
2017influential reference
Adaptive Computation Time for Recurrent Neural Networks
2016cited by this paper
Cellular Automata A Discrete Universe
2016cited by this paper
Hybrid computing using a neural network with dynamic external memory
2016cited by this paper
Neural GPUs Learn Algorithms
2015cited by this paper
Neural Turing Machines
2014cited by this paper
A measure of intelligence
2012cited by this paper
Growth and Decay in Life-Like Cellular Automata
2009cited by this paper
A New Kind of Science
2003cited by this paper
Finite-State Transducers in Language and Speech Processing
1997cited by this paper
Mollusc Shell Pigmentation: Cellular Automaton Simulations and Evidence for Undecidability
1996cited by this paper
De Bruijn Graphs and Linear Cellular Automata
1991influential reference
Computation at the edge of chaos: Phase transitions and emergent computation
1990influential reference
Algebraic properties of cellular automata
1984cited by this paper
Statistical mechanics of cellular automata
1983cited by this paper
Endomorphisms and automorphisms of the shift dynamical system
1969cited by this paper
Theory Of Self Reproducing Automata
1967cited by this paper
A method for synthesizing sequential circuits
1955cited by this paper
Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences is currently published by The Royal
1951cited by this paper
A combinatorial problem
1946cited by this paper
On Computable Numbers, with an Application to the Entscheidungsproblem.
1937cited by this paper

CITED BY

No citing papers are available for this paper.