Research Papers research paper arxiv ai artificial-intelligence

From Human Cognition to Neural Activations: Probing the Computational Primitives of Spatial Reasoning in LLMs

arXivby [Submitted on 27 Mar 2026]March 30, 20262 min read1 views

arXiv:2603.26323v1 Announce Type: cross Abstract: As spatial intelligence becomes an increasingly important capability for foundation models, it remains unclear whether large language models' (LLMs) performance on spatial reasoning benchmarks reflects structured internal spatial representations or reliance on linguistic heuristics. We address this question from a mechanistic perspective by examining how spatial information is internally represented and used. Drawing on computational theories of human spatial cognition, we decompose spatial reasoning into three primitives, relational compositio — Jiyuan An, Liner Yang, Mengyan Wang, Luming Lu, Weihua An, Erhong Yang

View PDF HTML (experimental)

Abstract:As spatial intelligence becomes an increasingly important capability for foundation models, it remains unclear whether large language models' (LLMs) performance on spatial reasoning benchmarks reflects structured internal spatial representations or reliance on linguistic heuristics. We address this question from a mechanistic perspective by examining how spatial information is internally represented and used. Drawing on computational theories of human spatial cognition, we decompose spatial reasoning into three primitives, relational composition, representational transformation, and stateful spatial updating, and design controlled task families for each. We evaluate multilingual LLMs in English, Chinese, and Arabic under single pass inference, and analyze internal representations using linear probing, sparse autoencoder based feature analysis, and causal interventions. We find that task relevant spatial information is encoded in intermediate layers and can causally influence behavior, but these representations are transient, fragmented across task families, and weakly integrated into final predictions. Cross linguistic analysis further reveals mechanistic degeneracy, where similar behavioral performance arises from distinct internal pathways. Overall, our results suggest that current LLMs exhibit limited and context dependent spatial representations rather than robust, general purpose spatial reasoning, highlighting the need for mechanistic evaluation beyond benchmark accuracy.

Subjects:

Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.26323 [cs.CL]

(or arXiv:2603.26323v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.26323

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Jiyuan An [view email] [v1] Fri, 27 Mar 2026 11:42:36 UTC (3,424 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26323

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ProductsRecent

Uniformed Services University Introduces Web App for Ethical AI Use in Medical Research - Uniformed Services University

Uniformed Services University Introduces Web App for Ethical AI Use in Medical Research Uniformed Services University

GNews AI healthcare

1m1 day ago

Research PapersFresh

New research could empower people without AI expertise to help create trustworthy AI applications

Involving people without AI expertise in the development and evaluation of artificial intelligence applications could help create better, fairer, and more trustworthy automated decision-making systems, new research suggests. After enlisting members of the public to evaluate the potential impacts of two real-world applications, researchers from UK universities will present a paper at a major international computing conference which suggests how "participatory AI auditing" could improve AI decision-making in the future.

TechXplore AI

1mabout 3 hours ago

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - wsj.com

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models wsj.com

Google News: LLM

1m2 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 140 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

New research could empower people without AI expertise to help create trustworthy AI applications

TechXplore AI

1mabout 3 hours ago

Research PapersFresh

Google Research touts memory-compression breakthrough for AI processing - Network World

Google Research touts memory-compression breakthrough for AI processing Network World

GNews AI Google

1mabout 4 hours ago

Research Papers

Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy

Analysis of behavioral consistency in large language model agents reveals that while consistent performance correlates with higher accuracy, consistency can amplify both correct and incorrect interpretations, emphasizing that accurate interpretation is more crucial than execution consistency for production deployment. (2 upvotes on HuggingFace)

HuggingFace Papers

2m8 days ago

Research PapersRecent

A Survey of On-Policy Distillation for Large Language Models

On-Policy Distillation for large language models unifies diverse approaches through an f-divergence framework organized by feedback signals, teacher access, and loss granularity. (4 upvotes on HuggingFace)

HuggingFace Papers

2m1 day ago