Research Papers research paper arxiv ai artificial-intelligence

Limits of Imagery Reasoning in Frontier LLM Models

arXivMarch 31, 202610 min read0 views

arXiv:2603.26779v1 Announce Type: cross Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, yet they struggle with spatial tasks that require mental simulation, such as mental rotation. This paper investigates whether equipping an LLM with an external ``Imagery Module'' -- a tool capable of rendering and rotating 3D models -- can bridge this gap, functioning as a ``cognitive prosthetic.'' We conducted experiments using a dual-module architecture in which a reasoning module (an MLLM) interacts with an imagery module on 3D model rotation tasks. Performance — Sergio Y. Hayashi, Nina S. T. Hirata

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have demonstrated impressive reasoning capabilities, yet they struggle with spatial tasks that require mental simulation, such as mental rotation. This paper investigates whether equipping an LLM with an external Imagery Module'' -- a tool capable of rendering and rotating 3D models -- can bridge this gap, functioning as a cognitive prosthetic.'' We conducted experiments using a dual-module architecture in which a reasoning module (an MLLM) interacts with an imagery module on 3D model rotation tasks. Performance was lower than expected, with accuracy reaching at most 62.5%. Further investigation suggests that even when the burden of maintaining and manipulating a holistic 3D state is outsourced, the system still fails. This reveals that current frontier models lack the foundational visual-spatial primitives required to interface with imagery. Specifically, they lack: (1) the low-level sensitivity to extract spatial signals such as (a) depth, (b) motion, and (c) short-horizon dynamic prediction; and (2) the capacity to reason contemplatively over images, dynamically shifting visual focus and balancing imagery with symbolic and associative information.

Comments: 25 pages

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.26779 [cs.CV]

(or arXiv:2603.26779v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26779

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Sergio Hayashi Y [view email] [v1] Wed, 25 Mar 2026 01:17:13 UTC (965 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26779

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ReleasesLive

AI Impact on the Interface

How artificial intelligence is fundamentally reshaping user interactions, and what it means for the future of design Continue reading on Paperclip Design »

Medium AI

1m21 minutes ago

Research Papers

I was a beta tester for the Nobel prize-winning AlphaFold AI – it’s going to revolutionise health research - The Conversation

I was a beta tester for the Nobel prize-winning AlphaFold AI – it’s going to revolutionise health research The Conversation

GNews AI protein

1mover 1 year ago

ProductsLive

Your Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.

Seven landmark research reports agree the ROI gap is real. None of them explain why. The answer is simpler than you think — and fixable… Continue reading on Medium »

Medium AI

1m18 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 184 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research Papers

I was a beta tester for the Nobel prize-winning AlphaFold AI – it’s going to revolutionise health research - The Conversation

I was a beta tester for the Nobel prize-winning AlphaFold AI – it’s going to revolutionise health research The Conversation

GNews AI protein

1mover 1 year ago

Research PapersRecent

IBM Advances Quantum Computing Research: Will it Boost Prospects? - Yahoo Finance Singapore

IBM Advances Quantum Computing Research: Will it Boost Prospects? Yahoo Finance Singapore

GNews AI quantum

1m1 day ago

Research PapersFresh

Quantum computers might crack today's encryption far sooner than we thought

According to a study by engineers at Caltech and the UC Department of Physics, quantum computers do not need to be nearly as powerful as previously believed to crack the most advanced cryptographic technologies. The research claims that Shor's algorithm could break RSA public-key encryption using quantum computers with just... Read Entire Article

TechSpot

1mabout 4 hours ago

Research Papers

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI - WSJ

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI WSJ

GNews AI manufacturing

1m29 days ago