Beyond the Answer: Decoding the Behavior of LLMs as Scientific Reasoners
arXiv:2603.28038v1 Announce Type: new Abstract: As Large Language Models (LLMs) achieve increasingly sophisticated performance on complex reasoning tasks, current architectures serve as critical proxies for the internal heuristics of frontier models. Characterizing emergent reasoning is vital for long-term interpretability and safety. Furthermore, understanding how prompting modulates these processes is essential, as natural language will likely be the primary interface for interacting with AGI systems. In this work, we use a custom variant of Genetic Pareto (GEPA) to systematically optimize p — Rohan Pandey, Eric Ye, Michael Li
View PDF HTML (experimental)
Abstract:As Large Language Models (LLMs) achieve increasingly sophisticated performance on complex reasoning tasks, current architectures serve as critical proxies for the internal heuristics of frontier models. Characterizing emergent reasoning is vital for long-term interpretability and safety. Furthermore, understanding how prompting modulates these processes is essential, as natural language will likely be the primary interface for interacting with AGI systems. In this work, we use a custom variant of Genetic Pareto (GEPA) to systematically optimize prompts for scientific reasoning tasks, and analyze how prompting can affect reasoning behavior. We investigate the structural patterns and logical heuristics inherent in GEPA-optimized prompts, and evaluate their transferability and brittleness. Our findings reveal that gains in scientific reasoning often correspond to model-specific heuristics that fail to generalize across systems, which we call "local" logic. By framing prompt optimization as a tool for model interpretability, we argue that mapping these preferred reasoning structures for LLMs is an important prerequisite for effectively collaborating with superhuman intelligence.
Comments: Accepted at the Post-AGI Science and Society Workshop at ICLR 2026
Subjects:
Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2603.28038 [cs.AI]
(or arXiv:2603.28038v1 [cs.AI] for this version)
https://doi.org/10.48550/arXiv.2603.28038
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Michael Li [view email] [v1] Mon, 30 Mar 2026 05:01:07 UTC (256 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Brain-inspired chip could make some AI tasks up to 2,000 times more energy efficient
A new type of computer chip that uses the physics of materials to process information could make some artificial intelligence (AI) systems far more energy efficient, researchers have found. Loughborough University physicists have developed a device that can process data that changes over time directly in hardware, rather than relying on software running on conventional computers.


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!