Research Papers research paper arxiv ai artificial-intelligence

Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking

arXivMarch 31, 202610 min read0 views

arXiv:2509.23392v3 Announce Type: replace Abstract: Large Reasoning Models (LRMs) have achieved impressive performance on challenging tasks, yet their deep reasoning often incurs substantial computational costs. To achieve efficient reasoning, existing reinforcement learning methods still struggle to construct short reasoning path during the rollout stage, limiting effective learning. Inspired by Evidence Accumulation Models, we find that LRMs have accumulated sufficient information early in reasoning, making further reasoning steps redundant. Based on this insight, we propose Just-Enough Thin — Jinyi Han, Ying Huang, Ying Liao, Zishang Jiang, Xikun Lu, Haiquan Zhao, Xinyi Wang, Guanghao Zhou, Sihang Jiang, Jiaqing Liang, Weikang Zhou, Zeye Sun, Fei Yu, Yanghua Xiao

Authors:Jinyi Han, Ying Huang, Ying Liao, Zishang Jiang, Xikun Lu, Haiquan Zhao, Xinyi Wang, Guanghao Zhou, Sihang Jiang, Jiaqing Liang, Weikang Zhou, Zeye Sun, Fei Yu, Yanghua Xiao

View PDF HTML (experimental)

Abstract:Large Reasoning Models (LRMs) have achieved impressive performance on challenging tasks, yet their deep reasoning often incurs substantial computational costs. To achieve efficient reasoning, existing reinforcement learning methods still struggle to construct short reasoning path during the rollout stage, limiting effective learning. Inspired by Evidence Accumulation Models, we find that LRMs have accumulated sufficient information early in reasoning, making further reasoning steps redundant. Based on this insight, we propose Just-Enough Thinking (JET), which trains models to proactively terminate unnecessary reasoning. JET performs trajectory truncation during rollout to expose the model to short, distributionally consistent reasoning paths. Besides, it uses a quality-controlled length reward to better encourage concise reasoning while maintaining correctness. Extensive experiments demonstrate that JET significantly improves reasoning efficiency without sacrificing accuracy. Especially, DeepSeek-Distill-Qwen-1.5B achieves a 4.6% accuracy gain while reducing output length by 46.3% on the Olympiad benchmark. Our code is available in the GitHub.

Subjects:

Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Cite as: arXiv:2509.23392 [cs.AI]

(or arXiv:2509.23392v3 [cs.AI] for this version)

https://doi.org/10.48550/arXiv.2509.23392

arXiv-issued DOI via DataCite

Submission history

From: Jinyi Han [view email] [v1] Sat, 27 Sep 2025 16:25:06 UTC (579 KB) [v2] Sun, 5 Oct 2025 13:54:32 UTC (579 KB) [v3] Mon, 30 Mar 2026 15:21:37 UTC (780 KB)

Original source

arXiv

https://arxiv.org/abs/2509.23392

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ProductsFresh

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or have insight into the process), could you share what the average scores look like in your batch after rebuttal? Also, do tools like trackers https://papercopilot.com/statistics/icml-statistics/icml-2026-statistics/ reflect true Score distributions to some degree. Appreciate any insights. submitted by /u/Hope999991 [link] [comments]

Reddit r/MachineLearning

1mabout 3 hours ago

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m3 days ago

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m3 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 192 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking

Submission history

Daily AI Digest

More about

[D] ICML 2026 Average Score

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

[D] CVPR 2026 Travel Grant/Registration Waiver

On algorithms, life, and learning

Researchers Develop VertINGreen To Predict Greenwall Performance - Let's Data Science

'More is Different': Research shows scale alone does not explain AI's power—specialization and cooperation do - Tech Xplore