Research Papers research paper arxiv computer-vision image-recognition

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

arXivMarch 30, 202610 min read0 views

arXiv:2603.26126v1 Announce Type: new Abstract: Recent advances in Reinforcement Learning with Verifiable Rewards (RLVR) for multimodal large language models (MLLMs) have mainly focused on improving final answer correctness and strengthening visual grounding. However, a critical bottleneck remains: although models can attend to relevant visual regions, they often fail to effectively incorporate visual evidence into subsequent reasoning, leading to reasoning chains that are weakly grounded in visual facts. To address this issue, we propose Trajectory-Guided Reinforcement Learning (TGRL), which — Jinda Lu, Junkang Wu, Jinghan Li, Kexin Huang, Shuo Yang, Mingzhu Chen, Jiancan Wu, Kuien Liu, Xiang Wang

View PDF HTML (experimental)

Abstract:Recent advances in Reinforcement Learning with Verifiable Rewards (RLVR) for multimodal large language models (MLLMs) have mainly focused on improving final answer correctness and strengthening visual grounding. However, a critical bottleneck remains: although models can attend to relevant visual regions, they often fail to effectively incorporate visual evidence into subsequent reasoning, leading to reasoning chains that are weakly grounded in visual facts. To address this issue, we propose Trajectory-Guided Reinforcement Learning (TGRL), which guides the policy model to integrate visual evidence into fine-grained reasoning processes using expert reasoning trajectories from stronger models. We further introduce token-level reweighting and trajectory filtering to ensure stable and effective policy optimization. Extensive experiments on multiple multimodal reasoning benchmarks demonstrate that TGRL consistently improves reasoning performance and effectively bridges the gap between visual perception and logical reasoning.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26126 [cs.CV]

(or arXiv:2603.26126v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26126

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Jinghan Li [view email] [v1] Fri, 27 Mar 2026 07:18:18 UTC (510 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26126

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Countries

From climate storytelling to AI innovation: Rice researchers take on global challenges at SXSW - Rice University

From climate storytelling to AI innovation: Rice researchers take on global challenges at SXSW Rice University

GNews AI climate

1m16 days ago

Research PapersLive

🔮 Autoresearch and the experimental society - exponentialview.co

🔮 Autoresearch and the experimental society exponentialview.co

Google News: Machine Learning

1mabout 1 hour ago

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m2 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 179 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersLive

🔮 Autoresearch and the experimental society - exponentialview.co

🔮 Autoresearch and the experimental society exponentialview.co

Google News: Machine Learning

1mabout 1 hour ago

Research PapersLive

Springing into AI: PyTorch Conference Europe and ICLR 2026

Article URL: https://www.collabora.com/news-and-blog/news-and-events/springing-into-ai-pytorch-conference-europe-and-iclr-2026.html Comments URL: https://news.ycombinator.com/item?id=47619120 Points: 2 # Comments: 0

Hacker News AI Top

1mabout 1 hour ago

Research Papers

Vector researchers presenting more than 98 papers at NeurIPS 2024

Leading researchers from Vector are presenting groundbreaking research at this year s Conference on Neural Information Processing Systems (NeurIPS). The conference, taking place December 10-15 in Vancouver and online, showcases innovative [ ] The post Vector researchers presenting more than 98 papers at NeurIPS 2024 appeared first on Vector Institute for Artificial Intelligence .

Vector Institute

1mover 1 year ago

Research Papers

Enterprise AI vs. Consumer AI: What’s the Difference? - Oracle

Enterprise AI vs. Consumer AI: What’s the Difference? Oracle

GNews AI UK

1m24 days ago