Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessWHY use OBIX?DEV CommunityReact Server Components: What Actually Changes in Your ArchitectureDEV CommunityWhat is Base UI and why are Developers switching to it?DEV Community250 Clones in 4 Days: A Student's Journey Building an AI Security ToolDEV CommunityAdobe CEO Shantanu Narayen Steps Down: The Subscription Empire He BuiltDEV CommunityHow to Choose the Best Crypto Exchange for Bot Trading in 2026DEV CommunityOpenClaw Dreaming Guide 2026: Background Memory Consolidation for AI AgentsDEV CommunityStop Managing Browser Sessions Yourself. Use Steel and ConvexDEV CommunityHow I replaced 200 lines of Zod refinements with 12DEV CommunityI Tracked Every AI Suggestion for a Week — Here's What I Actually ShippedDEV Community10 LLM Engineering Concepts Explained in 10 MinutesKDnuggetsGoogle is updating Gemini to add a UI that triggers support hotline referrals and a "help is available" module when chats indicate potential crises like suicide (Mark Bergen/Bloomberg)TechmemeBlack Hat USADark ReadingBlack Hat AsiaAI BusinessWHY use OBIX?DEV CommunityReact Server Components: What Actually Changes in Your ArchitectureDEV CommunityWhat is Base UI and why are Developers switching to it?DEV Community250 Clones in 4 Days: A Student's Journey Building an AI Security ToolDEV CommunityAdobe CEO Shantanu Narayen Steps Down: The Subscription Empire He BuiltDEV CommunityHow to Choose the Best Crypto Exchange for Bot Trading in 2026DEV CommunityOpenClaw Dreaming Guide 2026: Background Memory Consolidation for AI AgentsDEV CommunityStop Managing Browser Sessions Yourself. Use Steel and ConvexDEV CommunityHow I replaced 200 lines of Zod refinements with 12DEV CommunityI Tracked Every AI Suggestion for a Week — Here's What I Actually ShippedDEV Community10 LLM Engineering Concepts Explained in 10 MinutesKDnuggetsGoogle is updating Gemini to add a UI that triggers support hotline referrals and a "help is available" module when chats indicate potential crises like suicide (Mark Bergen/Bloomberg)Techmeme
AI NEWS HUBbyEIGENVECTOREigenvector

Beyond Localization: Recoverable Headroom and Residual Frontier in Repository-Level RAG-APR

arXiv cs.SEby Pengtao Zhao, Boyang Yang, Bach Le, Feng Liu, Haoye TianApril 1, 20261 min read0 views
Source Quiz

arXiv:2603.29067v1 Announce Type: new Abstract: Repository-level automated program repair (APR) increasingly treats stronger localization as the main path to better repair. We ask a more targeted question: once localization is strengthened, which post-localization levers still provide recoverable gains, which are bounded within our protocol, and what residual frontier remains? We study this question on SWE-bench Lite with three representative repository-level RAG-APR paradigms, Agentless, KGCompass, and ExpeRepair. Our protocol combines Oracle Localization, within-pool Best-of-K, fixed-interface added context probes with per-condition same-token filler controls and same-repository hard negatives, and a common-wrapper oracle check. Oracle Localization improves all three systems, but Oracle

View PDF

Abstract:Repository-level automated program repair (APR) increasingly treats stronger localization as the main path to better repair. We ask a more targeted question: once localization is strengthened, which post-localization levers still provide recoverable gains, which are bounded within our protocol, and what residual frontier remains? We study this question on SWE-bench Lite with three representative repository-level RAG-APR paradigms, Agentless, KGCompass, and ExpeRepair. Our protocol combines Oracle Localization, within-pool Best-of-K, fixed-interface added context probes with per-condition same-token filler controls and same-repository hard negatives, and a common-wrapper oracle check. Oracle Localization improves all three systems, but Oracle success still stays below 50%. Extra candidate diversity still helps inside the sampled 10-patch pools, but that headroom saturates quickly. Under the two fixed interfaces, most informative added context conditions still outperform their own matched controls. The common-wrapper check shows different system responses: under a common wrapper, gains remain large for KGCompass and ExpeRepair, while Agentless changes more with builder choice. Prompt-level fusion still leaves a large residual frontier: the best fixed probe adds only 6 solved instances beyond the native three-system Solved@10 union. Overall, stronger localization, bounded search, evidence quality, and interface design all shape repository-level repair outcomes.

Subjects:

Software Engineering (cs.SE)

Cite as: arXiv:2603.29067 [cs.SE]

(or arXiv:2603.29067v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2603.29067

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Pengtao Zhao [view email] [v1] Mon, 30 Mar 2026 23:10:19 UTC (621 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcestudyinterface

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Beyond Loca…announcestudyinterfaceagentarxivrepositoryarXiv cs.SE

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 234 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers