Research Papers research paper arxiv large language model coding agents pull requests

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

HuggingFace PapersMarch 27, 20268 min read0 views

Learning to Commit framework enhances LLM coding agents by using historical commit patterns to improve code organicity and adherence to project-specific conventions. (2 upvotes on HuggingFace)

Published on Mar 27

Submitted by

Mo Li

on Mar 30

Authors:

Abstract

Learning to Commit framework enhances LLM coding agents by using historical commit patterns to improve code organicity and adherence to project-specific conventions.

AI-generated summary

Large language model (LLM)-based coding agents achieve impressive results on controlled benchmarks yet routinely produce pull requests that real maintainers reject. The root cause is not functional incorrectness but a lack of organicity: generated code ignores project-specific conventions, duplicates functionality already provided by internal APIs, and violates implicit architectural constraints accumulated over years of development. Simply exposing an agent to the latest repository snapshot is not enough: the snapshot reveals the final state of the codebase, but not the repository-specific change patterns by which that state was reached. We introduce Learning to Commit, a framework that closes this gap through Online Repository Memory. Given a repository with a strict chronological split, the agent performs supervised contrastive reflection on earlier commits: it blindly attempts to resolve each historical issue, compares its prediction against the oracle diff, and distils the gap into a continuously growing set of skills-reusable patterns capturing coding style, internal API usage, and architectural invariants. When a new PR description arrives, the agent conditions its generation on these accumulated skills, producing changes grounded in the project's own evolution rather than generic pretraining priors. Evaluation is conducted on genuinely future, merged pull requests that could not have been seen during the skill-building phase, and spans multiple dimensions including functional correctness, code-style consistency, internal API reuse rate, and modified-region plausibility. Experiments on an expert-maintained repository with rich commit history show that Online Repository Memory effectively improves organicity scores on held-out future tasks.

View arXiv page View PDF Add to collection

Get this paper in your agent:

hf papers read 2603.26664

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2603.26664 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.26664 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.26664 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.

Original source

HuggingFace Papers

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

AI Tools

The Prompt Engineering Guide

A new resource for discovering papers, guides, tools, and datasets to learn about prompt engineering.

Elvis Saravia Blog

1mover 3 years ago

Research PapersFresh

Google Quantum Paper Boosts Odds of Bitcoin ‘Q-Day’ by 2032, Researchers Warn

Google warned that quantum advances could break crypto security sooner than expected, with analysts recommending ‘appropriate urgency.’

Decrypt AI

1mabout 5 hours ago

Laws & RegulationRecent

EFF's Cindy Cohn on The Daily Show! Tonight Monday, March 30

<div class="field field--name-body field--type-text-with-summary field--label-hidden"><div class="field__items"><div class="field__item even"><p>EFF Executive Director Cindy Cohn will be on The Daily Show tonight, Monday March 30, at 11 pm ET and PT, speaking with host Jon Stewart. Cindy will discuss her long history of fighting for privacy online and her new book,<span> </span><a href="https://www.eff.org/Privacys-Defender" target="_blank" rel="noopener noreferrer">Privacy’s Defender: My Thirty-Year Fight Against Digital Surveillance</a><span> (MIT Press). The book details her own personal story alongside her role representing the rights and interests of technology users, innovators, whistleblowers, and researchers during the Crypto Wars of the 1990s, battles over NSA’s dragnet internet s

Electronic Frontier Foundation

2m1 day ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 138 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

Google Quantum Paper Boosts Odds of Bitcoin ‘Q-Day’ by 2032, Researchers Warn

Google warned that quantum advances could break crypto security sooner than expected, with analysts recommending ‘appropriate urgency.’

Decrypt AI

1mabout 5 hours ago

Research PapersFresh

A quantum computer may need just 10,000 qubits to empty your crypto wallets, researchers say

The research shows quantum computers may break bitcoin and ether wallet encryption with far fewer qubits than previously thought, accelerating the push toward post-quantum security.

CoinDesk AI

1mabout 7 hours ago

Research PapersFresh

Bitcoin bulls scramble for post-quantum protection as Google drops bombshell paper

Google's finding that breaking bitcoin's cryptography requires 20x fewer qubits than previously estimated has triggered the strongest industry response to quantum threats since the Willow chip in 2024. Here's how builders, investors, and researchers are reacting.

CoinDesk AI

1mabout 6 hours ago

Research PapersRecent

Energy Landscapes of Emotion: Quantifying Brain Network Stability During Happy and Sad Face Processing Using EEG-Based Hopfield Energy

arXiv:2603.27644v1 Announce Type: new Abstract: Understanding how the human brain instantiates distinct emotional states is a key challenge in affective neuroscience. While network-based approaches have advanced emotion processing research,they remain largely descriptive,leaving the dynamical stability of emotional brain states unquantified.This study introduces a novel framework to quantify this stability by applying Hopfield network energy to empirically derived functional connectivity. High density EEG was recorded from 20 healthy adults during a happy versus sad facial expression discrimination task. Functional connectivity was estimated using the weighted Phase Lag Index to obtain artifact-robust,frequency-specific matrices, which served as coupling weights in a continuous Hopfield en

arXiv q-bio.NC

2mabout 14 hours ago

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Abstract

Models citing this paper 0

Datasets citing this paper 0

Spaces citing this paper 0

Collections including this paper 0

Daily AI Digest

More about

The Prompt Engineering Guide

Google Quantum Paper Boosts Odds of Bitcoin ‘Q-Day’ by 2032, Researchers Warn

EFF&#039;s Cindy Cohn on The Daily Show! Tonight Monday, March 30

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

Google Quantum Paper Boosts Odds of Bitcoin ‘Q-Day’ by 2032, Researchers Warn

A quantum computer may need just 10,000 qubits to empty your crypto wallets, researchers say

Bitcoin bulls scramble for post-quantum protection as Google drops bombshell paper

Energy Landscapes of Emotion: Quantifying Brain Network Stability During Happy and Sad Face Processing Using EEG-Based Hopfield Energy

EFF's Cindy Cohn on The Daily Show! Tonight Monday, March 30