Self-Improvement of Large Language Models: A Technical Overview and Future Outlook
As large language models (LLMs) continue to advance, improving them solely through human supervision is becoming increasingly costly and limited in scalability. As models approach human-level capabilities in certain domains, human feedback may no longer provide sufficiently informative signals for further improvement. At the same time, the growing ability of models to make autonomous decisions and execute complex actions naturally enables abstractions in which components of the model development process can be progressively automated. Together, these challenges and opportunities have driven in — Haoyan Yang, Mario Xerri, Solha Park
View PDF
Abstract:As large language models (LLMs) continue to advance, improving them solely through human supervision is becoming increasingly costly and limited in scalability. As models approach human-level capabilities in certain domains, human feedback may no longer provide sufficiently informative signals for further improvement. At the same time, the growing ability of models to make autonomous decisions and execute complex actions naturally enables abstractions in which components of the model development process can be progressively automated. Together, these challenges and opportunities have driven increasing interest in self-improvement, where models autonomously generate data, evaluate outputs, and iteratively refine their own capabilities. In this paper, we present a system-level perspective on self-improving language models and introduce a unified framework that organizes existing techniques. We conceptualize the self-improvement system as a closed-loop lifecycle, consisting of four tightly coupled processes: data acquisition, data selection, model optimization, and inference refinement, along with an autonomous evaluation layer. Within this framework, the model itself plays a central role in driving each stage: collecting or generating data, selecting informative signals, updating its parameters, and refining outputs, while the autonomous evaluation layer continuously monitors progress and guides the improvement cycle across stages. Following this lifecycle perspective, we systematically review and analyze representative methods for each component from a technical standpoint. We further discuss current limitations and outline our vision for future research toward fully self-improving LLMs.
Subjects:
Computation and Language (cs.CL)
Cite as: arXiv:2603.25681 [cs.CL]
(or arXiv:2603.25681v1 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2603.25681
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Haoyan Yang [view email] [v1] Thu, 26 Mar 2026 17:32:37 UTC (3,179 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv
Stop Writing Boilerplate. Start Building: Introducing app-generator-cli
Last Updated on April 2, 2026 by Editorial Team Author(s): Rajendra Kumar Yadav, M.Sc (CS) Originally published on Towards AI. Scaffold production-ready FastAPI, LangChain, and full-stack Python projects in seconds — powered by uv. You have a great idea. You open your terminal, create a new folder, and then… you spend the next 60–90 minutes doing the same thing you always do. ai generate imageThe article introduces app-generator-cli, a command-line tool designed to eliminate the repetitive boilerplate tax experienced by Python developers, streamlining the setup of common backend projects like FastAPI and LangChain. It discusses the tool s ability to scaffold production-ready templates for different use cases, its ease of installation via pip, and optional flags for customization. Additiona
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Quantum computers might crack today's encryption far sooner than we thought
According to a study by engineers at Caltech and the UC Department of Physics, quantum computers do not need to be nearly as powerful as previously believed to crack the most advanced cryptographic technologies. The research claims that Shor's algorithm could break RSA public-key encryption using quantum computers with just... Read Entire Article


.jpg)


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!