Models model language model benchmark announce product analysis

Memory in the LLM Era: Modular Architectures and Strategies in a Unified Framework

arXiv cs.DBby Yanchen Wu, Tenghui Lin, Yingli Zhou, Fangyuan Zhang, Qintian Guo, Xun Zhou, Sibo Wang, Xilin Liu, Yuchi Ma, Yixiang FangApril 3, 20261 min read0 views

Source Quiz

arXiv:2604.01707v1 Announce Type: cross Abstract: Memory emerges as the core module in the large language model (LLM)-based agents for long-horizon complex tasks (e.g., multi-turn dialogue, game playing, scientific discovery), where memory can enable knowledge accumulation, iterative reasoning and self-evolution. A number of memory methods have been proposed in the literature. However, these methods have not been systematically and comprehensively compared under the same experimental settings. In this paper, we first summarize a unified framework that incorporates all the existing agent memory methods from a high-level perspective. We then extensively compare representative agent memory methods on two well-known benchmarks and examine the effectiveness of all methods, providing a thorough

View PDF HTML (experimental)

Abstract:Memory emerges as the core module in the large language model (LLM)-based agents for long-horizon complex tasks (e.g., multi-turn dialogue, game playing, scientific discovery), where memory can enable knowledge accumulation, iterative reasoning and self-evolution. A number of memory methods have been proposed in the literature. However, these methods have not been systematically and comprehensively compared under the same experimental settings. In this paper, we first summarize a unified framework that incorporates all the existing agent memory methods from a high-level perspective. We then extensively compare representative agent memory methods on two well-known benchmarks and examine the effectiveness of all methods, providing a thorough analysis of those methods. As a byproduct of our experimental analysis, we also design a new memory method by exploiting modules in the existing methods, which outperforms the state-of-the-art methods. Finally, based on these findings, we offer promising future research opportunities. We believe that a deeper understanding of the behavior of existing methods can provide valuable new insights for future research.

Subjects:

Computation and Language (cs.CL); Databases (cs.DB)

Cite as: arXiv:2604.01707 [cs.CL]

(or arXiv:2604.01707v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2604.01707

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yanchen Wu [view email] [v1] Thu, 2 Apr 2026 07:19:20 UTC (1,246 KB)

Original source

arXiv cs.DB

https://arxiv.org/abs/2604.01707

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelbenchmark

ModelsLive

Google launches Gemma 4 with a broad licensing model - Techzine Global

Google launches Gemma 4 with a broad licensing model Techzine Global

Google News: DeepMind

1m41 minutes ago

ModelsLive

Google DeepMind unveils Gemma 4: Next-Gen AI models for advanced reasoning - financialexpress.com

Google DeepMind unveils Gemma 4: Next-Gen AI models for advanced reasoning financialexpress.com

Google News: DeepMind

1mabout 1 hour ago

ModelsFresh

The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration

arXiv:2603.22862v2 Announce Type: replace Abstract: Tool use enables large language models (LLMs) to access external information, invoke software systems, and act in digital environments beyond what can be solved from model parameters alone. Early research mainly studied whether a model could select and execute a correct single tool call. As agent systems evolve, however, the central problem has shifted from isolated invocation to multi-tool orchestration over long trajectories with intermediate state, execution feedback, changing environments, and practical constraints such as safety, cost, and verifiability. We comprehensively review recent progress in multi-tool LLM agents and analyzes the state of the art in this rapidly developing area. First, we unify task formulations and distinguis

arXiv cs.SE

1mabout 4 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 197 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Memory in the LLM Era: Modular Architectures and Strategies in a Unified Framework

Submission history

Daily AI Digest

More about

Google launches Gemma 4 with a broad licensing model - Techzine Global

Google DeepMind unveils Gemma 4: Next-Gen AI models for advanced reasoning - financialexpress.com

The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Models

Google launches Gemma 4 with a broad licensing model - Techzine Global

Anthropic Races to Contain Leak of Code Behind Claude AI Agent - WSJ

Google DeepMind unveils Gemma 4: Next-Gen AI models for advanced reasoning - financialexpress.com

The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration