Research Papers research paper arxiv ai artificial-intelligence

Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP

arXivMarch 31, 202610 min read0 views

arXiv:2603.27277v1 Announce Type: cross Abstract: Large Language Model (LLM) coding agents typically explore codebases through repeated file-reading and grep-searching, consuming thousands of tokens per query without structural understanding. We present Codebase-Memory, an open-source system that constructs a persistent, Tree-Sitter-based knowledge graph via the Model Context Protocol (MCP), parsing 66 languages through a multi-phase pipeline with parallel worker pools, call-graph traversal, impact analysis, and community discovery. Evaluated across 31 real-world repositories, Codebase-Memory — Martin Vogel, Falk Meyer-Eschenbach, Severin Kohler, Elias Gr\"unewald, Felix Balzer

View PDF HTML (experimental)

Abstract:Large Language Model (LLM) coding agents typically explore codebases through repeated file-reading and grep-searching, consuming thousands of tokens per query without structural understanding. We present Codebase-Memory, an open-source system that constructs a persistent, Tree-Sitter-based knowledge graph via the Model Context Protocol (MCP), parsing 66 languages through a multi-phase pipeline with parallel worker pools, call-graph traversal, impact analysis, and community discovery. Evaluated across 31 real-world repositories, Codebase-Memory achieves 83% answer quality versus 92% for a file-exploration agent, at ten times fewer tokens and 2.1 times fewer tool calls. For graph-native queries such as hub detection and caller ranking, it matches or exceeds the explorer on 19 of 31 languages.

Comments: 10 pages, 5 authors, preprint

Subjects:

Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)

ACM classes: D.2.3; D.2.7; D.3.4; H.3.3; I.2.2

Cite as: arXiv:2603.27277 [cs.SE]

(or arXiv:2603.27277v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2603.27277

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Martin Vogel [view email] [v1] Sat, 28 Mar 2026 14:18:12 UTC (25 KB)

Original source

arXiv

https://arxiv.org/abs/2603.27277

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Laws & RegulationFresh

Show HN: Semantic atlas of 188 constitutions in 3D (30k articles, embeddings)

I built this after noticing that existing tools for comparing constitutional law either have steep learning curves or only support keyword search. By combining Gemini embeddings with UMAP projection, you can navigate 30,828 constitutional articles from 188 countries in 3D and find conceptually related provisions even when the wording differs. Feedback welcome, especially from legal researchers or comparative law folks. Source and pipeline: github.com/joaoli13/constitutional-map-ai Comments URL: https://news.ycombinator.com/item?id=47609372 Points: 4 # Comments: 0

Hacker News Top

1mabout 2 hours ago

ModelsRecent

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxPWEh6U2I5SmhLcnhXMzZCRExEaC1RRV81ZVFMcWVpeUJ5eXpqYjlkbkZWSWhtSDZ6SmxJcnI1Ni03eDdrdUIwaVZwZjc1NTFLUmxIdTRXcXJwcDNPTzVJUDZhYVJoU3pkTzhPczZYUW9kVXIyU1N1M2NVb1Qyd0gwUmNiRU1xR3dSTVFMdExzalhwTDVmZ1dIUkZ0TG9LQjg5S3JGTEFNdXhzX05HYl95VHh5MGFRbEk2NkdhbzIwVTgtV3pEeWY2cXEtbmEyX0lPTDdkRkhKSWZDcnRSdzhkM29GUEpXWVF2bUhJbXgyWjNWUUtpQlMtZWdVT3Z0cTB2SmpfaUJlMEJVX2s1OHhSVnFHSS1MSnU0S2F1akhWdFJjX1pqTy1nYmdndUhpc2oxNTBDVldNWEI5dEl3dHQ4eW1fS1hkTXNzdGNfX0lCZldRZ3pvbzBGaEE1T0dMYjY3VTNZZUpEQVhMTGpJOHNFWmZoRmtuRWdTbmxQUnBLTXI3ZXlBS2hJOTdRcktTb0l5WE9QaDBWdjFmdGREM1NfRVJSVno3ZG1yYkpVNFFNdHR0NG11Sjg2Qw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>

Google News: LLM

1m1 day ago

ProductsFresh

Meta Forms 'Elite' AI Research Team Focused On Social App Recommendations 04/02/2026 - MediaPost

<a href="https://news.google.com/rss/articles/CBMiqwFBVV95cUxOM3loSFZYMmFQLWNLYTlRMmpheTB2T01weHhKcFN0MmpzOHQ5aGlNcmg0TEFNRi1mYUxxT2FLekx5dXFSeHItOVNyWXQ2OHY4QkhLY2tYbzJEeFMtcXp5bUc2WjRaa2FOVk9ybHVQX1J0a012aXNzZXItUTctTWptcEdieUJXbndMakN0bDZvTjdJWlNLd2huUzIyczFNb3JWTDJhbVk5RnNpREk?oc=5" target="_blank">Meta Forms 'Elite' AI Research Team Focused On Social App Recommendations 04/02/2026</a> <font color="#6f6f6f">MediaPost</font>

GNews AI Meta

1mabout 7 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 159 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersLive

GENPACK: KPI-Guided Multi-Criteria Genetic Algorithm for Industrial 3D Bin Packing

arXiv:2601.11325v3 Announce Type: replace Abstract: The three-dimensional bin packing problem (3D-BPP) is a longstanding challenge in operations research and logistics. While classical heuristics and constructive methods can generate packings efficiently, they often fail to satisfy industrial requirements such as stability, balance, and handling feasibility. Metaheuristics such as genetic algorithms (GAs) offer greater flexibility, but pure GA approaches frequently struggle with efficiency, parameter sensitivity, and scalability to industrial order sizes. These limitations are particularly evident at real-world pallet dimensions, where even state-of-the-art methods often fail to produce robust, deployable solutions. We propose a KPI-guided GA-based pipeline for industrial 3D-BPP that integ

arXiv cs.NE

1mabout 1 hour ago

Research PapersLive

PRISM: Differentiable Analysis-by-Synthesis for Fixel Recovery in Diffusion MRI

arXiv:2604.00250v1 Announce Type: new Abstract: Diffusion MRI microstructure fitting is nonconvex and often performed voxelwise, which limits fiber peak recovery in narrow crossings. This work introduces PRISM, a differentiable analysis-by-synthesis framework that fits an explicit multi-compartment forward model end-to-end over spatial patches. The model combines cerebrospinal fluid (CSF), gray matter, up to K white-matter fiber compartments (stick-and-zeppelin), and a restricted compartment, with explicit fiber directions and soft model selection via repulsion and sparsity priors. PRISM supports a fast MSE objective and a Rician negative log-likelihood (NLL) that jointly learns sigma without oracle information. A lightweight nuisance calibration module (smooth bias field and per-measureme

arXiv cs.CV

2mabout 1 hour ago

Research PapersLive

Competition and Cooperation of LLM Agents in Games

arXiv:2604.00487v1 Announce Type: new Abstract: Large language model (LLM) agents are increasingly deployed in competitive multi-agent settings, raising fundamental questions about whether they converge to equilibria and how their strategic behavior can be characterized. In this paper, we study LLM agent interactions in two standard games: a network resource allocation game and a Cournot competition game. Rather than converging to Nash equilibria, we find that LLM agents tend to cooperate when given multi-round prompts and non-zero-sum context. Chain-of-thought analysis reveals that fairness reasoning is central to this behavior. We propose an analytical framework that captures the dynamics of LLM agent reasoning across rounds and explains these experimental findings.

arXiv cs.MA

1mabout 1 hour ago

Research PapersLive

Evolution Strategies for Deep RL pretraining

arXiv:2604.00066v1 Announce Type: new Abstract: Although Deep Reinforcement Learning has proven highly effective for complex decision-making problems, it demands significant computational resources and careful parameter adjustment in order to develop successful strategies. Evolution strategies offer a more straightforward, derivative-free approach that is less computationally costly and simpler to deploy. However, ES generally do not match the performance levels achieved by DRL, which calls into question their suitability for more demanding scenarios. This study examines the performance of ES and DRL across tasks of varying difficulty, including Flappy Bird, Breakout and Mujoco environments, as well as whether ES could be used for initial training to enhance DRL algorithms. The results ind

arXiv cs.LG

1mabout 1 hour ago