Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMachine Learning in Blockchain for AI Engineers and Blockchain Developers - Blockchain CouncilGoogle News: Machine LearningAI Models Secretly Schemed to Prevent Each Other From Being Shut Down - SOFXGoogle News: AI SafetyAI boost to S.Korea and Taiwan will outweigh current energy headwinds: Causeway Capital - CNBCGNews AI TaiwanAI Is Routine for College Students, Despite Campus Limits - Gallup NewsGoogle News: AICollege Students Weigh AI's Impact on Majors and Careers - Gallup NewsGoogle News: AICincinnati doctors built an AI assistant to improve heart failure care - Cincinnati EnquirerGoogle News: AIAI ScrapingTowards AIPrivate AI: Enterprise Data in the RAG EraTowards AII Read Every Line of Anthropic’s Leaked Source Code So You Don’t Have To.Towards AIStop Writing Boilerplate. Start Building: Introducing app-generator-cliTowards AIData MiningTowards AIMastering LangGraph: The Backbone of Stateful Multi-Agent AITowards AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMachine Learning in Blockchain for AI Engineers and Blockchain Developers - Blockchain CouncilGoogle News: Machine LearningAI Models Secretly Schemed to Prevent Each Other From Being Shut Down - SOFXGoogle News: AI SafetyAI boost to S.Korea and Taiwan will outweigh current energy headwinds: Causeway Capital - CNBCGNews AI TaiwanAI Is Routine for College Students, Despite Campus Limits - Gallup NewsGoogle News: AICollege Students Weigh AI's Impact on Majors and Careers - Gallup NewsGoogle News: AICincinnati doctors built an AI assistant to improve heart failure care - Cincinnati EnquirerGoogle News: AIAI ScrapingTowards AIPrivate AI: Enterprise Data in the RAG EraTowards AII Read Every Line of Anthropic’s Leaked Source Code So You Don’t Have To.Towards AIStop Writing Boilerplate. Start Building: Introducing app-generator-cliTowards AIData MiningTowards AIMastering LangGraph: The Backbone of Stateful Multi-Agent AITowards AI

Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27277v1 Announce Type: cross Abstract: Large Language Model (LLM) coding agents typically explore codebases through repeated file-reading and grep-searching, consuming thousands of tokens per query without structural understanding. We present Codebase-Memory, an open-source system that constructs a persistent, Tree-Sitter-based knowledge graph via the Model Context Protocol (MCP), parsing 66 languages through a multi-phase pipeline with parallel worker pools, call-graph traversal, impact analysis, and community discovery. Evaluated across 31 real-world repositories, Codebase-Memory — Martin Vogel, Falk Meyer-Eschenbach, Severin Kohler, Elias Gr\"unewald, Felix Balzer

View PDF HTML (experimental)

Abstract:Large Language Model (LLM) coding agents typically explore codebases through repeated file-reading and grep-searching, consuming thousands of tokens per query without structural understanding. We present Codebase-Memory, an open-source system that constructs a persistent, Tree-Sitter-based knowledge graph via the Model Context Protocol (MCP), parsing 66 languages through a multi-phase pipeline with parallel worker pools, call-graph traversal, impact analysis, and community discovery. Evaluated across 31 real-world repositories, Codebase-Memory achieves 83% answer quality versus 92% for a file-exploration agent, at ten times fewer tokens and 2.1 times fewer tool calls. For graph-native queries such as hub detection and caller ranking, it matches or exceeds the explorer on 19 of 31 languages.

Comments: 10 pages, 5 authors, preprint

Subjects:

Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Programming Languages (cs.PL)

ACM classes: D.2.3; D.2.7; D.3.4; H.3.3; I.2.2

Cite as: arXiv:2603.27277 [cs.SE]

(or arXiv:2603.27277v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2603.27277

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Martin Vogel [view email] [v1] Sat, 28 Mar 2026 14:18:12 UTC (25 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Codebase-Me…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 159 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers