Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessI Built an MCP Server That Understands Your MSBuild Project Graph — Before You BuildDEV CommunityAgent Middleware in Microsoft Agent Framework 1.0DEV Communityکود کشاورزی — Complete GuideDEV CommunityHow I Track My AI Spending as a Solo Dev (Without Going Broke)DEV CommunityWe Shipped an AI Song Generator. The Hardest Part Wasn't the AI.DEV CommunityPOTS explained: The disorder that forced OpenAI exec Fidji Simo to take medical leaveBusiness InsiderWhat is POTS, the disorder that forced OpenAI exec Fidji Simo to take medical leave - Business InsiderGoogle News: OpenAI"I Built a Web Browser from Scratch in 42 Days — No Libraries, Just Node.js"DEV CommunityWhy 80% of AI Projects Fail — And the 4-Layer Readiness Framework That Changes the OddsTowards AIWhy Your Data Governance is Already ObsoleteAI YouTube Channel 35How to Clean Up Xcode and Free 30-50GB on Your MacDEV CommunityNo Fooling, Spaceballs 2 Will Hit Theaters April 2027GizmodoBlack Hat USADark ReadingBlack Hat AsiaAI BusinessI Built an MCP Server That Understands Your MSBuild Project Graph — Before You BuildDEV CommunityAgent Middleware in Microsoft Agent Framework 1.0DEV Communityکود کشاورزی — Complete GuideDEV CommunityHow I Track My AI Spending as a Solo Dev (Without Going Broke)DEV CommunityWe Shipped an AI Song Generator. The Hardest Part Wasn't the AI.DEV CommunityPOTS explained: The disorder that forced OpenAI exec Fidji Simo to take medical leaveBusiness InsiderWhat is POTS, the disorder that forced OpenAI exec Fidji Simo to take medical leave - Business InsiderGoogle News: OpenAI"I Built a Web Browser from Scratch in 42 Days — No Libraries, Just Node.js"DEV CommunityWhy 80% of AI Projects Fail — And the 4-Layer Readiness Framework That Changes the OddsTowards AIWhy Your Data Governance is Already ObsoleteAI YouTube Channel 35How to Clean Up Xcode and Free 30-50GB on Your MacDEV CommunityNo Fooling, Spaceballs 2 Will Hit Theaters April 2027Gizmodo
AI NEWS HUBbyEIGENVECTOREigenvector

Friends and Grandmothers in Silico: Localizing Entity Cells in Language Models

arXiv cs.CLby [Submitted on 1 Apr 2026]April 4, 20262 min read1 views
Source Quiz

arXiv:2604.01404v1 Announce Type: new Abstract: Language models can answer many entity-centric factual questions, but it remains unclear which internal mechanisms are involved in this process. We study this question across multiple language models. We localize entity-selective MLP neurons using templated prompts about each entity, and then validate them with causal interventions on PopQA-based QA examples. On a curated set of 200 entities drawn from PopQA, localized neurons concentrate in early layers. Negative ablation produces entity-specific amnesia, while controlled injection at a placeholder token improves answer retrieval relative to mean-entity and wrong-cell controls. For many entities, activating a single localized neuron is sufficient to recover entity-consistent predictions once

View PDF HTML (experimental)

Abstract:Language models can answer many entity-centric factual questions, but it remains unclear which internal mechanisms are involved in this process. We study this question across multiple language models. We localize entity-selective MLP neurons using templated prompts about each entity, and then validate them with causal interventions on PopQA-based QA examples. On a curated set of 200 entities drawn from PopQA, localized neurons concentrate in early layers. Negative ablation produces entity-specific amnesia, while controlled injection at a placeholder token improves answer retrieval relative to mean-entity and wrong-cell controls. For many entities, activating a single localized neuron is sufficient to recover entity-consistent predictions once the context is initialized, consistent with compact entity retrieval rather than purely gradual enrichment across depth. Robustness to aliases, acronyms, misspellings, and multilingual forms supports a canonicalization interpretation. The effect is strong but not universal: not every entity admits a reliable single-neuron handle, and coverage is higher for popular entities. Overall, these results identify sparse, causally actionable access points for analyzing and modulating entity-conditioned factual behavior.

Subjects:

Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

Cite as: arXiv:2604.01404 [cs.CL]

(or arXiv:2604.01404v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2604.01404

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Itay Yona [view email] [v1] Wed, 1 Apr 2026 21:09:06 UTC (1,732 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelannounce

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Friends and…modellanguage mo…announcepredictionstudyarxivarXiv cs.CL

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 121 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models