Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMastering the art of no in generative AI projects - FinTech GlobalGoogle News: Generative AIAI Guardrails by Zapier Gives Teams Inline Safety Checks for Every AI-Powered Workflow - citybizGoogle News: AI SafetyAnthropic Accidentally Leaks Claude Source Code - BenzingaGoogle News: ClaudeThe Fact That Anthropic Has Been Boasting About How Much Its Development Now Relies on Claude Makes It Very Interesting That It Just Suffered a Catastrophic Leak of Its Source Code - FuturismGoogle News: ClaudeAOC Reportedly Says She Will Vote Against All Military Aid To Israel, Including Defensive WeaponsInternational Business TimesTop Artificial Intelligence Speakers for Events | Scott Steinberg - futuristsspeakers.comGoogle News: AIThese Raspberry Pi price hikes are no jokeThe Verge AIYou can now talk to ChatGPT on Apple CarPlay - ArenaEV - ArenaEVGoogle News: ChatGPTCan Science Predict When a Study Won’t Hold Up? - The New York TimesGoogle News: AICan Science Predict When a Study Won’t Hold Up?NYT TechnologyGenerative AI is the future of traffic engineering, Miovision says - Smart Cities DiveGoogle News: Generative AIAI startups dominate VC interest even as funding slips - BusinessLineGNews AI startupsBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMastering the art of no in generative AI projects - FinTech GlobalGoogle News: Generative AIAI Guardrails by Zapier Gives Teams Inline Safety Checks for Every AI-Powered Workflow - citybizGoogle News: AI SafetyAnthropic Accidentally Leaks Claude Source Code - BenzingaGoogle News: ClaudeThe Fact That Anthropic Has Been Boasting About How Much Its Development Now Relies on Claude Makes It Very Interesting That It Just Suffered a Catastrophic Leak of Its Source Code - FuturismGoogle News: ClaudeAOC Reportedly Says She Will Vote Against All Military Aid To Israel, Including Defensive WeaponsInternational Business TimesTop Artificial Intelligence Speakers for Events | Scott Steinberg - futuristsspeakers.comGoogle News: AIThese Raspberry Pi price hikes are no jokeThe Verge AIYou can now talk to ChatGPT on Apple CarPlay - ArenaEV - ArenaEVGoogle News: ChatGPTCan Science Predict When a Study Won’t Hold Up? - The New York TimesGoogle News: AICan Science Predict When a Study Won’t Hold Up?NYT TechnologyGenerative AI is the future of traffic engineering, Miovision says - Smart Cities DiveGoogle News: Generative AIAI startups dominate VC interest even as funding slips - BusinessLineGNews AI startups

Structural Graph Probing of Vision-Language Models

arXivMarch 31, 20261 min read0 views
Source Quiz

arXiv:2603.27070v1 Announce Type: new Abstract: Vision-language models (VLMs) achieve strong multimodal performance, yet how computation is organized across populations of neurons remains poorly understood. In this work, we study VLMs through the lens of neural topology, representing each layer as a within-layer correlation graph derived from neuron-neuron co-activations. This view allows us to ask whether population-level structure is behaviorally meaningful, how it changes across modalities and depth, and whether it identifies causally influential internal components under intervention. We s — Haoyu He, Yue Zhuo, Yu Zheng, Qi R. Wang

View PDF HTML (experimental)

Abstract:Vision-language models (VLMs) achieve strong multimodal performance, yet how computation is organized across populations of neurons remains poorly understood. In this work, we study VLMs through the lens of neural topology, representing each layer as a within-layer correlation graph derived from neuron-neuron co-activations. This view allows us to ask whether population-level structure is behaviorally meaningful, how it changes across modalities and depth, and whether it identifies causally influential internal components under intervention. We show that correlation topology carries recoverable behavioral signal; moreover, cross-modal structure progressively consolidates with depth around a compact set of recurrent hub neurons, whose targeted perturbation substantially alters model output. Neural topology thus emerges as a meaningful intermediate scale for VLM interpretability: richer than local attribution, more tractable than full circuit recovery, and empirically tied to multimodal behavior. Code is publicly available at this https URL.

Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.27070 [cs.CV]

(or arXiv:2603.27070v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.27070

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Haoyu He [view email] [v1] Sat, 28 Mar 2026 01:14:40 UTC (167 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Structural …researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 135 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers