Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessDeveloper’s Guide to Building ADK Agents with SkillsGoogle Developers BlogCargill Wins 2026 BIG Artificial Intelligence Excellence Award - foodmarket.comGoogle News: AIMeet the Agentic AI Design-to-Source Workspace for PLM: From CAD to Confident Sourcing Decisions - Oracle BlogsGNews AI agenticYouTube blasted by hundreds of experts over ‘AI slop’ videos served up to kidsFast Company TechZenity Emphasizes Security Controls for Expanding Enterprise AI Agent Ecosystems - TipRanksGoogle News: AI SafetyApono Uses Gamified AI Security Exercise to Engage Cloud Security Community - TipRanksGoogle News: AI SafetyUniversity of Colorado delays student rollout of ChatGPT Edu - Boulder Daily CameraGoogle News: ChatGPTSpaceX finally files for IPO, targets $1.75 trillion valuationArs TechnicaMeta’s natural gas binge could power South DakotaTechCrunch AIYour AI Vendor's Worst Enemy Is Its Own Development Pipeline - GovInfoSecurityGoogle News: Machine LearningLegal AI startup Legora hits $100 million in annual recurring revenueBusiness InsiderAnthropic's leaked AI coding tool has been cloned over 8,000 times on GitHub despite mass takedownsThe DecoderBlack Hat USADark ReadingBlack Hat AsiaAI BusinessDeveloper’s Guide to Building ADK Agents with SkillsGoogle Developers BlogCargill Wins 2026 BIG Artificial Intelligence Excellence Award - foodmarket.comGoogle News: AIMeet the Agentic AI Design-to-Source Workspace for PLM: From CAD to Confident Sourcing Decisions - Oracle BlogsGNews AI agenticYouTube blasted by hundreds of experts over ‘AI slop’ videos served up to kidsFast Company TechZenity Emphasizes Security Controls for Expanding Enterprise AI Agent Ecosystems - TipRanksGoogle News: AI SafetyApono Uses Gamified AI Security Exercise to Engage Cloud Security Community - TipRanksGoogle News: AI SafetyUniversity of Colorado delays student rollout of ChatGPT Edu - Boulder Daily CameraGoogle News: ChatGPTSpaceX finally files for IPO, targets $1.75 trillion valuationArs TechnicaMeta’s natural gas binge could power South DakotaTechCrunch AIYour AI Vendor's Worst Enemy Is Its Own Development Pipeline - GovInfoSecurityGoogle News: Machine LearningLegal AI startup Legora hits $100 million in annual recurring revenueBusiness InsiderAnthropic's leaked AI coding tool has been cloned over 8,000 times on GitHub despite mass takedownsThe Decoder

Steering Sparse Autoencoder Latents to Control Dynamic Head Pruning in Vision Transformers (Student Abstract)

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.26743v1 Announce Type: cross Abstract: Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads, but existing pruning policies are often difficult to interpret and control. In this work, we propose a novel framework by integrating Sparse Autoencoders (SAEs) with dynamic pruning, leveraging their ability to disentangle dense embeddings into interpretable and controllable sparse latents. Specifically, we train an SAE on the final-layer residual embedding of the ViT and amplify the sparse latents with different strategies to alter pru — Yousung Lee, Dongsoo Har

View PDF HTML (experimental)

Abstract:Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads, but existing pruning policies are often difficult to interpret and control. In this work, we propose a novel framework by integrating Sparse Autoencoders (SAEs) with dynamic pruning, leveraging their ability to disentangle dense embeddings into interpretable and controllable sparse latents. Specifically, we train an SAE on the final-layer residual embedding of the ViT and amplify the sparse latents with different strategies to alter pruning decisions. Among them, per-class steering reveals compact, class-specific head subsets that preserve accuracy. For example, bowl improves accuracy (76% to 82%) while reducing head usage (0.72 to 0.33) via heads h2 and h5. These results show that sparse latent features enable class-specific control of dynamic pruning, effectively bridging pruning efficiency and mechanistic interpretability in ViTs.

Comments: 3 pages, 5 figures. Accepted as AAAI 2026 Student Abstract. Includes additional appendix with extended analysis

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Cite as: arXiv:2603.26743 [cs.CV]

(or arXiv:2603.26743v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26743

arXiv-issued DOI via DataCite (pending registration)

Journal reference: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2026), Vol. 40, No. 48, pp. 41263-41265

Related DOI:

https://doi.org/10.1609/aaai.v40i48.42236

DOI(s) linking to related resources

Submission history

From: Yousung Lee [view email] [v1] Mon, 23 Mar 2026 07:08:19 UTC (3,632 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Steering Sp…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 200 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers