Research Papers research paper arxiv ai artificial-intelligence

Steering Sparse Autoencoder Latents to Control Dynamic Head Pruning in Vision Transformers (Student Abstract)

arXivMarch 31, 202610 min read0 views

arXiv:2603.26743v1 Announce Type: cross Abstract: Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads, but existing pruning policies are often difficult to interpret and control. In this work, we propose a novel framework by integrating Sparse Autoencoders (SAEs) with dynamic pruning, leveraging their ability to disentangle dense embeddings into interpretable and controllable sparse latents. Specifically, we train an SAE on the final-layer residual embedding of the ViT and amplify the sparse latents with different strategies to alter pru — Yousung Lee, Dongsoo Har

View PDF HTML (experimental)

Abstract:Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads, but existing pruning policies are often difficult to interpret and control. In this work, we propose a novel framework by integrating Sparse Autoencoders (SAEs) with dynamic pruning, leveraging their ability to disentangle dense embeddings into interpretable and controllable sparse latents. Specifically, we train an SAE on the final-layer residual embedding of the ViT and amplify the sparse latents with different strategies to alter pruning decisions. Among them, per-class steering reveals compact, class-specific head subsets that preserve accuracy. For example, bowl improves accuracy (76% to 82%) while reducing head usage (0.72 to 0.33) via heads h2 and h5. These results show that sparse latent features enable class-specific control of dynamic pruning, effectively bridging pruning efficiency and mechanistic interpretability in ViTs.

Comments: 3 pages, 5 figures. Accepted as AAAI 2026 Student Abstract. Includes additional appendix with extended analysis

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

Cite as: arXiv:2603.26743 [cs.CV]

(or arXiv:2603.26743v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26743

arXiv-issued DOI via DataCite (pending registration)

Journal reference: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2026), Vol. 40, No. 48, pp. 41263-41265

DOI(s) linking to related resources

Submission history

From: Yousung Lee [view email] [v1] Mon, 23 Mar 2026 07:08:19 UTC (3,632 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26743

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsFresh

AI models will secretly scheme to protect other AI models from being shut down, researchers find

Leading AI models will inflate performance reviews, exfiltrate model weights to prevent 'peer' AI models from being shut down

Fortune Tech

1mabout 2 hours ago

ModelsLive

AI alignment researchers want to automate themselves - Transformer | Substack

<a href="https://news.google.com/rss/articles/CBMiiwFBVV95cUxQTTlsWE8xQzg4Rlg4RW5fVUE4Nkc4WkN0WkRISmhvUnFndnpUMFlkcHNvZGQyQ1JRdm81Wmp6bGhzdnZyT295MFl2bmh3dTNpWWNmaXdUMnNNNGhkWEFHZXhiS0w5cm5GZGc3THJkeVEyYlRSM3pPZUNJejlqOHVoZkE4SXk0bGRHMGE4?oc=5" target="_blank">AI alignment researchers want to automate themselves</a> Transformer | Substack

Google News: AI Safety

1mabout 1 hour ago

ModelsRecent

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxQT01EWURiSlJKbk1kZ2pSQ3BSUHFGRVpSdnBNdE1EMmtzUDJYemduTWJPa1FsZEw3RUdPQWt5WnlvMU9Ya0FKWjdBaHIyWEFoRzJHLTBhdnZCbTZxZ0JwdjJQMDMzY09rSmpabDNyc1JGRjI4Y1pBOXBZcnk0dzJ3Q25hMlkzLXhRRHl4YUF0R1lUSGdyQ2xfcm9DN1lyN01SbnNza2pmUmVDcVNVbHFXTXRUYkd2U1BxSXdqRzJpQ2JlMVVESW1qeGxHVG44enlSRXlZamJUS1RTdE56MllEQ0M3blB4dEJwNURrZzNjNWxROGc3cDJ2b1ZqeExFN0E5MEEzZWJDR3luVFNfRlBDdWxtMDBHMklmRWN4M3VjX3B3SjJXZFdJUHNTc2FBQmhjdjF0ZXFMV2hZWVdLS00wenpUZGVGelVQdXNxUWNUTUd5RXowR090dXBLcjdZVndOZXM2QzBFRkFDTllQLW16YWNwWlR2T0JzMENNbXNUanduSmZudm1rM0MtaS1CV0RodE9JRzBjMDBid3V1MDhaX0piWW1ocUlxMTBEWGd6QW9UNG1CMFlMMw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> WSJ

Google News: LLM

1m1 day ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 200 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

The Quantum Threat to Bitcoin Dividing Crypto

Two papers published this week have reignited debates about the risk posed by “Q-day” to the cryptography that underpins digital assets.

Decrypt AI

1mabout 2 hours ago

Research PapersFresh

Researchers to use robotics and AI to help sheep producers - University of Nevada, Reno

<a href="https://news.google.com/rss/articles/CBMic0FVX3lxTFB4UmxpREpFODBJN0lKakYwRVVtdlZPNmNiTExRelVFaDYzYW9kX2RCc0pEZjlmX01fT1dWYTlxZE1ET2ZKVVgzSVZIenY3bDlHa3FXS1dUdVBmTEdLa1hUR2x3OWxHbkE2RnROSjl6VHVHQ2c?oc=5" target="_blank">Researchers to use robotics and AI to help sheep producers</a> University of Nevada, Reno

Google News: AI

1mabout 3 hours ago

Research PapersFresh

AIRA_2: Breaking Bottlenecks In AI Research Agents - Forbes

<a href="https://news.google.com/rss/articles/CBMiowFBVV95cUxNNmtndHhmQ2lpZGdPdTJwY25xejcyV1c1SWNLdWFOWnNwbjRUQTF0ZWdOZFNaclNBNWVsaUgtU0JUM2xrakhoOXVLMVJzVTNkajdrMmJGeS1lYUpMUG1NMkZNMDJFREZZdXU2ZVdEbkNZSDNBRjJBLVYyZE9XeEY4T0RJY3J5aDVWcEZVQ2lWUjhUYXBsUk16d09NdGdsQ3lxb3gw?oc=5" target="_blank">AIRA_2: Breaking Bottlenecks In AI Research Agents</a> Forbes

Google News: Machine Learning

1mabout 3 hours ago

Research PapersFresh

Can Science Predict When a Study Won’t Hold Up?

Conducting research is hard; confirming the results is, too. And artificial intelligence isn’t yet ready to help, a major new study finds.

NYT Technology

1mabout 4 hours ago