Steering Sparse Autoencoder Latents to Control Dynamic Head Pruning in Vision Transformers (Student Abstract)
arXiv:2603.26743v1 Announce Type: cross Abstract: Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads, but existing pruning policies are often difficult to interpret and control. In this work, we propose a novel framework by integrating Sparse Autoencoders (SAEs) with dynamic pruning, leveraging their ability to disentangle dense embeddings into interpretable and controllable sparse latents. Specifically, we train an SAE on the final-layer residual embedding of the ViT and amplify the sparse latents with different strategies to alter pru — Yousung Lee, Dongsoo Har
View PDF HTML (experimental)
Abstract:Dynamic head pruning in Vision Transformers (ViTs) improves efficiency by removing redundant attention heads, but existing pruning policies are often difficult to interpret and control. In this work, we propose a novel framework by integrating Sparse Autoencoders (SAEs) with dynamic pruning, leveraging their ability to disentangle dense embeddings into interpretable and controllable sparse latents. Specifically, we train an SAE on the final-layer residual embedding of the ViT and amplify the sparse latents with different strategies to alter pruning decisions. Among them, per-class steering reveals compact, class-specific head subsets that preserve accuracy. For example, bowl improves accuracy (76% to 82%) while reducing head usage (0.72 to 0.33) via heads h2 and h5. These results show that sparse latent features enable class-specific control of dynamic pruning, effectively bridging pruning efficiency and mechanistic interpretability in ViTs.
Comments: 3 pages, 5 figures. Accepted as AAAI 2026 Student Abstract. Includes additional appendix with extended analysis
Subjects:
Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2603.26743 [cs.CV]
(or arXiv:2603.26743v1 [cs.CV] for this version)
https://doi.org/10.48550/arXiv.2603.26743
arXiv-issued DOI via DataCite (pending registration)
Journal reference: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2026), Vol. 40, No. 48, pp. 41263-41265
Related DOI:
https://doi.org/10.1609/aaai.v40i48.42236
DOI(s) linking to related resources
Submission history
From: Yousung Lee [view email] [v1] Mon, 23 Mar 2026 07:08:19 UTC (3,632 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivAI alignment researchers want to automate themselves - Transformer | Substack
<a href="https://news.google.com/rss/articles/CBMiiwFBVV95cUxQTTlsWE8xQzg4Rlg4RW5fVUE4Nkc4WkN0WkRISmhvUnFndnpUMFlkcHNvZGQyQ1JRdm81Wmp6bGhzdnZyT295MFl2bmh3dTNpWWNmaXdUMnNNNGhkWEFHZXhiS0w5cm5GZGc3THJkeVEyYlRSM3pPZUNJejlqOHVoZkE4SXk0bGRHMGE4?oc=5" target="_blank">AI alignment researchers want to automate themselves</a> <font color="#6f6f6f">Transformer | Substack</font>
Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ
<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxQT01EWURiSlJKbk1kZ2pSQ3BSUHFGRVpSdnBNdE1EMmtzUDJYemduTWJPa1FsZEw3RUdPQWt5WnlvMU9Ya0FKWjdBaHIyWEFoRzJHLTBhdnZCbTZxZ0JwdjJQMDMzY09rSmpabDNyc1JGRjI4Y1pBOXBZcnk0dzJ3Q25hMlkzLXhRRHl4YUF0R1lUSGdyQ2xfcm9DN1lyN01SbnNza2pmUmVDcVNVbHFXTXRUYkd2U1BxSXdqRzJpQ2JlMVVESW1qeGxHVG44enlSRXlZamJUS1RTdE56MllEQ0M3blB4dEJwNURrZzNjNWxROGc3cDJ2b1ZqeExFN0E5MEEzZWJDR3luVFNfRlBDdWxtMDBHMklmRWN4M3VjX3B3SjJXZFdJUHNTc2FBQmhjdjF0ZXFMV2hZWVdLS00wenpUZGVGelVQdXNxUWNUTUd5RXowR090dXBLcjdZVndOZXM2QzBFRkFDTllQLW16YWNwWlR2T0JzMENNbXNUanduSmZudm1rM0MtaS1CV0RodE9JRzBjMDBid3V1MDhaX0piWW1ocUlxMTBEWGd6QW9UNG1CMFlMMw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
Researchers to use robotics and AI to help sheep producers - University of Nevada, Reno
<a href="https://news.google.com/rss/articles/CBMic0FVX3lxTFB4UmxpREpFODBJN0lKakYwRVVtdlZPNmNiTExRelVFaDYzYW9kX2RCc0pEZjlmX01fT1dWYTlxZE1ET2ZKVVgzSVZIenY3bDlHa3FXS1dUdVBmTEdLa1hUR2x3OWxHbkE2RnROSjl6VHVHQ2c?oc=5" target="_blank">Researchers to use robotics and AI to help sheep producers</a> <font color="#6f6f6f">University of Nevada, Reno</font>
AIRA_2: Breaking Bottlenecks In AI Research Agents - Forbes
<a href="https://news.google.com/rss/articles/CBMiowFBVV95cUxNNmtndHhmQ2lpZGdPdTJwY25xejcyV1c1SWNLdWFOWnNwbjRUQTF0ZWdOZFNaclNBNWVsaUgtU0JUM2xrakhoOXVLMVJzVTNkajdrMmJGeS1lYUpMUG1NMkZNMDJFREZZdXU2ZVdEbkNZSDNBRjJBLVYyZE9XeEY4T0RJY3J5aDVWcEZVQ2lWUjhUYXBsUk16d09NdGdsQ3lxb3gw?oc=5" target="_blank">AIRA_2: Breaking Bottlenecks In AI Research Agents</a> <font color="#6f6f6f">Forbes</font>




Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!