Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechFair decisions, clear reasons: Creating fuzzy AI with fairness built in from the startPhys.org AIICE says it bought Paragon s spyware to use in drug trafficking casesTechCrunchMitigating collusive self-preference by redaction and paraphrasinglesswrong.comOpenClaw Unlocks China’s AI Token Export Business - Bloomberg.comGNews AI ChinaDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIM-Files and Microsoft deepen strategic AI partnership - FinTech GlobalGNews AI CopilotGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechFair decisions, clear reasons: Creating fuzzy AI with fairness built in from the startPhys.org AIICE says it bought Paragon s spyware to use in drug trafficking casesTechCrunchMitigating collusive self-preference by redaction and paraphrasinglesswrong.comOpenClaw Unlocks China’s AI Token Export Business - Bloomberg.comGNews AI ChinaDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIM-Files and Microsoft deepen strategic AI partnership - FinTech GlobalGNews AI CopilotGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPT
AI NEWS HUBbyEIGENVECTOREigenvector

A Provable Energy-Guided Test-Time Defense Boosting Adversarial Robustness of Large Vision-Language Models

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2603.26984v1 Announce Type: new Abstract: Despite the rapid progress in multimodal models and Large Visual-Language Models (LVLM), they remain highly susceptible to adversarial perturbations, raising serious concerns about their reliability in real-world use. While adversarial training has become the leading paradigm for building models that are robust to adversarial attacks, Test-Time Transformations (TTT) have emerged as a promising strategy to boost robustness at inference.In light of this, we propose Energy-Guided Test-Time Transformation (ET3), a lightweight, training-free defense t — Mujtaba Hussain Mirza, Antonio D'Orazio, Odelia Melamed, Iacopo Masi

View PDF

Abstract:Despite the rapid progress in multimodal models and Large Visual-Language Models (LVLM), they remain highly susceptible to adversarial perturbations, raising serious concerns about their reliability in real-world use. While adversarial training has become the leading paradigm for building models that are robust to adversarial attacks, Test-Time Transformations (TTT) have emerged as a promising strategy to boost robustness at this http URL light of this, we propose Energy-Guided Test-Time Transformation (ET3), a lightweight, training-free defense that enhances the robustness by minimizing the energy of the input this http URL method is grounded in a theory that proves our transformation succeeds in classification under reasonable assumptions. We present extensive experiments demonstrating that ET3 provides a strong defense for classifiers, zero-shot classification with CLIP, and also for boosting the robustness of LVLMs in tasks such as Image Captioning and Visual Question Answering. Code is available at this http URL .

Comments: Accepted at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026, Main Conference

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26984 [cs.CV]

(or arXiv:2603.26984v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26984

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Mujtaba Hussain Mirza [view email] [v1] Fri, 27 Mar 2026 20:53:04 UTC (18,805 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
A Provable …researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 167 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!