Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessThe way I see it — The development of autonomous vehicles is fraught with ethical concerns. And: The notion that the separatiDev.to AIThe Architect’s Reflection: The 5D MiddlewareMedium AII Am a Software Engineer Teaching Myself AI Engineering. Here Is Where I Am Starting.Medium AI20 Meta-Prompts That Boost AI Response Quality by 300%Dev.to AI5 Projects That Put a Fully Customizable AI Assistant on Your Wrist in Under $15Dev.to AIWhy OpenAI’s TBPN Acquisition Is a Turning Point for Enterprise AIMedium AIThe Internet Feels Uncannily Different in 2026 — The Data Explains WhyMedium AIUpstage, a South Korean artificial intelligence (AI) startup, met with French AI unicorn Mistral AI - 매일경제GNews AI MistralThis Artificial Intelligence (AI) Stock Could Be a Hidden Gem (and Here's Why) - The Motley FoolGoogle News: AIAI, Warfare, and Augmented Cities - Small Wars JournalGNews AI USAGamingtak Sony koopt start-up die foto s en video s omzet naar 3dTweakers.netChinese Chip Makers Hit Record Revenue on AI Boom, US Curbs - The Tech BuzzGNews AI ChinaBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessThe way I see it — The development of autonomous vehicles is fraught with ethical concerns. And: The notion that the separatiDev.to AIThe Architect’s Reflection: The 5D MiddlewareMedium AII Am a Software Engineer Teaching Myself AI Engineering. Here Is Where I Am Starting.Medium AI20 Meta-Prompts That Boost AI Response Quality by 300%Dev.to AI5 Projects That Put a Fully Customizable AI Assistant on Your Wrist in Under $15Dev.to AIWhy OpenAI’s TBPN Acquisition Is a Turning Point for Enterprise AIMedium AIThe Internet Feels Uncannily Different in 2026 — The Data Explains WhyMedium AIUpstage, a South Korean artificial intelligence (AI) startup, met with French AI unicorn Mistral AI - 매일경제GNews AI MistralThis Artificial Intelligence (AI) Stock Could Be a Hidden Gem (and Here's Why) - The Motley FoolGoogle News: AIAI, Warfare, and Augmented Cities - Small Wars JournalGNews AI USAGamingtak Sony koopt start-up die foto s en video s omzet naar 3dTweakers.netChinese Chip Makers Hit Record Revenue on AI Boom, US Curbs - The Tech BuzzGNews AI China
AI NEWS HUBbyEIGENVECTOREigenvector

ClipTTT: CLIP-Guided Test-Time Training Helps LVLMs See Better

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.26486v1 Announce Type: new Abstract: Large vision-language models (LVLMs) tend to hallucinate, especially when visual inputs are corrupted at test time. We show that such corruptions act as additional distribution shifts, significantly amplifying hallucination rates in real-world applications. To address this, we propose CLIP-guided Test-Time Training (ClipTTT), a method to adapt LVLMs under degraded conditions on the fly with a single test sample. Specifically, we leverage the image-text alignment strength of a pre-trained CLIP model as a stable guidance signal to identify reliable — Mriganka Nath, Anurag Das, Jiahao Xie, Bernt Schiele

View PDF HTML (experimental)

Abstract:Large vision-language models (LVLMs) tend to hallucinate, especially when visual inputs are corrupted at test time. We show that such corruptions act as additional distribution shifts, significantly amplifying hallucination rates in real-world applications. To address this, we propose CLIP-guided Test-Time Training (ClipTTT), a method to adapt LVLMs under degraded conditions on the fly with a single test sample. Specifically, we leverage the image-text alignment strength of a pre-trained CLIP model as a stable guidance signal to identify reliable self-supervision targets, enabling rapid adaptation without altering the base LVLMs. Extensive experiments on standard hallucination benchmarks, with 15 common corruptions, demonstrate that ClipTTT effectively mitigates hallucinations and improves descriptive faithfulness under visual corruptions.

Comments: 30 pages, 12 figures

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26486 [cs.CV]

(or arXiv:2603.26486v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26486

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Mriganka Nath [view email] [v1] Fri, 27 Mar 2026 14:47:35 UTC (8,566 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
ClipTTT: CL…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 245 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers