Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningI Built a 6-Agent AI System in a WeekendMedium AIGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIHow to Make Money with AI in 2026 (Even If You’re Starting from Zero)Medium AIYour Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.Medium AIIn the AI Era, Just Get FitMedium AIMy Salary Doubled After I Added These 4 Skills to My Resume — All Free to LearnMedium AITurn Word Into Your AI Writing Partner With Agent ModeMedium AIBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningI Built a 6-Agent AI System in a WeekendMedium AIGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIHow to Make Money with AI in 2026 (Even If You’re Starting from Zero)Medium AIYour Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.Medium AIIn the AI Era, Just Get FitMedium AIMy Salary Doubled After I Added These 4 Skills to My Resume — All Free to LearnMedium AITurn Word Into Your AI Writing Partner With Agent ModeMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.25823v1 Announce Type: cross Abstract: Beneath the stunning visual fidelity of modern AIGC models lies a "logical desert", where systems fail tasks that require physical, causal, or complex spatial reasoning. Current evaluations largely rely on superficial metrics or fragmented benchmarks, creating a ``performance mirage'' that overlooks the generative process. To address this, we introduce ViGoR Vision-G}nerative Reasoning-centric Benchmark), a unified framework designed to dismantle this mirage. ViGoR distinguishes itself through four key innovations: 1) holistic cross-modal cover — Haonan Han, Jiancheng Huang, Xiaopeng Sun, Junyan He, Rui Yang, Jie Hu, Xiaojiang Peng, Lin Ma, Xiaoming Wei, Xiu Li

View PDF HTML (experimental)

Abstract:Beneath the stunning visual fidelity of modern AIGC models lies a "logical desert", where systems fail tasks that require physical, causal, or complex spatial reasoning. Current evaluations largely rely on superficial metrics or fragmented benchmarks, creating a performance mirage'' that overlooks the generative process. To address this, we introduce ViGoR Vision-G}nerative Reasoning-centric Benchmark), a unified framework designed to dismantle this mirage. ViGoR distinguishes itself through four key innovations: 1) holistic cross-modal coverage bridging Image-to-Image and Video tasks; 2) a dual-track mechanism evaluating both intermediate processes and final results; 3) an evidence-grounded automated judge ensuring high human alignment; and 4) granular diagnostic analysis that decomposes performance into fine-grained cognitive dimensions. Experiments on over 20 leading models reveal that even state-of-the-art systems harbor significant reasoning deficits, establishing ViGoR as a critical stress test'' for the next generation of intelligent vision models. The demo have been available at this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.25823 [cs.CV]

(or arXiv:2603.25823v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.25823

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Haonan Han [view email] [v1] Thu, 26 Mar 2026 18:40:09 UTC (5,687 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
ViGoR-Bench…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 195 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!