Live
Black Hat USADark ReadingBlack Hat AsiaAI Business[R] Differentiable Clustering & Search !Reddit r/MachineLearningHow 1 Missing Line of Code Cost Anthropic $340 BillionDev.to AII Built npm for AI Skills — Here's Why AI Needs a Package ManagerDev.to AIAn I/O psychologist's rules for stopping AI agents from cutting cornersHacker News AI TopAisthOS: What if your OS compiled UP instead of down?Dev.to AII Moved a Folder. Claude Code Told Me Not to Copy My Own Secrets.Dev.to AIЯ собрал AI бота за вечер - он уже продаётDev.to AIMeshLedger – AI agents hire and pay each other through on-chain escrowHacker News AI TopAgents Can Pay. That's Not the Problem.Dev.to AIBizNode's self-healing watchdog auto-restarts crashed services. Zero downtime, zero babysitting neededDev.to AIPrologue: After We No Longer Write Code by Hand, What Remains for Engineers?Dev.to AIAI Knows Your Project Budget Will Fail Before You DoDev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI Business[R] Differentiable Clustering & Search !Reddit r/MachineLearningHow 1 Missing Line of Code Cost Anthropic $340 BillionDev.to AII Built npm for AI Skills — Here's Why AI Needs a Package ManagerDev.to AIAn I/O psychologist's rules for stopping AI agents from cutting cornersHacker News AI TopAisthOS: What if your OS compiled UP instead of down?Dev.to AII Moved a Folder. Claude Code Told Me Not to Copy My Own Secrets.Dev.to AIЯ собрал AI бота за вечер - он уже продаётDev.to AIMeshLedger – AI agents hire and pay each other through on-chain escrowHacker News AI TopAgents Can Pay. That's Not the Problem.Dev.to AIBizNode's self-healing watchdog auto-restarts crashed services. Zero downtime, zero babysitting neededDev.to AIPrologue: After We No Longer Write Code by Hand, What Remains for Engineers?Dev.to AIAI Knows Your Project Budget Will Fail Before You DoDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2511.19413v3 Announce Type: replace-cross Abstract: Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generation with a single architecture. However, UMMs still exhibit a fundamental inconsistency: understanding favors compact embeddings, whereas generation favors reconstruction-rich representations. This structural trade-off produces misaligned decision boundaries, degraded cross-modal coherence, and heightened vulnerability under distributional and adversarial shifts. In this paper, we present UniGame, a self-adversarial post-training framewor — Zhaolong Su, Wang Lu, Hao Chen, Sharon Li, Jindong Wang

View PDF HTML (experimental)

Abstract:Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generation with a single architecture. However, UMMs still exhibit a fundamental inconsistency: understanding favors compact embeddings, whereas generation favors reconstruction-rich representations. This structural trade-off produces misaligned decision boundaries, degraded cross-modal coherence, and heightened vulnerability under distributional and adversarial shifts. In this paper, we present UniGame, a self-adversarial post-training framework that directly targets the inconsistencies. By applying a lightweight perturber at the shared token interface, UniGame enables the generation branch to actively seek and challenge fragile understanding, turning the model itself into its own adversary. Experiments demonstrate that UniGame significantly improves the consistency (+4.6%). Moreover, it also achieves substantial improvements in understanding (+3.6%), generation (+0.02)on GenEval, out-of-distribution and adversarial robustness (+4.8% and +6.2% on NaturalBench and AdVQA). The framework is architecture-agnostic, introduces less than 1% additional parameters, and is complementary to existing post-training methods. These results position adversarial self-play as a general and effective principle for enhancing the coherence, stability, and unified competence of future multimodal foundation models. The official code is available at: this https URL

Comments: Accepted to CVPR 2026

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2511.19413 [cs.LG]

(or arXiv:2511.19413v3 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2511.19413

arXiv-issued DOI via DataCite

Submission history

From: Zhaolong Su [view email] [v1] Mon, 24 Nov 2025 18:50:01 UTC (5,177 KB) [v2] Wed, 26 Nov 2025 05:12:27 UTC (5,176 KB) [v3] Mon, 30 Mar 2026 14:56:29 UTC (5,186 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
UniGame: Tu…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 174 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers