Live
🔥 google-research/timesfmGitHub Trending🔥 aliasrobotics/caiGitHub Trending🔥 ComposioHQ/awesome-claude-skillsGitHub Trending🔥 SkyworkAI/Matrix-GameGitHub Trending🔥 sponsors/vas3kGitHub Trending🔥 sponsors/khoj-aiGitHub Trending🔥 PaddlePaddle/PaddleOCRGitHub TrendingTest: 15% of Americans say they would work for AI bossTechCrunch AIAutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure DesignarXivMultiverse: Language-Conditioned Multi-Game Level Blending via Shared RepresentationarXivMediHive: A Decentralized Agent Collective for Medical ReasoningarXivBitboard version of Tetris AIarXivThe Price of Meaning: Why Every Semantic Memory System ForgetsarXivWhen Verification Hurts: Asymmetric Effects of Multi-Agent Feedback in Logic Proof TutoringarXivQuantification of Credal Uncertainty: A Distance-Based ApproacharXiv🔥 google-research/timesfmGitHub Trending🔥 aliasrobotics/caiGitHub Trending🔥 ComposioHQ/awesome-claude-skillsGitHub Trending🔥 SkyworkAI/Matrix-GameGitHub Trending🔥 sponsors/vas3kGitHub Trending🔥 sponsors/khoj-aiGitHub Trending🔥 PaddlePaddle/PaddleOCRGitHub TrendingTest: 15% of Americans say they would work for AI bossTechCrunch AIAutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure DesignarXivMultiverse: Language-Conditioned Multi-Game Level Blending via Shared RepresentationarXivMediHive: A Decentralized Agent Collective for Medical ReasoningarXivBitboard version of Tetris AIarXivThe Price of Meaning: Why Every Semantic Memory System ForgetsarXivWhen Verification Hurts: Asymmetric Effects of Multi-Agent Feedback in Logic Proof TutoringarXivQuantification of Credal Uncertainty: A Distance-Based ApproacharXiv

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.25716v2 Announce Type: replace-cross Abstract: Video world models have shown immense potential in simulating the physical world, yet existing memory mechanisms primarily treat environments as static canvases. When dynamic subjects hide out of sight and later re-emerge, current methods often struggle, leading to frozen, distorted, or vanishing subjects. To address this, we introduce Hybrid Memory, a novel paradigm requiring models to simultaneously act as precise archivists for static backgrounds and vigilant trackers for dynamic subjects, ensuring motion continuity during out-of-vie — Kaijin Chen, Dingkang Liang, Xin Zhou, Yikang Ding, Xiaoqiang Liu, Pengfei Wan, Xiang Bai

View PDF HTML (experimental)

Abstract:Video world models have shown immense potential in simulating the physical world, yet existing memory mechanisms primarily treat environments as static canvases. When dynamic subjects hide out of sight and later re-emerge, current methods often struggle, leading to frozen, distorted, or vanishing subjects. To address this, we introduce Hybrid Memory, a novel paradigm requiring models to simultaneously act as precise archivists for static backgrounds and vigilant trackers for dynamic subjects, ensuring motion continuity during out-of-view intervals. To facilitate research in this direction, we construct HM-World, the first large-scale video dataset dedicated to hybrid memory. It features 59K high-fidelity clips with decoupled camera and subject trajectories, encompassing 17 diverse scenes, 49 distinct subjects, and meticulously designed exit-entry events to rigorously evaluate hybrid coherence. Furthermore, we propose HyDRA, a specialized memory architecture that compresses memory into tokens and utilizes a spatiotemporal relevance-driven retrieval mechanism. By selectively attending to relevant motion cues, HyDRA effectively preserves the identity and motion of hidden subjects. Extensive experiments on HM-World demonstrate that our method significantly outperforms state-of-the-art approaches in both dynamic subject consistency and overall generation quality. Code is publicly available at this https URL.

Comments: Project Page: this https URL Code: this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.25716 [cs.CV]

(or arXiv:2603.25716v2 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.25716

arXiv-issued DOI via DataCite

Submission history

From: Kaijin Chen [view email] [v1] Thu, 26 Mar 2026 17:56:01 UTC (32,301 KB) [v2] Sat, 28 Mar 2026 08:29:52 UTC (32,301 KB)

Original source

arXiv

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Out of Sigh…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 336 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers