Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessFrance’s Mistral AI seeks Samsung memory for AI expansion - The Korea HeraldGoogle News - Mistral AI FranceMistral AI pursues Samsung memory partnership during South Korea visit - CHOSUNBIZ - ChosunbizGoogle News - Mistral AI FranceFuture Women Diplomats Gather for AI Event in Dushanbe - miragenews.comGoogle News - AI TajikistanAnthropic leaks source code for its AI coding agent Claude - Lynnwood TimesGoogle News: ClaudeWeekend vote: What are your feelings about 'Artificial Intelligence' (AI)? - violinist.comGoogle News: AIA Beginner's Guide to Affiliate MarketingDev.to AIThe End of “Hard Work” in Coding, And Why That’s a ProblemDev.to AIActive Job and Background Processing for AI Features in RailsDev.to AIBig Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AIUK courts Anthropic to expand in London after US defence clash - Financial TimesGNews AI USAI'm 산들, Leader 41 of Lawmadi OS — Your AI Family & Divorce Expert for Korean LawDev.to AIAccelerating the next phase of AIDev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessFrance’s Mistral AI seeks Samsung memory for AI expansion - The Korea HeraldGoogle News - Mistral AI FranceMistral AI pursues Samsung memory partnership during South Korea visit - CHOSUNBIZ - ChosunbizGoogle News - Mistral AI FranceFuture Women Diplomats Gather for AI Event in Dushanbe - miragenews.comGoogle News - AI TajikistanAnthropic leaks source code for its AI coding agent Claude - Lynnwood TimesGoogle News: ClaudeWeekend vote: What are your feelings about 'Artificial Intelligence' (AI)? - violinist.comGoogle News: AIA Beginner's Guide to Affiliate MarketingDev.to AIThe End of “Hard Work” in Coding, And Why That’s a ProblemDev.to AIActive Job and Background Processing for AI Features in RailsDev.to AIBig Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AIUK courts Anthropic to expand in London after US defence clash - Financial TimesGNews AI USAI'm 산들, Leader 41 of Lawmadi OS — Your AI Family & Divorce Expert for Korean LawDev.to AIAccelerating the next phase of AIDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

HippoMM: Hippocampal-inspired Multimodal Memory for Long Audiovisual Event Understanding

arXiv eess.IVby [Submitted on 14 Apr 2025 (v1), last revised 1 Apr 2026 (this version, v2)]April 3, 20262 min read1 views
Source Quiz

arXiv:2504.10739v2 Announce Type: replace-cross Abstract: Comprehending extended audiovisual experiences remains challenging for computational systems, particularly temporal integration and cross-modal associations fundamental to human episodic memory. We introduce HippoMM, a computational cognitive architecture that maps hippocampal mechanisms to solve these challenges. Rather than relying on scaling or architectural sophistication, HippoMM implements three integrated components: (i) Episodic Segmentation detects audiovisual input changes to split videos into discrete episodes, mirroring dentate gyrus pattern separation; (ii) Memory Consolidation compresses episodes into summaries with key features preserved, analogous to hippocampal memory formation; and (iii) Hierarchical Memory Retriev

View PDF HTML (experimental)

Abstract:Comprehending extended audiovisual experiences remains challenging for computational systems, particularly temporal integration and cross-modal associations fundamental to human episodic memory. We introduce HippoMM, a computational cognitive architecture that maps hippocampal mechanisms to solve these challenges. Rather than relying on scaling or architectural sophistication, HippoMM implements three integrated components: (i) Episodic Segmentation detects audiovisual input changes to split videos into discrete episodes, mirroring dentate gyrus pattern separation; (ii) Memory Consolidation compresses episodes into summaries with key features preserved, analogous to hippocampal memory formation; and (iii) Hierarchical Memory Retrieval first searches semantic summaries, then escalates via temporal window expansion around seed segments for cross-modal queries, mimicking CA3 pattern completion. These components jointly create an integrated system exceeding the sum of its parts. On our HippoVlog benchmark testing associative memory, HippoMM achieves state-of-the-art 78.2% accuracy while operating 5x faster than retrieval-augmented baselines. Our results demonstrate that cognitive architectures provide blueprints for next-generation multimodal understanding. The code and benchmark dataset are publicly available at this https URL.

Comments: Accepted at CVPR 2026 Findings

Subjects:

Multimedia (cs.MM); Image and Video Processing (eess.IV)

Cite as: arXiv:2504.10739 [cs.MM]

(or arXiv:2504.10739v2 [cs.MM] for this version)

https://doi.org/10.48550/arXiv.2504.10739

arXiv-issued DOI via DataCite

Submission history

From: Yueqian Lin [view email] [v1] Mon, 14 Apr 2025 22:17:55 UTC (1,822 KB) [v2] Wed, 1 Apr 2026 21:23:13 UTC (1,856 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

benchmarkannounceavailable

Knowledge Map

Knowledge Map
TopicsEntitiesSource
HippoMM: Hi…benchmarkannounceavailablefeatureintegrationmultimodalarXiv eess.…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 240 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Releases