CogPic: A Multimodal Dataset for Early Cognitive Impairment Assessment via Picture Description Tasks
arXiv:2604.01626v1 Announce Type: new Abstract: The automated evaluation of cognitive status utilizing multimedia technologies presents a promising frontier in early dementia diagnosis. However, the development of robust machine learning models for cognitive impairment detection is frequently hindered by the scarcity of large-scale, strictly synchronized, and clinically validated multimodal datasets. To bridge this critical gap, we introduce the CogPic database, a comprehensive multimodal benchmark meticulously designed for fine-grained cognitive impairment detection. The dataset comprises strictly synchronized audio, visual, and linguistic data continuously collected from 574 participants during a naturalistic picture description task. To establish highly reliable diagnostic ground truth,
View PDF HTML (experimental)
Abstract:The automated evaluation of cognitive status utilizing multimedia technologies presents a promising frontier in early dementia diagnosis. However, the development of robust machine learning models for cognitive impairment detection is frequently hindered by the scarcity of large-scale, strictly synchronized, and clinically validated multimodal datasets. To bridge this critical gap, we introduce the CogPic database, a comprehensive multimodal benchmark meticulously designed for fine-grained cognitive impairment detection. The dataset comprises strictly synchronized audio, visual, and linguistic data continuously collected from 574 participants during a naturalistic picture description task. To establish highly reliable diagnostic ground truth, expert clinical neuropsychologists conducted exhaustive evaluations, stratifying participants into distinct cognitive groups through a comprehensive clinical consensus. Consequently, CogPic stands as the largest, most modality-rich, and most meticulously evaluated dataset of its kind to date. By conducting extensive benchmark experiments on the CogPic dataset, we establish an exceptionally robust, unbiased, and clinically generalizable foundation to propel future multimedia research in automated cognitive health assessment. Detailed information and access application procedures for our CogPic database are available at this https URL.
Comments: 10 pages, 3 figures, 5 tables
Subjects:
Databases (cs.DB)
Cite as: arXiv:2604.01626 [cs.DB]
(or arXiv:2604.01626v1 [cs.DB] for this version)
https://doi.org/10.48550/arXiv.2604.01626
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Feng Rui [view email] [v1] Thu, 2 Apr 2026 05:04:31 UTC (1,497 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modelbenchmarkannounce
Generalist introduces GEN-1 general-purpose model for physical AI
Generalist AI said its latest model marks a significant step toward its mission of creating general intelligence for the physical world. The post Generalist introduces GEN-1 general-purpose model for physical AI appeared first on The Robot Report .
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Models

Generalist introduces GEN-1 general-purpose model for physical AI
Generalist AI said its latest model marks a significant step toward its mission of creating general intelligence for the physical world. The post Generalist introduces GEN-1 general-purpose model for physical AI appeared first on The Robot Report .




Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!