Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessIntel repurchasing 49pc stake in Leixlip chip factory for $14.2bnSilicon RepublicBell Called Watson; Today, We Call AIeetimes.comGoogle verdubbelt opslag AI Pro ruimschoots naar 5TB zonder hogere prijsTweakers.netThe AI-Powered Agency: A Developer Playbook for Selling AI Services in 2026Dev.to AIYour AI Chatbot Isn't Stupid. It Just Has No Memory. Here's How We Fixed That.Dev.to AIInternational RegLab Project reports on AI use in nuclear power plant operations - Nuclear Energy Agency (NEA)Google News: AIAI Agent Tools for Small Business Owners: A Practical GuideDev.to AINavigating the Quiet Rhythms of the Siuntio FortDev.to AIArtificial Intelligence in the Battle against Coronavirus (COVID-19): A Surveyand Future Research DirectionsDev.to AISoftware Testing Training in Kalyan Nagar – Learnmore TechnologiesDev.to AII'm 단아, Leader 36 of Lawmadi OS — Your AI Cultural Heritage & Religion Expert for Korean LawDev.to AIHow to Access All AI Models with a Single API Key in 2026Dev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessIntel repurchasing 49pc stake in Leixlip chip factory for $14.2bnSilicon RepublicBell Called Watson; Today, We Call AIeetimes.comGoogle verdubbelt opslag AI Pro ruimschoots naar 5TB zonder hogere prijsTweakers.netThe AI-Powered Agency: A Developer Playbook for Selling AI Services in 2026Dev.to AIYour AI Chatbot Isn't Stupid. It Just Has No Memory. Here's How We Fixed That.Dev.to AIInternational RegLab Project reports on AI use in nuclear power plant operations - Nuclear Energy Agency (NEA)Google News: AIAI Agent Tools for Small Business Owners: A Practical GuideDev.to AINavigating the Quiet Rhythms of the Siuntio FortDev.to AIArtificial Intelligence in the Battle against Coronavirus (COVID-19): A Surveyand Future Research DirectionsDev.to AISoftware Testing Training in Kalyan Nagar – Learnmore TechnologiesDev.to AII'm 단아, Leader 36 of Lawmadi OS — Your AI Cultural Heritage & Religion Expert for Korean LawDev.to AIHow to Access All AI Models with a Single API Key in 2026Dev.to AI
Eigenvector logo
AI NEWS HUBbyEIGENVECTOR

MA-Bench: Towards Fine-grained Micro-Action Understanding

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.26586v1 Announce Type: new Abstract: With the rapid development of Multimodal Large Language Models (MLLMs), their potential in Micro-Action understanding, a vital role in human emotion analysis, remains unexplored due to the absence of specialized benchmarks. To tackle this issue, we present MA-Bench, a benchmark comprising 1,000 videos and a three-tier evaluation architecture that progressively examines micro-action perception, relational comprehension, and interpretive reasoning. MA-Bench contains 12,000 structured question-answer pairs, enabling systematic assessment of both rec — Kun Li, Jihao Gu, Fei Wang, Zhiliang Wu, Hehe Fan, Dan Guo

View PDF HTML (experimental)

Abstract:With the rapid development of Multimodal Large Language Models (MLLMs), their potential in Micro-Action understanding, a vital role in human emotion analysis, remains unexplored due to the absence of specialized benchmarks. To tackle this issue, we present MA-Bench, a benchmark comprising 1,000 videos and a three-tier evaluation architecture that progressively examines micro-action perception, relational comprehension, and interpretive reasoning. MA-Bench contains 12,000 structured question-answer pairs, enabling systematic assessment of both recognition accuracy and action interpretation. The results of 23 representative MLLMs reveal that there are significant challenges in capturing motion granularity and fine-grained body-part dynamics. To address these challenges, we further construct MA-Bench-Train, a large-scale training corpus with 20.5K videos annotated with structured micro-action captions for fine-tuning MLLMs. The results of Qwen3-VL-8B fine-tuned on MA-Bench-Train show clear performance improvements across micro-action reasoning and explanation tasks. Our work aims to establish a foundation benchmark for advancing MLLMs in understanding subtle micro-action and human-related behaviors. Project Page: this https URL

Comments: Accepted by CVPR 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26586 [cs.CV]

(or arXiv:2603.26586v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26586

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Kun Li [view email] [v1] Fri, 27 Mar 2026 16:49:19 UTC (5,082 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
MA-Bench: T…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 166 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers