Research Papers research paper arxiv computer-vision image-recognition

HighlightBench: Benchmarking Markup-Driven Table Reasoning in Scientific Documents

arXivby [Submitted on 25 Mar 2026]March 31, 20261 min read0 views

arXiv:2603.26784v1 Announce Type: new Abstract: Visual markups such as highlights, underlines, and bold text are common in table-centric documents. Although multimodal large language models (MLLMs) have made substantial progress in document understanding, their ability to treat such cues as explicit logical directives remains under-explored. More importantly, existing evaluations cannot distinguish whether a model fails to see the markup or fails to reason with it. This creates a key blind spot in assessing markup-conditioned behavior over tables. To address this gap, we introduce HighlightBen — Lexin Wang, Shenghua Liu, Yiwei Wang, Yujun Cai, Yuyao Ge, Jiayu Yao, Jiafeng Guo, Xueqi Cheng

View PDF HTML (experimental)

Abstract:Visual markups such as highlights, underlines, and bold text are common in table-centric documents. Although multimodal large language models (MLLMs) have made substantial progress in document understanding, their ability to treat such cues as explicit logical directives remains under-explored. More importantly, existing evaluations cannot distinguish whether a model fails to see the markup or fails to reason with it. This creates a key blind spot in assessing markup-conditioned behavior over tables. To address this gap, we introduce HighlightBench, a diagnostic benchmark for markup-driven table understanding that decomposes evaluation into five task families: Markup Grounding, Constrained Retrieval, Local Relations, Aggregation & Comparison, and Consistency & Missingness. We further provide a reference pipeline that makes intermediate decisions explicit, enabling reproducible baselines and finer-grained attribution of errors along the perception-to-execution chain. Experiments show that even strong models remain unstable when visual cues must be consistently aligned with symbolic reasoning under structured output constraints.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26784 [cs.CV]

(or arXiv:2603.26784v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26784

arXiv-issued DOI via DataCite

Submission history

From: Lexin Wang [view email] [v1] Wed, 25 Mar 2026 06:15:40 UTC (6,553 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26784

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Research Papers

AI is quick but risky for updating old software, researchers warn - Tech Xplore

AI is quick but risky for updating old software, researchers warn Tech Xplore

GNews AI coding

1m3 months ago

Products

Trailer: The Shape of Things to Come

Microsoft research lead Doug Burger introduces his new podcast series, "The Shape of Things to Come", an exploration into the fundamental truths about AI and how the technology will reshape the future. The post Trailer: The Shape of Things to Come appeared first on Microsoft Research .

Microsoft Research Blog

2mabout 1 month ago

Models

Will machines ever be intelligent?

Are machines truly intelligent? AI researchers Subutai Ahmad and Nicolò Fusi join Doug Burger to compare transformer-based AI with the human brain, exploring continual learning, efficiency, and whether today’s models are on a path toward human intelligence. The post Will machines ever be intelligent? appeared first on Microsoft Research .

Microsoft Research Blog

1m11 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 111 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

HighlightBench: Benchmarking Markup-Driven Table Reasoning in Scientific Documents

Submission history

Daily AI Digest

More about

AI is quick but risky for updating old software, researchers warn - Tech Xplore

Trailer: The Shape of Things to Come

Will machines ever be intelligent?

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

AI is quick but risky for updating old software, researchers warn - Tech Xplore

When the server crashes the soul

Automatic Textbook Formalization

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI - WSJ