Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAI Agents Are Coming for Your Waiting Room. That’s Just the Start. - CDOTrendsGNews AI agenticAI Has Already Decided: First-Party Data Will Define Advertising’s Agentic Era - AdExchangerGNews AI agenticDefending Habit Streakslesswrong.comChinese-made drone shot in Iran, Australia’s diplomacy pivot: 5 weekend reads you missedSCMP Tech (Asia AI)How NLP Actually Understands Text?Medium AIXENONOSTRA RESEARCH NOTES ALGEBROS: An Algebraic Meta-Language for Code Structure Extraction and…Medium AI18 Specific Tutorial Ideas for AI Voice Integration Using Vapi and TwilioDev.to AIUI DESIGNERS IN TROUBLEMedium AIEstimates of the expected utility gain of AI Safety Researchlesswrong.comMastering Python for Machine Learning: A Practical, No-Nonsense RoadmapMedium AII Audited 13 AI Agent Platforms for Security Misconfigurations — Here's the Open-Source Scanner I BuiltDev.to AIFrom Reality to Writing: Why I Explore Technology, Identity and Human BehaviorMedium AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAI Agents Are Coming for Your Waiting Room. That’s Just the Start. - CDOTrendsGNews AI agenticAI Has Already Decided: First-Party Data Will Define Advertising’s Agentic Era - AdExchangerGNews AI agenticDefending Habit Streakslesswrong.comChinese-made drone shot in Iran, Australia’s diplomacy pivot: 5 weekend reads you missedSCMP Tech (Asia AI)How NLP Actually Understands Text?Medium AIXENONOSTRA RESEARCH NOTES ALGEBROS: An Algebraic Meta-Language for Code Structure Extraction and…Medium AI18 Specific Tutorial Ideas for AI Voice Integration Using Vapi and TwilioDev.to AIUI DESIGNERS IN TROUBLEMedium AIEstimates of the expected utility gain of AI Safety Researchlesswrong.comMastering Python for Machine Learning: A Practical, No-Nonsense RoadmapMedium AII Audited 13 AI Agent Platforms for Security Misconfigurations — Here's the Open-Source Scanner I BuiltDev.to AIFrom Reality to Writing: Why I Explore Technology, Identity and Human BehaviorMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

HuggingFace Papersby Jiajun Zhang ,March 26, 20262 min read2 views
Source Quiz
🧒Explain Like I'm 5Simple language

Hey there, little explorer! 🚀

Imagine you have a super smart robot friend! This robot can look at a picture, like a drawing of how many cookies you ate each day. 🍪

Now, these smart robots are learning to draw their own pictures, like charts, from real numbers. But guess what? Sometimes it's tricky!

A new game was made to test them. It's like giving them a big puzzle with lots of numbers and asking them to draw a super fancy chart, like a big castle made of blocks. 🏰

Some robots are really good at it, like the ones from big companies. But other robots, the ones anyone can play with, find it a bit harder. They sometimes make mistakes on the tricky parts of the castle.

So, scientists are learning how to help all the robot friends draw even better pictures! Yay for smart robots! 🎉

Vision-Language Models face significant challenges in generating complex multi-panel charts from real-world data, as demonstrated by a new large-scale benchmark that reveals performance gaps between proprietary and open-weight models. (8 upvotes on HuggingFace)

Published on Mar 26

·

Submitted by

zjj

on Mar 30

Authors:

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Abstract

Vision-Language Models face significant challenges in generating complex multi-panel charts from real-world data, as demonstrated by a new large-scale benchmark that reveals performance gaps between proprietary and open-weight models.

AI-generated summary

Vision-Language Models (VLMs) have demonstrated impressive capabilities in code generation across various domains. However, their ability to replicate complex, multi-panel visualizations from real-world data remains largely unassessed. To address this gap, we introduce \texttt{RealChart2Code}, a new large-scale benchmark with over 2,800 instances grounded in authentic datasets and featuring tasks with clear analytical intent. Crucially, it is the first benchmark to systematically evaluate chart generation from large-scale raw data and assess iterative code refinement in a multi-turn conversational setting. Our comprehensive evaluation of 14 leading VLMs on RealChart2Code reveals significant performance degradation compared to simpler benchmarks, highlighting their struggles with complex plot structures and authentic data. Our analysis uncovers a substantial performance gap between proprietary and open-weight models and confirms that even state-of-the-art VLMs often fail to accurately replicate intricate, multi-panel charts. These findings provide valuable insights into the current limitations of VLMs and guide future research directions. We release the benchmark and code at https://github.com/Speakn0w/RealChart2Code.

View arXiv page View PDF Project page GitHub 8 Add to collection

Get this paper in your agent:

hf papers read 2603.25804

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2603.25804 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2603.25804 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2603.25804 in a Space README.md to link it from this page.

Collections including this paper 1

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
RealChart2C…researchpaperarxivVision-Lang…chart gener…multi-panel…HuggingFace…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 275 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers