Live
Black Hat USAAI BusinessBlack Hat AsiaAI Business'AI-pilled' engineers are working harder and burning out faster, Django co-creator saysBusiness InsiderK-pop has an AI problem - dazeddigital.comGoogle News: Generative AIOpenAI’s new ChatGPT base model ‘Spud’: All you need to know - Storyboard18Google News: ChatGPTGoogle DeepMind Launches Gemma 4 Amid Competition from Chinese Open Models - Analytics India MagazineGoogle News: DeepMindMicrosoft releases foundational AI models targeting enterprisesSilicon RepublicCan AI chatbots effectively support cancer patients during treatments? - ESMO Daily ReporterGoogle News: AIAccelerating drug discovery with “paradigm shifting” AI model - BioTechniquesGoogle News: Machine LearningStep by Step Guide to Build an End-to-End Model Optimization Pipeline with NVIDIA Model Optimizer Using FastNAS Pruning and Fine-TuningMarkTechPostSeeking arXiv cs.AI endorsement — neuroscience-inspired memory architecture for AI agentsdiscuss.huggingface.coGenerative AI: A Legal Framework in Development - group.bnpparibasGoogle News: Generative AIMicrosoft announces US$10B AI investment plan in Japan - MSNGNews AI USAS. Korea, France Bolster Ties in AI, Quantum Computing - KBS WORLD RadioGNews AI KoreaBlack Hat USAAI BusinessBlack Hat AsiaAI Business'AI-pilled' engineers are working harder and burning out faster, Django co-creator saysBusiness InsiderK-pop has an AI problem - dazeddigital.comGoogle News: Generative AIOpenAI’s new ChatGPT base model ‘Spud’: All you need to know - Storyboard18Google News: ChatGPTGoogle DeepMind Launches Gemma 4 Amid Competition from Chinese Open Models - Analytics India MagazineGoogle News: DeepMindMicrosoft releases foundational AI models targeting enterprisesSilicon RepublicCan AI chatbots effectively support cancer patients during treatments? - ESMO Daily ReporterGoogle News: AIAccelerating drug discovery with “paradigm shifting” AI model - BioTechniquesGoogle News: Machine LearningStep by Step Guide to Build an End-to-End Model Optimization Pipeline with NVIDIA Model Optimizer Using FastNAS Pruning and Fine-TuningMarkTechPostSeeking arXiv cs.AI endorsement — neuroscience-inspired memory architecture for AI agentsdiscuss.huggingface.coGenerative AI: A Legal Framework in Development - group.bnpparibasGoogle News: Generative AIMicrosoft announces US$10B AI investment plan in Japan - MSNGNews AI USAS. Korea, France Bolster Ties in AI, Quantum Computing - KBS WORLD RadioGNews AI Korea
AI NEWS HUBbyEIGENVECTOREigenvector

TailNLG: A Multilingual Benchmark Addressing Verbalization of Long-Tail Entities

arXivMarch 31, 20262 min read1 views
Source Quiz

arXiv:2603.27768v1 Announce Type: new Abstract: The automatic verbalization of structured knowledge is a key task for making knowledge graphs accessible to non-expert users and supporting retrieval-augmented generation systems. Although recent advances in Data-to-Text generation have improved multilingual coverage, little attention has been paid to potential biases in the verbalization of rare entities, frequently known as long-tail entities. In this work, we present the first systematic study of long-tail entities in Data-to-Text generation. We introduce TailNLG, a new multilingual benchmark — Lia Draetta, Michael Oliverio, Virginia Ram\'on-Ferrer, Pier Felice Balestrucci, Flaviana Corallo, Carlos Badenes-Olmedo, Alessandro Mazzei, Marco Antonio Stranisci, Rossana Damiano

View PDF HTML (experimental)

Abstract:The automatic verbalization of structured knowledge is a key task for making knowledge graphs accessible to non-expert users and supporting retrieval-augmented generation systems. Although recent advances in Data-to-Text generation have improved multilingual coverage, little attention has been paid to potential biases in the verbalization of rare entities, frequently known as long-tail entities. In this work, we present the first systematic study of long-tail entities in Data-to-Text generation. We introduce TailNLG, a new multilingual benchmark in English, Italian, and Spanish, built from Wikidata and covering entities with varying levels of popularity. We evaluate three different families of large language models in zero-shot settings and compare their performance on rare versus common entities, as well as against the established WebNLG benchmark. Our results reveal a consistent bias against long-tail entities: embedding-based scores are lower, and model uncertainty is higher for rare entities. We further show that the impact of long-tail entities varies across models and languages, and that existing evaluation metrics do not consistently capture these differences, highlighting the need for more reliable evaluation frameworks.

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2603.27768 [cs.CL]

(or arXiv:2603.27768v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.27768

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Marco Antonio Stranisci [view email] [v1] Sun, 29 Mar 2026 17:01:54 UTC (463 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
TailNLG: A …researchpaperarxivnlplanguage-mo…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 182 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers