Research Papers research paper arxiv nlp language-models

Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs

arXivMarch 31, 20262 min read0 views

arXiv:2603.27664v1 Announce Type: new Abstract: Large language models (LLMs) have achieved strong performance across a wide range of tasks, but they are also prone to sycophancy, the tendency to agree with user statements regardless of validity. Previous research has outlined both the extent and the underlying causes of sycophancy in earlier models, such as ChatGPT-3.5 and Davinci. Newer models have since undergone multiple mitigation strategies, yet there remains a critical need to systematically test their behavior. In particular, the effect of language on sycophancy has not been explored. I — Bayan Abdullah Aldahlawi, A. B. M. Ashikur Rahman, Irfan Ahmad

View PDF

Abstract:Large language models (LLMs) have achieved strong performance across a wide range of tasks, but they are also prone to sycophancy, the tendency to agree with user statements regardless of validity. Previous research has outlined both the extent and the underlying causes of sycophancy in earlier models, such as ChatGPT-3.5 and Davinci. Newer models have since undergone multiple mitigation strategies, yet there remains a critical need to systematically test their behavior. In particular, the effect of language on sycophancy has not been explored. In this work, we investigate how the language influences sycophantic responses. We evaluate three state-of-the-art models, GPT-4o mini, Gemini 1.5 Flash, and Claude 3.5 Haiku, using a set of tweet-like opinion prompts translated into five additional languages: Arabic, Chinese, French, Spanish, and Portuguese. Our results show that although newer models exhibit significantly less sycophancy overall compared to earlier generations, the extent of sycophancy is still influenced by the language. We further provide a granular analysis of how language shapes model agreeableness across sensitive topics, revealing systematic cultural and linguistic patterns. These findings highlight both the progress of mitigation efforts and the need for broader multilingual audits to ensure trustworthy and bias-aware deployment of LLMs.

Comments: 15 Pages, 5 figures

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2603.27664 [cs.CL]

(or arXiv:2603.27664v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.27664

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: A. B. M. Ashikur Rahman [view email] [v1] Sun, 29 Mar 2026 12:31:05 UTC (859 KB)

Original source

arXiv

https://arxiv.org/abs/2603.27664

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Models

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m2 days ago

ModelsLive

New method predicts the success of LLMs on untried tasks with high accuracy

A team from the Universitat Politècnica de València, part of the Valencian University Research Institute for Artificial Intelligence (VRAIN) and ValgrAI, has participated in the development of ADeLe, a new methodology that offers precise explanations and predictions regarding whether large language models (LLMs) will succeed or fail at specific new tasks they have not yet performed. Furthermore, this methodology identifies exactly the limits of any given model s reasoning capacity.

TechXplore AI

1mabout 2 hours ago

Research PapersRecent

CAYIN Technology at Touch Taiwan 2026 AI Content Creation × E-Paper Sustainability × Taiwan-Built Security - Yahoo Finance

CAYIN Technology at Touch Taiwan 2026 AI Content Creation × E-Paper Sustainability × Taiwan-Built Security Yahoo Finance

GNews AI Taiwan

1m1 day ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 165 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersRecent

CAYIN Technology at Touch Taiwan 2026 AI Content Creation × E-Paper Sustainability × Taiwan-Built Security - Yahoo Finance

CAYIN Technology at Touch Taiwan 2026 AI Content Creation × E-Paper Sustainability × Taiwan-Built Security Yahoo Finance

GNews AI Taiwan

1m1 day ago

Research PapersFresh

Tracking Rising Electricity Costs

U.S.-based research organization, the Institute for Energy Research, has created a visual that compares the growth of data centers with electricity‑price changes across U.S. states from 2015 to 2025. The

Center for Data Innovation

1mabout 2 hours ago

Research Papers

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI - WSJ

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI WSJ

GNews AI manufacturing

1m30 days ago

Research PapersLive

Elon University Research Warns Greatest AI Risk is 'Superstupidity' - govtech.com

Elon University Research Warns Greatest AI Risk is 'Superstupidity' govtech.com

GNews AI education

1mabout 2 hours ago