Research Papers research paper arxiv nlp language-models

Understanding the Anchoring Effect of LLM with Synthetic Data: Existence, Mechanism, and Potential Mitigations

arXivMarch 31, 20261 min read0 views

arXiv:2505.15392v2 Announce Type: replace Abstract: The rise of Large Language Models (LLMs) like ChatGPT has advanced natural language processing, yet concerns about cognitive biases are growing. In this paper, we investigate the anchoring effect, a cognitive bias where the mind relies heavily on the first information as anchors to make affected judgments. We explore whether LLMs are affected by anchoring, the underlying mechanisms, and potential mitigation strategies. To facilitate studies at scale on the anchoring effect, we introduce a new dataset, SynAnchors (https://huggingface.co/datase — Yiming Huang, Biquan Bie, Zuqiu Na, Weilin Ruan, Songxin Lei, Yutao Yue, Xinlei He

View PDF HTML (experimental)

Abstract:The rise of Large Language Models (LLMs) like ChatGPT has advanced natural language processing, yet concerns about cognitive biases are growing. In this paper, we investigate the anchoring effect, a cognitive bias where the mind relies heavily on the first information as anchors to make affected judgments. We explore whether LLMs are affected by anchoring, the underlying mechanisms, and potential mitigation strategies. To facilitate studies at scale on the anchoring effect, we introduce a new dataset, SynAnchors (this https URL). Combining refined evaluation metrics, we benchmark current widely used LLMs. Our findings show that LLMs' anchoring bias exists commonly with shallow-layer acting and can not be eliminated by conventional strategies, while reasoning can offer some mitigation.

Comments: Accepted by the HCAIR workshop of ICLR 2026

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2505.15392 [cs.CL]

(or arXiv:2505.15392v2 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2505.15392

arXiv-issued DOI via DataCite

Submission history

From: Yiming Huang [view email] [v1] Wed, 21 May 2025 11:33:54 UTC (2,433 KB) [v2] Sun, 29 Mar 2026 02:46:50 UTC (2,446 KB)

Original source

arXiv

https://arxiv.org/abs/2505.15392

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Open Source AIFresh

Gemma 4 - 31b abliterated quants

Got inspired to try and crack this egg without using heretic. FP16, Q8_0 and Q4_K_M quants, plus the abliteration script for modification/use is here: https://huggingface.co/paperscarecrow/Gemma-4-31B-it-abliterated-gguf based off of mlabonne's Orthogonalized Representation Intervention method , because I loved his ablits of gemma3 so much. Edit: Overestimated my internet speeds, still uploading the models. submitted by /u/Polymorphic-X [link] [comments]

Reddit r/LocalLLaMA

1mabout 2 hours ago

Research PapersFresh

Google Research touts memory-compression breakthrough for AI processing - Network World

Google Research touts memory-compression breakthrough for AI processing Network World

GNews AI Google

1mabout 3 hours ago

Self-Evolving AIFresh

Google Researchers Reveal Every Way Hackers Can Trap, Hijack AI Agents - Decrypt

Google Researchers Reveal Every Way Hackers Can Trap, Hijack AI Agents Decrypt

GNews AI Google

1mabout 4 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 185 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

Google Research touts memory-compression breakthrough for AI processing - Network World

Google Research touts memory-compression breakthrough for AI processing Network World

GNews AI Google

1mabout 3 hours ago

Research Papers

Consistency Amplifies: How Behavioral Variance Shapes Agent Accuracy

Analysis of behavioral consistency in large language model agents reveals that while consistent performance correlates with higher accuracy, consistency can amplify both correct and incorrect interpretations, emphasizing that accurate interpretation is more crucial than execution consistency for production deployment. (2 upvotes on HuggingFace)

HuggingFace Papers

2m8 days ago

Research PapersRecent

A Survey of On-Policy Distillation for Large Language Models

On-Policy Distillation for large language models unifies diverse approaches through an f-divergence framework organized by feedback signals, teacher access, and loss granularity. (4 upvotes on HuggingFace)

HuggingFace Papers

2m1 day ago

Research Papers

Brevity Constraints Reverse Performance Hierarchies in Language Models

Large language models can underperform smaller ones due to verbose responses that introduce errors, but constraining output length reveals their superior capabilities and improves performance across benchmarks. (16 upvotes on HuggingFace)

HuggingFace Papers

2m23 days ago