Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessSadly, The Whispering EarringLessWrong AIAnthropic Responsible Scaling Policy v3: Dive Into The Detailslesswrong.comI tried ChatGPT's new CarPlay integration: It's my new go-to for the questions Siri can't answerZDNet AIAvast Premium Isn’t Flashy — But It Might Be the Smartest Cheap Antivirus Right NowGizmodoScaling AI's Promise in Healthcare: The Time is Now - Pharmaceutical ExecutiveGNews AI healthcareChina issues guideline for AI ethics governance - China DailyGNews AI ChinaLiving brain cells enable machine learning computations - Tech XploreGoogle News: Machine LearningLiving brain cells enable machine learning computationsPhys.org AIARC Raiders Publisher Nexon Calls The Game A "Trojan Horse" To Normalize Generative AI - gameranx.comGoogle News: Generative AIInvestors Chasing AI Hardware Gains May Want to Rethink ARTY Before Adding More Exposure - 24/7 Wall St.GNews AI NVIDIAAI for investors - MLQ.aiGNews AI NVIDIABlack Hat USADark ReadingBlack Hat AsiaAI BusinessSadly, The Whispering EarringLessWrong AIAnthropic Responsible Scaling Policy v3: Dive Into The Detailslesswrong.comI tried ChatGPT's new CarPlay integration: It's my new go-to for the questions Siri can't answerZDNet AIAvast Premium Isn’t Flashy — But It Might Be the Smartest Cheap Antivirus Right NowGizmodoScaling AI's Promise in Healthcare: The Time is Now - Pharmaceutical ExecutiveGNews AI healthcareChina issues guideline for AI ethics governance - China DailyGNews AI ChinaLiving brain cells enable machine learning computations - Tech XploreGoogle News: Machine LearningLiving brain cells enable machine learning computationsPhys.org AIARC Raiders Publisher Nexon Calls The Game A "Trojan Horse" To Normalize Generative AI - gameranx.comGoogle News: Generative AIInvestors Chasing AI Hardware Gains May Want to Rethink ARTY Before Adding More Exposure - 24/7 Wall St.GNews AI NVIDIAAI for investors - MLQ.aiGNews AI NVIDIA
AI NEWS HUBbyEIGENVECTOREigenvector

Exploring Cultural Variations in Moral Judgments with Large Language Models

arXivby [Submitted on 14 Jun 2025 (v1), last revised 4 Jan 2026 (this version, v2)]March 31, 20262 min read1 views
Source Quiz

arXiv:2506.12433v2 Announce Type: cross Abstract: Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture culturally diverse moral values remains unclear. In this paper, we examine whether LLMs mirror variations in moral attitudes reported by the World Values Survey (WVS) and the Pew Research Center's Global Attitudes Survey (PEW). We compare smaller monolingual and multilingual models (GPT-2, OPT, BLOOMZ, and Qwen) with recent instruction-tuned models (GPT-4o, GPT-4o-mini, Gemma-2-9b-it, and Llama-3.3-70B-Instruct). Using log-probability-base — Hadi Mohammadi, Ayoub Bagheri

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture culturally diverse moral values remains unclear. In this paper, we examine whether LLMs mirror variations in moral attitudes reported by the World Values Survey (WVS) and the Pew Research Center's Global Attitudes Survey (PEW). We compare smaller monolingual and multilingual models (GPT-2, OPT, BLOOMZ, and Qwen) with recent instruction-tuned models (GPT-4o, GPT-4o-mini, Gemma-2-9b-it, and Llama-3.3-70B-Instruct). Using log-probability-based \emph{moral justifiability} scores, we correlate each model's outputs with survey data covering a broad set of ethical topics. Our results show that many earlier or smaller models often produce near-zero or negative correlations with human judgments. In contrast, advanced instruction-tuned models achieve substantially higher positive correlations, suggesting they better reflect real-world moral attitudes. We provide a detailed regional analysis revealing that models align better with Western, Educated, Industrialized, Rich, and Democratic (W.E.I.R.D.) nations than with other regions. While scaling model size and using instruction tuning improves alignment with cross-cultural moral norms, challenges remain for certain topics and regions. We discuss these findings in relation to bias analysis, training data diversity, information retrieval implications, and strategies for improving the cultural sensitivity of LLMs.

Subjects:

Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

Cite as: arXiv:2506.12433 [cs.CL]

(or arXiv:2506.12433v2 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2506.12433

arXiv-issued DOI via DataCite

Submission history

From: Hadi Mohammadi [view email] [v1] Sat, 14 Jun 2025 10:16:48 UTC (3,125 KB) [v2] Sun, 4 Jan 2026 17:02:40 UTC (728 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Exploring C…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 155 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers