Research Papers research paper arxiv ai artificial-intelligence

Exploring Cultural Variations in Moral Judgments with Large Language Models

arXivby [Submitted on 14 Jun 2025 (v1), last revised 4 Jan 2026 (this version, v2)]March 31, 20262 min read1 views

arXiv:2506.12433v2 Announce Type: cross Abstract: Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture culturally diverse moral values remains unclear. In this paper, we examine whether LLMs mirror variations in moral attitudes reported by the World Values Survey (WVS) and the Pew Research Center's Global Attitudes Survey (PEW). We compare smaller monolingual and multilingual models (GPT-2, OPT, BLOOMZ, and Qwen) with recent instruction-tuned models (GPT-4o, GPT-4o-mini, Gemma-2-9b-it, and Llama-3.3-70B-Instruct). Using log-probability-base — Hadi Mohammadi, Ayoub Bagheri

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have shown strong performance across many tasks, but their ability to capture culturally diverse moral values remains unclear. In this paper, we examine whether LLMs mirror variations in moral attitudes reported by the World Values Survey (WVS) and the Pew Research Center's Global Attitudes Survey (PEW). We compare smaller monolingual and multilingual models (GPT-2, OPT, BLOOMZ, and Qwen) with recent instruction-tuned models (GPT-4o, GPT-4o-mini, Gemma-2-9b-it, and Llama-3.3-70B-Instruct). Using log-probability-based \emph{moral justifiability} scores, we correlate each model's outputs with survey data covering a broad set of ethical topics. Our results show that many earlier or smaller models often produce near-zero or negative correlations with human judgments. In contrast, advanced instruction-tuned models achieve substantially higher positive correlations, suggesting they better reflect real-world moral attitudes. We provide a detailed regional analysis revealing that models align better with Western, Educated, Industrialized, Rich, and Democratic (W.E.I.R.D.) nations than with other regions. While scaling model size and using instruction tuning improves alignment with cross-cultural moral norms, challenges remain for certain topics and regions. We discuss these findings in relation to bias analysis, training data diversity, information retrieval implications, and strategies for improving the cultural sensitivity of LLMs.

Subjects:

Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

Cite as: arXiv:2506.12433 [cs.CL]

(or arXiv:2506.12433v2 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2506.12433

arXiv-issued DOI via DataCite

Submission history

From: Hadi Mohammadi [view email] [v1] Sat, 14 Jun 2025 10:16:48 UTC (3,125 KB) [v2] Sun, 4 Jan 2026 17:02:40 UTC (728 KB)

Original source

arXiv

https://arxiv.org/abs/2506.12433

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

CountriesFresh

Paper AI robot offerings for tomb-sweeping festival - South China Morning Post

Paper AI robot offerings for tomb-sweeping festival South China Morning Post

GNews AI China

1mabout 5 hours ago

Research PapersLive

New memristor design uses built-in oxygen gradient to bring stability to reinforcement learning

In a recent study published in Nature Communications, researchers created a memristor that uses a built-in oxygen gradient to produce slow, stable conductance changes, enabling a reinforcement learning (RL) algorithm to learn faster and more stably than conventional approaches.

Phys.org AI

1mabout 1 hour ago

Research PapersLive

Living brain cells enable machine learning computations

A research team at Tohoku University and Future University Hakodate has demonstrated that living biological neurons can be trained to perform a supervised temporal pattern learning task previously carried out by artificial systems. By integrating cultured neuronal networks into a machine learning framework, the team showed that these biological systems can generate complex time-series signals, marking a significant step forward in both neuroscience and bio-inspired computing.

Phys.org AI

1mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 155 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Exploring Cultural Variations in Moral Judgments with Large Language Models

Submission history

Daily AI Digest

More about

Paper AI robot offerings for tomb-sweeping festival - South China Morning Post

New memristor design uses built-in oxygen gradient to bring stability to reinforcement learning

Living brain cells enable machine learning computations

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

New memristor design uses built-in oxygen gradient to bring stability to reinforcement learning

Living brain cells enable machine learning computations

Innovations in Medical Education Conference Confronts the AI Tipping Point - University of Miami

Judiciary Ready To Go Paperless, Rolls Out AI and Digital Systems - Uganda Radionetwork