Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessHow SPACElab Has Integrated Science and Family Legacy to Craft Functional BeveragesInternational Business TimesDo You Trust Me? A Framework For Making Networks of Robots and Vehicles Safer - Harvard School of Engineering and Applied SciencesGoogle News: Machine LearningKubeCon Europe 2026: The Not-So-Unseen Engine Behind AI Innovation?Forrester AI Blog2. Mastering Time Series Forecasting with Python and timesfmDEV CommunityAirPods Max 2 reviewed: premium sound, top-tier ANC, same high priceTechSpotn8n Docker Setup: Why It Breaks (And the Easier Alternative)DEV Community1. Orchestrating AI Teams: A Python Guide to ChatDevDEV CommunityAI companies charge you 60% more based on your language, BPE tokensHacker NewsHow I Reverse-Engineered Claude Code's Hidden Pet SystemDEV Community@craft-ng: Associer l’art de la composition & du state management dans AngularDEV Community🔬 3D Science Lab — Interactive 3D STEM Education with 40+ Experiments Built Using Next.js and Three.jsDEV CommunityI Turned helix-agent into helix-agents: One MCP Server for Ollama, Codex, and OpenAI-Compatible ModelsDEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessHow SPACElab Has Integrated Science and Family Legacy to Craft Functional BeveragesInternational Business TimesDo You Trust Me? A Framework For Making Networks of Robots and Vehicles Safer - Harvard School of Engineering and Applied SciencesGoogle News: Machine LearningKubeCon Europe 2026: The Not-So-Unseen Engine Behind AI Innovation?Forrester AI Blog2. Mastering Time Series Forecasting with Python and timesfmDEV CommunityAirPods Max 2 reviewed: premium sound, top-tier ANC, same high priceTechSpotn8n Docker Setup: Why It Breaks (And the Easier Alternative)DEV Community1. Orchestrating AI Teams: A Python Guide to ChatDevDEV CommunityAI companies charge you 60% more based on your language, BPE tokensHacker NewsHow I Reverse-Engineered Claude Code's Hidden Pet SystemDEV Community@craft-ng: Associer l’art de la composition & du state management dans AngularDEV Community🔬 3D Science Lab — Interactive 3D STEM Education with 40+ Experiments Built Using Next.js and Three.jsDEV CommunityI Turned helix-agent into helix-agents: One MCP Server for Ollama, Codex, and OpenAI-Compatible ModelsDEV Community

CounterMoral: Editing Morals in Language Models

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27338v1 Announce Type: new Abstract: Recent advancements in language model technology have significantly enhanced the ability to edit factual information. Yet, the modification of moral judgments, a crucial aspect of aligning models with human values, has garnered less attention. In this work, we introduce CounterMoral, a benchmark dataset crafted to assess how well current model editing techniques modify moral judgments across diverse ethical frameworks. We apply various editing techniques to multiple language models and evaluate their performance. Our findings contribute to the ev — Michael Ripa, Jim Davies

View PDF HTML (experimental)

Abstract:Recent advancements in language model technology have significantly enhanced the ability to edit factual information. Yet, the modification of moral judgments, a crucial aspect of aligning models with human values, has garnered less attention. In this work, we introduce CounterMoral, a benchmark dataset crafted to assess how well current model editing techniques modify moral judgments across diverse ethical frameworks. We apply various editing techniques to multiple language models and evaluate their performance. Our findings contribute to the evaluation of language models designed to be ethical.

Comments: 7 pages (10 + 1 reference + 6 appendix). Honors thesis completed in June 2024, write-up completed in 2025

Subjects:

Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.27338 [cs.AI]

(or arXiv:2603.27338v1 [cs.AI] for this version)

https://doi.org/10.48550/arXiv.2603.27338

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Michael Ripa [view email] [v1] Sat, 28 Mar 2026 17:13:30 UTC (192 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
CounterMora…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 213 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers