AI RESEARCH
Emotional Cost Functions for AI Safety: Teaching Agents to Feel the Weight of Irreversible Consequences
arXiv CS.AI
•
ArXi:2603.14531v1 Announce Type: new Humans learn from catastrophic mistakes not through numerical penalties, but through qualitative suffering that reshapes who they are. Current AI safety approaches replicate none of this. Reward shaping captures magnitude, not meaning. Rule-based alignment constrains behaviour, but does not change it. We propose Emotional Cost Functions, a framework in which agents develop Qualitative Suffering States, rich narrative representations of irreversible consequences that persist forward and actively reshape character.