AI RESEARCH
Understanding Moral Reasoning Trajectories in Large Language Models: Toward Probing-Based Explainability
arXiv CS.AI
•
ArXi:2603.16017v1 Announce Type: cross Large language models (LLMs) increasingly participate in morally sensitive decision-making, yet how they organize ethical frameworks across reasoning steps remains underexplored. We