Understanding Moral Reasoning Trajectories in Large Language Models: Toward Probing-Based Explainability

ArXi:2603.16017v1 Announce Type: cross Large language models (LLMs) increasingly participate in morally sensitive decision-making, yet how they organize ethical frameworks across reasoning steps remains underexplored. We