AI RESEARCH
How Value Induction Reshapes LLM Behaviour
arXiv CS.CL
•
ArXi:2605.07925v1 Announce Type: new Conversational Large Language Models are post-trained on language that expresses specific behavioural traits, such as curiosity, open-mindedness, and empathy, and values, such as helpfulness, harmlessness, and honesty. This is done to increase utility, ensure safety, and improve the experience of the people interacting with the model. However, values are complex and inter-related -- inducing one could modify behaviour on another.