AI RESEARCH

Controllability in preference-conditioned multi-objective reinforcement learning

arXiv CS.LG

ArXi:2605.10585v1 Announce Type: new Multi-objective reinforcement learning (MORL) allows a user to express preference over outcomes in terms of the relative importance of the objectives, but standard metrics cannot capture whether changes in preference reliably change the agent's behavior in the intended way, a property termed controllability. As a result, preference-conditioned agents can score well on standard MORL metrics while being insensitive to the preference input.