Shaping the exploration of the motivation-space matters for AI safety

Summary We argue that shaping RL exploration, and especially the exploration of the motivation-space, is understudied in AI safety and could be influential in mitigating risks.