AI SAFETY & ETHICS

Shaping the exploration of the motivation-space matters for AI safety

LessWrong AI

Summary We argue that shaping RL exploration, and especially the exploration of the motivation-space, is understudied in AI safety and could be influential in mitigating risks.