AI SAFETY & ETHICS
Shaping the exploration of the motivation-space matters for AI safety
LessWrong AI
•
Summary We argue that shaping RL exploration, and especially the exploration of the motivation-space, is understudied in AI safety and could be influential in mitigating risks.