Shaping the exploration of the motivation-space matters for AI safety (17 minute read)
TLDR AI
•
AI Safety
Reinforcement Learning
Shaping motivation-space exploration in RL could enhance AI safety by preventing harmful misalignment.