RL agents go from face-planting to parkour when researchers keep adding network layers
The Decoder
•
Machine Learning
AI Research
Reinforcement Learning
While most reinforcement learning algorithms use two to five network layers, a research team achieved 2x to 50x performance gains by scaling network depth up to 1,024 layers in a self-supervised agent and saw entirely new behaviors emerge in the process.