AI RESEARCH
SPAARS: Safer RL Policy Alignment through Abstract Exploration and Refined Exploitation of Action Space
arXiv CS.AI
•
ArXi:2603.09378v1 Announce Type: cross Offline-to-online reinforcement learning (RL) offers a promising paradigm for robotics by pre-