AI RESEARCH

Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning

arXiv CS.LG

ArXi:2602.07441v2 Announce Type: replace Offline reinforcement learning (RL), which optimizes policies using a previously collected static dataset, is an important branch of RL.