AI RESEARCH

Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning

arXiv CS.LG • May 15, 2026

ArXi:2602.07441v2 Announce Type: replace Offline reinforcement learning (RL), which optimizes policies using a previously collected static dataset, is an important branch of RL.

Read Full Article