Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning

ArXi:2402.02017v3 Announce Type: replace Offline reinforcement learning (RL) has progressed with return-conditioned supervised learning (RCSL), but its lack of stitching ability remains a limitation. We