AI RESEARCH
Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning
arXiv CS.LG
•
ArXi:2402.02017v3 Announce Type: replace Offline reinforcement learning (RL) has progressed with return-conditioned supervised learning (RCSL), but its lack of stitching ability remains a limitation. We