AI RESEARCH
On the Hardness of Reinforcement Learning with Transition Look-Ahead
arXiv CS.LG
•
ArXi:2510.19372v2 Announce Type: replace-cross We study reinforcement learning (RL) with transition look-ahead, where the agent may observe which states would be visited upon playing any sequence of $\ell$ actions before deciding its