On the Hardness of Reinforcement Learning with Transition Look-Ahead

ArXi:2510.19372v2 Announce Type: replace-cross We study reinforcement learning (RL) with transition look-ahead, where the agent may observe which states would be visited upon playing any sequence of $\ell$ actions before deciding its