Quantum spatial best-arm identification via quantum walks

ArXi:2509.05890v3 Announce Type: replace-cross Quantum reinforcement learning has emerged as a framework combining quantum computation with sequential decision-making, and applications to the multi-armed bandit (MAB) problem have been reported. The graph bandit problem extends the MAB setting by