AI RESEARCH

Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning

arXiv CS.AI

ArXi:2305.09840v4 Announce Type: replace Balancing exploration and exploitation has been an important problem in both game tree search and automated planning. However, while the problem has been extensively analyzed within the Multi-Armed Bandit (MAB) literature, the planning community has had limited success when attempting to apply those results. We show that a detailed theoretical understanding of MAB literature helps improve existing planning algorithms that are based on Monte Carlo Tree Search (MCTS) / Trial Based Heuristic Tree Search