AI RESEARCH

$\varepsilon$-Good Action Identification in Fixed-Budget Monte Carlo Tree Search

arXiv CS.LG

ArXi:2605.11324v1 Announce Type: new We study the fixed-budget max-min action identification problem in depth-2 max-min trees, an important special case of Monte Carlo Tree Search. A learner sequentially allocates $T$ samples to leaves and then recommends a subtree whose minimum leaf value is largest. Motivated by approximate planning, we focus on $\varepsilon$-good subtree identification, where any subtree whose min value is within $\varepsilon$ of the optimal maximin value is acceptable.