AI RESEARCH
Rate-optimal Design for Anytime Best Arm Identification
arXiv CS.LG
•
ArXi:2510.23199v3 Announce Type: replace-cross We consider the best arm identification problem, where the goal is to identify the arm with the highest mean reward from a set of $K$ arms under a limited sampling budget. This problem models many practical scenarios such as A/B testing. We consider a class of algorithms for this problem, which is provably minimax optimal up to a constant factor. This idea is a generalization of existing works in fixed-budget best arm identification, which are limited to a particular choice of risk measures.