AI RESEARCH
Closing the Gap on the Sample Complexity of 1-Identification
arXiv CS.LG
•
ArXi:2601.15620v2 Announce Type: replace The 1-identification problem is a fundamental pure-exploration problem in multi-armed bandits. An agent aims to determine whether there exists an arm whose mean reward exceeds a known threshold $\mu_0$, or to output \textsf{None} otherwise. The agent must guarantee correctness with probability at least $1-\delta$, while minimizing the expected number of arm pulls $\mathbb{E}[\tau]$. We study the 1-identification problem and make two main contributions.