AI RESEARCH

Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention

arXiv CS.LG

ArXi:2402.15127v2 Announce Type: replace