AI RESEARCH
Delightful Exploration
arXiv CS.AI
•
ArXi:2605.13287v1 Announce Type: cross Most exploration algorithms search broadly until uncertainty is resolved. When the action space is too large to resolve within budget, practitioners default to $\varepsilon$-greedy, which bounds disruption but spends its override blindly. We