AI RESEARCH

Delightful Exploration

arXiv CS.AI

ArXi:2605.13287v1 Announce Type: cross Most exploration algorithms search broadly until uncertainty is resolved. When the action space is too large to resolve within budget, practitioners default to $\varepsilon$-greedy, which bounds disruption but spends its override blindly. We