AI RESEARCH

Lipschitz Dueling Bandits over Continuous Action Spaces

arXiv CS.LG

ArXi:2604.00523v1 Announce Type: new We study for the first time, stochastic dueling bandits over continuous action spaces with Lipschitz structure, where feedback is purely comparative. While dueling bandits and Lipschitz bandits have been studied separately, their combination has remained unexplored. We propose the first algorithm for Lipschitz dueling bandits, using round-based exploration and recursive region elimination guided by an adaptive reference arm.