Optimal Control of Fluid Restless Multi-armed Bandits: A Machine Learning Approach

ArXi:2502.03725v2 Announce Type: replace We present a novel machine learning framework for the optimal control of fluid restless multi-armed bandit problems (FRMABPs) with state equations that are either affine or quadratic in the state variables. By establishing fundamental properties of FRMABPs, we develop an efficient numerical algorithm that generates a comprehensive