AI RESEARCH

MARBLE: Multi-Armed Restless Bandits in Latent Markovian Environment

arXiv CS.LG

ArXi:2511.09324v2 Announce Type: replace Restless Multi-Armed Bandits (RMABs) are powerful models for decision-making under uncertainty, yet classical formulations typically assume fixed dynamics, an assumption often violated in nonstationary environments. We