AI RESEARCH

An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs

arXiv CS.LG

ArXi:2510.08882v2 Announce Type: replace We study decision making with structured observation (DMSO). Previous work (Foster, 2021b, 2023a) has characterized the complexity of DMSO via the decision-estimation coefficient (DEC), but left a gap between the regret upper and lower bounds that scales with the size of the model class.