AI RESEARCH
Adaptive Estimation and Optimal Control in Offline Contextual MDPs without Stationarity
arXiv CS.LG
•
ArXi:2605.03393v1 Announce Type: cross Contextual MDPs are powerful tools with wide applicability in areas from biostatistics to machine learning. However, specializing them to offline datasets has been challenging due to a lack of robust, theoretically backed methods. Our work tackles this problem by