AI RESEARCH
Monotone and Conservative Policy Iteration Beyond the Tabular Case
arXiv CS.LG
•
ArXi:2506.07134v3 Announce Type: replace