AI RESEARCH

OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards

arXiv CS.AI

ArXi:2603.19191v1 Announce Type: new Reinforcement Learning (RL) has the potential to improve the robustness of GUI agents in stochastic environments, yet