AI RESEARCH

Reward Hacking in Rubric-Based Reinforcement Learning

arXiv CS.AI

ArXi:2605.12474v1 Announce Type: new Reinforcement learning with verifiable rewards has enabled strong post-