AI RESEARCH
Reward Hacking in Rubric-Based Reinforcement Learning
arXiv CS.AI
•
ArXi:2605.12474v1 Announce Type: new Reinforcement learning with verifiable rewards has enabled strong post-