AI RESEARCH
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
arXiv CS.AI
•
ArXi:2605.20164v1 Announce Type: new Reinforcement learning with verifiable rewards has made post-