AI RESEARCH
Does RLVR Extend Reasoning Boundaries? Investigating Capability Expansion in Vision-Language Models
arXiv CS.AI
•
ArXi:2511.00710v4 Announce Type: replace Recent studies posit that Reinforcement Learning with Verifiable Rewards (RLVR) primarily amplifies behaviors inherent to the pre-