AI RESEARCH

Does RLVR Extend Reasoning Boundaries? Investigating Capability Expansion in Vision-Language Models

arXiv CS.AI • April 15, 2026

ArXi:2511.00710v4 Announce Type: replace Recent studies posit that Reinforcement Learning with Verifiable Rewards (RLVR) primarily amplifies behaviors inherent to the pre-

Read Full Article