AI RESEARCH

S-GRPO: Unified Post-Training for Large Vision-Language Models

arXiv CS.LG

ArXi:2604.16557v1 Announce Type: new Current post-