AI RESEARCH
S-GRPO: Unified Post-Training for Large Vision-Language Models
arXiv CS.LG
•
ArXi:2604.16557v1 Announce Type: new Current post-