AI RESEARCH

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

arXiv CS.CV

ArXi:2605.15980v1 Announce Type: new Group Relative Policy Optimization has emerged as essential for aligning video diffusion models with human preferences, but faces a critical computational bottleneck