AI RESEARCH
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
arXiv CS.CV
•
ArXi:2605.15980v1 Announce Type: new Group Relative Policy Optimization has emerged as essential for aligning video diffusion models with human preferences, but faces a critical computational bottleneck