Stitched Value Model for Diffusion Alignment

ArXi:2605.19804v1 Announce Type: cross For practical use, diffusion- or flow-based generative models must be aligned with task-specific rewards, such as prompt fidelity or aesthetic preference. That alignment is challenging because the reward is defined for clean output images, but the alignment procedure requires value function estimates at noisy intermediate latents.