AI RESEARCH
When Test-Time Guidance Is Enough: Fast Image and Video Editing with Diffusion Guidance
arXiv CS.AI
•
ArXi:2602.14157v2 Announce Type: replace-cross Text-driven image and video editing can be naturally cast as inpainting problems, where masked regions are reconstructed to remain consistent with both the observed content and the editing prompt. Recent advances in test-time guidance for diffusion and flow models provide a principled framework for this task; however, existing methods rely on costly vector--Jacobian product (VJP) computations to approximate the intractable guidance term, limiting their practical applicability.