AI RESEARCH
Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models
arXiv CS.AI
•
ArXi:2603.14504v1 Announce Type: cross Optimizing the noise samples of diffusion and flow models is an increasingly popular approach to align these models to target rewards at inference time. However, we observe that these approaches are usually restricted to differentiable or cheap reward models, the formulation of the underlying pretrained generative model, or are memory/compute inefficient. We instead propose a simple trust-region based search algorithm (TRS) which treats the pre-trained generative and reward models as a black-box and only optimizes the source noise.