AI RESEARCH

Sampling-Aware Quantization for Diffusion Models

arXiv CS.CV

ArXi:2505.02242v2 Announce Type: replace Diffusion models have recently emerged as the dominant approach in visual generation tasks. However, the lengthy denoising chains and the computationally intensive noise estimation networks hinder their applicability in low-latency and resource-limited environments. Previous research has endeavored to address these limitations in a decoupled manner, utilizing either advanced samplers or efficient model quantization techniques.