AI RESEARCH

PermuQuant: Lowering Per-Group Quantization Error by Reordering Channels for Diffusion Models

arXiv CS.CV

ArXi:2605.09503v1 Announce Type: new Large-scale visual generative models have achieved remarkable performance. However, their high computational and memory costs make deployment challenging in resource-constrained scenarios, such as interactive applications and personal single-GPU usage. Post-