AI RESEARCH
PermuQuant: Lowering Per-Group Quantization Error by Reordering Channels for Diffusion Models
arXiv CS.CV
•
ArXi:2605.09503v1 Announce Type: new Large-scale visual generative models have achieved remarkable performance. However, their high computational and memory costs make deployment challenging in resource-constrained scenarios, such as interactive applications and personal single-GPU usage. Post-