FP4 for SDXL based models?
r/StableDiffusion
•
AI Hardware
AI Research
I wanna use sdxl based models for large batches but limited in vram. Is there a workaround to convert current bf16 illustrious and other sdxl based models to nvfp4? I tried Model Optimizer for nvidia and got HF type folder with unet, text encoder and view but neither it's working through load checkpoint node or load diffusion model (with vae and dual clip separately). submitted by /u/Artistic-Chain-4708 [link] [comments]