Meituan open sources LongCat-Image-Edit-Turbo, a distilled image editing model that hits open source SOTA in only 8 inference steps

r/singularity
Generative AI AI Research

Meituan's LongCat team just dropped another one. LongCat-Image-Edit-Turbo is the distilled version of their LongCat-Image-Edit model, and it achieves high quality instruction based image editing with only 8 NFEs (number of function evaluations), roughly a 10x speedup over the base editing model. The whole thing runs on about 18GB VRAM with CPU offloading enabled. For context, the LongCat-Image family is built on a foundation model with a compact 6B parameter diffusion core for text to image generation, which already outperforms numerous open source models several times its size.