Nvidia SANA Video 2B

r/StableDiffusion
Generative AI AI Hardware Open Source AI AI Tools

Efficient-Large-Model/SANA-Video_2B_720p · Hugging Face SANA-Video is a small, ultra-efficient diffusion model designed for rapid generation of high-quality, minute-long videos at resolutions up to 720×1280. Key innovations and efficiency drivers include: (1) Linear DiT: Leverages linear attention as the core operation, offering significantly efficiency than vanilla attention when processing the massive number of tokens required for video generation.