Nvidia SANA Video 2B

Efficient-Large-Model/SANA-Video_2B_720p · Hugging Face SANA-Video is a small, ultra-efficient diffusion model designed for rapid generation of high-quality, minute-long videos at resolutions up to 720×1280. Key innovations and efficiency drivers include: (1) Linear DiT: Leverages linear attention as the core operation, offering significantly efficiency than vanilla attention when processing the massive number of tokens required for video generation.