DiffusionLLM - Inception Mercury 2 - 11,000 tokens per second on NVIDIA H100 GPUs.

r/LocalLLaMA • April 20, 2026

AI Hardware

Submitted by /u/Revolutionary_Ask154 [link] [comments]

Read Full Article

Back to AI News Leader