GATED_DELTA_NET for vulkan merged in llama.cpp

r/LocalLLaMA
Generative AI Open Source AI

It would be already in the latest release. There is a performance boost in my AMD RX7800XT setup (Fedora Linux). For Qwen 3.5 27B, token generation was ~28t/s. It is now ~36t/s. submitted by /u/FancyImagination880 [link] [comments]