NCCL-Free Tensor Parallelism on Dual Blackwell PCIe llama.cpp b9095 released!
r/LocalLLaMA
•
Generative AI
Open Source AI
B9095 finally makes -sm tensor work on dual consumer Blackwell PCIe GPUs without NCCL If youre on dual Blackwell gpus this look like it could be big. I'll have my own results for 2x5060ti asap submitted by /u/Bulky-Priority6824 [link] [comments]