3090 NVLink testing w/ Q3.5 27B
r/LocalLLaMA
•
AI Hardware
Was playing around with NVLink and was somewhat surprised it made a meaningful difference, even for generation speeds. If you are confused why same PLX chip is the slowest, with stock drivers, consumer gpu's can't communicate directly with each other over pcie, they are fighting over the same x16 link back to the