am I running this llama-bench of Qwen3.6-27B on these V100s right?

r/LocalLLaMA
Generative AI Open Source AI

Basically what I'm doing here is trying to validate whether or not it's a reasonable idea to get a couple of V100s, either SXMs with PCIe adapters or straight-up PCIe cards in the first place, for the sake of running this model or models like it, for codegen and other mostly-text applications. a pair of these is around $1200 for 64GB RAM, compared to $1100 for 24GB RAM from a 3090.