Do you think there is room for optimization? llama.cpp/qwen3.6 27b on two 6000 Blackwell
r/LocalLLaMA
•
Generative AI
Open Source AI
AI Research
Hi, i run llama.cpp inside LXC on a Proxmox server. The hardware is a recent AMD Epyc with two 6000 Blackwell MaxQ.