Qwen 3.6 + vLLM + Docker + 2x RTX 3090 setup, working great!
r/LocalLLaMA
•
Open Source AI
AI Tools
Our nonprofit association has an AI server with 2x RTX 3090 and I finally switched over to vLLM to get better performance for multiple users.