Pure-attention 70B for agentic C#/.NET coding: what are you running?

r/LocalLLaMA
Generative AI Open Source AI

I'm putting together a WRX80 build (TR PRO 3975WX + RTX PRO 6000 96GB) and trying to figure out what model to target for my main workload. I have a VS extension that acts as an agentic coding assistant - it reads files, patches code, runs builds, fixes errors, and loops autonomously through 5-15 iterations. All C#/. NET 10. Right now I'm on Qwen 3.5 27B Q4_K_M via ik_llama.cpp at 65K context, and it honestly works pretty well for the agentic stuff. The reasoning quality at 27B is solid for this kind of structured task.