Is there actually something meaningfully better for coding stepping up from 12GB -> 16GB?

r/LocalLLaMA
AI Hardware Open Source AI

Right now I'm running a 12GB GPU with models Qwen3-30B-A3B and Omnicoder, I'm looking at a 16GB new card and yet I don't see what better model I could run on that: QWEN 27B would take at least ~24GB. Pretty much I would run the same 30B A3B with a slight better quantization, little context. Am I missing some cool model? Can you recommend some LMs for coding in the zones of: * 12GB * 16GB * 12 + 16GB:P (If I was to keep both) Note: If I had to tell: context size 40-120k. submitted by /u/ea_man [link] [comments.