Why can't we have small SOTA-like models for coding?

Maybe a dumb question but, i'm wondering why can't we have a specialized model just for a specific programming language like python, that can perform on par with opus 4.6? or to frame my question better, we have coder Qwen3-Coder-480B-A35B-Instruct, does it make sense to train Qwen3-Coder-30B-A3B-Instruct-Python that's as good as 480B-A35B or opus, in python de? submitted by /u/itsArmanJr [link] [comments]