Why is it that Flux2K is so good at image editing but Z image Turbo isn't when they both use Qwen text encoders??

So I've been trying to wrap my head around this because on paper they should behave similarly - both Flux 2 Klein and Z Image Turbo use Qwen as the text encoder so the language understanding side is basically the same. But in practice Flux 2 Klein is dramatically better at image editing tasks and I genuinely couldn't figure out why. I ended up watching a video by this guy.