Questions about usage of Intel GPUs for small 4gpu cluster

r/LocalLLaMA
Generative AI AI Hardware Open Source AI

Hey guys! I’m currently in the position where I should make a recommendation for buying hardware for a company of about 30 people. It is supposed to be used primarily for code review of git commits. As well as agentic coding for some of those people. I was currently testing with my two 5070ti gpus, when it comes to qwen-3-coder-30b they give me 50 tokens a second. I was now wondering how intel gpus would compare to that. How much of a performance difference can I actually expect between Nvidia and intel gpus? I’m currently looking at the intel arc b60.