M5 32GB LM Studio, double checking my speeds
r/LocalLLaMA
•
Generative AI
Open Source AI
I have a M5 MBP 32GB w. Mac OS 26.4, using LM Studio, and I suspect my speeds are low: 8 t/s Gemma3 27B 4Bit MLX 32 t/s Nemotron 3 Nano 4B GGUF 39 t/s GPT OSS 20B MLX All models were loaded with Default Context settings and I used the following runtime versions: MLX v1.4.0 M5 Metal Llama v2.8.0 Can someone tell me if they got the same speeds with a similar configuration? even if it's MB Air instead of Pro.