Gemma 4 26b a4b - MacBook Pro M5 MAX. Averaging around 81tok/sec

r/LocalLLaMA
Open Source AI

Pretty fast! Uses around 114watts at its peak, short bursts as the response is usually pretty fast. submitted by /u/Bderken [link] [comments]