Comparing Qwen3.5 27B vs Gemma 4 31B for agentic stuff
r/LocalLLaMA
•
Generative AI
AI Hardware
Open Source AI
Models compared: Qwen3.5-27B-UD-Q5_K_XL gemma-4-31B-it-UD-Q5_K_XL Main flags for boths --flash-attn on \ --n-gpu-layers 99 \ --no-mmap \ -c 150000 \ --temp 1 --top-p 0.9 --min-p 0.1 --top-k 20 \ --ctx-checkpoints 1 \ --jinja \ -np 1 \ --reasoning on \ --mmproj 'mmproj-BF16.gguf' \ --image-min-tokens 300 --image-max-tokens 512 I know they may not be the best and I still need experiments (thank you u/Sadman782 ) I find these tests fun and interesting.