Performance Benchmark - Qwen3.5 & Gemma4 on dual GPU setup (RTX 4070 + RTX 3060)

r/LocalLLaMA
Generative AI AI Hardware AI Research

Hi everyone, Been following a lot of local LLM talk in this forum lately - learned quite a bit from you all! This is my first post, hopefully not my last. I wanted to share some interesting benchmarks I did in my free time testing out a dual-GPU setup. Hardware Specs: CPU: 7700x (slightly undervolted to save temps, but performance is like stock) RAM: 32 GB DDR5 @ 6000 MHz Motherboard: MSI B650 Tomahawk Wifi GPU Setup: Primary: RTX 4070 (12 GB) at PCI 4.0 x16 Secondary: RTX 3060 (12 GB) at PCI 4.0 x2 (Note: This is a new addition.