Run Qwen3.5-4B on AMD NPU
r/LocalLLaMA
•
AI Hardware
AI Research
Tested on Ryzen AI 7 350 (XDNA2 NPU), 32GB RAM, using Lemonade v10.0.1 and FastFlowLM v0.9.36. Features Low-power Well below 50°C without screen recording Tool-calling Up to 256k tokens (not on this 32GB machine) VLMEvalKit score: 85.6% FLM s all XDNA 2 NPUs. Some links: Perf. benchmark: Computer (ASUS) under test: 🍋Lemonade server: FastFlowLM: submitted by /u/BandEnvironmental834 [link] [comments]