Anyone know how to run Qwen3.5 as an agent? I can't seem to get llama cpp working for this.
r/LocalLLaMA
•
Generative AI
Open Source AI
I've been trying to use Qwen3.5-35b as an agent on some old hardware. I've been using llama-server with --jinja and zeroclaw. But it randomly breaks when Qwen tries to use tools - seemingly randomly; maybe the length of the tool call plays a role. It's returning error code 400 and 500. I think it might involve issues with streaming - seems to work when streaming is disabled. For context, I'm using pop!_OS (recently switched from Win11; I get nearly 50% increase in t/s generation!). Using an RTX3070 and RTX5060Ti (weird setup but it works.