do GLM-4.7 Flash Q4_K_M have problem with claude or agent?

I'm brand new to local LLMs and started with GLM-4.7 Flash q4_K_M. When I run it directly: ollama run glm-4.7-flash:q4_K_M it works pretty decently - nothing amazing, but usable and responsive. The problem starts when I switch to the Claude interface with: ollama launch claude --model glm-4.7-flash:q4_K_M Suddenly the model feels super dumb. It has basically zero memory between messages, can't create/save files, and forgets everything from the previous turn. Concrete example: I asked it to “build a CLI Snake game in Python”. It gave me clean, working code.