[Help] Severe Latency during Prompt Ingestion - OpenClaw/Ollama on AMD Minisforum (AVX-512) & 64GB RAM (No GPU)

r/LocalLLaMA
Generative AI AI Hardware

Hi everyone! I posted this message in a few other subjects last days, and I didn't found any answer for the moment. Sorry if you already saw this topic anywhere else, I think it's maybe my last chance here! I’m seeking some technical insight regarding a performance bottleneck I’m hitting with a local AI agent setup. Despite having a fairly capable "mini-server" and applying several optimizations, my response times are extremely slow.