Wild Experience - Titan X Pascal

r/LocalLLaMA
Generative AI Open Source AI

I wanted to see how older GPUs hold up for AI tasks today. Seven months ago I posted about the AMD 9070 XT I had for gaming, which I also wanted to use for AI. Recently, I added an old Titan X Pascal card to my server just to see what it could do it was just collecting dust anyway. Even if it only ran a small LLM agent that reviews code while I sleep, I thought it would be a fun experiment. After some tweaking with OpenCode and llama dot cpp, I’m seeing around 500 tokens/sec for prompt processing and 25 tokens/sec for generation.