club-5060ti follow-up: cleaner RTX 5060 Ti local LLM recipes, benchmark explorer, and CUDA GPU compatibility notes

r/LocalLLaMA
Generative AI AI Hardware Open Source AI AI Research AI Tools

I posted earlier about RTX 5060 Ti local LLM testing, and I have cleaned the repo up quite a bit since then. The project is now a structured benchmark/recipe repo rather than scattered notes. It has a static results explorer, schema-validated benchmark JSON, clearer llama.cpp/vLLM notes, single-card and dual-card RTX 5060 Ti recipes, a model-agnostic download helper, and better labels for generation speed, prompt eval speed, MTP/no-MTP, and thinking mode. Repo: Results explorer: The tested baseline is still RTX 5060 Ti 16GB, especially 2x 5060 Ti for the larger Qwen3.6 runs.