which framework will give me best performance and utilize both 5060ti and 4060
r/LocalLLaMA
•
Generative AI
Open Source AI
Currently I'm using llama.cpp it's answer all my needs from llm, but I wonder can I improve the performance, get faster tokens using other frameworks? submitted by /u/ResponsibleTruck4717 [link] [comments]