which framework will give me best performance and utilize both 5060ti and 4060

Currently I'm using llama.cpp it's answer all my needs from llm, but I wonder can I improve the performance, get faster tokens using other frameworks? submitted by /u/ResponsibleTruck4717 [link] [comments]