I Tried Running a 70B Model on a Gaming GPU… It Actually Worked
Towards AI
•
Generative AI
AI Hardware
The strange engineering trick that breaks the rules of LLM deployment