Trained a GPT transformer from scratch on a $300 CPU — 39 minutes, 0.82M params, no GPU needed
r/LocalLLaMA
•
Generative AI
NLP
AI Hardware
AI Tools
Character-level GPT transformer built in PyTorch from scratch - pure architecture and