Trained a GPT transformer from scratch on a $300 CPU — 39 minutes, 0.82M params, no GPU needed

r/LocalLLaMA
Generative AI NLP AI Hardware AI Tools

Character-level GPT transformer built in PyTorch from scratch - pure architecture and