Video of how my LLM's decoder blocks changed while training
r/LocalLLaMA
•
Generative AI
AI Research
This is in response to my popular post: It was requested that I make a video of this data, so here it is. Enjoy! Edit: I see that reddit nuked it with compression. Let me know if my X post is any better: submitted by /u/1ncehost [link] [comments]