Video of how my LLM's decoder blocks changed while training

r/LocalLLaMA
Generative AI AI Research

This is in response to my popular post: It was requested that I make a video of this data, so here it is. Enjoy! Edit: I see that reddit nuked it with compression. Let me know if my X post is any better: submitted by /u/1ncehost [link] [comments]