Quantization from the ground up
Simon Willison Blog
•
Generative AI
Quantization from the ground up Sam Rose continues his streak of publishing spectacularly informative interactive essays, this time explaining how quantization of Large Language Models works. Also included is the best visual explanation I've ever seen of how floating point numbers are represented using binary digits.