Quantization from the ground up

Simon Willison Blog
Generative AI

Quantization from the ground up Sam Rose continues his streak of publishing spectacularly informative interactive essays, this time explaining how quantization of Large Language Models works. Also included is the best visual explanation I've ever seen of how floating point numbers are represented using binary digits.