GPT-4 Has 1.8 Trillion Parameters. It Uses 2% of Them Per Token.

Towards AI
Generative AI Open Source AI

DeepSeek-R1: 671B parameters. 37B active per token.