The Crystallization of Transformer Architectures (2017-2025)

Lobste.rs AI
Generative AI NLP AI Research

A dataset-driven analysis of transformer architecture choices and their convergence over eight years.