AI RESEARCH
An evolutionary perspective on modes of learning in Transformers
arXiv CS.AI
•
ArXi:2505.09855v2 Announce Type: replace-cross The success of Transformers lies in their ability to improve inference through two complementary strategies: the permanent refinement of model parameters via in-weight learning (IWL), and the ephemeral modulation of inferences via in-context learning (ICL), which leverages contextual information maintained in the model's activations. Evolutionary biology tells us that the predictability of the environment across timescales predicts the extent to which analogous strategies should be preferred.