AI RESEARCH
VORT: Adaptive Power-Law Memory for NLP Transformers
arXiv CS.LG
•
ArXi:2605.08966v1 Announce Type: new Standard Transformers impose near-exponential decay on the influence of distant tokens, conflicting with the power-law structure of long-range dependencies in natural language. We