AI RESEARCH
Token Sample Complexity of Attention
arXiv CS.LG
•
ArXi:2512.10656v2 Announce Type: replace As context windows in large language models continue to expand, it is essential to characterize how attention behaves at extreme sequence lengths. We