AI RESEARCH

Remember to Forget: Gated Adaptive Positional Encoding

arXiv CS.LG

ArXi:2605.10414v1 Announce Type: new Rotary Positional Encoding (RoPE) is widely used in modern large language models. However, when sequences are extended beyond the range seen during