Rethinking positional encoding as a geometric constraint rather than a signal injection

r/LocalLLaMA
Generative AI

We've been exploring an alternative framing of positional encoding where instead of additively injecting position signals into token embeddings, you treat position as a geometric constraint on the manifold the embeddings are allowed to occupy.