AI RESEARCH

Turbulence-like 5/3 spectral scaling in contextual representations of language as a complex system

arXiv CS.AI

ArXi:2604.05536v1 Announce Type: cross Natural language is a complex system that exhibits robust statistical regularities. Here, we represent text as a trajectory in a high-dimensional embedding space generated by transformer-based language models, and quantify scale-dependent fluctuations along the token sequence using an embedding-step signal. Across multiple languages and corpora, the resulting power spectrum exhibits a robust power law with an exponent close to $5/3$ over an extended frequency range.