AI RESEARCH
HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free Modeling
arXiv CS.LG
•
ArXi:2603.16917v1 Announce Type: new Sequence modeling universally relies on discrete subword tokenization to circumvent the $\mathcal{O}(N^2)$ computational intractability of native byte-level attention. However, this heuristic quantization imposes artificial morphological boundaries, enforces vocabulary dependence, and fractures the continuity of the optimization landscape. To resolve this dichotomy, we