AI RESEARCH
Agnostic Language Identification and Generation
arXiv CS.LG
•
ArXi:2601.23258v2 Announce Type: replace Recent works on language identification and generation have established tight statistical rates at which these tasks can be achieved. These works typically operate under a strong realizability assumption: that the input data is drawn from an unknown distribution necessarily ed on some language in a given collection. In this work, we relax this assumption of realizability entirely, and impose no restrictions on the distribution of the input data.