AI RESEARCH

An Empirical Recipe for Universal Phone Recognition

arXiv CS.CL

ArXi:2603.29042v1 Announce Type: new recognition (PR) is a key enabler of multilingual and low-resource speech processing tasks, yet robust performance remains elusive. Highly performant English-focused models do not generalize across languages, while multilingual models underutilize pretrained representations. It also remains unclear how data scale, architecture, and