AI RESEARCH
A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR
arXiv CS.CL
•
ArXi:2605.14427v1 Announce Type: new In hybrid automatic speech recognition (ASR) systems, the vocabulary size is unambiguous, typically determined by the number of phones, bi-phones, or tri-phones present in the language. In contrast, end-to-end ASR systems derive their vocabulary, often referred to as tokens from the text corpus used for