AI RESEARCH

A Calculus-Based Framework for Determining Vocabulary Size in End-to-End ASR

arXiv CS.CL

ArXi:2605.14427v1 Announce Type: new In hybrid automatic speech recognition (ASR) systems, the vocabulary size is unambiguous, typically determined by the number of phones, bi-phones, or tri-phones present in the language. In contrast, end-to-end ASR systems derive their vocabulary, often referred to as tokens from the text corpus used for