AI RESEARCH
Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences
arXiv CS.CV
•
ArXi:2508.03542v2 Announce Type: replace Conversion of spoken mathematical expressions is a challenging task that involves transcribing speech into a strictly structured symbolic representation while addressing the ambiguity inherent in the pronunciation of equations. Although significant progress has been achieved in automatic speech recognition (ASR) and language models (LM), the problem of converting spoken mathematics into LaTeX remains underexplored. This task directly applies to educational and research domains, such as lecture transcription or note creation.