AI RESEARCH
Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning
arXiv CS.CL
•
ArXi:2603.06505v1 Announce Type: new Automatic speech recognition (ASR) has benefited from advances in pretrained speech and language models, yet most systems remain constrained to monolingual settings and short, isolated utterances. While recent efforts in context-aware ASR show promise, two key challenges persist: limited multilingual and the absence of principled alignment between speech and contextual representations. In this paper, we