AI RESEARCH

Cross-lingual Matryoshka Representation Learning across Speech and Text

arXiv CS.CL

ArXi:2602.19991v2 Announce Type: replace Speakers of under-represented languages face both a language barrier, as most online knowledge is in a few dominant languages, and a modality barrier, since information is largely text-based while many languages are primarily oral. We address this for French-Wolof by