AI RESEARCH

SEA-BED: How Do Embedding Models Represent Southeast Asian Languages?

arXiv CS.CL

ArXi:2508.12243v3 Announce Type: replace Multilingual text embeddings are often assumed to encode meaning in a perspective-independent semantic space, yielding stable similarity judgments across tasks and languages. Our results show that this assumption does not hold in practice. We