AI RESEARCH

ORPHEAS: A Cross-Lingual Greek-English Embedding Model for Retrieval-Augmented Generation

arXiv CS.CL

ArXi:2604.20666v1 Announce Type: new Effective retrieval-augmented generation across bilingual Greek--English applications requires embedding models capable of capturing both domain-specific semantic relationships and cross-lingual semantic alignment. Existing multilingual embedding models distribute their representational capacity across numerous languages, limiting their optimization for Greek and failing to encode the morphological complexity and domain-specific terminological structures inherent in Greek text.