Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition

ArXi:2603.18557v1 Announce Type: new As large language models are increasingly deployed across diverse real-world applications, extending automated evaluation beyond English has become a critical challenge. Existing evaluation approaches are predominantly English-focused, and adapting them to other languages is hindered by the scarcity and cost of human-annotated judgments in most languages. We