Automated evaluation of LLMs for effective machine translation of Mandarin Chinese to English

ArXi:2603.09998v1 Announce Type: cross Although Large Language Models (LLMs) have exceptional performance in machine translation, only a limited systematic assessment of translation quality has been done. The challenge lies in automated frameworks, as human-expert-based evaluations can be time-consuming, given the fast-evolving LLMs and the need for a diverse set of texts to ensure fair assessments of translation quality.