AI RESEARCH
SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation
arXiv CS.LG
•
ArXi:2505.16637v4 Announce Type: replace-cross Large language models (LLMs) have recently nstrated remarkable capabilities in machine translation (MT). However, most advanced MT-specific LLMs heavily rely on external supervision signals during