AI RESEARCH

MultiwayPAM: Multiway Partitioning Around Medoids for LLM-as-a-Judge Score Analysis

arXiv CS.LG

ArXi:2603.10287v1 Announce Type: cross LLM-as-a-Judge is a flexible framework for text evaluation, which allows us to obtain scores for the quality of a given text from various perspectives by changing the prompt template. Two main challenges in using LLM-as-a-Judge are computational cost of LLM inference, especially when evaluating a large number of texts, and inherent bias of an LLM evaluator.