HumorRank: A Tournament-Based Leaderboard for Evaluating Humor Generation in Large Language Models

ArXi:2604.19786v1 Announce Type: new Evaluating humor in large language models (LLMs) is an open challenge because existing approaches yield isolated, incomparable metrics rather than unified model rankings, making it difficult to track progress across systems. We