AI RESEARCH
HumorRank: A Tournament-Based Leaderboard for Evaluating Humor Generation in Large Language Models
arXiv CS.CL
•
ArXi:2604.19786v1 Announce Type: new Evaluating humor in large language models (LLMs) is an open challenge because existing approaches yield isolated, incomparable metrics rather than unified model rankings, making it difficult to track progress across systems. We