AI RESEARCH

LLMbench: A Comparative Close Reading Workbench for Large Language Models

arXiv CS.AI

ArXi:2604.15508v1 Announce Type: cross LLMbench is a browser-based workbench for the comparative close reading of large language model (LLM) outputs. Where existing tools for LLM comparison, such as Google PAIR's LLM Comparator are engineered for quantitative evaluation and user-rating metrics, LLMbench is oriented towards the hermeneutic practices of the digital humanities.