AI RESEARCH

HealthAdminBench: Evaluating Computer-Use Agents on Healthcare Administration Tasks

arXiv CS.AI

ArXi:2604.09937v1 Announce Type: new Healthcare administration accounts for over $1 trillion in annual spending, making it a promising target for LLM-based computer-use agents (CUAs). While clinical applications of LLMs have received significant attention, no benchmark exists for evaluating CUAs on end-to-end administrative workflows. To address this gap, we