AI RESEARCH

CheeseBench: Evaluating Large Language Models on Rodent Behavioral Neuroscience Paradigms

arXiv CS.AI

ArXi:2604.10825v1 Announce Type: new