AI RESEARCH
CheeseBench: Evaluating Large Language Models on Rodent Behavioral Neuroscience Paradigms
arXiv CS.AI
•
ArXi:2604.10825v1 Announce Type: new