AI RESEARCH

Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement

arXiv CS.AI

ArXi:2505.08245v3 Announce Type: replace-cross The advancement of large language models (LLMs) has outpaced traditional evaluation methodologies. This progress presents novel challenges, such as measuring human-like psychological constructs, moving beyond static and task-specific benchmarks, and establishing human-centered evaluation. These challenges intersect with psychometrics, the science of quantifying the intangible aspects of human psychology, such as personality, values, and intelligence. This review paper.