AI RESEARCH
Reinforcement Learning Measurement Model
arXiv CS.LG
•
ArXi:2605.09305v1 Announce Type: cross Interactive assessments generate sequential process data that are not well handled by conventional item response models. Existing MDP-based measurement approaches, such as the Marko decision process measurement model (MDP-MM, LaMar, 2018), link action choices to state-action values, but their reliance on person-specific tabular value functions makes them difficult to scale beyond small, fully enumerated tasks.