AI RESEARCH
Beyond State Consistency: Behavior Consistency in Text-Based World Models
arXiv CS.LG
•
ArXi:2604.13824v1 Announce Type: new World models have been emerging as critical components for assessing the consequences of actions generated by interactive agents in online planning and offline evaluation. In text-based environments, world models are typically evaluated and trained with single-step metrics such as Exact Match, aiming to improve the similarity between predicted and real-world states, but such metrics have been shown to be insufficient for capturing actual agent behavior. To address this issue, we.