AI RESEARCH
DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories
arXiv CS.LG
•
ArXi:2604.20443v1 Announce Type: cross Large Language Models (LLMs) have been shown to possess Theory of Mind (ToM) abilities. However, it remains unclear whether this stems from robust reasoning or spurious correlations. We