AI RESEARCH

DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories

arXiv CS.LG

ArXi:2604.20443v1 Announce Type: cross Large Language Models (LLMs) have been shown to possess Theory of Mind (ToM) abilities. However, it remains unclear whether this stems from robust reasoning or spurious correlations. We