KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

ArXi:2601.04745v2 Announce Type: replace Existing long-horizon memory benchmarks mostly use multi-turn dialogues or synthetic user histories, which makes retrieval performance an imperfect proxy for person understanding. We present \BenchName, a publicly releasable benchmark built from long-form autobiographical narratives, where actions, context, and inner thoughts provide dense evidence for inferring stable motivations and decision principles.