AI RESEARCH

MARCH: Evaluating the Intersection of Ambiguity Interpretation and Multi-hop Inference

arXiv CS.CL

ArXi:2509.22750v3 Announce Type: replace Real-world multi-hop QA is naturally linked with ambiguity, where a single query can trigger multiple reasoning paths that require independent resolution. Since ambiguity can occur at any stage, models must navigate layered uncertainty throughout the entire reasoning chain. Despite its prevalence in real-world user queries, previous benchmarks have primarily focused on single-hop ambiguity, leaving the complex interaction between multi-step inference and layered ambiguity underexplored. In this paper, we.