AI RESEARCH

Dependence on Early and Late Reverberation of Single-Channel Speaker Distance Estimation

arXiv CS.AI

ArXi:2605.07694v1 Announce Type: cross Single-channel speaker distance estimation has recently achieved centimeter-level accuracy in simulated environments, yet it remains unclear which components of the room impulse response (RIR) the model exploits and how performance depends on the recording conditions. In this work, we decompose simulated RIRs into four variants (full, direct-only, no-late, and no-early) using the mixing time estimated from the echo density function as the boundary between early reflections and late reverberation.