AI RESEARCH
Audio Spatially-Guided Fusion for Audio-Visual Navigation
arXiv CS.AI
•
ArXi:2604.02389v1 Announce Type: cross Audio-visual Navigation refers to an agent utilizing visual and auditory information in complex 3D environments to accomplish target localization and path planning, thereby achieving autonomous navigation. The core challenge of this task lies in the following: how the agent can break free from the dependence on