Multi-Domain Audio Question Answering Benchmark Toward Acoustic Content Reasoning

ArXi:2505.07365v2 Announce Type: replace-cross We present Task 5 of the DCASE 2025 Challenge: an Audio Question Answering (AQA) benchmark spanning multiple domains of sound understanding. This task defines three QA subsets (Bioacoustics, Temporal Soundscapes, and Complex QA) to test audio-language models on interactive question-answering over diverse acoustic scenes.