AI RESEARCH

MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio

arXiv CS.CL

ArXi:2605.00969v1 Announce Type: cross We present MedMosaic, a medical audio question-answering dataset designed to benchmark language and audio reasoning models under realistic clinical constraints. Medical audio data is difficult to collect due to privacy regulations and high annotation costs arising from domain expertise. Thus, existing benchmarks tend to underrepresent complex medical audio scenarios.