Closing the Modality Reasoning Gap for Speech Large Language Models

ArXi:2601.05543v2 Announce Type: replace Although Speech Large Language Models have achieved notable progress, a substantial modality reasoning gap remains: their reasoning performance on speech inputs is markedly weaker than on text. This gap could be associated with representational drift across Transformer layers and behavior deviations in long-chain reasoning. To address this issue, we