AI RESEARCH

Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive Decoding

arXiv CS.AI

ArXi:2603.06193v1 Announce Type: cross Long-form speech recognition with large encoder-decoder models such as Whisper often exhibit hallucinations, repetition loops, and content omissions. These errors can accumulate and be further amplified when the previous segment's transcription is used as decoding context. We propose Whisper-CD, a