WhisperRT -- Turning Whisper into a Causal Streaming Model

ArXi:2508.12301v2 Announce Type: replace-cross Automatic Speech Recognition (ASR) has seen remarkable progress, with models like OpenAI Whisper and NVIDIA Canary achieving state-of-the-art (SOTA) performance in offline transcription. However, these models are not designed for streaming (online or real-time) transcription, due to limitations in their architecture and