Any better local alternative to whisperer?

r/StableDiffusion
Generative AI Open Source AI

Using 4 whisperers (installable via pip install -U openai-whisper) in parallel to infer lyrics for 500+ songs. I see inaccurate captions from time to time. Is there a better alternative? Also, I have captioned these songs using Qwen-2.5 in Side-Step but since these are oldies, it fails to capture the themes - it said there is a "bass drop" in a Bobby Darrin's song, lol. How to fix this? submitted by /u/Fdx_dy [link] [comments]