Cohere Transcribe WebGPU: state-of-the-art multilingual speech recognition in your browser
r/LocalLLaMA
•
NLP
Yesterday, Cohere released their first speech-to-text model, which now tops the OpenASR leaderboard (for English, but the model does 14 different languages). So, I decided to build a WebGPU for it: running the model entirely locally in the browser with Transformers.js. I hope you like it! Link to (+ source code): submitted by /u/xenovatech [link] [comments]