Building a fully local PDF-to-audiobook workflow with Kokoro 82M, Qwen and llama.cpp

Hey everyone, I’ve been building a local-first desktop PDF reader that can read technical books aloud and keep the spoken text highlighted while reading. The original motivation was pretty practical: I read a lot of programming and technical books, but many publishers either don’t offer audio versions or charge extra for AI-generated audio. I wanted to see how far I could get with a completely local setup instead. The app is built with Tauri 2.0 and runs locally on my Mac. For TTS I’m using Kokoro 82M.