Got local Qwen 3.5/3.6 generating meeting summaries entirely offline on an M4 Max. Demo with Wi-Fi off. This is the future.

r/LocalLLaMA
Generative AI NLP Open Source AI

I'm the founder behind Hedy, an AI meeting app. I'm a huge er of Local AI, and we've been working on making it "consumer friendly". Speech recognition in Hedy has always run on-device (whisper.cpp and now also parakeet). What just shipped is that the rest of the AI pipeline (summaries, detailed notes, chat with the meeting, live coaching) can now run on-device too using llama.cpp. Wi-Fi off, nothing leaves the laptop. Video above shows the full flow. A few technical specifics: Models ed out of the box. Qwen 3.6, Qwen 3.5, and Gemma 4 families.