Show HN: Go LLM inference with a Vulkan GPU back end that beats Ollama's CUDA

Hacker News Show AI
Generative AI AI Hardware Open Source AI

Dlgo is an LLM inference engine written in Go. CPU path has zero dependencies beyond the standard library. GPU path uses Vulkan compute - no CUDA required.