Trace your LLM API and MCP calls with zero code changes (eBPF, Linux)

r/LocalLLaMA
Generative AI

Built an eBPF-based tracer that captures LLM API and MCP traffic from any process on your machine - no SDK changes, no proxy, no code instrumentation. It intercepts TLS via OpenSSL uprobes and parses Anthropic, OpenAI, and Gemini API calls in real time. Extracts model, tokens, latency, TTFT, tool names, streaming status, and full request/response bodies. Also traces MCP calls over stdio/socketpairs and HTTP (so Claude Code tool use shows up too). Outputs JSONL, exports to OpenTelemetry and Prometheus. Linux only, needs root for eBPF probes.