Cut Claude usage by ~85% in a job search pipeline (16k → 900 tokens/app) — here’s what worked

r/artificial
Generative AI

Like many here, I kept running into Claude usage limits when building anything non-trivial. I was working with a job search automation pipeline (based on the Career-Ops project), and the naive flow was burning ~16k tokens per application - completely unsustainable. So I spent some time reworking it with a focus on token efficiency as a first-class concern, not an afterthought. 🚀 Results ~85% reduction in token usage ~900 tokens per application Most repeated context calls eliminated Much stable under usage limits ⚡ What actually helped (practical takeaways) 1.