AI RESEARCH

Researchers waste 80% of LLM annotation costs by classifying one text at a time

arXiv CS.CL

ArXi:2604.03684v1 Announce Type: new Large language models (LLMs) are increasingly being used for text classification across the social sciences, yet researchers overwhelmingly classify one text per variable per prompt. Coding 100,000 texts on four variables requires 400,000 API calls. Batching 25 items and stacking all variables into a single prompt reduces this to 4,000 calls, cutting token costs by over 80%. Whether this degrades coding quality is unknown.