AI RESEARCH
Information Extraction from Electricity Invoices with General-Purpose Large Language Models
arXiv CS.CL
•
ArXi:2604.25927v1 Announce Type: new Information extraction from semi-structured business documents remains a critical challenge for enterprise management. This study evaluates the capability of general-purpose Large Language Models to extract structured information from Spanish electricity invoices without task-specific fine-tuning. Using a subset of the IDSEM dataset, we benchmark two architecturally distinct models, Gemini 1.5 Pro and Mistral-small, across 19 parameter configurations and 6 prompting strategies.