AI RESEARCH
GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics
arXiv CS.AI
•
ArXi:2603.11442v1 Announce Type: new Can humans detect AI-generated financial documents better than machines? We present GPT4o-Receipt, a benchmark of 1,235 receipt images pairing GPT-4o-generated receipts with authentic ones from established datasets, evaluated by five state-of-the-art multimodal LLMs and a 30-annotator crowdsourced perceptual study. Our findings reveal a striking paradox: humans are better at seeing AI artifacts, yet worse at detecting AI documents.