Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy

ArXi:2603.23146v1 Announce Type: cross The widespread adoption of Large Language Models (LLMs) has made the detection of AI-Generated text a pressing and complex challenge. Although many detection systems report high benchmark accuracy, their reliability in real-world settings remains uncertain, and their interpretability is often unexplored. In this work, we investigate whether contemporary detectors genuinely identify machine authorship or merely exploit dataset-specific artefacts.