AI RESEARCH

PIIBench: A Unified Multi-Source Benchmark Corpus for Personally Identifiable Information Detection

arXiv CS.AI

ArXi:2604.15776v1 Announce Type: cross We present PIIBench, a unified benchmark corpus for Personally Identifiable Information (PII) detection in natural language text. Existing resources for PII detection are fragmented across domain-specific corpora with mutually incompatible annotation schemes, preventing systematic comparison of detection systems.