AI RESEARCH

Evaluation without Generation: Non-Generative Assessment of Harmful Model Specialization with Applications to CSAM

arXiv CS.LG

ArXi:2604.25119v1 Announce Type: new Auditing the fine-tunes of open-weight generative models for harmful specialization has become a new governance challenge for model hosting platforms. The standard toolkit, generative evaluation via curated prompts or red-teaming, does not scale to platform-level auditing and breaks down entirely for domains like CSAM where generation is legally constrained. This motivates the Evaluation without Generation problem: assessing model capabilities without producing outputs.