AI RESEARCH

Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

arXiv CS.AI

ArXi:2605.19147v1 Announce Type: cross Large language models (LLMs) are highly susceptible to backdoor attacks (BAs), wherein