A Systematic Study of Training-Free Methods for Trustworthy Large Language Models

ArXi:2604.15789v1 Announce Type: new As Large Language Models (LLMs) receive increasing attention and are being deployed across various domains, their potential risks, including generating harmful or biased content, producing uned claims, and exhibiting vulnerabilities to adversarial attacks, have drawn significant attention. To enable quick and low-cost adaptation