AI RESEARCH
Green Shielding: A User-Centric Approach Towards Trustworthy AI
arXiv CS.AI
•
ArXi:2604.24700v1 Announce Type: cross Large language models (LLMs) are increasingly deployed, yet their outputs can be highly sensitive to routine, non-adversarial variation in how users phrase queries, a gap not well addressed by existing red-teaming efforts. We propose Green Shielding, a user-centric agenda for building evidence-backed deployment guidance by characterizing how benign input variation shifts model behavior.