AI RESEARCH
Guardian-as-an-Advisor: Advancing Next-Generation Guardian Models for Trustworthy LLMs
arXiv CS.CL
•
ArXi:2604.07655v1 Announce Type: cross Hard-gated safety checkers often over-refuse and misalign with a vendor's model spec; prevailing taxonomies also neglect robustness and honesty, yielding safer-on-paper yet less useful systems. This work