AI RESEARCH
Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism
arXiv CS.AI
•
ArXi:2604.09544v1 Announce Type: cross Large language models (LLMs) undergo alignment