BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks

ArXi:2508.08127v2 Announce Type: replace The security of LLM-based multi-agent systems (MAS) is critically threatened by propagation vulnerability, where malicious agents can distort collective decision-making through inter-agent message interactions. While existing supervised defense methods nstrate promising performance, they may be impractical in real-world scenarios due to their heavy reliance on labeled malicious agents to train a supervised malicious detection model.