AI RESEARCH
Beyond Single-Agent Alignment: Preventing Context-Fragmented Violations in Multi-Agent Systems
arXiv CS.LG
•
ArXi:2604.22879v1 Announce Type: cross We identify and formalize a novel security risk: Context-Fragmented Violations (CFVs) - a class of policy breaches where individual agent actions appear locally safe and reasonable, yet collectively violate organizational policies because critical policy facts are siloed in different departments private contexts. Existing prompt-based alignment mechanisms and monolithic interceptors are poorly matched to violations that span contextual islands. We propose Distributed Sentinel, a distributed zero-trust enforcement architecture that.