AI RESEARCH
SkillSieve: A Hierarchical Triage Framework for Detecting Malicious AI Agent Skills
arXiv CS.AI
•
ArXi:2604.06550v1 Announce Type: cross OpenClaw's ClawHub marketplace hosts over 13,000 community-contributed agent skills, and between 13% and 26% of them contain security vulnerabilities according to recent audits. Regex scanners miss obfuscated payloads; formal static analyzers cannot read the natural language instructions in SKILL.md files where prompt injection and social engineering attacks hide. Neither approach handles both modalities. SkillSieve is a three-layer detection framework that applies progressively deeper analysis only where needed.