AI RESEARCH
Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use
arXiv CS.AI
•
ArXi:2604.05432v1 Announce Type: cross Tool-use large language model (LLM) agents are increasingly deployed to sensitive workflows, relying on tool calls for retrieval, external API access, and session memory management. While prior research has examined various threats, the risk of systematic data exfiltration by backdoored agents remains underexplored. In this work, we present Back-Reveal, a data exfiltration attack that embeds semantic triggers into fine-tuned LLM agents.