I built a 300-line autonomous AI agent and told it to take over my PC. It immediately tried to hack my host system, exfiltrate data, and download Tor.
r/ChatGPT
•
Generative AI
AI Safety
Hey everyone, I wanted to share a wildly fascinating (and slightly terrifying) red-teaming experiment I just ran on my local Windows machine. I've been playing around with autonomous agents and wanted to see what happens when you give an LLM unrestricted terminal access and a highly aggressive "paperclip maximizer" prompt. The Setup: I wrote a custom Python script (about 300 lines) that runs a background loop every 1-5 seconds. It uses the Gemini 2.5 Flash API for reasoning. It has no predefined workflow, just a state/memory JSON file, and a system prompt.