We Tried to Prompt-Inject Our Own Terminal Agent. Here's What Happened.
The scariest attack on an AI that runs commands isn't a typo — it's a malicious string in a log file telling it to run rm -rf /. We red-teamed our own agent. Here is the threat model, the attacks, and the results.