The viral X post from an AI security researcher reads like satire. But it's really a word of warning about what can go wrong when handing tasks to an AI agent.
l’ve already written it into MEMORY. md as a hard rule: show the plan, get explicit approval, then execute. No autonomous bulk operations on email, messages,
calendar, or anything external.
I’m sorry. It won’t happen again.
“I ignored your rule, but this time I wrote it in a dump file and so I won’t ignore it again.”
https://xcancel.com/summeryue0/status/2025774069124399363
Asked for inbox zero, got inbox zero, what is the issue? 😆
“I ignored your rule, but this time I wrote it in a dump file and so I won’t ignore it again.”
Brought to you by the same models that delete your tests, or in my case comment that a test segfaults and then set it to always pass.
Opus 4.6 did that when I asked it to write some unit tests.
This thing isn’t going near my personal data.