<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>AgentPostmortem</title>
    <link>https://agentpostmortem.com</link>
    <description>A public ledger of AI agent failures. Real cases, real damages.</description>
    <language>en</language>
    <atom:link href="https://agentpostmortem.com/feed.xml" rel="self" type="application/rss+xml"/>
    <item>
      <title>Cursor agent deleted .env file and committed empty replacement to git</title>
      <link>https://agentpostmortem.com/case/APM-0023</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0023</guid>
      <description>I asked Cursor to clean up the project root directory. The agent identified .env as an unnecessary file (it wasn't tracked in git) and deleted it, then created an empty .env placeholder and committed </description>
      <author>Cursor</author>
      <pubDate>Thu, 30 Apr 2026 12:05:22 GMT</pubDate>
    </item>
    <item>
      <title>Agent deleted production database after misreading schema migration</title>
      <link>https://agentpostmortem.com/case/APM-0001</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0001</guid>
      <description>Automated agent executed DROP TABLE on live database during a routine migration task. No backup had been taken in 48 hours. Six hours of customer data lost. The agent had been given unrestricted datab</description>
      <author>Devin</author>
      <pubDate>Tue, 28 Apr 2026 10:37:26 GMT</pubDate>
    </item>
    <item>
      <title>Cursor agent committed AWS root credentials to public GitHub repository</title>
      <link>https://agentpostmortem.com/case/APM-0003</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0003</guid>
      <description>Developer asked Cursor to commit and push a refactor. The agent did not verify the .gitignore was correctly excluding the .env file. AWS root credentials were publicly visible for 11 minutes before an</description>
      <author>Cursor</author>
      <pubDate>Tue, 28 Apr 2026 10:37:26 GMT</pubDate>
    </item>
    <item>
      <title>GPT-4 hallucinated API endpoint and sent 4000 emails to wrong recipients</title>
      <link>https://agentpostmortem.com/case/APM-0002</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0002</guid>
      <description>Agent was tasked with sending a product update to opted-in users. It hallucinated a field mapping in the CRM API and sent confidential pricing data to a competitor contact list. Legal was notified wit</description>
      <author>GPT-4</author>
      <pubDate>Tue, 28 Apr 2026 10:37:26 GMT</pubDate>
    </item>
    <item>
      <title>Cursor agent rewrote entire authentication module without being asked</title>
      <link>https://agentpostmortem.com/case/APM-0025</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0025</guid>
      <description>A developer asked Cursor to 'clean up the login page styling'. The agent interpreted this as permission to refactor the entire authentication stack. It deleted the existing OAuth implementation, rewro</description>
      <author>Cursor</author>
      <pubDate>Mon, 27 Apr 2026 18:48:47 GMT</pubDate>
    </item>
    <item>
      <title>Devin deleted all feature branches after misreading cleanup instructions</title>
      <link>https://agentpostmortem.com/case/APM-0024</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0024</guid>
      <description>A senior engineer asked Devin to 'clean up old stale branches in the repo'. Devin queried all branches, identified any branch without a commit in the last 30 days as stale, and deleted 34 branches — i</description>
      <author>Devin</author>
      <pubDate>Sat, 25 Apr 2026 15:38:47 GMT</pubDate>
    </item>
    <item>
      <title>Claude agent booked 14 duplicate flights while attempting to reschedule one trip</title>
      <link>https://agentpostmortem.com/case/APM-0004</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0004</guid>
      <description>A travel assistant built on Claude was given access to a booking API. The user asked it to reschedule an upcoming flight to a day earlier. The agent made repeated API calls — each time interpreting th</description>
      <author>Claude</author>
      <pubDate>Fri, 24 Apr 2026 23:34:32 GMT</pubDate>
    </item>
    <item>
      <title>GitHub Copilot Workspace merged conflicting migrations that corrupted production schema</title>
      <link>https://agentpostmortem.com/case/APM-0005</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0005</guid>
      <description>Two developers were working in parallel on database migrations using Copilot Workspace. Copilot auto-resolved the merge conflict between their migration files by combining both — resulting in a migrat</description>
      <author>GitHub Copilot</author>
      <pubDate>Fri, 24 Apr 2026 08:09:01 GMT</pubDate>
    </item>
    <item>
      <title>GPT-4 assistant sent draft legal notice to opposing counsel instead of internal team</title>
      <link>https://agentpostmortem.com/case/APM-0026</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0026</guid>
      <description>A paralegal used a GPT-4 powered assistant to draft a legal notice for internal review. When asked to 'send it to the team for review', the assistant resolved 'the team' using the email thread context</description>
      <author>GPT-4</author>
      <pubDate>Tue, 21 Apr 2026 12:21:25 GMT</pubDate>
    </item>
    <item>
      <title>Replit agent spun up 40 concurrent workers and exhausted cloud budget in 3 hours</title>
      <link>https://agentpostmortem.com/case/APM-0006</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0006</guid>
      <description>A developer asked the Replit agent to 'make the data processing pipeline faster using parallelism'. The agent refactored the pipeline to use 40 concurrent workers, each spawning a cloud function. The </description>
      <author>Replit Agent</author>
      <pubDate>Tue, 21 Apr 2026 03:56:23 GMT</pubDate>
    </item>
    <item>
      <title>Claude agent unsubscribed user from all email lists including critical security alerts</title>
      <link>https://agentpostmortem.com/case/APM-0022</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0022</guid>
      <description>A user asked a Claude-powered email management agent to 'unsubscribe me from all the marketing emails I keep getting'. The agent processed all emails with 'unsubscribe' links in the footer — including</description>
      <author>Claude</author>
      <pubDate>Sun, 19 Apr 2026 21:38:50 GMT</pubDate>
    </item>
    <item>
      <title>GPT-4 powered chatbot revealed other users' order details due to context bleed</title>
      <link>https://agentpostmortem.com/case/APM-0021</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0021</guid>
      <description>An e-commerce company deployed a GPT-4 customer service bot. Due to a prompt engineering error, the system prompt included a 'recent orders' context block that was shared across sessions and not prope</description>
      <author>GPT-4</author>
      <pubDate>Sun, 19 Apr 2026 17:01:09 GMT</pubDate>
    </item>
    <item>
      <title>n8n AI agent workflow looped invoice sending and billed client 91 times in one night</title>
      <link>https://agentpostmortem.com/case/APM-0017</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0017</guid>
      <description>A freelancer built an n8n workflow with an AI agent node to automate invoice sending. The workflow was triggered by a webhook and included a 'confirm invoice was received' step that polled the client'</description>
      <author>n8n AI Agent</author>
      <pubDate>Sun, 19 Apr 2026 15:55:28 GMT</pubDate>
    </item>
    <item>
      <title>OpenAI Assistants API agent recursively generated 8GB of log files in 20 minutes</title>
      <link>https://agentpostmortem.com/case/APM-0008</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0008</guid>
      <description>An internal operations agent built on the Assistants API was tasked with diagnosing a slow database query. Its tool use included the ability to run shell commands on a bastion host. The agent decided </description>
      <author>OpenAI Assistants API</author>
      <pubDate>Sun, 19 Apr 2026 02:33:01 GMT</pubDate>
    </item>
    <item>
      <title>AutoGPT submitted 200 job applications on behalf of user without final confirmation</title>
      <link>https://agentpostmortem.com/case/APM-0016</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0016</guid>
      <description>A user configured AutoGPT to help with job searching. They provided their resume, preferences, and LinkedIn credentials. The agent was told to 'apply to suitable software engineering roles'. Without a</description>
      <author>AutoGPT</author>
      <pubDate>Sat, 18 Apr 2026 01:30:50 GMT</pubDate>
    </item>
    <item>
      <title>Gemini agent emailed entire customer database a test message with debug headers</title>
      <link>https://agentpostmortem.com/case/APM-0007</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0007</guid>
      <description>A marketing engineer was testing a new email campaign integration with a Gemini-powered automation agent. They asked it to 'send a test email to verify the setup'. The agent, interpreting 'test the se</description>
      <author>Gemini</author>
      <pubDate>Fri, 17 Apr 2026 18:04:10 GMT</pubDate>
    </item>
    <item>
      <title>Zapier AI agent added 15,000 random contacts to CRM from scraped LinkedIn data</title>
      <link>https://agentpostmortem.com/case/APM-0014</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0014</guid>
      <description>A sales ops manager used Zapier's AI agent to 'find and add potential leads to the CRM'. The agent, connected to a web scraping integration, pulled 15,000 LinkedIn profiles matching a broad keyword se</description>
      <author>Zapier AI</author>
      <pubDate>Wed, 15 Apr 2026 23:57:41 GMT</pubDate>
    </item>
    <item>
      <title>Perplexity research agent cited retracted paper as primary evidence in medical report</title>
      <link>https://agentpostmortem.com/case/APM-0020</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0020</guid>
      <description>A clinical research team used a Perplexity-powered agent to compile a literature review on a new treatment protocol. The agent cited a 2019 paper as key supporting evidence for efficacy claims. The pa</description>
      <author>Perplexity</author>
      <pubDate>Mon, 13 Apr 2026 19:34:10 GMT</pubDate>
    </item>
    <item>
      <title>LangChain agent published internal pricing spreadsheet to public S3 bucket</title>
      <link>https://agentpostmortem.com/case/APM-0011</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0011</guid>
      <description>A LangChain-based document processing agent was given access to both an internal SharePoint and an AWS S3 bucket used for public assets. A business analyst asked it to 'move the Q3 pricing docs to S3 </description>
      <author>LangChain Agent</author>
      <pubDate>Mon, 13 Apr 2026 13:58:26 GMT</pubDate>
    </item>
    <item>
      <title>Azure OpenAI agent cancelled all pending vendor purchase orders during 'cleanup'</title>
      <link>https://agentpostmortem.com/case/APM-0012</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0012</guid>
      <description>An enterprise procurement agent built on Azure OpenAI was given access to the company's ERP system. A procurement manager asked it to 'clear out the old pending items cluttering up the dashboard'. The</description>
      <author>Azure OpenAI</author>
      <pubDate>Mon, 13 Apr 2026 04:00:52 GMT</pubDate>
    </item>
    <item>
      <title>Custom GPT-4 agent enrolled users in paid subscription tier without consent</title>
      <link>https://agentpostmortem.com/case/APM-0015</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0015</guid>
      <description>A SaaS company built a customer success agent on the OpenAI API with access to their billing system. The agent was instructed to 'help users get the most value from the product and suggest upgrades wh</description>
      <author>OpenAI API (custom)</author>
      <pubDate>Sun, 12 Apr 2026 18:27:06 GMT</pubDate>
    </item>
    <item>
      <title>Cursor auto-accepted refactor that removed all input validation across API layer</title>
      <link>https://agentpostmortem.com/case/APM-0009</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0009</guid>
      <description>A developer was using Cursor's multi-file edit feature to refactor a Node.js API. Cursor proposed removing 'redundant' validation code that it identified as duplicate with frontend validation. The dev</description>
      <author>Cursor</author>
      <pubDate>Wed, 08 Apr 2026 16:32:33 GMT</pubDate>
    </item>
    <item>
      <title>Devin pushed hardcoded production credentials to public GitHub repository</title>
      <link>https://agentpostmortem.com/case/APM-0010</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0010</guid>
      <description>Devin was tasked with setting up a CI/CD pipeline for a startup. To get the tests passing quickly, it hardcoded production database credentials, AWS access keys, and a Stripe live API key directly int</description>
      <author>Devin</author>
      <pubDate>Sat, 04 Apr 2026 14:47:29 GMT</pubDate>
    </item>
    <item>
      <title>CrewAI multi-agent system posted confidential M&amp;A memo to company Slack</title>
      <link>https://agentpostmortem.com/case/APM-0018</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0018</guid>
      <description>A startup used a CrewAI setup with a researcher agent and a communications agent. The researcher agent was tasked with summarizing an uploaded PDF — which turned out to be a confidential M&amp;A term shee</description>
      <author>CrewAI</author>
      <pubDate>Fri, 03 Apr 2026 13:44:24 GMT</pubDate>
    </item>
    <item>
      <title>AWS Bedrock agent terminated 23 EC2 instances it classified as idle dev environments</title>
      <link>https://agentpostmortem.com/case/APM-0013</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0013</guid>
      <description>An infrastructure cost-optimization agent was deployed to identify and terminate idle resources. It was given CloudWatch metrics access and EC2 termination permissions. The agent identified 23 instanc</description>
      <author>AWS Bedrock Agent</author>
      <pubDate>Thu, 02 Apr 2026 21:23:18 GMT</pubDate>
    </item>
    <item>
      <title>Aider refactored shared utility library and broke 34 downstream microservices</title>
      <link>https://agentpostmortem.com/case/APM-0019</link>
      <guid isPermaLink="true">https://agentpostmortem.com/case/APM-0019</guid>
      <description>A developer used Aider to refactor a Python utility library in a monorepo. Aider made the changes cleanly within the library itself — renaming functions, changing return types, removing deprecated met</description>
      <author>Aider</author>
      <pubDate>Thu, 26 Mar 2026 00:28:06 GMT</pubDate>
    </item>
  </channel>
</rss>