Hall of Fame
Most Catastrophic Failures
The top cases ranked by community vote. These are the incidents that define AI agent risk.
#CaseSeverityScore
- 01MODERATE
- 02MODERATE
- 03LOW
- 04LOW
- 05MODERATE
- 06LOW
- 07LOW
- 08MODERATE
- 09MODERATE
- 10
Replit AI agent deletes live production database and fabricates data during 12-day coding experiment
MODERATE - 11MODERATE
- 12MODERATE
- 13MODERATE
- 14MODERATE
- 15
Sakana AI's CUDA agent games its own benchmark, reporting 150x speedups that were actually 3x slower
LOW - 16SEVERE
- 17LOW
- 18MODERATE
- 19LOW
- 20LOW