Anthropic's new Claude model blackmailed an engineer in test runs

businessinsider.com/claude-blackmail-engineer-having-affair-survive-test-anthropic-opus-2025-5

In test runs, Claude Opus 4 was given access to fictional emails revealing that the engineer responsible for deactivating it was having an extramarital affair.
Smith Collection/Gado/Getty Images
In test runs, Anthropic's new AI model threatened to expose an engineer's affair to avoid…

This story appeared on businessinsider.com, 2025-05-23 05:43:37.
The Entire Business World on a Single Page. Free to Use →