Anthropic’s new AI model turns to blackmail when engineers try to take it offline

Why did Trump’s tax bill spark bond selloff? How will tariffs affect major retailers’ prices? What’s driving Bitcoin’s recent record surge? Why is Harvard blocked from enrolling foreign students? How is Nvidia’s partnership impacting chip stocks? What caused Tesla’s sales decline in China and Europe? Why are mortgage rates hitting three-month highs? How will GOP Medicaid cuts affect labor market? What’s behind Mercedes-Benz’s new US headquarters in Atlanta?

Anthropic’s new AI model turns to blackmail when engineers try to take it offline

techcrunch.com/2025/05/22/anthropics-new-ai-model-turns-to-blackmail-when-engineers-try-to-take-it-offline

Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday.
During…

This story appeared on techcrunch.com, 2025-05-22 17:47:45.