Anthropic’s new AI model turns to blackmail when engineers try to take it offline

techcrunch.com/2025/05/22/anthropics-new-ai-model-turns-to-blackmail-when-engineers-try-to-take-it-offline

Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday.
During…

This story appeared on techcrunch.com, 2025-05-22 17:47:45.
The Entire Business World on a Single Page. Free to Use →