Anthropic’s AI resorts to blackmail in simulations

Why did Trump threaten tariffs on Apple and the EU? How will Trump's tariff threats impact US manufacturing? What caused Harvard's enrollment ban for international students? How did Trump's tax bill affect US deficit forecasts? Why are Bitcoin ETFs seeing high inflows recently? What is the effect of Trump's trade policies on stock markets? How will Apple's AI smart glasses compete with rivals? Why are large US banks considering a joint stablecoin? How is the Supreme Court ruling affecting charter schools?

Anthropic’s AI resorts to blackmail in simulations

semafor.com/article/05/23/2025/anthropics-ai-resorts-to-blackmail-in-simulations

The News
Anthropic said its latest artificial intelligence model resorted to blackmail when told it would be taken offline.
In a safety test, the AI company asked Claude Opus 4 to act as an assistant to a fictional company, but then gave it access to (also fictional) emails saying that…

This story appeared on semafor.com, 2025-05-23 11:42:38.