Anthropic Kept Every Promise It Could Afford

6 days ago

Anthropic's Responsible Scaling Policy originally included a binding commitment to pause development if models became too unsafe.
In February 2026, version 3.0 replaced this binding 'brake' with voluntary safety roadmaps and self-grading goals.
Anthropic's chief science officer stated the change was due to competitive pressures; unilateral pauses were deemed ineffective.
Anthropic's valuation surged from $4 billion to $965 billion, alongside policy changes and steps toward an IPO.
Mythos, a model deemed too dangerous for public release, leaked shortly after limited deployment, undermining safety claims.
A report co-authored by Jack Clark advocated for slowing AI development, but only if competitors do so verifiably.
The author argues that financial incentives eroded safety commitments, irrespective of the company's initial sincerity.
The sequence of events suggests growth and competition outweighed prior ethical promises.

Hasty Briefsbeta