Anthropic Kept Every Promise It Could Afford
6 days ago
- #technology policy
- #AI ethics
- #corporate responsibility
- Anthropic's Responsible Scaling Policy originally included a binding commitment to pause development if models became too unsafe.
- In February 2026, version 3.0 replaced this binding 'brake' with voluntary safety roadmaps and self-grading goals.
- Anthropic's chief science officer stated the change was due to competitive pressures; unilateral pauses were deemed ineffective.
- Anthropic's valuation surged from $4 billion to $965 billion, alongside policy changes and steps toward an IPO.
- Mythos, a model deemed too dangerous for public release, leaked shortly after limited deployment, undermining safety claims.
- A report co-authored by Jack Clark advocated for slowing AI development, but only if competitors do so verifiably.
- The author argues that financial incentives eroded safety commitments, irrespective of the company's initial sincerity.
- The sequence of events suggests growth and competition outweighed prior ethical promises.