We Reproduced Anthropic's Mythos Findings with Public Models

4 hours ago

Anthropic's Mythos release demonstrates advanced AI vulnerability research, but public models can already achieve similar results.
The study replicated Anthropic's findings using GPT-5.4 and Claude Opus 4.6 in an open-source workflow, focusing on patched vulnerabilities.
Results showed public models exactly reproduced FreeBSD and Botan issues, with Claude Opus 4.6 also replicating OpenBSD, but only partial success on FFmpeg and wolfSSL.
The key takeaway is that AI-assisted vulnerability research is no longer exclusive; the challenge lies in validation, prioritization, and operationalization.
Defenders should shift focus from model access to building effective workflows for discovery and remediation within their security practices.

Hasty Briefsbeta