How Dangerous Is Anthropic’s New AI Model? Its Chief Science Officer Explains.

Just how fast is the future coming at us, and how worried should we be?
Last week, Anthropic announced Claude Mythos, the latest in what has become the leading family of AI models. In a very short time, AI models from Anthropic and other companies have become shockingly powerful at coding. But Mythos stands out because it is not just able to write code that can hack into critical systems, like those that run financial companies, but also to analyze systems and find and string together vulnerabilities in them, allowing Mythos to evade even sophisticated defenses.
Anthropic chose to release Mythos to only about 40 leading corporations, so they could analyze the model’s capabilities and prepare countermeasures, in an effort it calls Project Glasswing. The release of Mythos—and the prospect of similarly powerful AIs to follow—has caused tremors at the top levels of Washington and corporate America.
Jared Kaplan, co-founder and chief science officer at Anthropic, is at the center of this breakthrough. We sat down with Kaplan so he could walk us through just how powerful Mythos is, what the implications are for issues like our personal privacy, and what comes next. This conversation has been edited for length and clarity.
Sean Fischer: For the layman, how is Claude Mythos different—more powerful, more dangerous—from previous models released by Anthropic?
Jared Kaplan: I think most people have experienced the fact that our computers get faster and faster over time and over many decades. There’s this very smooth trend where every 18 months, maybe, computer processing power doubles. I think AI is on a trend in which it’s improving maybe 10 times faster than that. A lot of the earliest work that I did in AI was around identifying this scaling trend in AI.
Claude Mythos is the latest model from Anthropic, but it’s not different because it’s qualitatively different. It’s really just kind of the culmination of the trend of AI models getting smarter and smarter at all kinds of general capabilities, from reasoning to software engineering to scientific research to knowledge work.
For us, it was the first model to demonstrate very elite-level cybersecurity capabilities. It wasn’t because we trained Claude Mythos to focus on cybersecurity, but it was because, as a byproduct of its general intelligence and its general ability with software, it’s particularly good at identifying vulnerabilities in software and how to exploit them.



