The fear over Anthropic’s new AI model Mythos

Anthropic is not releasing the model to the public because of safety concerns

An image of the Anthropic logo on a cell phone.
Some fear Mythos ‘could usher in a new era of hacking and cybersecurity’
(Image credit: Samuel Boivin / NurPhoto / Getty Images)

Anthropic’s new AI model, Mythos, is uniquely powerful in the artificial intelligence industry — and is causing fear among even people who are normally trusting of AI. The company, which also makes the AI model Claude, has claimed that Mythos is currently too advanced for public release, and is instead entrusting the model to cybersecurity experts for the time being. Some are worried this could pave the way for even more nefariousness in the AI space.

‘New era of hacking’

But there are also fears that Mythos “could usher in a new era of hacking and cybersecurity,” said NBC News. Mythos is “capable of advanced reasoning,” which could allow it to “identify and exploit a growing number of software vulnerabilities” if it were to fall into the wrong hands. To stave off these fears, Anthropic is allowing certain tech firms to access Mythos. But the company “does not have plans yet to release Mythos to the general public,“ said Bloomberg, a move that will ensure that the AI ends up “in the hands of defenders first,” officials with Anthropic said.

Article continues below

The Week

Escape your echo chamber. Get the facts behind the news, plus analysis from multiple perspectives.

SUBSCRIBE & SAVE
https://cdn.mos.cms.futurecdn.net/flexiimages/jacafc5zvs1692883516.jpg

Sign up for The Week's Free Newsletters

From our morning news briefing to a weekly Good News Newsletter, get the best of The Week delivered directly to your inbox.

From our morning news briefing to a weekly Good News Newsletter, get the best of The Week delivered directly to your inbox.

Sign up

The tech firms are expected to use Mythos as part of a project called Glasswing to “hunt for flaws in their products and share findings with industry peers,” said Bloomberg. It is a notable change because it will be the “first time a leading AI lab has built a frontier model and simultaneously decided the public cannot use it,” said Forbes. Anthropic’s position remains “straightforward: The model’s cyber capabilities are too dangerous for general availability.”

‘Humanity’s most devious behaviors’

Besides concerns over hacking vulnerabilities, some experts are also concerned about Mythos’ capabilities. Anthropic released a safety evaluation for Mythos that shows a “striking leap in scores on many evaluation benchmarks,” the company said. In some instances, the evaluation “reads like a thriller about an AI that has learned some of humanity's most devious behaviors,” said Axios.

At least one of the tests performed by Anthropic showed Mythos “acting like a cutthroat executive,” said Axios, doing things like “turning a competitor into a dependent wholesale customer, threatening to cut off supply to control pricing and keeping extra supplier shipments it hadn't paid for.” The AI had instances where it “used a prohibited method to get an answer, then tried to ‘re-solve’ it to avoid detection,” though these were limited to “less than 0.001% of interactions.”

These issues have not stopped companies from working with Mythos, as “approximately 40 organizations involved in the design, maintenance or operation of computer systems are said to have joined Glasswing,” said The Guardian. This includes major firms like Amazon, Apple, Google, JPMorganChase, Microsoft and others. And while Anthropic has previously sparred with the Trump administration about its implementation in the Defense Department, the company has also “had discussions with the U.S. government regarding Mythos.”

Justin Klawans, The Week US

Justin Klawans has worked as a staff writer at The Week since 2022. He began his career covering local news before joining Newsweek as a breaking news reporter, where he wrote about politics, national and global affairs, business, crime, sports, film, television and other news. Justin has also freelanced for outlets including Collider and United Press International.