Following leaked revelations on the finish of March that Anthropic had developed a strong new Claude mannequin, the corporate formally announced Mythos Preview on Tuesday together with information of an {industry} consortium it has convened, generally known as Mission Glasswing, to grapple with the cybersecurity implications of the brand new mannequin and advancing capabilities extra typically throughout the AI area.
The group contains Microsoft, Apple, and Google in addition to Amazon Web Services, the Linux Basis, Cisco, Nvidia, Broadcom, and greater than 40 different tech, cybersecurity, important infrastructure, and monetary organizations that may have non-public entry to the mannequin, which isn’t but being typically launched. The concept, partly, is just to present the builders of the world’s foundational tech platforms time to show Mythos Preview on their very own methods to allow them to mitigate vulnerabilities and exploit chains that the mannequin develops in simulated assaults. Extra broadly, Anthropic emphasizes that the aim of convening the hassle is to kickstart pressing exploration of how AI capabilities throughout the {industry} are on the precipice, the corporate says, of upending present software program safety and digital protection practices around the globe.
“The true message is that this isn’t concerning the mannequin or Anthropic,” Logan Graham, the corporate’s frontier pink crew lead, tells WIRED. “We have to put together now for a world the place these capabilities are broadly accessible in 6, 12, 24 months. Many issues can be completely different about safety. Lots of the assumptions that we’ve constructed the fashionable safety paradigms on may break.”
Fashions developed and skilled by multiple companies have more and more been capable of finding vulnerabilities in code and propose mitigations—or strategies for exploitation. This creates a subsequent era of safety’s basic cat-and-mouse sport wherein a instrument can help defenders however also can gas unhealthy actors and make it simpler to hold out assaults that had been as soon as too costly or complicated to be sensible.
“Claude Mythos preview is a very massive leap,” Anthropic CEO Dario Amodei stated on Tuesday in a Mission Glasswing launch video. “We’ve not skilled it particularly to be good at cyber. We skilled it to be good at code, however as a aspect impact of being good at code, it is also good at cyber.” He provides within the video that “extra highly effective fashions are going to return from us and from others. And so we do want a plan to answer this.”
Anthropic’s Graham notes that along with vulnerability discovery—together with producing potential assault chains and proofs of idea—Mythos Preview is able to extra superior exploit growth, penetration testing, endpoint safety evaluation, trying to find system misconfigurations, and evaluating software program binaries with out entry to its supply code.
In finishing up a staggered launch of Mythos Preview, starting with an {industry} collaboration section, Graham says that Anthropic sought to attract on tenets of coordinated vulnerability disclosure, the method of giving builders time to patch a bug earlier than it’s publicly mentioned.
“We have seen Mythos Preview accomplish issues {that a} senior safety researcher would be capable to accomplish,” Graham says. “This has very massive implications then for a way capabilities like this ought to be launched. Executed not fastidiously, this may very well be a meaningfully accelerant for attackers.”
Mission Glasswing companions, together with a few of Anthropic’s rivals, struck a collaborative tone in statements as a part of the launch.
“Google is happy to see this cross-industry cybersecurity initiative coming collectively,” Heather Adkins, Google’s vice chairman of safety engineering, says in an announcement. “Now we have lengthy believed that AI poses new challenges and opens new alternatives in cyber protection.”

