Vlad Ionescu and Ariel Herbert-Voss, cofounders of the cybersecurity startup RunSybil, had been momentarily confused when their AI instrument, Sybil, alerted them to a weak spot in a buyer’s programs final November.
Sybil makes use of a mixture of completely different AI models—in addition to just a few proprietary technical tips—to scan laptop programs for points that hackers may exploit, like an unpatched server or a misconfigured database.
On this case, Sybil flagged a problem with the shopper’s deployment of federated GraphQL, a language used to specify how information is accessed over the online via software programming interfaces (APIs). The difficulty meant that the shopper was inadvertently exposing confidential data.
What puzzled Ionescu and Herbert-Voss was that recognizing the difficulty required a remarkably deep information of a number of completely different programs and the way these programs work together. RunSybil says it has since discovered the identical drawback with different deployments of GraphQL—earlier than anyone else made it public “We scoured the web, and it didn’t exist,” Herbert-Voss says. “Discovering it was a reasoning step by way of fashions’ capabilities—a step change.”
The scenario factors to a rising threat. As AI fashions proceed to get smarter, their skill to search out zero-day bugs and different vulnerabilities additionally continues to develop. The identical intelligence that can be utilized to detect vulnerabilities may also be used to use them.
Dawn Song, a pc scientist at UC Berkeley who focuses on each AI and safety, says current advances in AI have produced fashions which can be higher at discovering flaws. Simulated reasoning, which entails splitting issues into constituent items, and agentic AI, like looking out the online or putting in and operating software program instruments, have amped up fashions’ cyber talents.
“The cyber safety capabilities of frontier fashions have elevated drastically in the previous few months,” she says. “That is an inflection level.”
Final yr, Tune cocreated a benchmark known as CyberGym to find out how nicely massive language fashions discover vulnerabilities in massive open-source software program initiatives. CyberGym contains 1,507 identified vulnerabilities present in 188 initiatives.
In July 2025, Anthropic’s Claude Sonnet 4 was capable of finding about 20 p.c of the vulnerabilities within the benchmark. By October 2025, a brand new mannequin, Claude Sonnet 4.5, was in a position to establish 30 p.c. “AI brokers are capable of finding zero-days, and at very low price,” Tune says.
Tune says this development reveals the necessity for brand new countermeasures, together with having AI assist cybersecurity consultants. “We’d like to consider the way to even have AI assist extra on the protection aspect, and one can discover completely different approaches,” she says.
One thought is for frontier AI corporations to share fashions with safety researchers earlier than launch, to allow them to use the fashions to search out bugs and safe programs previous to a normal launch.
One other countermeasure, says Tune, is to rethink how software program is constructed within the first place. Her lab has proven that it’s doable to make use of AI to generate code that’s safer than what most programmers use in the present day. “In the long term we predict this secure-by-design strategy will actually assist defenders,” Tune says.
The RunSybil workforce says that, within the close to time period, the coding abilities of AI fashions might imply that hackers achieve the higher hand. “AI can generate actions on a pc and generate code, and people are two issues that hackers do,” Herbert-Voss says. “If these capabilities speed up, meaning offensive safety actions will even speed up.”
That is an version of Will Knight’s AI Lab newsletter. Learn earlier newsletters here.

