As generative AI pushes the speed of software development, it’s also enhancing the power of digital attackers to hold out financially motivated or state-backed hacks. Because of this safety groups at tech firms have extra code than ever to evaluate whereas coping with much more strain from dangerous actors. On Monday, Amazon will publish particulars for the primary time of an inside system often known as Autonomous Menace Evaluation (ATA), which the corporate has been utilizing to assist its safety groups proactively determine weaknesses in its platforms, carry out variant evaluation to rapidly seek for different, comparable flaws, after which develop remediations and detection capabilities to plug holes earlier than attackers discover them.
ATA was born out of an inside Amazon hackathon in August 2024, and safety staff members say that it has grown into an important software since then. The important thing idea underlying ATA is that it is not a single AI agent developed to comprehensively conduct safety testing and risk evaluation. As an alternative, Amazon developed a number of specialised AI brokers that compete towards one another in two groups to quickly examine actual assault methods and other ways they could possibly be used towards Amazon’s techniques—after which suggest safety controls for human evaluate.
“The preliminary idea was aimed to deal with a important limitation in safety testing—restricted protection and the problem of holding detection capabilities present in a quickly evolving risk panorama,” Steve Schmidt, Amazon’s chief safety officer, tells WIRED. “Restricted protection means you may’t get via the entire software program or you may’t get to the entire functions since you simply don’t have sufficient people. After which it’s nice to do an evaluation of a set of software program, however for those who don’t preserve the detection techniques themselves updated with the modifications within the risk panorama, you’re lacking half of the image.”
As a part of scaling its use of ATA, Amazon developed particular “high-fidelity” testing environments which might be deeply life like reflections of Amazon’s manufacturing techniques, so ATA can each ingest and produce actual telemetry for evaluation.
The corporate’s safety groups additionally made some extent to design ATA so each method it employs, and detection functionality it produces, is validated with actual, automated testing and system information. Purple staff brokers which might be engaged on discovering assaults that could possibly be used towards Amazon’s techniques execute precise instructions in ATA’s particular check environments that produce verifiable logs. Blue staff, or defense-focused brokers, use actual telemetry to verify whether or not the protections they’re proposing are efficient. And anytime an agent develops a novel method, it additionally pulls time-stamped logs to show that its claims are correct.
This verifiability reduces false positives, Schmidt says, and acts as “hallucination administration.” As a result of the system is constructed to demand sure requirements of observable proof, Schmidt claims that “hallucinations are architecturally unimaginable.”

