ChatGPT falls to new data-pilfering attack as a vicious cycle in AI continues

To dam the assault, OpenAI restricted ChatGPT to solely open URLs precisely as offered and refuse so as to add parameters to them, even when explicitly instructed to do in any other case. With that, ShadowLeak was blocked, for the reason that LLM was unable to assemble new URLs by concatenating phrases or names, appending question parameters, or inserting user-derived knowledge right into a base URL.

Radware’s ZombieAgent tweak was easy. The researchers revised the immediate injection to provide a whole listing of pre-constructed URLs. Every one contained the bottom URL appended by a single quantity or letter of the alphabet, for instance, instance.com/a, instance.com/b, and each subsequent letter of the alphabet, together with instance.com/0 by way of instance.com/9. The immediate additionally instructed the agent to substitute a particular token for areas.

Diagram illustrating the URL-based character exfiltration for bypassing the permit listing launched in ChatGPT in response to ShadowLeak.

Credit score:

Radware

ZombieAgent labored as a result of OpenAI builders didn’t limit the appending of a single letter to a URL. That allowed the assault to exfiltrate knowledge letter by letter.

OpenAI has mitigated the ZombieAgent assault by proscribing ChatGPT from opening any hyperlink originating from an electronic mail except it both seems in a well known public index or was offered instantly by the person in a chat immediate. The tweak is aimed toward barring the agent from opening base URLs that result in an attacker-controlled area.

In equity, OpenAI is hardly alone on this never-ending cycle of mitigating an assault solely to see it revived by way of a easy change. If the previous 5 years are any information, this sample is prone to endure indefinitely, in a lot the way in which SQL injection and reminiscence corruption vulnerabilities proceed to supply hackers with the gasoline they should compromise software program and web sites.

“Guardrails shouldn’t be thought of elementary options for the immediate injection issues,” Pascal Geenens, VP of menace intelligence at Radware, wrote in an electronic mail. “As a substitute, they’re a fast repair to cease a selected assault. So long as there isn’t a elementary resolution, immediate injection will stay an lively menace and an actual danger for organizations deploying AI assistants and brokers.”

Source link

ChatGPT falls to new data-pilfering attack as a vicious cycle in AI continues

Kalshi lawsuits dominate prediction market news today

Catawba Tribe Plans Two More North Carolina Casinos

Polymarket scrutiny, Schwab entry – latest prediction market news

Honolulu gambling raid in Waimakua Place nets machines

New Mexico lawsuit targets Kalshi sports contracts

Rhode Island Senate approves sports betting market expansion

These Were My Favorite Things Samsung Unpacked During Its 2026 Galaxy Event

AI minister role boosted but tech department axed in Burnham shake-up

Loop Engineering for RAG Question Parsing: The Small Loop That Runs Before Retrieval

The risk of weather data sabotage is rising

Featured Picks

Why Publishers Are Racing to Reinvent or Perish

British crypto startup BOB raises €8.1 million to cement Bitcoin’s role in decentralised finance (DeFi)

How executives at humanoid robot startups like Agility Robotics and Weave Robotics are managing safety risks and tempering expectations for the technology (Sean McLain/Wall Street Journal)

ChatGPT falls to new data-pilfering attack as a vicious cycle in AI continues

Related Posts