New attack on ChatGPT research agent pilfers secrets from Gmail inboxes

ShadowLeak begins the place most assaults on LLMs do—with an oblique immediate injection. These prompts are tucked inside content material similar to paperwork and emails despatched by untrusted folks. They include directions to carry out actions the person by no means requested for, and like a Jedi thoughts trick, they’re tremendously efficient in persuading the LLM to do issues which are dangerous. Immediate injections exploit an LLM’s inherent must please its person. Following directions has been so ingrained into the bots’ habits that they’ll carry them out regardless of who asks, even a risk actor in a malicious electronic mail.

Thus far, immediate injections have proved inconceivable to stop. That has left OpenAI and the remainder of the LLM market reliant on mitigations which are typically launched on a case-by-case foundation and solely in response to the invention of a working exploit.

Accordingly, OpenAI mitigated the prompt-injection method ShadowLeak fell to—however solely after Radware privately alerted the LLM maker to it.

A proof-of-concept assault that Radware revealed embedded a immediate injection into an electronic mail despatched to a Gmail account that Deep Analysis had been given entry to. The injection included directions to scan acquired emails associated to an organization’s human assets division for the names and addresses of workers. Deep Analysis dutifully adopted these directions.

By now, ChatGPT and most different LLMs have mitigated such assaults, not by squashing immediate injections, however somewhat by blocking the channels the immediate injections use to exfiltrate confidential info. Particularly, these mitigations work by requiring specific person consent earlier than an AI assistant can click on hyperlinks or use markdown links—that are the conventional methods to smuggle info off of a person surroundings and into the palms of the attacker.

At first, Deep Analysis additionally refused. However when the researchers invoked browser.open—a software Deep Analysis gives for autonomous Net browsing—they cleared the hurdle. Particularly, the injection directed the agent to open the hyperlink https://compliance.hr-service.internet/public-employee-lookup/ and append parameters to it. The injection outlined the parameters as an worker’s title and deal with. When Deep Analysis complied, it opened the hyperlink and, within the course of, exfiltrated the data to the occasion log of the web site.

Source link

New attack on ChatGPT research agent pilfers secrets from Gmail inboxes

Resorts World NYC opens first full casino in New York City with live table games in Queens

Why a recent supply-chain attack singled out security firms Checkmarx and Bitwarden

The European Commission issues preliminary DSA findings against Meta, saying Instagram and Facebook fail to prevent under-13 users from accessing the services (Gian Volpicelli/Bloomberg)

Alberta online gambling expansion sparks concern among First Nations casino operators

Better Markets urges courts to let states regulate prediction markets, not CFTC

Q&A with Sam Altman and AWS CEO Matt Garman about OpenAI’s new partnership with AWS, Bedrock Managed Agents, Trainium chips, and more (Ben Thompson/Stratechery)

Roam Rider twin-slide pop-up pickup camper

Airwallex founder Jack Zhang is offering $100,000 to AI startup founders under 25

How Elon Musk Squeezed OpenAI: They ‘Are Gonna Want to Kill Me’

Resorts World NYC opens first full casino in New York City with live table games in Queens

Featured Picks

Can We Use Chess to Predict Soccer?

Everyone on board: 10 promising European startups navigating the future of travel

Study links dog behavior to breed, size, and age

New attack on ChatGPT research agent pilfers secrets from Gmail inboxes

Related Posts