ChatGPT unexpectedly began speaking in a user’s cloned voice during testing

On Thursday, OpenAI launched the “system card” for ChatGPT’s new GPT-4o AI mannequin that particulars mannequin limitations and security testing procedures. Amongst different examples, the doc reveals that in uncommon occurrences throughout testing, the mannequin’s Advanced Voice Mode unintentionally imitated customers’ voices with out permission. At present, OpenAI has safeguards in place that stop this from taking place, however the occasion displays the rising complexity of safely architecting with an AI chatbot that might doubtlessly imitate any voice from a small clip.

Superior Voice Mode is a characteristic of ChatGPT that permits customers to have spoken conversations with the AI assistant.

In a bit of the GPT-4o system card titled “Unauthorized voice technology,” OpenAI particulars an episode the place a loud enter by some means prompted the mannequin to out of the blue imitate the consumer’s voice. “Voice technology may happen in non-adversarial conditions, resembling our use of that potential to generate voices for ChatGPT’s superior voice mode,” OpenAI writes. “Throughout testing, we additionally noticed uncommon cases the place the mannequin would unintentionally generate an output emulating the consumer’s voice.”

On this instance of unintentional voice technology offered by OpenAI, the AI mannequin outbursts “No!” and continues the sentence in a voice that sounds much like the “pink teamer” heard at first of the clip. (A pink teamer is an individual employed by an organization to do adversarial testing.)

It could actually be creepy to be speaking to a machine after which have it unexpectedly start speaking to you in your personal voice. Ordinarily, OpenAI has safeguards to forestall this, which is why the corporate says this prevalence was uncommon even earlier than it developed methods to forestall it utterly. However the instance prompted BuzzFeed information scientist Max Woolf to tweet, “OpenAI simply leaked the plot of Black Mirror’s subsequent season.”

Audio immediate injections

How may voice imitation occur with OpenAI’s new mannequin? The first clue lies elsewhere within the GPT-4o system card. To create voices, GPT-4o can apparently synthesize virtually any sort of sound present in its coaching information, together with sound results and music (although OpenAI discourages that conduct with particular directions).

As famous within the system card, the mannequin can essentially imitate any voice based mostly on a brief audio clip. OpenAI guides this functionality safely by offering a licensed voice pattern (of a employed voice actor) that it’s instructed to mimic. It offers the pattern within the AI mannequin’s system prompt (what OpenAI calls the “system message”) initially of a dialog. “We supervise best completions utilizing the voice pattern within the system message as the bottom voice,” writes OpenAI.

In text-only LLMs, the system message is a hidden set of textual content directions that guides conduct of the chatbot that will get added to the dialog historical past silently simply earlier than the chat session begins. Successive interactions are appended to the identical chat historical past, and the complete context (usually known as a “context window”) is fed again into the AI mannequin every time the consumer offers a brand new enter.

(It is most likely time to replace this diagram created in early 2023 under, but it surely exhibits how the context window works in an AI chat. Simply think about that the primary immediate is a system message that claims issues like “You’re a useful chatbot. You don’t speak about violent acts, and many others.”)

A diagram showing how GPT conversational language model prompting works. — Enlarge / A diagram exhibiting how GPT conversational language mannequin prompting works.

Benj Edwards / Ars Technica

Since GPT-4o is multimodal and may course of tokenized audio, OpenAI may use audio inputs as a part of the mannequin’s system immediate, and that is what it does when OpenAI offers a licensed voice pattern for the mannequin to mimic. The corporate additionally makes use of one other system to detect if the mannequin is producing unauthorized audio. “We solely enable the mannequin to make use of sure pre-selected voices,” writes OpenAI, “and use an output classifier to detect if the mannequin deviates from that.”

Source link

ChatGPT unexpectedly began speaking in a user’s cloned voice during testing

Share of news influencers on Bluesky has doubled after the 2024 US election to 43%, but X remains popular, with 82% of news influencers maintaining an account (Pew Research Center)

Ransomware kingpin “Stern” apparently IDed by German law enforcement

Brazil is piloting dWallet, a digital wallet program that allows users to monetize their data, the first nationwide initiative of its kind in the world (Gabriel Daros/Rest of World)

London-based Dojo, a payments tech provider to over 140K businesses, raised $190M from Vitruvian Partners in its first equity raise since launching in 2021 (Vishal Singh/Silicon Canals)

Builder.ai faked business with Bengaluru-based VerSe, which runs the Dailyhunt news app, by “round-tripping” sales worth ~$60M from 2021-2024 (Bloomberg)

Anthropic has been lobbying Congress to oppose a bill banning states from regulating AI, and pushed against the recent US AI deal with Gulf states (Reed Albergotti/Semafor)

21 Gifts for Dads Who Don’t Need Anything (2025)

Share of news influencers on Bluesky has doubled after the 2024 US election to 43%, but X remains popular, with 82% of news influencers maintaining an account (Pew Research Center)

I Monitor Tariff Impacts Every Day: Here Are My Top Tips to Help You Track Prices

The Mars Pathfinder Rovers: Sojourner and Marie Curie

Featured Picks

A Beginner’s Guide to AI-Powered Podcast Generators

Government Tech Workers Forced to Defend Projects to Random Elon Musk Bros

Better-glass breakthrough achieved using just sound and salt

ChatGPT unexpectedly began speaking in a user’s cloned voice during testing

Audio immediate injections

Related Posts