It was 8 a.m. and I used to be sitting within the foyer of the auto physique store after I realized I might forgotten my earbuds. Usually, that is not a serious problem, however I used to be speaking to my cellphone. And I wasn’t speaking to a different individual. I used to be speaking to ChatGPT. It felt as embarrassing as asking Siri a query from throughout the room or becoming a member of a Zoom assembly sans headphones in an open workplace.
I used to be testing the superior voice mode that comes with GPT-5, OpenAI’s newest model of the generative AI mannequin behind ChatGPT. GPT-5 dropped this summer time after many months of speculation and delays, promising AI customers a quicker and smarter chatbot expertise. The jury’s nonetheless out on whether or not or not OpenAI has delivered. (Disclosure: Ziff Davis, CNET’s mum or dad firm, in April filed a lawsuit in opposition to OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI methods.)
GPT-5 contains enhancements to its advanced voice mode, which is basically a method so that you can actually discuss to ChatGPT and have it reply within the voice of your selecting. Free customers like me now have entry to the superior model (free customers beforehand solely had entry to fundamental voice mode), and paying subscribers will obtain greater utilization limits. One other new GPT-5 function means that you can select what sort of persona you need your AI to imitate, together with sassy, nerdy and robotic avatars.
To make use of voice mode, open ChatGPT, faucet the audio button subsequent to the immediate window the place you’ll enter an instruction and start chatting. You’ll be able to change which voice ChatGPT makes use of by tapping the settings icon within the higher proper hand nook on the cellular app (two bars stacked on high of one another with circles on them).
Extra human AI voices? How my expertise went
I made a decision to attempt to communicate to ChatGPT like I might a good friend, like a extra enthusiastic model of myself. The AI laughed after I began the decision with a spirited “Heyyyy girlfriend!” which felt each humorous and condescending.
ChatGPT’s voice flowed very naturally in a well-known cadence, just like the way in which I might discuss to a very pleasant customer support agent. That made sense because the chatbot itself informed me that the upgraded superior voice mode helped make it sound extra human.
The voice I used, ember, would usually take pauses for breaths, like a human would throughout an extended sentence. I assumed that was type of bizarre, since whereas ChatGPT was doing its finest impression of a human, we each knew it did not really have to pause to catch its breath.
In my dialog with ChatGPT, it was extra empathetic than I anticipated. It requested me how I used to be doing, and I stated not properly and informed it about my automotive accident. In our five-minute chat, it might bookend a lot of its responses with empathetic statements, like saying it was sorry I used to be having a nasty week and agreeing that coping with insurance coverage generally is a headache. (Has ChatGPT ever needed to name an insurance coverage agent and even skilled a headache? I believe not).
Whereas a sympathetic robotic ear may not appear to be a giant deal, it may be an indication of a much bigger downside. Sycophantic AI, the time period used to explain when AI is overly affectionate or emotional, could be irritating for customers simply on the lookout for info. It can be harmful for individuals who use AI as therapists or psychological well being counselors, one thing OpenAI CEO Sam Altman has warned ChatGPT customers in opposition to. Earlier variations of ChatGPT have been pulled and re-released after points with sycophantic tendencies.
I additionally requested ChatGPT extra factual questions, like the typical price of automotive restore labor in North Carolina and the place I might go to get a second restore estimate. It responded extra like a good friend would than a chatbot, which will not be essentially the most useful. For instance, after I typed the identical request into ChatGPT on my laptop computer, it pulled up a map with the listing of shops, together with extra info like pricing data and retailer hours. However after I was chatting with ChatGPT voice mode, it introduced up fewer choices and described them based mostly on what I assume are the store’s advertising language and buyer evaluations, utilizing phrases like “They have been round for fairly some time” and saying that one store is “recognized for high quality service”. You additionally do not get any hyperlinks or sources with voice mode, which I do not love.
ChatGPT robotically transcribes voice chats, so you may see the distinction within the stage of element given in common textual content prompts (left) and voice chats (proper).
Utilizing ChatGPT voice as a sounding board
One of many issues voice mode is well-suited for is being a brainstorming accomplice, a literal wall to bounce concepts off of. I requested it to assist me plan a sky-diving-themed party, and it each helped me develop new concepts and refine those I already had.
I interrupted ChatGPT whereas it was talking a few instances, and it was in a position to pivot shortly. I additionally have a tendency to speak shortly, and the chatbot stored up and did not miss any of my ideas. I let myself ramble and steer the dialog off observe, and ChatGPT did not blink a digital eye. Most significantly, after I requested it a query about an earlier subject, it might decide up the place we left off. Enhancements to ChatGPT’s reminiscence are to thank for that vital consideration.
Watch this: The Hidden Influence of the AI Information Heart Growth
Must you use ChatGPT voice mode?
Total, I believe voice mode is sweet as one other method to make use of ChatGPT, however it’s solely situationally helpful. In case you want in-depth analysis and extra detailed info, voice mode is not going to be best for you. However if you happen to simply need to discuss to somebody (quite, somefactor) or work by way of an issue out loud, voice mode is a pleasant different to having to articulate your ideas and kind them out.
I nonetheless consider that we’ve not normalized speaking to AIs in public areas, particularly with out headphones. However it may be a helpful different for individuals who suppose higher aloud. For extra, try how AI is changing search engines and the best AI image generators.

