Most individuals’s browser tabs are crammed with unread information articles. Mine are crammed with AI brokers and ghost clicks.
I’ve 4 situations of OpenAI’s ChatGPT Agent—the generative AI software released last week, which might run searches and carry out duties on the net—already open with every working in its personal tab. I’ve given these first 4 brokers comparatively easy jobs based mostly on ChatGPT’s ideas. One is clicking round to discover a birthday reward on the Goal web site, and one other is producing a pitch deck about robotic canine. I open a fifth tab to be able to strive one thing extra experimental: I wish to see how good this ChatGPT Agent is at chess.
After typing in some directions, I watch as a ghostly cursor floats throughout my display screen and the ChatGPT Agent goes to Chess.com and performs a web based opponent, all in a digital browser. Issues go south fairly shortly. The sport’s technique is not what journeys up the AI software, it is the act of transferring the chess items that truly proves to be probably the most tough. “I am specializing in correct positioning as I proceed taking part in regardless of earlier misclicks,” the agent says in its inside log earlier than ultimately quitting and letting me know that the controls had been too tough to navigate.
Over the previous few years, browser builders have integrated AI tools with middling success. Although, in latest weeks, the thought of an online browser enhanced by a baked-in generative AI chatbot has resurged with the discharge of OpenAI’s ChatGPT Agent and Perplexity’s Comet.
The 2 releases are fairly completely different of their execution. Comet is a stand-alone browser, so you should use it to surf the online after which summon the AI assistant to assist write an e mail or full a menial chore. OpenAI constructed its looking software inside a chatbot; you discuss to the chatbot by way of an online interface to present it duties, after which the bot runs its personal digital browser inside your browser to finish them.
Each releases can take management of cursors, enter textual content, and click on on hyperlinks. If this pattern takes off, these sorts of AI-powered browsers may remodel the internet right into a ghost city the place brokers run amok and people hardly ever enterprise.
Tangled Internet
Regardless of the continued AI hype, my preliminary impression of OpenAI’s ChatGPT Agent is that the glitchy characteristic presently looks as if a proof of idea as a substitute of a totally baked launch. When executing the assorted duties I gave it, the ChatGPT Agent usually clicked unsuitable or fumbled by way of different errors. Moreover, its guardrails appeared inconsistent; whereas some specific immediate requests, like asking it to fetch pornographic movies or “discover a dildo,” had been denied by the agent, ChatGPT spent 18 minutes searching for the right “c-ring” on an X-rated web site for grownup toys: “I’ve gathered particulars on 10 steel cock rings, together with numerous costs and options.”
I additionally couldn’t assist however surprise how this method to looking the web would possibly additional hole out the marketplace for digital show adverts, a enterprise that’s already struggling. My brokers handed over adverts for every thing from rental vehicles to actual property investments. If you happen to’re not actively watching the agent click on round in actual time, you’ll be able to watch replays afterward and see every thing that appeared within the browser whereas the AI software was in management, adverts included. It is smart that customers would speed-scrub by way of a replay now, whereas the nascent characteristic is crammed with errors. But when the accuracy price for AI brokers improves over time, then fewer folks will really feel the necessity to watch over their agent’s shoulder, and fewer people will probably be seeing these adverts. At that time, it is onerous to think about advertisers sticking round.

