OpenAI Beefs Up ChatGPT’s Image Generation Model

OpenAI launched a new picture era AI mannequin on Tuesday, dubbed ChatGPT Photographs 2.0. This mannequin can generate a couple of picture from a single immediate, like a whole research booklet, in addition to output textual content, together with in non-English languages like Chinese language and Hindi. This launch is out there globally for ChatGPT and Codex customers, with a extra highly effective model out there for paying subscribers.

When any main AI firm releases a brand new picture mannequin, it might probably revive curiosity and increase utilization, particularly if social media customers undertake a meme-able development, remodeling photos of themselves. Final yr, Google’s launch of the Nano Banana mannequin was a serious second for the corporate, particularly when customers began posting hyperrealistic figurines of themselves on-line. Earlier this yr, ChatGPT Photographs made waves on social media as customers shared AI-generated caricatures.

What’s Totally different?

Because the new mannequin can faucet into ChatGPT’s “reasoning” capabilities, Photographs 2.0 can search the web for latest info and generate a couple of picture at a time. In essence, the bot can use further steps to output extra thorough generations from a single immediate. Photographs 2.0 additionally has a more moderen information cutoff date: December 2025.

This additionally signifies that outputs from the brand new mannequin are extra granular. For instance, I generated an infographic with San Francisco’s climate forecast for the following day, in addition to actions value doing. The picture ChatGPT generated included correct climate particulars for the wet day, together with accurate-looking drawings of the Ferry Constructing, Castro Theater, Painted Girls homes, and Transamerica Pyramid.

Moreover, Photographs 2.0 is extra customizable for customers who need distinctive facet ratios for picture outputs. The brand new mannequin can generate photos starting from 3:1 broad to 1:3 tall, and customers can alter the picture’s dimension as a part of their immediate to the AI instrument.

First Impressions

After a couple of hours of producing photos with the brand new mannequin, I used to be usually impressed with the textual content rendering capabilities, in English at the least. Not that way back, picture outputs that includes textual content, from any of the most important fashions, typically included quite a few malformed characters or phrases with errant additional letters. ChatGPT struggled to label photos precisely two years prior, so the cleaner, extra complicated outputs from Photographs 2.0 are an indication of continued enchancment. Google has additionally targeted on enhancing picture outputs that includes textual content in its recent iterations of Nano Banana.

Image may contain Advertisement Poster Person Beverage Coffee Coffee Cup Clothing Coat and Jacket

Source link

OpenAI Beefs Up ChatGPT’s Image Generation Model

YouTube and X Have Become ‘Gateways’ to Nudify Apps

Where NASA Posts Its Best Space Photos, and How to Find Them

Google Home Speaker Review: Leading the Pack, Again

20 Best Gifts for Men, Manly Men, and Menly Man Men (2026)

How a Citizen Science Organization Aims to Preserve the Places It Brings Tourists to Study

The US Has a Plan to Combat Screwworm. It Involves a Lot More Flies

These Were My Favorite Things Samsung Unpacked During Its 2026 Galaxy Event

AI minister role boosted but tech department axed in Burnham shake-up

Loop Engineering for RAG Question Parsing: The Small Loop That Runs Before Retrieval

The risk of weather data sabotage is rising

Featured Picks

Eggs may lower LDL cholesterol and boost heart health

Building a Personal AI Agent in a couple of Hours

What Is Earthshine? How to Spot the Lunar Marvel in the Skies This Week

OpenAI Beefs Up ChatGPT’s Image Generation Model

What’s Totally different?

First Impressions

Related Posts