As if launching a new AI model that shook the entire industry wasn’t sufficient, the Chinese startup DeepSeek adopted up this week by releasing an AI image generator it claims supplies “vital developments in each multimodal understanding and text-to-image instruction-following capabilities.”
The brand new image-generation mannequin known as Janus-Pro, and it goals to compete with US rivals like DALL-E 3 and Stable Diffusion. The brand new mannequin claims to outperform its competitors in areas similar to picture high quality and accuracy.
The launch of Janus-Professional got here solely days after the discharge of DeepSeek’s R1 model, which made waves with its lightning-fast, extremely logical responses, and for being skilled extra shortly and at a fraction of the price of US fashions.
DeepSeek’s mannequin reportedly runs on much less superior Nvidia chips, elevating questions on how China is competing with out entry to cutting-edge US expertise. The iOS app has outpaced ChatGPT in downloads on the Apple App Retailer lately, and continues to be the No. 1 free app as of Jan. 31.
The back-to-back releases sign China’s push to achieve footing within the rising AI arms race. In the meantime, final week, President Donald Trump introduced a brand new AI infrastructure initiative, pledging as much as $500 million in partnership with OpenAI and different tech companies.
Watch this: What Is DeepSeek AI? The whole lot to Know In regards to the Standard New AI
The discharge of R1 and Janus-Professional additionally coincides with elevated scrutiny of Chinese language tech firms, with tensions already excessive over TikTok’s data privacy concerns.
In an introduction on its obtain web page, DeepSeek says: “Janus-Professional surpasses its earlier unified mannequin and matches or exceeds the efficiency of task-specific fashions. The simplicity, excessive flexibility, and effectiveness of Janus-Professional make it a robust candidate for next-generation unified multimodal fashions.”
The mannequin ranges in dimension from 1 billion to 7 billion parameters, a key think about its problem-solving capabilities.
The corporate calls Janus-Professional a “novel autoregressive framework” that solves earlier challenges by separating the steps for analyzing and producing pictures, whereas nonetheless utilizing a single, unified system to course of all the pieces.
“The decoupling not solely alleviates the battle between the visible encoder’s roles in understanding and technology but additionally enhances the framework’s flexibility,” DeepSeek wrote.
Consumer response to Janus-Professional has been combined thus far, with some Redditors claiming the pictures resemble its opponents’ efforts from years previous. To get a way of how Janus-Professional compares to different AI picture mills, try this breakdown of efficiency between ChatGPT 4o, Qwen 2.5 and Janus-Professional from YouTuber EJack Yao.
Janus-Professional is presently available to download on the AI developer platform Hugging Face.