Ask anybody what Nvidia makes, they usually’re prone to first say “GPUs.” For many years, the chipmaker has been outlined by superior parallel computing, and the emergence of generative AI and the ensuing surge in demand for GPUs has been a boon for the company.
However Nvidia’s current strikes sign that it’s trying to lock in additional clients on the much less compute-intensive finish of the AI market—clients who don’t essentially want the beefiest, most powerful GPUs to coach AI fashions, however as an alternative are searching for essentially the most environment friendly methods to run agentic AI software program. Nvidia just lately spent billions to license know-how from a chip startup targeted on low-latency AI computing, and it additionally began promoting stand-alone CPUs as a part of its newest superchip system.
Yesterday, Nvidia and Meta announced that the social media big had agreed to purchase billions of {dollars}’ value of Nvidia chips to offer computing energy for its huge infrastructure tasks—with Nvidia’s CPUs as a part of the deal.
The multiyear deal is an growth of a comfortable ongoing partnership between the 2 corporations. Meta beforehand estimated that by the top of 2024, it could have bought 350,000 H100 chips from Nvidia, and that by the top of 2025 the corporate would have entry to 1.3 million GPUs in total (although it wasn’t clear whether or not these would all be Nvidia chips).
As a part of the most recent announcement, Nvidia stated that Meta would “construct hyperscale information facilities optimized for each coaching and inference in help of the corporate’s long-term AI infrastructure roadmap.” This features a “large-scale deployment” of Nvidia’s CPUs and “thousands and thousands of Nvidia Blackwell and Rubin GPUs.”
Notably, Meta is the primary tech big to announce it was making a large-scale buy of Nvidia’s Grace CPU as a stand-alone chip, one thing Nvidia stated can be an possibility when it revealed the complete specs of its new Vera Rubin superchip in January. Nvidia has additionally been emphasizing that it gives know-how that connects numerous chips, as a part of its “soup-to-nuts strategy” to compute energy, as one analyst places it.
Ben Bajarin, CEO and principal analyst on the tech market analysis agency Inventive Methods, says the transfer signaled that Nvidia acknowledges {that a} rising vary of AI software program now must run on CPUs, a lot in the identical approach that standard cloud functions do. “The rationale why the trade is so bullish on CPUs inside information facilities proper now could be agentic AI, which places new calls for on general-purpose CPU architectures,” he says.
A recent report from the chip newsletter Semianalysis underscored this level. Analysts famous that CPU utilization is accelerating to help AI coaching and inference, citing one in all Microsoft’s information facilities for OpenAI for instance, the place “tens of 1000’s of CPUs are actually wanted to course of and handle the petabytes of information generated by the GPUs, a use case that wouldn’t have in any other case been required with out AI.”
Bajarin notes, although, that CPUs are nonetheless only one part of essentially the most superior AI {hardware} techniques. The variety of GPUs Meta is buying from Nvidia nonetheless outnumbers the CPUs.
“If you happen to’re one of many hyperscalers, you’re not going to be operating all of your inference computing on CPUs,” Bajarin says. “You simply want no matter software program you’re operating to be quick sufficient on the CPU to work together with the GPU structure that’s truly the driving power of that computing. In any other case, the CPU turns into a bottleneck.”
Meta declined to touch upon its expanded cope with Nvidia. Throughout a current earnings name, the social media big stated that it deliberate to dramatically enhance its spending on AI infrastructure this yr to between $115 billion and $135 billion, up from $72.2 billion final yr.

