AI Boom Fuels DRAM Shortage and Price Surge

If it feels lately as if all the things in know-how is about AI, that’s as a result of it’s. And nowhere is that extra true than available in the market for computer memory. Demand, and profitability, for the kind of DRAM used to feed GPUs and different accelerators in AI data centers is so enormous that it’s diverting away provide of reminiscence for different makes use of and inflicting costs to skyrocket. In accordance with Counterpoint Research, DRAM costs have risen 80-90 precent thus far this quarter.

The biggest AI hardware firms say they’ve secured their chips out so far as 2028, however that leaves everyone else—makers of PCs, client gizmos, and all the things else that should briefly retailer a billion bits—scrambling to cope with scarce provide and inflated costs.

How did the electronics trade get into this mess, and extra importantly, how will it get out? IEEE Spectrum requested economists and reminiscence specialists to clarify. They are saying at this time’s scenario is the results of a collision between the DRAM trade’s historic increase and bust cycle and an AI {hardware} infrastructure build-out that’s with out precedent in its scale. And, barring some main collapse within the AI sector, it should take years for brand new capability and new know-how to convey provide consistent with demand. Costs would possibly keep excessive even then.

To know each ends of the story, that you must know the primary offender within the provide and demand swing, high-bandwidth reminiscence, or HBM.

What’s HBM?

HBM is the DRAM trade’s try and short-circuit the slowing tempo of Moore’s Legislation by utilizing 3D chip packaging know-how. Every HBM chip is made up of as many as 12 thinned-down DRAM chips known as dies. Every die accommodates a lot of vertical connections known as by way of silicon vias (TSVs). The dies are piled atop one another and linked by arrays of microscopic solder balls aligned to the TSVs. This DRAM tower—nicely, at about 750 micrometers thick, it’s extra of a brutalist office-block than a tower—is then stacked atop what’s known as the bottom die, which shuttles bits between the reminiscence dies and the processor.

This complicated piece of know-how is then set inside a millimeter of a GPU or different AI accelerator, to which it’s linked by as many as 2,048 micrometer-scale connections. HBMs are connected on two sides of the processor, and the GPU and reminiscence are packaged collectively as a single unit.

The thought behind such a good, highly-connected squeeze with the GPU is to knock down what’s known as the memory wall. That’s the barrier in vitality and time of bringing the terabytes per second of information wanted to run large language models into the GPU. Memory bandwidth is a key limiter to how briskly LLMs can run.

As a know-how, HBM has been round for more than 10 years, and DRAM makers have been busy boosting its functionality.

As the scale of AI models has grown, so has HBM’s significance to the GPU. However that’s come at a price. SemiAnalysis estimates that HBM typically prices thrice as a lot as different sorts of reminiscence and constitutes 50 p.c or extra of the price of the packaged GPU.

Origins of the memory chip scarcity

Reminiscence and storage trade watchers agree that DRAM is a extremely cyclical trade with enormous booms and devastating busts. With new fabs costing US $15 billion or extra, companies are extraordinarily reluctant to develop and should solely have the money to take action throughout increase instances, explains Thomas Coughlin, a storage and reminiscence knowledgeable and president of Coughlin Associates. However constructing such a fab and getting it up and working can take 18 months or extra, virtually making certain that new capability arrives nicely previous the preliminary surge in demand, flooding the market and miserable costs.

The origins of at this time’s cycle, says Coughlin, go all the way in which again to the chip supply panic surrounding the COVID-19 pandemic . To keep away from supply-chain stumbles and assist the speedy shift to remote work, hyperscalers—knowledge heart giants like Amazon, Google, and Microsoft—purchased up enormous inventories of reminiscence and storage, boosting costs, he notes.

However then provide turned extra common and knowledge heart enlargement fell off in 2022, inflicting reminiscence and storage costs to plummet. This recession continued into 2023, and even resulted in massive reminiscence and storage firms comparable to Samsung chopping manufacturing by 50 p.c to attempt to preserve costs from going under the prices of producing, says Coughlin. It was a uncommon and pretty determined transfer, as a result of firms usually need to run vegetation at full capability simply to earn again their worth.

After a restoration started in late 2023, “all of the reminiscence and storage firms have been very cautious of accelerating their manufacturing capability once more,” says Coughlin. “Thus there was little or no funding in new manufacturing capability in 2024 and thru most of 2025.”

The AI knowledge heart increase

That lack of recent funding is colliding headlong with an enormous enhance in demand from new knowledge facilities. Globally, there are nearly 2,000 new data centers both deliberate or beneath building proper now, in line with Knowledge Middle Map. In the event that they’re all constructed, it will signify a 20 p.c soar within the international provide, which stands at round 9,000 services now.

If the present build-out continues at tempo, McKinsey predicts firms will spend $7 trillion by 2030, with the majority of that—$5.2 trillion—going to AI-focused knowledge facilities. Of that chunk, $3.3 billion will go towards servers, data storage, and community tools, the agency predicts.

The most important beneficiary thus far of the AI knowledge heart increase is definitely GPU-maker Nvidia. Income for its knowledge heart enterprise went from barely a billion in the final quarter of 2019 to $51 billion in the quarter that ended in October 2025. Over this era, its server GPUs have demanded not simply increasingly gigabytes of DRAM however an rising variety of DRAM chips. The not too long ago launched B300 makes use of eight HBM chips, every of which is a stack of 12 DRAM dies. Rivals’ use of HBM has largely mirrored Nvidia’s. AMD’s MI350 GPU, for instance, additionally makes use of eight, 12-die chips.

With a lot demand, an rising fraction of the income for DRAM makers comes from HBM. Micron—the quantity three producer behind SK Hynix and Samsung—reported that HBM and other cloud-related memory went from being 17 p.c of its DRAM income in 2023 to just about 50 p.c in 2025.

Micron predicts the whole marketplace for HBM will develop from $35 billion in 2025 to $100 billion by 2028—a determine bigger than the whole DRAM market in 2024, CEO Sanjay Mehrotra told analysts in December. It’s reaching that determine two years sooner than Micron had beforehand anticipated. Throughout the trade, demand will outstrip provide “considerably… for the foreseeable future,” he mentioned.

Future DRAM provide and know-how

“There are two methods to deal with provide points with DRAM: with innovation or with constructing extra fabs,” explains Mina Kim, an economist with the Mkecon Insights. “As DRAM scaling has change into tougher, the trade has turned to superior packaging… which is simply utilizing extra DRAM.”

Micron, Samsung, and SK Hynix mixed make up the overwhelming majority of the reminiscence and storage markets, and all three have new fabs and services within the works. Nevertheless, these are unlikely to contribute meaningfully to bringing down costs.

Micron is within the strategy of building an HBM fab in Singapore that must be in manufacturing in 2027. And it’s retooling a fab it bought from PSMC in Taiwan that can start manufacturing within the second half of 2027. Final month, Micron broke ground on what can be a DRAM fab complicated in Onondaga County, N.Y. It won’t be in full manufacturing till 2030.

Samsung plans to start producing at a brand new plant in Pyeongtaek, South Korea in 2028.

SK Hynix is constructing HBM and packaging services in West Lafayette, Indiana set to start manufacturing by the top of 2028, and an HBM fab it’s building in Cheongju must be full in 2027.

Talking of his sense of the DRAM market, Intel CEO Lip-Bu Tan instructed attendees on the Cisco AI Summit final week: “There’s no reduction till 2028.”

With these expansions unable to contribute for a number of years, different elements can be wanted to extend provide. “Reduction will come from a mixture of incremental capability expansions by current DRAM leaders, yield enhancements in advanced packaging, and a broader diversification of provide chains,” says Shawn DuBravac , chief economist for the Global Electronics Association (previously the IPC). “New fabs will assist on the margin, however the sooner good points will come from course of studying, higher [DRAM] stacking effectivity, and tighter coordination between reminiscence suppliers and AI chip designers.”

So, will costs come down as soon as a few of these new vegetation come on line? Don’t wager on it. “On the whole, economists discover that costs come down rather more slowly and reluctantly than they go up. DRAM at this time is unlikely to be an exception to this basic statement, particularly given the insatiable demand for compute,” says Kim.

Within the meantime, applied sciences are within the works that might make HBM an excellent larger client of silicon. The usual for HBM4 can accommodate 16 stacked DRAM dies, despite the fact that at this time’s chips solely use 12 dies. Attending to 16 has loads to do with the chip stacking know-how. Conducting warmth by way of the HBM “layer cake” of silicon, solder, and assist materials is a key limiter to going larger and in repositioning HBM inside the package to get much more bandwidth.

SK Hynix claims a warmth conduction benefit by way of a producing course of known as superior MR-MUF (mass reflow molded underfill). Additional out, an alternate chip stacking know-how known as hybrid bonding may assist warmth conduction by decreasing the die-to-die vertical distance primarily to zero. In 2024, researchers at Samsung proved they might produce a 16-high stack with hybrid bonding, they usually instructed that 20 dies was not out of reach.

From Your Website Articles

Associated Articles Across the Net

Source link

AI Boom Fuels DRAM Shortage and Price Surge

AI minister role boosted but tech department axed in Burnham shake-up

Meta pulls new AI image feature after days of backlash

Wonka Netflix show faces backlash for AI-generated Gene Wilder voice

Tech Life – ChatGPT prompt generates disturbing images

Tech Life – Tackling lithium battery fires on planes

50 Years of The Institute

These Were My Favorite Things Samsung Unpacked During Its 2026 Galaxy Event

AI minister role boosted but tech department axed in Burnham shake-up

Loop Engineering for RAG Question Parsing: The Small Loop That Runs Before Retrieval

The risk of weather data sabotage is rising

Featured Picks

Mark Zuckerberg dines with Donald Trump at Mar-a-Lago

Best Tower Fan for 2025

A Smart, Modular Electric Bike

AI Boom Fuels DRAM Shortage and Price Surge

What’s HBM?

Origins of the memory chip scarcity

The AI knowledge heart increase

Future DRAM provide and know-how

Related Posts