Nvidia announces “Rubin Ultra” and “Feynman” AI chips for 2027 and 2028

On Tuesday at Nvidia’s GTC 2025 convention in San Jose, California, CEO Jensen Huang revealed a number of new AI-accelerating GPUs the corporate plans to launch over the approaching months and years. He additionally revealed extra specs about beforehand introduced chips.

The centerpiece announcement was Vera Rubin, first teased at Computex 2024 and now scheduled for launch within the second half of 2026. This GPU, named after a famous astronomer, will characteristic 288 gigabytes of reminiscence and comes with a customized Nvidia-designed CPU referred to as Vera.

In accordance with Nvidia, Vera Rubin will ship vital efficiency enhancements over its predecessor, Grace Blackwell, notably for AI coaching and inference.

Specifications for Vera Rubin, presented by Jensen Huang during his GTC 2025 keynote. — Specs for Vera Rubin, introduced by Jensen Huang throughout his GTC 2025 keynote.

Vera Rubin options two GPUs collectively on one die that ship 50 petaflops of FP4 inference efficiency per chip. When configured in a full NVL144 rack, the system delivers 3.6 exaflops of FP4 inference compute—3.3 occasions greater than Blackwell Extremely’s 1.1 exaflops in an analogous rack configuration.

The Vera CPU options 88 customized ARM cores with 176 threads linked to Rubin GPUs by way of a high-speed 1.8 TB/s NVLink interface.

Huang additionally introduced Rubin Extremely, which is able to comply with within the second half of 2027. Rubin Extremely will use the NVL576 rack configuration and have particular person GPUs with 4 reticle-sized dies, delivering 100 petaflops of FP4 precision (a 4-bit floating-point format used for representing and processing numbers inside AI fashions) per chip.

On the rack stage, Rubin Extremely will present 15 exaflops of FP4 inference compute and 5 exaflops of FP8 coaching efficiency—about 4 occasions extra highly effective than the Rubin NVL144 configuration. Every Rubin Extremely GPU will embody 1TB of HBM4e reminiscence, with the entire rack containing 365TB of quick reminiscence.

Source link

Nvidia announces “Rubin Ultra” and “Feynman” AI chips for 2027 and 2028

Microsoft’s new “passwordless by default” is great but comes at a cost

Time saved by AI offset by new work created, study suggests

iOS and Android juice jacking defenses have been trivial to bypass for years

New Android spyware is targeting Russian military personnel on the front lines

Annoyed ChatGPT users complain about bot’s relentlessly positive tone

What could possibly go wrong? DOGE to rapidly rebuild Social Security codebase.

FEMA Is Ending Door-to-Door Canvassing in Disaster Areas

Microsoft’s new “passwordless by default” is great but comes at a cost

Today’s NYT Mini Crossword Answers for May 5

Voters Approve Incorporation of SpaceX Hub as Starbase, Texas

Featured Picks

My Monthly Student Loan Payment Could Jump From $0 to $488. Here’s How I’m Preparing

DeepTech startup Astral Systems raises €5.3 million to commercialise fusion technology

Yahoo Is Still Here—and It Has Big Plans for AI

Nvidia announces “Rubin Ultra” and “Feynman” AI chips for 2027 and 2028

Related Posts