Nvidia Is Preparing a New Generation of GPUs to Support Millions of Contexts

watch 1m, 17s
views 2

13:36, 10.09.2025

Article Content
arrow

  • Disaggregated Inference Architecture
  • A Breakthrough for Business and Science
  • Focus on Inference, not Training
  • Market Launch

Nvidia has unveiled the Rubin CPX graphics processor, designed specifically for language and multimodal models that need to store and analyze huge amounts of data. The chip is optimized to process contexts of over 1 million tokens,  a figure that far exceeds the capabilities of current systems.

Disaggregated Inference Architecture

The key innovation of Rubin CPX is the use of disaggregated inference architecture. With this approach, multiple GPUs process different parts of the task and then combine the results into a single answer. This increases speed, reduces latency, and makes resource usage more efficient. This is especially useful for document analysis, multimedia content generation, and working with large code projects.

A Breakthrough for Business and Science

Nvidia notes that Rubin CPX opens up new horizons for lawyers, doctors, and developers. In law, it will help work with hundreds of pages of laws; in medicine, it will help compare large arrays of patient data; and in IT, it will help analyze entire projects instead of individual files. In the creative field, the GPU will allow you to generate long videos and complex multimedia projects.

Focus on Inference, not Training

Unlike traditional solutions, Rubin CPX is primarily aimed at optimizing inference, accelerating the performance of existing models. This makes it attractive to companies that want to implement AI into their real-world business faster while reducing costs.

Market Launch

Rubin CPX is expected to hit the market in late 2026. Experts suggest that this processor could set a new standard for the industry, where working with long contexts will no longer be a rarity but the norm.

Share

Was this article helpful to you?

VPS popular offers

-10%

CPU
CPU
10 Epyc Cores
RAM
RAM
64 GB
Space
Space
400 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 65536 Linux

135.49 /mo

/mo

Billed annually

-12.3%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
150 GB SSD
Bandwidth
Bandwidth
Unlimited
10Ge-wKVM-SSD 16384 Windows

237 /mo

/mo

Billed annually

-12.8%

CPU
CPU
3 Xeon Cores
RAM
RAM
1 GB
Space
Space
50 GB SSD
Bandwidth
Bandwidth
1 TB
wKVM-SSD 1024 Metered Windows

17 /mo

/mo

Billed annually

-7.9%

CPU
CPU
6 Xeon Cores
RAM
RAM
8 GB
Space
Space
200 GB HDD
Bandwidth
Bandwidth
300 Gb
wKVM-HDD HK 8192 Windows

25.95 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
8 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
Unlimited
10Ge-KVM-SSD 8192 Linux

115.5 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
75 GB SSD
Bandwidth
Bandwidth
Unlimited
wKVM-SSD 2048 Windows

10.23 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Epyc Cores
RAM
RAM
8 GB
Space
Space
100 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 8192 Linux

26.35 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
30 GB SSD
Bandwidth
Bandwidth
Unlimited
10Ge-KVM-SSD 2048 Linux

30.3 /mo

/mo

Billed annually

-16.2%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
50 GB SSD
Bandwidth
Bandwidth
60 Mbps
DDoS Protected SSD-KVM 4096 Linux

67 /mo

/mo

Billed annually

-10%

CPU
CPU
10 Epyc Cores
RAM
RAM
64GB
Space
Space
400 GB NVMe
Bandwidth
Bandwidth
Unlimited
Keitaro KVM 65536
OS
CentOS
Software
Software
Keitaro

149.04 /mo

/mo

Billed annually

Other articles on this topic

cookie

Accept cookies & privacy policy?

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the HostZealot website.