NVIDIA Fixes Blackwell: A Swift Response to the GPU Issue

watch 1m, 10s
views 2

12:57, 24.10.2024

Article Content
arrow

  • Chip Improvements and TSMC’s Role
  • Mass Production of the Updated Chips

NVIDIA CEO Jensen Huang acknowledged a design flaw in the Blackwell series GPU, which led to delays in the supply of AI chips. The issue involved a functional defect that resulted in a low yield of working chips. According to Huang, the fault was entirely on NVIDIA and not their manufacturing partner TSMC, as some sources had suggested. He emphasized that TSMC was not only uninvolved in the problem but also played an active role in helping to fix it.

Chip Improvements and TSMC’s Role

The issue was resolved by modifying the upper metal layers and silicon bumps in the GPU, which enhanced performance. The fix required significant efforts, given the need to simultaneously manufacture seven different types of chips from scratch. The main challenges were associated with the CoWoS-L packaging technology, which uses LSI silicon bridges, the RDL interposer, and GPU chiplets. Problems arose due to thermal expansion of the components, causing system deformation. Such fixes typically take around 10 cycles, but NVIDIA and TSMC managed to resolve the issue in record time.

Mass Production of the Updated Chips

The updated Blackwell B100 and B200 GPUs are set to enter mass production by the end of October, with shipments expected to begin early next year. While the production of the improved chips is ramping up, NVIDIA still anticipates some shortage of high-performance GPUs in 2024, particularly for major cloud providers such as AWS, Google, and Microsoft.

Share

Was this article helpful to you?

VPS popular offers

-10%

CPU
CPU
6 Epyc Cores
RAM
RAM
16 GB
Space
Space
150 GB NVMe
Bandwidth
Bandwidth
Unlimited
Keitaro KVM 16384
OS
CentOS
Software
Software
Keitaro

55.54 /mo

/mo

Billed annually

-10%

CPU
CPU
8 Epyc Cores
RAM
RAM
32 GB
Space
Space
200 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 32768 Linux

70.49 /mo

/mo

Billed annually

-20.5%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
150 GB SSD
Bandwidth
Bandwidth
10 TB
KVM-SSD 16384 Metered Linux

95 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
75 GB SSD
Bandwidth
Bandwidth
Unlimited
wKVM-SSD 2048 Windows

10.23 /mo

/mo

Billed annually

-20.6%

CPU
CPU
6 Xeon Cores
RAM
RAM
8GB
Space
Space
100GB SSD
Bandwidth
Bandwidth
500GB
KVM-SSD 8192 HK Linux

59 /mo

/mo

Billed annually

-10.2%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
150 GB SSD
Bandwidth
Bandwidth
100 Mbps
DDoS Protected SSD-KVM 16384 Linux

123 /mo

/mo

Billed semiannually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB HDD
Bandwidth
Bandwidth
Unlimited
KVM-HDD 4096 Linux

15 /mo

/mo

Billed annually

-10%

CPU
CPU
3 Epyc Cores
RAM
RAM
2 GB
Space
Space
25 GB NVMe
Bandwidth
Bandwidth
Unlimited
wKVM-NVMe 2048 Windows

9.9 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Xeon Cores
RAM
RAM
8 GB
Space
Space
200 GB HDD
Bandwidth
Bandwidth
300 Gb
KVM-HDD HK 8192 Linux

20.42 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
50 GB SSD
Bandwidth
Bandwidth
Unlimited
KVM-SSD 4096 Linux

15.95 /mo

/mo

Billed annually

Other articles on this topic

cookie

Accept cookies & privacy policy?

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the HostZealot website.