Alibaba Cloud Reduces GPU Load for AI Services by Nearly Five Times

watch 42s
views 2

13:16, 22.10.2025

Article Content
arrow

  • Aegaeon: Resource Redistribution
  • Why Aegaeon is Profitable

Alibaba Cloud concluded that AI services often use resources irrationally. Many AI models consume power unevenly, thereby increasing the load on the GPU. Such inefficient distribution of resources hinders scaling and increases the costs associated with the AI infrastructure.

Aegaeon: Resource Redistribution

In response to the observed challenge, Alibaba Cloud introduced the Aegaeon system, which dynamically redistributes resources, thereby solving the problem. Thanks to the system, graphics modules can now switch between models in real time, even during response generation.

Why Aegaeon is Profitable

The new system allows one GPU to serve up to seven models simultaneously, which is much more productive compared to the previous two or three models. When using Aegaeon, task switching delays were reduced by 97%. The system is already in use on Alibaba Cloud's Bailian marketplace. This represents an important step towards a cheaper and more sustainable AI infrastructure.

Share

Was this article helpful to you?

VPS popular offers

-21.5%

CPU
CPU
2 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
300 GB
wKVM-SSD 4096 HK Windows

40 /mo

/mo

Billed annually

-18.6%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
4 TB
wKVM-SSD 4096 Metered Windows

38 /mo

/mo

Billed annually

-9.8%

CPU
CPU
3 Xeon Cores
RAM
RAM
1 GB
Space
Space
40 GB HDD
Bandwidth
Bandwidth
300 Gb
KVM-HDD HK 1024 Linux

4.98 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
400 GB HDD
Bandwidth
Bandwidth
300 Gb
KVM-HDD HK 16384 Linux

40.36 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
8 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
Unlimited
10Ge-KVM-SSD 8192 Linux

115.5 /mo

/mo

Billed annually

-5.4%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
60 GB HDD
Bandwidth
Bandwidth
300 Gb
wKVM-HDD HK 2048 Windows

11.66 /mo

/mo

Billed annually

-9.4%

CPU
CPU
8 Xeon Cores
RAM
RAM
32 GB
Space
Space
200 GB SSD
Bandwidth
Bandwidth
Unlimited
wKVM-SSD 32768 Windows

99.5 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Epyc Cores
RAM
RAM
8 GB
Space
Space
100 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 8192 Linux

36.3 /mo

/mo

Billed annually

-9.7%

CPU
CPU
10 Xeon Cores
RAM
RAM
64 GB
Space
Space
300 GB SSD
Bandwidth
Bandwidth
Unlimited
wKVM-SSD 65536 Windows

187.5 /mo

/mo

Billed annually

-9.4%

CPU
CPU
8 Epyc Cores
RAM
RAM
32 GB
Space
Space
200 GB NVMe
Bandwidth
Bandwidth
Unlimited
wKVM-NVMe 32768 Windows

102.8 /mo

/mo

Billed annually

Other articles on this topic

cookie

Accept cookies & privacy policy?

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the HostZealot website.