New Qwen2.5-Max exceeds DeepSeek capabilities

watch 1m, 10s
views 2

11:54, 31.01.2025

After the releases of Qwen2.5, Qwen2.5-VL, a new version of Qwen2.5-Max has become available. The new version of Qwen shows top performance over the DeepSeek V3 in the following benchmarks - GPQA-Diamond, Arena-Hard, LiveCodeBench, and LiveBench.

Architecture and Model Features

The Max version is a fairly large-scale project of the Mixture of Experts model. The uniqueness of this particular model was in training on real user feedback (RLHF), using Supervised-Fine-Tuning, and of course training on 20 trillion tokens.

At the moment, the data for the new version has not yet been posted on GitHub, only access to the API and Qwen Chat is available for now. There's a good chance that the lack of data on HuggingFace and GitHub indicates a rush to unveil the new project or a planned promotion by the company to incentivize the adoption of their cloud platform.

Qwen has published results regarding the new model. According to the open data table of the new Qwen version compared to LLaMA3.1 and DeepSeek-V3, the Max version outperforms its competitors in most characteristics. When compared to Claude Sonnet and GPT, the Max version loses to GPT.

The company has invested a significant budget in training data, and the superiority over competitors exists, but it is relatively insignificant. Because of this, some experts have the theory that it is possible to extend the capabilities of language models by using computing power during testing. 

Share

Was this article helpful to you?

VPS popular offers

-10%

CPU
CPU
6 Xeon Cores
RAM
RAM
8 GB
Space
Space
200 GB HDD
Bandwidth
Bandwidth
300 Gb
KVM-HDD HK 8192 Linux

20.58 /mo

/mo

Billed annually

-26.7%

CPU
CPU
3 Xeon Cores
RAM
RAM
1 GB
Space
Space
20 GB SSD
Bandwidth
Bandwidth
1 TB
KVM-SSD 1024 Metered Linux

10 /mo

/mo

Billed annually

-20.6%

CPU
CPU
6 Xeon Cores
RAM
RAM
8GB
Space
Space
100GB SSD
Bandwidth
Bandwidth
500GB
KVM-SSD 8192 HK Linux

59 /mo

/mo

Billed annually

-10%

CPU
CPU
8 Xeon Cores
RAM
RAM
32 GB
Space
Space
200 GB SSD
Bandwidth
Bandwidth
12 TB
KVM-SSD 32768 Metered Linux

150 /mo

/mo

Billed annually

-10%

CPU
CPU
3 Epyc Cores
RAM
RAM
2 GB
Space
Space
20 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 2048 Linux

14.9 /mo

/mo

Billed annually

-20.2%

CPU
CPU
1 Xeon Core
RAM
RAM
1 GB
Space
Space
50 GB SSD
Bandwidth
Bandwidth
300 GB
wKVM-SSD 1024 HK Windows

19 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Epyc Cores
RAM
RAM
4 GB
Space
Space
50 GB NVMe
Bandwidth
Bandwidth
Unlimited
KVM-NVMe 4096 Linux

25.9 /mo

/mo

Billed annually

-15.4%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
60 Mbps
DDoS Protected SSD-wKVM 4096 Windows

73 /mo

/mo

Billed annually

-10%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB HDD
Bandwidth
Bandwidth
Unlimited
KVM-HDD 4096 Linux

15 /mo

/mo

Billed annually

-24.7%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
50 GB SSD
Bandwidth
Bandwidth
4 TB
KVM-SSD 4096 Metered Linux

31 /mo

/mo

Billed annually

Other articles on this topic

cookie

Accept cookies & privacy policy?

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the HostZealot website.