Grok 4.1 Fast benchmark results and details about the Agent Tools API

1m, 18s

15:00, 24.11.2025

Article Content

Grok 4.1 Fast takes the leading position
Agent Tools API – tools for autonomous agents

The xAI team shared benchmark results showing Grok 4.1 Fast as the leader among its competitors. In addition, the developers provided more details about the Agent Tools API infrastructure.

Grok 4.1 Fast takes the leading position

Based on updated benchmark information from xAI, Grok 4.1 Fast takes the leading position in the following categories:

τ²-bench Telecom: in this category, the new model shows the maximum possible result of 100% and significantly outperforms Claude Sonnet 4.5, GPT-5.1, and Grok 4.
Berkeley Function Calling v4: the new model scores 72% in the accuracy category at low cost and also takes the lead.
Long context: the model maintains stable quality up to a 2-million context window. For example, in this category, the new model scores 67%, while Grok 4 scores only 22%.

Agent Tools API – tools for autonomous agents

Agent Tools API is a set of server tools that provide access to external operations and real data.

With the API, an agent can:

Combine multiple tools in a chain automatically.
Use intelligent search in a loaded document.
Connect to external MCP servers.
Search data in X in real time.
Run Python code in a secure environment.

A distinctive feature of these tools is that they operate entirely on the xAI infrastructure. This eliminates the need to manage environments, keys, or limits. Grok independently determines the necessary steps and invokes the required tool, and parallel calls can also be implemented.

Currently, there are two model options available: reasoning for tasks requiring deep analysis and non-reasoning for instant responses. The Agent Tools API is available free of charge to xAI users, and the model will be free until December 3.

VPS popular offers

See all products

KVM-SSD 4096

-10%

CPU

4 Xeon Cores

RAM

4 GB

Space

50 GB SSD

Bandwidth

Unlimited

Linux

€ 15.95 /mo

€

/mo

Billed annually

wKVM-SSD 16384

-9.3%

CPU

6 Xeon Cores

RAM

16 GB

Space

150 GB SSD

Bandwidth

Unlimited

Windows

€ 53.99 /mo

€

/mo

Billed annually

Keitaro KVM 4096

-10%

CPU

4 Epyc Cores

RAM

4 GB

Space

50 GB NVMe

Bandwidth

Unlimited

CentOS

Software

Keitaro

€ 18.1 /mo

€

/mo

Billed annually

KVM-SSD 32768 Metered

-10%

CPU

8 Xeon Cores

RAM

32 GB

Space

200 GB SSD

Bandwidth

12 TB

Linux

€ 150 /mo

€

/mo

Billed annually

KVM-HDD HK 8192

-10%

CPU

6 Xeon Cores

RAM

8 GB

Space

200 GB HDD

Bandwidth

300 Gb

Linux

€ 20.95 /mo

€

/mo

Billed annually

KVM-SSD 32768

-10%

CPU

8 Xeon Cores

RAM

32 GB

Space

200 GB SSD

Bandwidth

Unlimited

Linux

€ 69.99 /mo

€

/mo

Billed annually

KVM-SSD 2048 HK

-20.4%

CPU

2 Xeon Cores

RAM

2 GB

Space

30 GB SSD

Bandwidth

300 GB

Linux

€ 18 /mo

€

/mo

Billed annually

wKVM-HDD 1024

-5%

CPU

3 Xeon Cores

RAM

1 GB

Space

40 GB HDD

Bandwidth

Unlimited

Windows

€ 12.1 /mo

€

/mo

Billed annually

MT5 KVM 8192

-10%

CPU

6 Xeon Cores

RAM

8 GB

Space

100 GB SSD

Bandwidth

Unlimited

Windows

€ 29.99 /mo

€

/mo

Billed annually

10Ge-wKVM-SSD 4096

-9.2%

CPU

4 Xeon Cores

RAM

4 GB

Space

100 GB SSD

Bandwidth

Unlimited

Windows

€ 72 /mo

€

/mo

Billed annually

Grok 4.1 Fast benchmark results and details about the Agent Tools API

Grok 4.1 Fast takes the leading position

Agent Tools API – tools for autonomous agents

Was this article helpful to you?

VPS popular offers

KVM-SSD 4096

wKVM-SSD 16384

Keitaro KVM 4096

KVM-SSD 32768 Metered

KVM-HDD HK 8192

KVM-SSD 32768

KVM-SSD 2048 HK

wKVM-HDD 1024

MT5 KVM 8192

10Ge-wKVM-SSD 4096

Other articles on this topic