Record performance: 17,000 tokens per second with a new solution from startup Taalas

watch 1m, 30s
views 2

14:03, 20.02.2026

Canadian startup Taalas recently announced its first product, the HC1 chip with Llama 3.1 8B. The company's approach is unique in that the model is not loaded into memory but is directly soldered into the silicon during the manufacturing stage. As a result, it is possible to achieve a record result of 17,000 tokens per second per user. This result is almost 10 times faster than GPU solutions, and also achieves significant energy savings and minimizes production costs.

About Taalas

The startup was founded by Ljubiša Bajić, former director of integrated circuit design at AMD, his wife Leila Bajić (former technology manager and engineer at AMD, ATI, Altera), and Drago Ignjatović (former director of ASIC design at AMD).

The company's main approach can be described as total specification. The company plans to produce a separate chip for each model. The microchip will consist of approximately 100 layers, and only the top two will be customized as needed, with mask ROM recall fabric embedded in them. This will make it possible to produce a chip in two months instead of six. Computing and memory will also be combined on a single crystal.

At this stage, such aggressive quantization reduces quality when compared to GPU benchmarks. The startup acknowledges this fact, which is why it positions the product as a beta service. The chip's minimum flexibility is preserved due to the possibility of retraining via LaRA adapters and the presence of a context window.

The company has raised $200 million in investments and plans to release a new mid-sized chip soon, with the launch of an advanced LLM on the HC2 platform possible towards the end of the year.

Hope you found this article helpful - what do you think? Like and subscribe to our blog for more practical insights and the latest tech news from HostZealot.

Share

Was this article helpful to you?

VPS popular offers

-16.3%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
30 GB SSD
Bandwidth
Bandwidth
40 Mbps
DDoS Protected SSD-KVM 2048 Linux

48 /mo

/mo

Billed annually

-9.3%

CPU
CPU
6 Epyc Cores
RAM
RAM
16 GB
Space
Space
150 GB NVMe
Bandwidth
Bandwidth
Unlimited
wKVM-NVMe 16384 Windows

54.49 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
400 GB HDD
Bandwidth
Bandwidth
Unlimited
KVM-HDD 16384 Linux

50 /mo

/mo

Billed annually

-10.2%

CPU
CPU
6 Xeon Cores
RAM
RAM
16 GB
Space
Space
150 GB SSD
Bandwidth
Bandwidth
100 Mbps
DDoS Protected SSD-KVM 16384 Linux

123 /mo

/mo

Billed semiannually

-4.5%

CPU
CPU
4 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB HDD
Bandwidth
Bandwidth
300 Gb
wKVM-HDD HK 4096 Windows

16.83 /mo

/mo

Billed annually

-10%

CPU
CPU
8 Epyc Cores
RAM
RAM
32 GB
Space
Space
200 GB NVMe
Bandwidth
Bandwidth
Unlimited
Keitaro KVM 32768
OS
CentOS
Software
Software
Keitaro

77.54 /mo

/mo

Billed annually

-21.5%

CPU
CPU
2 Xeon Cores
RAM
RAM
4 GB
Space
Space
100 GB SSD
Bandwidth
Bandwidth
300 GB
wKVM-SSD 4096 HK Windows

40 /mo

/mo

Billed annually

-7.9%

CPU
CPU
6 Xeon Cores
RAM
RAM
8 GB
Space
Space
200 GB HDD
Bandwidth
Bandwidth
300 Gb
wKVM-HDD HK 8192 Windows

25.69 /mo

/mo

Billed annually

-5.6%

CPU
CPU
4 Xeon Cores
RAM
RAM
2 GB
Space
Space
60 GB HDD
Bandwidth
Bandwidth
Unlimited
wKVM-HDD 2048 Windows

13.7 /mo

/mo

Billed annually

-10%

CPU
CPU
6 Epyc Cores
RAM
RAM
8 GB
Space
Space
100 GB NVMe
Bandwidth
Bandwidth
Unlimited
aiKVM-NVMe 8192 Linux

26.62 /mo

/mo

Billed annually

Other articles on this topic

cookie

Accept cookies & privacy policy?

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we'll assume that you are happy to receive all cookies on the HostZealot website.