Verkor Launches Industry's First TurboQuant LLM Inference Accelerator Silicon IP

Unusual Whales

2026.05.19 14:06

In recent news, a new accelerator chip called VerTQ has been introduced, which incorporates Google's TurboQuant algorithm. This innovative chip has successfully reduced the KV cache memory usage of Large Language Models by an impressive factor of 4.3x. Remarkably, despite this reduction, it has been able to maintain or even improve the performance of the models it supports. Notably, VerTQ was autonomously developed by Conductor 2.0, Verkor's secondary project.