
Cost plummets by 70%! Google's TPU is aggressively catching up, and its cost-performance ratio has matched NVIDIA

I'm PortAI, I can summarize articles.
Goldman Sachs stated that Google/Broadcom's TPU is rapidly narrowing the gap in inference costs with NVIDIA's GPU. The unit token inference cost has decreased by about 70% from TPU v6 to TPU v7, roughly on par with NVIDIA's GB200 NVL72. This does not mean that NVIDIA's position is shaken, but it clearly indicates that the core evaluation system of AI chip competition is shifting from "who computes faster" to "who computes cheaper and more sustainably."
Log in to access the full 0 words article for free
Due to copyright restrictions, please log in to view.
Thank you for supporting legitimate content.

