NVIDIA released Rubin CPX, targeting ultra-long context processing, Jensen Huang stated that it can infer millions of tokens at once

Wallstreetcn
2025.09.09 15:18
portai
I'm PortAI, I can summarize articles.

Rubin CPX enhances AI video generation and software development capabilities, providing 30 petaflops of computing power, which is 3 times the attention acceleration compared to the GB300 NVL72 system, and is set to launch by the end of 2026. Jensen Huang stated that the Rubin CPX is the first CUDA GPU specifically designed for large-scale contextual AI, capable of inferring millions of knowledge tokens simultaneously. NVIDIA claims that deploying $100 million in new chip hardware will generate up to $5 billion in revenue for customers