Alibaba DeepSeek moment! Open-source new architecture model: inference 10 times faster, cost reduced by 90%

Wallstreetcn
2025.09.12 00:15
portai
I'm PortAI, I can summarize articles.

Alibaba open-sourced a new architecture model Qwen3-Next-80B-A3B this morning, which adopts a hybrid attention mechanism and high sparsity MoE, reducing training costs by 90% compared to Qwen3-32B and improving inference efficiency by 10 times. This model performs excellently in handling ultra-long texts, with performance comparable to Alibaba's flagship model Qwen3-235B, and surpassing Google's Gemini-2.5-Flash, becoming one of the low-energy consumption open-source models. Netizens have praised its architecture, considering its design outstanding