Xiaomi's latest large model achievement! Luo Fuli has appeared

Wallstreetcn
2025.10.17 06:00
portai
I'm PortAI, I can summarize articles.

The Xiaomi AI team, in collaboration with Peking University, has released a paper on MoE and reinforcement learning, with Luo Fuli as the corresponding author. The paper proposes an approach to improve the efficiency and stability of large model reinforcement learning within the MoE architecture, addressing instability issues during the training process. The research indicates that reinforcement learning is crucial for driving breakthroughs in large model capabilities, especially when pre-training encounters bottlenecks