Xiaomi launches its first inference open-source large model Mimo! With 7 billion parameters, it surpasses OpenAI o1-mini and Alibaba QwQ-32B-Preview

Wallstreetcn
2025.04.30 05:59
portai
I'm PortAI, I can summarize articles.

Under the same reinforcement learning (RL) training data conditions, MiMo-7B demonstrates a significantly superior reinforcement learning potential in mathematics and coding compared to other widely used models in the industry, including well-known RL starter models such as DeepSeek-R1-Distill-7B and Qwen2.5-32B