The ultimate test scores have reached a new high, Google Gemini 3 has undergone a major upgrade in its deep thinking model, targeting scientific research and engineering applications

Wallstreetcn
2026.02.12 19:11
portai
I'm PortAI, I can summarize articles.

Without the aid of tools, the model achieved a 48.4% accuracy rate on the "Human's Last Exam" (HLE) benchmark test and scored 84.6% on the ARC-AGI-2 test; the written portions of the 2025 International Physics Olympiad and Chemistry Olympiad both reached gold medal level. Google stated that the new model is driving discoveries and helping researchers solve "intractable" problems—from identifying flaws in research papers to optimizing semiconductor crystal growth