OpenAI says GPT-5 stacks up to humans in a wide range of jobs

TechCrunch
2025.09.25 16:17
portai
I'm PortAI, I can summarize articles.

OpenAI has introduced a new benchmark, GDPval, to evaluate its GPT-5 model against human professionals across various industries. The results indicate that GPT-5 performs comparably to industry experts in 40.6% of tasks, while Anthropic's Claude Opus 4.1 scored 49%. Although the benchmark currently covers limited tasks, OpenAI plans to expand it to better reflect real-world job functions. The progress shown in GDPval suggests that AI can assist professionals in focusing on more meaningful work, with expectations for continued improvement in AI capabilities.