As Will factors out, there have been two current wins for OpenAI in its efforts to construct AI that outcompetes people. Its fashions took second place at a top-level coding competitors and—alongside these from Google DeepMind—achieved gold-medal-level leads to the 2025 Worldwide Math Olympiad.
Individuals who consider that AI doesn’t pose real competitors to human-level intelligence would possibly truly take some consolation in that. AI is sweet on the mathematical and analytical, that are on full show in olympiads and coding competitions. That doesn’t imply it’s any good at grappling with the messiness of human feelings, making laborious choices, or creating artwork that resonates with anyone.
However that distinction—between machine-like reasoning and the flexibility to assume creatively—is just not one OpenAI’s heads of analysis are inclined to make.
“We’re speaking about programming and math right here,” mentioned Pachocki. “However it’s actually about creativity, arising with novel concepts, connecting concepts from completely different locations.”
That’s why, the researchers say, these testing grounds for AI will produce fashions which have an growing capacity to cause like an individual, one of the vital necessary objectives OpenAI is working towards. Reasoning fashions break issues down into extra discrete steps, however even the perfect have restricted capacity to chain items of knowledge collectively and method issues logically.
OpenAI is throwing an enormous amount of cash and expertise at that drawback not as a result of its researchers assume it’s going to lead to larger scores at math contests, however as a result of they consider it’s going to permit their AI fashions to return nearer to human intelligence.

