What is AGI? Nobody agrees, and it’s tearing Microsoft and OpenAI apart.

The reported $100 billion revenue threshold we talked about earlier conflates business success with cognitive functionality, as if a system’s means to generate income says something significant about whether or not it will possibly “suppose,” “purpose,” or “perceive” the world like a human.

Sam Altman speaks onstage throughout The New York Occasions Dealbook Summit 2024 at Jazz at Lincoln Heart on December 4, 2024, in New York Metropolis.

Credit score:

Eugene Gologursky via Getty Images

Relying in your definition, we might have already got AGI, or it might be bodily unattainable to attain. For those who outline AGI as “AI that performs higher than most people at most duties,” then present language fashions doubtlessly meet that bar for sure varieties of work (which duties, which people, what’s “higher”?), however settlement on whether or not that’s true is way from common. This says nothing of the even murkier idea of “superintelligence”—another nebulous term for a hypothetical, god-like mind to date past human cognition that, like AGI, it defies any strong definition or benchmark.

Given this definitional chaos, researchers have tried to create goal benchmarks to measure progress towards AGI, however these makes an attempt have revealed their very own set of issues.

Why benchmarks preserve failing us

The seek for higher AGI benchmarks has produced some fascinating alternate options to the Turing Check. The Abstraction and Reasoning Corpus (ARC-AGI), launched in 2019 by François Chollet, checks whether or not AI techniques can resolve novel visible puzzles that require deep and novel analytical reasoning.

“Nearly all present AI benchmarks might be solved purely through memorization,” Chollet told Freethink in August 2024. A serious drawback with AI benchmarks presently stems from knowledge contamination—when take a look at questions find yourself in coaching knowledge, fashions can seem to carry out properly with out really “understanding” the underlying ideas. Massive language fashions function grasp imitators, mimicking patterns present in coaching knowledge, however not at all times originating novel options to issues.

However even subtle benchmarks like ARC-AGI face a basic drawback: They’re nonetheless making an attempt to cut back intelligence to a rating. And whereas improved benchmarks are important for measuring empirical progress in a scientific framework, intelligence is not a single factor you possibly can measure, like peak or weight—it is a advanced constellation of talents that manifest in a different way in several contexts. Certainly, we don’t even have an entire useful definition of human intelligence, so defining synthetic intelligence by any single benchmark rating is prone to seize solely a small a part of the whole image.

Source link

What is AGI? Nobody agrees, and it’s tearing Microsoft and OpenAI apart.

OpenAI says its models, starting with GPT-5.1, “increasingly mentioned goblins, gremlins, and other creatures”, leading to prompt instructions to mitigate it (OpenAI)

CFTC Sues Wisconsin in Escalating Fight Over Prediction Market Regulation

US soldier pleads not guilty in first prediction market insider trading case tied to Polymarket bets

Resorts World NYC opens first full casino in New York City with live table games in Queens

Why a recent supply-chain attack singled out security firms Checkmarx and Bitwarden

The European Commission issues preliminary DSA findings against Meta, saying Instagram and Facebook fail to prevent under-13 users from accessing the services (Gian Volpicelli/Bloomberg)

Two Cases Where Simulation Fills the Gap

DeepSeek’s new AI model is rolling out quietly, not to the Wall Street market shock

TOI-201 system shows planets changing orbits in real time

How the future of AI is at stake in the legal fight between Elon Musk and OpenAI’s Sam Altman

Featured Picks

The people turning to AI for dating and relationship advice

Nestlé, PepsiCo and L’Oréal among brands using Dragonfly AI as it raises €5.7 million for predictive AI

A look at Apple’s conservative approach to M&A, which may need to change to catch up in AI by acquiring a startup like Perplexity, Cohere, Sierra AI, or Mistral (Mark Gurman/Bloomberg)

What is AGI? Nobody agrees, and it’s tearing Microsoft and OpenAI apart.

Why benchmarks preserve failing us

Related Posts