Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Malta’s nationalists oppose European Union gambling tax
    • American Airlines Signs Up for Starlink Wi-Fi Service on Its Flights
    • How a Cambridge Project Rescues Fading Floppy Disk Data
    • The Domain Shift: Moving Data Governance from Product Triage to Infrastructure Investment
    • Two-axis rotation for hobbyist work
    • Aiven co-founder Hannu Valtonen’s Avrea emerges from stealth with €4 million to build AI-native CI/CD platform
    • Why the Vatican Invited Anthropic to the Pope’s AI Encyclical Presentation
    • Sacramento casino cannabis fraud case reaches federal level
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Wednesday, May 27
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»AI Technology News»This is the most misunderstood graph in AI
    AI Technology News

    This is the most misunderstood graph in AI

    Editor Times FeaturedBy Editor Times FeaturedFebruary 5, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    That was actually the case for Claude Opus 4.5, the most recent model of Anthropic’s strongest mannequin, which was launched in late November. In December, METR introduced that Opus 4.5 seemed to be able to independently finishing a activity that may have taken a human about 5 hours—an enormous enchancment over what even the exponential development would have predicted. One Anthropic security researcher tweeted that he would change the route of his analysis in mild of these outcomes; one other worker on the firm merely wrote, “mother come choose me up i’m scared.”

    Credit score: METR.ORG

    However the fact is extra sophisticated than these dramatic responses would recommend. For one factor, METR’s estimates of the talents of particular fashions include substantial error bars. As METR explicitly said on X, Opus 4.5 would possibly be capable of commonly full solely duties that take people about two hours, or it’d succeed on duties that take people so long as 20 hours. Given the uncertainties intrinsic to the strategy, it was unattainable to know for certain. 

    “There are a bunch of ways in which individuals are studying an excessive amount of into the graph,” says Sydney Von Arx, a member of METR’s technical workers.

    Extra essentially, the METR plot doesn’t measure AI talents writ massive, nor does it declare to. To be able to construct the graph, METR exams the fashions totally on coding duties, evaluating the problem of every by measuring or estimating how lengthy it takes people to finish it—a metric that not everybody accepts. Claude Opus 4.5 would possibly be capable of full sure duties that take people 5 hours, however that doesn’t imply it’s anyplace near changing a human employee.

    METR was based to evaluate the dangers posed by frontier AI techniques. Although it’s best identified for the exponential development plot, it has additionally labored with AI firms to judge their techniques in higher element and printed a number of different unbiased analysis initiatives, together with a widely covered July 2025 study suggesting that AI coding assistants would possibly really be slowing software program engineers down. 

    However the exponential plot has made METR’s fame, and the group seems to have an advanced relationship with that graph’s usually breathless reception. In January, Thomas Kwa, one of many lead authors on the paper that launched it, wrote a blog post responding to some criticisms and making clear its limitations, and METR is at the moment engaged on a extra intensive FAQ doc. However Kwa isn’t optimistic that these efforts will meaningfully shift the discourse. “I feel the hype machine will principally, no matter we do, simply strip out all of the caveats,” he says.

    Nonetheless, the METR staff does suppose that the plot has one thing significant to say in regards to the trajectory of AI progress. “It’s best to completely not tie your life to this graph,” says Von Arx. “But additionally,” she provides, “I wager that this development is gonna maintain.”



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Rethinking organizational design in the age of agentic AI

    May 26, 2026

    A reality check on the AI jobs hysteria

    May 26, 2026

    It’s time to address the looming crisis in entry-level work.

    May 26, 2026

    A practical guide for platform teams managing shared AI deployments

    May 22, 2026

    Google I/O showed how the path for AI-driven science is shifting

    May 22, 2026

    DataRobot for Developers: Skills in Cursor, Gemini, and Claude

    May 22, 2026

    Comments are closed.

    Editors Picks

    Malta’s nationalists oppose European Union gambling tax

    May 27, 2026

    American Airlines Signs Up for Starlink Wi-Fi Service on Its Flights

    May 27, 2026

    How a Cambridge Project Rescues Fading Floppy Disk Data

    May 26, 2026

    The Domain Shift: Moving Data Governance from Product Triage to Infrastructure Investment

    May 26, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    A profile of Mark Lanier, a TX lawyer and part-time pastor who beat Meta and Google in the LA social media case and said Zuckerberg was “rattled” on the stand (Wall Street Journal)

    March 29, 2026

    I Tried What US Athletes Will Be Sleeping on at Milano Cortina

    February 7, 2026

    Aliasing in Audio, Easily Explained: From Wagon Wheels to Waveforms

    February 26, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.