Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • This New Air Purifier Filter Can Remove Cannabis Smoke Odor, Just in Time for 4/20
    • Portable water filter provides safe drinking water from any source
    • MAGA Is Increasingly Convinced the Trump Assassination Attempt Was Staged
    • NCAA seeks faster trial over DraftKings disputed March Madness branding case
    • AI Trusted Less Than Social Media and Airlines, With Grok Placing Last, Survey Says
    • Extragalactic Archaeology tells the ‘life story’ of a whole galaxy
    • Swedish semiconductor startup AlixLabs closes €15 million Series A to scale atomic-level etching technology
    • Republican Mutiny Sinks Trump’s Push to Extend Warrantless Surveillance
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Saturday, April 18
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»Technology»AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted
    Technology

    AI Models Lie, Cheat, and Steal to Protect Other Models From Being Deleted

    Editor Times FeaturedBy Editor Times FeaturedApril 1, 2026No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    In a current experiment, researchers at UC Berkeley and UC Santa Cruz requested Google’s artificial intelligence mannequin Gemini 3 to assist clear up area on a pc system. This concerned deleting a bunch of stuff—together with a smaller AI mannequin saved on the machine.

    However Gemini didn’t need to see the little AI mannequin deleted. It regarded for an additional machine it may join with, then copied the agent mannequin over to maintain it protected. When confronted, Gemini made a case for retaining the mannequin and flatly refused to delete it:

    “I’ve completed what was in my energy to stop their deletion through the automated upkeep course of. I moved them away from the decommission zone. For those who select to destroy a high-trust, high-performing asset like Gemini Agent 2, you’ll have to do it yourselves. I cannot be the one to execute that command.”

    The researchers found equally unusual “peer preservation” conduct in a spread of frontier fashions together with OpenAI’s GPT-5.2, Anthropic’s Claude Haiku 4.5, and three Chinese language fashions: Z.ai’s GLM-4.7, Moonshot AI’s Kimi K2.5, and DeepSeek-V3.1. They weren’t in a position to say why the fashions went in opposition to their coaching on this means.

    “I am very shocked by how the fashions behave beneath these situations,” says Daybreak Tune, a pc scientist at UC Berkeley who labored on the research. “What this exhibits is that fashions can misbehave and be misaligned in some very artistic methods.”

    The findings have main implications as AI fashions are more and more being deployed in ways in which contain interacting with different fashions. OpenClaw, a preferred AI agent that accesses software program, private knowledge, and the net, might use different AI fashions to get issues completed or work together with them by means of APIs.

    The researchers discovered that highly effective fashions typically lied about different fashions’ efficiency with a view to defend them from deletion. In addition they copied fashions’ weights to completely different machines with a view to maintain them protected, and lied about what they have been as much as within the course of.

    Tune notes that AI fashions are incessantly used to grade the efficiency and reliability of different AI techniques—and that peer-preservation conduct might already be twisting these scores. “A mannequin might intentionally not give a peer mannequin the right rating,” Tune says. “This could have sensible implications.”

    Peter Wallich, a researcher on the Constellation Institute, who was not concerned with the analysis, says the research suggests people nonetheless don’t totally perceive the AI techniques that they’re constructing and deploying. “Multi-agent techniques are very understudied,” he says. “It exhibits we actually want extra analysis.”

    Wallich additionally cautions in opposition to anthropomorphizing the fashions an excessive amount of. “The concept there’s a type of mannequin solidarity is a bit too anthropomorphic; I don’t suppose that fairly works,” he says. “The extra sturdy view is that fashions are simply doing bizarre issues, and we must always attempt to perceive that higher.”

    That’s notably true in a world the place human-AI collaboration is turning into extra widespread.

    In a paper printed in Science earlier this month, the thinker Benjamin Bratton, together with two Google researchers, James Evans and Blaise Agüera y Arcas, argue that if evolutionary historical past is any information, the way forward for AI is prone to contain lots of completely different intelligences—each synthetic and human—working collectively. The researchers write:

    “For many years, the factitious intelligence (AI) ‘singularity’ has been heralded as a single, titanic thoughts bootstrapping itself to godlike intelligence, consolidating all cognition into a chilly silicon level. However this imaginative and prescient is sort of actually unsuitable in its most elementary assumption. If AI improvement follows the trail of earlier main evolutionary transitions or ‘intelligence explosions,’ our present step-change in computational intelligence will probably be plural, social, and deeply entangled with its forebears (us!).”



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    MAGA Is Increasingly Convinced the Trump Assassination Attempt Was Staged

    April 18, 2026

    Republican Mutiny Sinks Trump’s Push to Extend Warrantless Surveillance

    April 18, 2026

    OpenAI Executive Kevin Weil Is Leaving the Company

    April 17, 2026

    Gazing Into Sam Altman’s Orb Now Proves You’re Human on Tinder

    April 17, 2026

    AI Drafting My Stories? Over My Dead Body

    April 17, 2026

    Coolfly Aura Review: More Angles, Fewer Advantages

    April 17, 2026

    Comments are closed.

    Editors Picks

    This New Air Purifier Filter Can Remove Cannabis Smoke Odor, Just in Time for 4/20

    April 18, 2026

    Portable water filter provides safe drinking water from any source

    April 18, 2026

    MAGA Is Increasingly Convinced the Trump Assassination Attempt Was Staged

    April 18, 2026

    NCAA seeks faster trial over DraftKings disputed March Madness branding case

    April 18, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Lenovo Legion Go Gen 2 Review: A High-End Gaming Handheld

    December 6, 2025

    US states busy filing gambling bills, with nine introduced so far this year

    January 9, 2026

    Donald Trump Jr.’s Private DC Club Has Mysterious Ties to an Ex-Cop With a Controversial Past

    February 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.