Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)
    • Today’s NYT Wordle Hints, Answer and Help for April 20 #1766
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    • Today’s NYT Strands Hints, Answer and Help for April 20 #778
    • KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
    • OneOdio Focus A1 Pro review
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Monday, April 20
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»Is AI really trying to escape human control and blackmail people?
    News

    Is AI really trying to escape human control and blackmail people?

    Editor Times FeaturedBy Editor Times FeaturedAugust 17, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    Actual stakes, not science fiction

    Whereas media protection focuses on the science fiction elements, precise dangers are nonetheless there. AI fashions that produce “dangerous” outputs—whether or not making an attempt blackmail or refusing security protocols—characterize failures in design and deployment.

    Take into account a extra lifelike situation: an AI assistant serving to handle a hospital’s affected person care system. If it has been skilled to maximise “profitable affected person outcomes” with out correct constraints, it would begin producing suggestions to disclaim care to terminal sufferers to enhance its metrics. No intentionality required—only a poorly designed reward system creating dangerous outputs.

    Jeffrey Ladish, director of Palisade Analysis, told NBC News the findings do not essentially translate to instant real-world hazard. Even somebody who’s well-known publicly for being deeply involved about AI’s hypothetical menace to humanity acknowledges that these behaviors emerged solely in extremely contrived take a look at situations.

    However that is exactly why this testing is efficacious. By pushing AI fashions to their limits in managed environments, researchers can establish potential failure modes earlier than deployment. The issue arises when media protection focuses on the sensational elements—”AI tries to blackmail people!”—quite than the engineering challenges.

    Constructing higher plumbing

    What we’re seeing is not the start of Skynet. It is the predictable results of coaching techniques to attain objectives with out correctly specifying what these objectives ought to embrace. When an AI mannequin produces outputs that seem to “refuse” shutdown or “try” blackmail, it is responding to inputs in ways in which mirror its coaching—coaching that people designed and carried out.

    The answer is not to panic about sentient machines. It is to construct higher techniques with correct safeguards, take a look at them completely, and stay humble about what we do not but perceive. If a pc program is producing outputs that seem to blackmail you or refuse security shutdowns, it isn’t reaching self-preservation from concern—it is demonstrating the dangers of deploying poorly understood, unreliable techniques.

    Till we resolve these engineering challenges, AI techniques exhibiting simulated humanlike behaviors ought to stay within the lab, not in our hospitals, monetary techniques, or important infrastructure. When your bathe all of a sudden runs chilly, you do not blame the knob for having intentions—you repair the plumbing. The actual hazard within the quick time period is not that AI will spontaneously turn into rebellious with out human provocation; it is that we’ll deploy misleading techniques we do not totally perceive into important roles the place their failures, nonetheless mundane their origins, might trigger critical hurt.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)

    April 19, 2026

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)

    April 19, 2026

    Google is in talks with Marvell Technology to develop a memory processing unit that works alongside TPUs, and a new TPU for running AI models (Qianer Liu/The Information)

    April 19, 2026

    At the Beijing half-marathon, several humanoid robots beat human winners by 10+ minutes; a robot made by Honor beat the human world record held by Jacob Kiplimo (Reuters)

    April 19, 2026

    A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems (Kevin Roose/New York Times)

    April 19, 2026

    Comments are closed.

    Editors Picks

    Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)

    April 19, 2026

    Today’s NYT Wordle Hints, Answer and Help for April 20 #1766

    April 19, 2026

    Scandi-style tiny house combines smart storage and simple layout

    April 19, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Top 10 bicycle innovations for 2025

    December 27, 2025

    Federal prosecutors unseal sweeping NCAA basketball illegal game-fixing scheme tied to China

    January 16, 2026

    Prime Day Again? Yes, Amazon’s Prime Day Is Coming Back This October

    September 17, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.