Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    • Today’s NYT Strands Hints, Answer and Help for April 20 #778
    • KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
    • OneOdio Focus A1 Pro review
    • The 11 Best Fans to Buy Before It Gets Hot Again (2026)
    • A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»Is AI really trying to escape human control and blackmail people?
    News

    Is AI really trying to escape human control and blackmail people?

    Editor Times FeaturedBy Editor Times FeaturedAugust 17, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    Actual stakes, not science fiction

    Whereas media protection focuses on the science fiction elements, precise dangers are nonetheless there. AI fashions that produce “dangerous” outputs—whether or not making an attempt blackmail or refusing security protocols—characterize failures in design and deployment.

    Take into account a extra lifelike situation: an AI assistant serving to handle a hospital’s affected person care system. If it has been skilled to maximise “profitable affected person outcomes” with out correct constraints, it would begin producing suggestions to disclaim care to terminal sufferers to enhance its metrics. No intentionality required—only a poorly designed reward system creating dangerous outputs.

    Jeffrey Ladish, director of Palisade Analysis, told NBC News the findings do not essentially translate to instant real-world hazard. Even somebody who’s well-known publicly for being deeply involved about AI’s hypothetical menace to humanity acknowledges that these behaviors emerged solely in extremely contrived take a look at situations.

    However that is exactly why this testing is efficacious. By pushing AI fashions to their limits in managed environments, researchers can establish potential failure modes earlier than deployment. The issue arises when media protection focuses on the sensational elements—”AI tries to blackmail people!”—quite than the engineering challenges.

    Constructing higher plumbing

    What we’re seeing is not the start of Skynet. It is the predictable results of coaching techniques to attain objectives with out correctly specifying what these objectives ought to embrace. When an AI mannequin produces outputs that seem to “refuse” shutdown or “try” blackmail, it is responding to inputs in ways in which mirror its coaching—coaching that people designed and carried out.

    The answer is not to panic about sentient machines. It is to construct higher techniques with correct safeguards, take a look at them completely, and stay humble about what we do not but perceive. If a pc program is producing outputs that seem to blackmail you or refuse security shutdowns, it isn’t reaching self-preservation from concern—it is demonstrating the dangers of deploying poorly understood, unreliable techniques.

    Till we resolve these engineering challenges, AI techniques exhibiting simulated humanlike behaviors ought to stay within the lab, not in our hospitals, monetary techniques, or important infrastructure. When your bathe all of a sudden runs chilly, you do not blame the knob for having intentions—you repair the plumbing. The actual hazard within the quick time period is not that AI will spontaneously turn into rebellious with out human provocation; it is that we’ll deploy misleading techniques we do not totally perceive into important roles the place their failures, nonetheless mundane their origins, might trigger critical hurt.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)

    April 19, 2026

    Google is in talks with Marvell Technology to develop a memory processing unit that works alongside TPUs, and a new TPU for running AI models (Qianer Liu/The Information)

    April 19, 2026

    At the Beijing half-marathon, several humanoid robots beat human winners by 10+ minutes; a robot made by Honor beat the human world record held by Jacob Kiplimo (Reuters)

    April 19, 2026

    A look at the AI nonprofit METR, whose time-horizon metrics are used by AI researchers and Wall Street investors to track the rapid development of AI systems (Kevin Roose/New York Times)

    April 19, 2026

    Binance and Bitget to probe a rally in RaveDAO’s RAVE token, which surged 4,500% in a week, after ZachXBT alleged RAVE insiders engineered a large short squeeze (Francisco Rodrigues/CoinDesk)

    April 19, 2026

    Comments are closed.

    Editors Picks

    Scandi-style tiny house combines smart storage and simple layout

    April 19, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    Today’s NYT Strands Hints, Answer and Help for April 20 #778

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Today’s NYT Mini Crossword Answers for Feb. 18

    February 18, 2026

    Entain announces it will end Coral Cup sponsorship after 52 years

    January 29, 2026

    Why 3D-Printing an Untraceable Ghost Gun Is Easier Than Ever

    May 23, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.